Skip to content

Introduction

Data engineering processes in real life are inherently iterative, and non-linear:

Fig 1-1 from the data science at the command line book

This means that the model is never "done," and that it is likely impossible to get to any perfectly scrubbed state before data migration, for example.

That said, there will by definition be two separate workstreams involved in moving to the model described here:

  1. Data Migration
  2. Incremental update via the UTA data management application, and via the ADM API

Confidential. For internal use only.