From the course: Data Curation Foundations
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Curating study vs. production data
From the course: Data Curation Foundations
Curating study vs. production data
- [Instructor] In this video, I'm going to highlight the differences between curating study data and curating production data. The reason curation for study and production data can be so different is because the data structures are typically different. Let's start by thinking of the structure of production data. There are usually a few big main tables with dynamic data that keep getting updated. Imagine a store database. There probably is a table of transactions that keeps getting updated each time someone buys something. But there are also a bunch of little picklist tables, these are what control the choices in the dropdown list. Sometimes, there are also skinny audit-trail tables that keep track of the history of values in different fields. All these mini tables are related in a complex way. You need a diagram to keep track of all that. A romanticized version of the kind of diagram that is on the slide. Now, let's see…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
Data dictionary basics5m 36s
-
(Locked)
Curating study vs. production data6m 5s
-
(Locked)
Entity-attribute-value (EAV) structure6m 8s
-
(Locked)
Indexes5m 18s
-
(Locked)
Entity-relationship diagrams (ERDs)5m 34s
-
(Locked)
Main tables in a data dictionary5m 17s
-
(Locked)
Picklists in a data dictionary5m 6s
-
(Locked)
Leveraging picklists for crosswalks5m 36s
-
(Locked)
Advanced crosswalks5m 29s
-
-
-
-
-
-
-