From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Creating a data prep to-do list

Creating a data prep to-do list

From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Start my 1-month free trial

Creating a data prep to-do list

- [Presenter] One way to think about data understanding is that it's a way of preparing a checklist for data prep. By doing a thorough job, you know what deserves your attention and you have developed a sense of what needs to be done. There are five data prep tasks. Let's discuss how they are influenced by the work you've done during data understanding. Remember that the phases overlap and iterate. They don't come to a complete stop before you move on. And a lot of formatting has already been done during data loading, so it isn't a major issue as you transition to data prep. Data integration is a different story. It's a major data prep task, and it has such a big impact on the data that it forces you to revisit some of the data understanding tasks. Missing data in particular is often generated during data integration, any ID field that's found in one table but is missing from the other is going to cause problems. But don't…

Contents