From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
How to organize your work with the four data understanding tasks
From the course: Data Science Foundations: Data Assessment for Predictive Modeling
How to organize your work with the four data understanding tasks
- We're back in the Christie M document, looking at figure three, the task list. Now let's talk about the four data understanding tasks. These tasks and even their names will be important to us because they will help us structure the course. In fact you may notice that the chapter titles for the most part refer explicitly back to these task names. So let's talk about each of them in turn. The first task is Collect Initial Data and that's going to include getting the data into your software environment of choice. So there might be a little bit of cleaning and formatting but only in support of data loading. Also keep in mind that we have not integrated the data yet. We have to give the individual data sources a little bit of attention before we integrate them. Integration conceals some missing data problems but it creates some new ones. So you must do an initial exploration first. I love this phrase, gross or surface properties…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
(Locked)
Clarifying how data understanding differs from data visualization3m 13s
-
Introducing the critical data understanding phase of CRISP-DM3m 59s
-
Data assessment in CRISP-DM alternatives: The IBM ASUM-DM and Microsoft TDSP3m 55s
-
(Locked)
Navigating the transition from business understanding to data understanding4m 6s
-
(Locked)
How to organize your work with the four data understanding tasks3m 42s
-
(Locked)
-
-
-
-
-
-
-
-
-
-
-
-