From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

How to organize your work with the four data understanding tasks

How to organize your work with the four data understanding tasks

From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Start my 1-month free trial

How to organize your work with the four data understanding tasks

- We're back in the Christie M document, looking at figure three, the task list. Now let's talk about the four data understanding tasks. These tasks and even their names will be important to us because they will help us structure the course. In fact you may notice that the chapter titles for the most part refer explicitly back to these task names. So let's talk about each of them in turn. The first task is Collect Initial Data and that's going to include getting the data into your software environment of choice. So there might be a little bit of cleaning and formatting but only in support of data loading. Also keep in mind that we have not integrated the data yet. We have to give the individual data sources a little bit of attention before we integrate them. Integration conceals some missing data problems but it creates some new ones. So you must do an initial exploration first. I love this phrase, gross or surface properties…

Contents