From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Unit analysis decisions

Unit analysis decisions

From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Start my 1-month free trial

Unit analysis decisions

- [Narrator] I want to talk about a terribly important issue, and that is, how to structure your data set in terms of what you want your row to be. Now, I'm looking at the Census Income Data Set in it's excel version, in excel with the header row, so that we can focus on the data. So we don't have to worry about setting up the variable names, everything's all set and Census Income Data Set in its excel form is found in the originals folder. So this is set up currently, so that we have a single individual, top row, for instance, is a 17 year old that is never been married. That is the child of the household, or apparently in the household is female. Actually had a large capital gains, which is unusual if you look at this column, there's a lot of zeros there. And total income is under 50,000. Okay, so it's set up so that every row is an individual. And we're trying to determine if that individual is going to make less…

Contents