From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Unlock the full course today
Join today to access over 22,500 courses taught by industry experts or purchase this course individually.
How many potential variables (columns) will I have?
From the course: Data Science Foundations: Data Assessment for Predictive Modeling
How many potential variables (columns) will I have?
- [Instructor] Okay, we're going to continue in the same dataset. Let's talk about a slightly different but related issue now, which is trying to get a sense of how many of these variables in this customer file will be potentially good input variables in a predictive model, down the line. Well, our ID field isn't going to help us at all in the modeling phase, even though it's critical now. These new variables, these features, we usually call them feature engineering, is the phrase of generating this new information from the original information, those are probably going to be terribly helpful. Gender, possibly, but look, all of this stuff really can't help us at all. The only thing we could hope to do with City is that maybe we have certain promotions going on in some cities and not others. Same kind of thing maybe with State, but look at this, people tend to to think, "Wow, I have dozens of variables. My model is…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
Reviewing basic concepts in the level of measurement3m 15s
-
(Locked)
What is dummy coding?2m 31s
-
(Locked)
Expanding our definition of level of measurement5m 44s
-
(Locked)
Taking an initial look at possible key variables2m 51s
-
(Locked)
Dealing with duplicate IDs and transactional data3m 49s
-
(Locked)
How many potential variables (columns) will I have?4m 53s
-
(Locked)
How to deal with high-order multiple nominals2m 30s
-
(Locked)
Challenge: Identifying the level of measurement1m 39s
-
(Locked)
Solution: Identifying the level of measurement3m 59s
-
-
-
-
-
-
-
-
-
-
-