From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Unlock the full course today
Join today to access over 22,500 courses taught by industry experts or purchase this course individually.
Is the missing data worth saving?
From the course: Data Science Foundations: Data Assessment for Predictive Modeling
Is the missing data worth saving?
- [Instructor] Okay, we're back in KNIME with the same data set and the same workflow, but I've added several nodes because what we're about to do is a bit elaborate. It'll be easier to walk you through it, with the work done in advance. So our mission at the moment is to deal with the fact that we have several sections within our data set. And when one variable is missing, it usually means a dozen variables are missing. So let's take a closer look. I've added a row filter, and I've chosen one variable out of this section about magazines and so on, MBCRAFT, and I've checked to see if that's missing. And I'm going to exclude those rows that are missing. So if our understanding of the data is correct, I'm not just excluding MBCRAFT missing cases. Those cases will be missing on that whole section of variables. So in the column filter, so that I can focus on what I'm doing, I've got that whole section and our target…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.