From the course: Supervised Learning Essential Training

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Steps to prepare data for modeling

Steps to prepare data for modeling - Python Tutorial

From the course: Supervised Learning Essential Training

Start my 1-month free trial

Steps to prepare data for modeling

- [Instructor] Let's prepare a dataset before feeding it into a linear regression model. Our example code will predict a student's final grades, given their behavioral attributes in prior grades. When you first start a machine learning project, you want to have a strong grasp on the skew and the patterns in your data set. In order to do this, we explore our data as part of preparing it, to know if we need to deal with missing values or manually fix column names or typos. We can easily check to see if we have no columns, as well. Let's get started. First we'll load in our important libraries, navigate to our data and since we have two CSVs, we're going to store this as student one and student two before combining them. Let's have a look at what student one has in it. This is about what we expected. We have a couple response variables and then a couple features that tell us details about each…

Contents