From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Tips and tricks to consider during data loading

Tips and tricks to consider during data loading

From the course: Data Science Foundations: Data Assessment for Predictive Modeling

Start my 1-month free trial

Tips and tricks to consider during data loading

- [Instructor] Okay, I want to talk a little bit about data loading. We're going to be working a lot with the Census Income Data Set from the UCI Machine Learning Repository. This is a comma-separated file. Now, let's face it, a lot of the information you're going to be pulling in is going to be from a data table, from a data warehouse or something like that but there's still a lot of information that you're going to get from scraping or pulling down from websites and comma-separated files are still around. So we want to talk a little bit about the challenges of using them. Tab-delimited, of course, would be similar but we'll talk just about this one and it's a comma-separated file. So here's the top line and we see that there's no header. That unfortunately is commonplace for this kind of file. Everything is separated by commas. So a couple of things that I always keep my eye out for. Strange characters, like…

Contents