From the course: Data Science Foundations: Data Mining in Python
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Iliad dataset - Python Tutorial
From the course: Data Science Foundations: Data Mining in Python
Iliad dataset
- [Instructor] Text mining is just a whole different kind of data science, and consequently, we're going to be working with a different kind of data. To introduce us to this section on text mining, I want to show you the data set that we're going to be using, which is the text, "The Iliad," by Homer written forever ago. We're getting "The Iliad" from Project Gutenberg, which provides over 60,000 free eBooks. There are a lot of other options there and we'll try some for the challenge, but right now let's use the plain text version of "The Iliad" that I've already downloaded and saved for us in our data folder. All we need to do to see this is to import Pandas, and then we're going to read CSV, but we're actually reading a text file. And we're going to separate things, we're going to drop the empty lines, and we're going to remove what's called the Gutenberg ID, and we'll look at just the first 10 lines, and what you…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.