From the course: Data Science Foundations: Data Mining in Python

Unlock the full course today

Join today to access over 22,500 courses taught by industry experts or purchase this course individually.

Spambase dataset

Spambase dataset - Python Tutorial

From the course: Data Science Foundations: Data Mining in Python

Start my 1-month free trial

Spambase dataset

- [Instructor] To demonstrate several methods for classification in Python, we're going to be using a dataset called Spambase which contained information about spam emails and not spam emails and describes each email according to a range of things like for instance do they use all caps, exclamation points and do they use this particular word, yes or no? Things that can be used to diagnose whether something is spam or not. To do this we're going to first import some common packages including psychic learn that we'll be able to do the test train split. And then we're going to import a data set. This data set comes from the machine learning repository at the University of California, Irvine, and you can access it directly at this URL if you want more information on it but I've saved a local CSV to make life easier. And you can run this command to import that CSV file. Let's take a quick look at what we've imported. What you…

Contents