From the course: Data Science Foundations: Data Mining in Python

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Groceries dataset

Groceries dataset - Python Tutorial

From the course: Data Science Foundations: Data Mining in Python

Start my 1-month free trial

Groceries dataset

- [Instructor] To begin our demonstrations of association analysis, I want to introduce you to the dataset we're going to use, which is the groceries dataset. This is a CSV file that originally comes from a package in R, called A Rules, which is for association rules. Now, what's unusual about this is it's in transactional format as opposed to a standard tabular format. Let me actually just show you what it looks like. To do this, we'll start by loading Pandas so we can open and work with the data frame, and then we'll come down and simply open the data set. I'm going to run this one command, it's going to open the groceries.csv file from the data folder, and let's take a quick look at this. What it shows us is a number of transactions, and it starts at transaction zero, and it shows us that the first thing this person purchased was citrus fruit, then semi-finished bread, then margarine, then ready soups. And then…

Contents