From the course: Data Science on Google Cloud Platform: Exploratory Data Analytics
Unlock the full course today
Join today to access over 22,500 courses taught by industry experts or purchase this course individually.
Loading data into a DataFrame - Google Cloud Tutorial
From the course: Data Science on Google Cloud Platform: Exploratory Data Analytics
Loading data into a DataFrame
- [Narrator] I want to show you how to execute a simple and complete exploratory data analysis project using Datalab. In this course, we have focused on the specifics of using GCP Technologies for data exploration. Other than that, we use standard Python capabilities, like Pandas, NumPy, SciPy, and scikit-learn for this use case. In our earlier exercises, we have already loaded the email campaign's data set into BigQuery. We will use that data set for this exercise. The exercise file for this EDA example is 06_XX_edawithdatalabexamplefile. First, we want to load data from the campaign stable into the data lab and convert it into a data frame. We select all the data in the table through Sequel. Please note that if the table is too large, you might want to filter or aggregate it beforehand, through Dataflow. We execute the query, and store the results in the sourcedf DataFrame variable. The data scheme are loaded, and the sample data in the DataFrame are also printed here. In the next…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.