From the course: Data Science on Google Cloud Platform: Exploratory Data Analytics

Unlock the full course today

Join today to access over 22,500 courses taught by industry experts or purchase this course individually.

Loading data into a DataFrame

Loading data into a DataFrame - Google Cloud Tutorial

From the course: Data Science on Google Cloud Platform: Exploratory Data Analytics

Start my 1-month free trial

Loading data into a DataFrame

- [Narrator] I want to show you how to execute a simple and complete exploratory data analysis project using Datalab. In this course, we have focused on the specifics of using GCP Technologies for data exploration. Other than that, we use standard Python capabilities, like Pandas, NumPy, SciPy, and scikit-learn for this use case. In our earlier exercises, we have already loaded the email campaign's data set into BigQuery. We will use that data set for this exercise. The exercise file for this EDA example is 06_XX_edawithdatalabexamplefile. First, we want to load data from the campaign stable into the data lab and convert it into a data frame. We select all the data in the table through Sequel. Please note that if the table is too large, you might want to filter or aggregate it beforehand, through Dataflow. We execute the query, and store the results in the sourcedf DataFrame variable. The data scheme are loaded, and the sample data in the DataFrame are also printed here. In the next…

Contents