From the course: Amazon Web Services: Data Analytics
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Use AWS Glue for ETL - Amazon Web Services (AWS) Tutorial
From the course: Amazon Web Services: Data Analytics
Use AWS Glue for ETL
- [Narrator] AWS Glue is a new service at the time of this recording, and one that I'm really excited about. In a nutshell, it's ETL, or extract, transform, and load, or prepare your data, for analytics as a service. So, what does that mean? It means several services that work together that help you to do common data preparation steps. So on the left side of this diagram you have potential data sources, like Redshift, S3, RDS, or proprietary databases that are running on EC2 like Cassandra or Mongo DB. In the middle you have the Glue capabilities. On the right you have the analytics output, Athena for ad hoc SQL queries, Redshift Spectrum for ad hoc Redshift queries, Amazon Elastic Map Reduce, managed, Hadoop, and Spark, for further analysis often using Spark. Also, Glue ETL, so further processing from Glue and then being visualized with one of many tools. The one they're showing here is Amazon's own Quicksight. You could use of course Tableau or ClickView or some custom tool. So…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
Query AWS public datasets6m 15s
-
(Locked)
Use AWS Glue for ETL10m 11s
-
(Locked)
Understanding ETL options7m 25s
-
(Locked)
Use AWS QuickSight for visualizations5m 24s
-
(Locked)
Use the AWS Marketplace for visualization tools3m 54s
-
(Locked)
Summary of tools3m 20s
-
(Locked)
Common analytics architecture patterns4m 12s
-
-