From the course: Amazon Web Services: Data Analytics

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Use AWS Glue for ETL

Use AWS Glue for ETL - Amazon Web Services (AWS) Tutorial

From the course: Amazon Web Services: Data Analytics

Start my 1-month free trial

Use AWS Glue for ETL

- [Narrator] AWS Glue is a new service at the time of this recording, and one that I'm really excited about. In a nutshell, it's ETL, or extract, transform, and load, or prepare your data, for analytics as a service. So, what does that mean? It means several services that work together that help you to do common data preparation steps. So on the left side of this diagram you have potential data sources, like Redshift, S3, RDS, or proprietary databases that are running on EC2 like Cassandra or Mongo DB. In the middle you have the Glue capabilities. On the right you have the analytics output, Athena for ad hoc SQL queries, Redshift Spectrum for ad hoc Redshift queries, Amazon Elastic Map Reduce, managed, Hadoop, and Spark, for further analysis often using Spark. Also, Glue ETL, so further processing from Glue and then being visualized with one of many tools. The one they're showing here is Amazon's own Quicksight. You could use of course Tableau or ClickView or some custom tool. So…

Contents