From the course: Amazon Web Services: Data Analytics
Unlock the full course today
Join today to access over 22,400 courses taught by industry experts or purchase this course individually.
Understanding ETL options - Amazon Web Services (AWS) Tutorial
From the course: Amazon Web Services: Data Analytics
Understanding ETL options
- [Instructor] So the previous movie, we took a look at the new serverless ETL AWS glue service, and the landscape here is getting kind of confusing. So I wanted to pull this all together. And you might wonder, why does Amazon have so many services that support ETL? And really, ETL is such a core part of working with data in preparation for analysis. It's complex, it's costly. So Amazon is trying to move its server base solutions into the world of serverless, and that's what they've been doing with Glue. So Glue is a service that supports Serverless Spark ETL or extract transform and load jobs, and has an integrated data catalog. In addition to that, there are several other services, which are commonly used for ETL. Amazon Elastic MapReduce, which is server-based Hadoop and Spark. It's used in situations where you want more control over the configuration of the key services, such as the Spark configuration file. Also, you might want to have persistent servers in situations where you…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.