From the course: Amazon Web Services: Data Analytics

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Understanding ETL options

Understanding ETL options - Amazon Web Services (AWS) Tutorial

From the course: Amazon Web Services: Data Analytics

Start my 1-month free trial

Understanding ETL options

- [Instructor] So the previous movie, we took a look at the new serverless ETL AWS glue service, and the landscape here is getting kind of confusing. So I wanted to pull this all together. And you might wonder, why does Amazon have so many services that support ETL? And really, ETL is such a core part of working with data in preparation for analysis. It's complex, it's costly. So Amazon is trying to move its server base solutions into the world of serverless, and that's what they've been doing with Glue. So Glue is a service that supports Serverless Spark ETL or extract transform and load jobs, and has an integrated data catalog. In addition to that, there are several other services, which are commonly used for ETL. Amazon Elastic MapReduce, which is server-based Hadoop and Spark. It's used in situations where you want more control over the configuration of the key services, such as the Spark configuration file. Also, you might want to have persistent servers in situations where you…

Contents