From the course: Amazon Web Services: Data Services

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Explore AWS EMR with Hadoop and Spark

Explore AWS EMR with Hadoop and Spark - Amazon Web Services (AWS) Tutorial

From the course: Amazon Web Services: Data Services

Start my 1-month free trial

Explore AWS EMR with Hadoop and Spark

- [Instructor] In this next section we'll be looking at EMR and several Data Lake and huge data processing services and there are additional course and exercise files in the Data Lake section of my GitHub repo for this course. Now as we've done with other data services, I've already created a cluster because it can take between five and 15 minutes for the managed virtual machines to be set up in the Amazon EMR ecosystem. Do notice here, they have a banner talking about using Spot Instances, a common pattern for production and you can save lots of service charges by using those Spot Instances. To create a cluster, we click the blue button and we have a number of choices. Now we have a standard interface and we have an advanced interface. In the advanced interface, you can see that we have some libraries selected by default and we have a large number of versions because Hadoop's been around for a long time and there are a lot…

Contents