From the course: Amazon Web Services: Data Services
Unlock the full course today
Join today to access over 22,400 courses taught by industry experts or purchase this course individually.
Run Spark job on a Jupyter Notebook on AWS EMR - Amazon Web Services (AWS) Tutorial
From the course: Amazon Web Services: Data Services
Run Spark job on a Jupyter Notebook on AWS EMR
- [Instructor] So if you'll notice over here on the left side, we have a notebooks tab. I'm going to go back to clusters, and here I have that notebook section open. This is a relatively new capability, and the idea is that you can have a Jupyter notebook as an alternative client rather than the terminal. And as you'll see in just a second here, I'll click create notebook and I'll call it Demo Thursday, and we're going to choose our existing cluster, and we'll accept all the defaults here. Just check them. Yeah. And this places our notebooks in this S3 bucket. It's usually pretty quick to spin up this notebook instance. So once this is available, then we can connect either in Jupyter Lab, and that's if you're going to have multiple people editing. It's a server-based environment. Or Jupyter if it's a single person, since it's just a single Jupyter environment. And then we can run our Spark job inside of a Jupyter…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.