From the course: Amazon Web Services: Data Services

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Run Spark job on a Jupyter Notebook on AWS EMR

Run Spark job on a Jupyter Notebook on AWS EMR - Amazon Web Services (AWS) Tutorial

From the course: Amazon Web Services: Data Services

Start my 1-month free trial

Run Spark job on a Jupyter Notebook on AWS EMR

- [Instructor] So if you'll notice over here on the left side, we have a notebooks tab. I'm going to go back to clusters, and here I have that notebook section open. This is a relatively new capability, and the idea is that you can have a Jupyter notebook as an alternative client rather than the terminal. And as you'll see in just a second here, I'll click create notebook and I'll call it Demo Thursday, and we're going to choose our existing cluster, and we'll accept all the defaults here. Just check them. Yeah. And this places our notebooks in this S3 bucket. It's usually pretty quick to spin up this notebook instance. So once this is available, then we can connect either in Jupyter Lab, and that's if you're going to have multiple people editing. It's a server-based environment. Or Jupyter if it's a single person, since it's just a single Jupyter environment. And then we can run our Spark job inside of a Jupyter…

Contents