From the course: Azure Spark Databricks Essential Training
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Use a notebook with scikit-learn
From the course: Azure Spark Databricks Essential Training
Use a notebook with scikit-learn
- [Instructor] Now up until this point, we've been looking at a lot of the mechanics of how to get started with Azure Databricks. So we've set up the cluster, we've set up the environment, we brought in notebooks, we've looked at how notebooks work, what are the mechanics of working with them, how we attach to a cluster. We've looked at how different languages work in notebooks, we've looked at visualizations. The whole core of why we would use this product is to process massive amounts of data in a complex way, because it's distributed, so how does that work, what does that look like? It can be used in standard analytics, so such as just aggregates, grouping, I call it business reporting, basically. However, more and more, we're seeing that these types of workloads include machine learning, and although there are machine learning libraries that are built native for Spark, what I'm finding in my work as a cloud architect, is a lot of the data's science teams come from what I call…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
(Locked)
Review Databricks Azure cluster setup3m 39s
-
(Locked)
Use a Python notebook with dashboards6m 1s
-
(Locked)
Use an R notebook4m
-
(Locked)
Use a Scala notebook for visualization6m 37s
-
(Locked)
Use a notebook with scikit-learn11m 29s
-
(Locked)
Use a Spark Streaming notebook8m 53s
-
Use an external Scala library: variant-spark10m 26s
-
(Locked)
-
-
-
-