From the course: Azure Spark Databricks Essential Training

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Meet Databricks Apache Spark clusters

Meet Databricks Apache Spark clusters

From the course: Azure Spark Databricks Essential Training

Start my 1-month free trial

Meet Databricks Apache Spark clusters

- [Instructor] To get us started working with Azure Databricks let's consider this quote from one of the Databricks company founders, Matei Zaharia. He said, I think that by 2020 most data will be in either public clouds or cloud-like private environments. And the key words here are not only cloud but also most data. The volume of data that we need to work with is growing exponentially because of new methods of collection. Whether it's informational data from our phones or genetic data from our bodily fluids we are having more and more data that is interesting and we need to process. So Matei vision started actually when he was a student back at UC Berkeley in California, and he envisioned distributed computing that could help us to process these volumes of data, and that's called Apache Spark. So when we're thinking about this new world of data we go beyond the typical relational databases and we think in terms of streaming systems, data lakes and data warehouses. And as we'll see in…

Contents