Start free trial Sign in

From the course: Azure Spark Databricks Essential Training

Unlock the full course today

Join today to access over 22,500 courses taught by industry experts or purchase this course individually.

Understand Databricks Delta

Understand Databricks Delta

From the course: Azure Spark Databricks Essential Training

Start my 1-month free trial

Understand Databricks Delta

“

- [Instructor] As we're looking at running complex workloads, let me remind you that there are two primary methods of inputting the data: batch or stream. We've mostly been working with batch, we did have one earlier movie where we showed stream, but what I'm seeing as an architect in the real world is combinations of both, and that leads this kind of into where we're going next. So, batch, as a reminder, is a one-time run. You can partition the input data. You can use other optimizations, such as compression. And you can partition the output data. Stream is a continual stream. It can be partitioned. It can be compressed. And again, traditionally, you have set up those optimizations, which requires more work to set up the pipeline. Now as we start thinking about taking our workflows and putting them into pipelines, we of course are working with Azure Databricks. So let's consider the Azure data ecosystem. The ingest capability for data can be handled by services such as Azure Data…

Contents

- (Locked)
  
  Next steps
  
  1m 1s