From the course: Data Science on Google Cloud Platform: Building Data Pipelines
Unlock the full course today
Join today to access over 22,400 courses taught by industry experts or purchase this course individually.
Beam pipelines - Google Cloud Tutorial
From the course: Data Science on Google Cloud Platform: Building Data Pipelines
Beam pipelines
- [Instructor] A beam pipeline defines a single data processing pipeline. It is usually sequential but you can have multiple branches, multiple inputs, and multiple outputs. The pipeline consists of data and a series of steps that transform the data. That is an example pipeline shown in the diagram here. It has one source, a couple of transformations, temporary data structures, and one data sink. A pipeline once defined can be executed in any of the runners available with Apache Beam.
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.