From the course: Data Science on Google Cloud Platform: Building Data Pipelines
Unlock the full course today
Join today to access over 22,400 courses taught by industry experts or purchase this course individually.
Topics and messages - Google Cloud Tutorial
From the course: Data Science on Google Cloud Platform: Building Data Pipelines
Topics and messages
- [Narrator] A topic is a named data queue in Pub/Sub. It is a collection of similar data. Typically, you will have one topic per type of data like logs, KIP streams, transactions, et cetera. A unit of data that is published and read from the topic is called a message. A message can be considered equivalent to a record in a database table. There can be multiple publishers and subscribers for a given topic. Multiple publishers can push data into the topic in parallel. Pub/Sub takes care of collection and consistency of data and publishes them to any number of subscribers for this topic. Topics are not strictly first in first out. Sequencing is maintained most of the time but it is not guaranteed. If sequencing is required the publisher should explicitly use a sequencing attribute which subscribers can then use to order them. Topics are asynchronous. Subscribers need not be active when publishers push in data. Whenever the subscriber comes up it fetches unread data from the topic. Topic…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.