Back to Resources

Yelp has built a robust stream processing ecosystem called Data Pipeline. As part of this system we created a Cassandra Source Connector, which streams data updates made to Cassandra into Kafka in real time. We use Cassandra CDC and leverage the stateful stream processing of Apache Flink to produce a Kafka stream containing the full content of each modified row, as well as its previous value.

WATCH

Speakers

Andrew Prudhomme

Distributed Systems Engineer at Yelp

Abrar Sheikh

Data Engineer at Yelp