홈페이지Data + AI Summit 2022 로고
Watch on demand

Optimizing Speed and Scale of User-Facing Analytics Using Apache Kafka and Pinot

On Demand

Type

  • Session

Format

  • Hybrid

Track

  • 데이터 엔지니어링

Difficulty

  • Beginner

Room

  • Moscone South | Upper Mezzanine | 155

Duration

  • 35 min
Download session slides

개요

Apache Kafka is the de facto standard for real-time event streaming, but what do you do if you want to perform user-facing, ad-hoc, real-time analytics too? That's where Apache Pinot comes in.

Apache Pinot is a realtime distributed OLAP datastore, which is used to deliver scalable real time analytics with low latency. It can ingest data from batch data sources (S3, HDFS, Azure Data Lake, Google Cloud Storage) as well as streaming sources such as Kafka. Pinot is used extensively at LinkedIn and Uber to power many analytical applications such as Who Viewed My Profile, Ad Analytics, Talent Analytics, Uber Eats and many more serving 100k+ queries per second while ingesting 1Million+ events per second.

Apache Kafka's highly performant, distributed, fault-tolerant, real-time publish-subscribe messaging platform powers big data solutions at Airbnb, LinkedIn, MailChimp, Netflix, the New York Times, Oracle, PayPal, Pinterest, Spotify, Twitter, Uber, Wikimedia Foundation, and countless other businesses.

Come hear from Neha Power, Founding Engineer at a StarTree and PMC and committer of Apache Pinot, and Karin Wolok, Head of Developer Community at StarTree, on an introduction to both systems and a view of how they work together.

Session Speakers

Headshot of Karin Wolok

Karin Wolok

Head of Developer Community

StarTree

Headshot of Neha Pawar

Neha Pawar

Founding Engineer

StarTree

Data+AI Summit 하이라이트 보기

Watch on demand