We all love real-time data — clicks, payments, rides, messages — but most of it comes with a catch: it contains personal information we’re not supposed to leak, such as names, emails, locations, or even small clues that can identify someone. The challenge: how do we keep streaming data useful and safe at the same time? In this talk, we’ll explore practical ways to protect privacy in streaming systems using Apache Kafka, Apache Flink, and Apache Iceberg. We’ll cover: - simple tricks like masking and tokenizing PII; - why “anonymous” data often isn’t anonymous (the re-identification problem); - techniques like bucketing, k-anonymity, and adding noise; - how to balance privacy with data utility (too much hiding makes data useless). Along the way, we’ll look at real-world stories: from public data leaks to surprising deanonymization attacks, and show live demos of pipelines that anonymize data before it’s written to storage. If you’ve ever wondered how to build privacy-aware pipelines, this talk will give you practical patterns you can use right away.

Session Speakers

Olena Kutsenko

Staff Developer Advocate at Confluent

Session Details

Start: June 16, 2026
Time: 3:15 PM - 4:05 PM
Room: Room 2
Track: AI, Big Data, ML, Python
Level: Intermediate

Tags:

privacy security data breach

Vote on OpenFeedback Get Tickets

More Talks from This Track

01

11:25 AM - 12:15 PM |
Room 3

The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend

David vonThenen

AI/ML Leader | Keynote Speaker | OSS Engineer & Developer Advocate | Agentic AI, Deep Learning, Production AI | Python, Go, C++

View Session Details

02

12:25 PM - 12:45 PM |
Room 3

Autonomy Without a Spec Is Just Hope With a GPU Bill

Archie Sharma

COO at Zencoder

Will Fleury

Head of Engineering, Zencoder

View Session Details

03

12:55 PM - 1:15 PM |
Room 3

Building AI Avatars for Speaking Practice

Gerard París

Back-end Engineer at Preply

Serge Harb

Software Engineer, Preply

View Session Details

04

2:15 PM - 3:05 PM |
Room 3

Iceberg for Agents - Turning Lakehouse Data Into AI-Ready Context

Andrew Madson

Head of Developer Relations at Fivetran | Author of "Apache Polaris - The Definitive Guide". Authoring "AI-Ready Data" for Wiley and "Data Transformation" for O'Reilly

View Session Details

05

4:35 PM - 5:25 PM |
Room 1 🎥

Coding a Multi-Agent Game Master with Strands Agents

Olivier Leplus

Developer Advocate at AWS & Google Developer Expert on Web Technologies

Tiffany Souterre

Senior Developer Advocate @AWS

View Session Details

-18
Days

-18
Hours

-18
Minutes

-27
Seconds

Buy Ticket

Session Speakers

Olena Kutsenko

Staff Developer Advocate at Confluent

Session Details

Start: June 16, 2026
Time: 3:15 PM - 4:05 PM
Room: Room 2
Track: AI, Big Data, ML, Python
Level: Intermediate

Tags:

privacy security data breach

Vote on OpenFeedback Get Tickets

More Talks from This Track

01

11:25 AM - 12:15 PM |
Room 3

The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend

David vonThenen

AI/ML Leader | Keynote Speaker | OSS Engineer & Developer Advocate | Agentic AI, Deep Learning, Production AI | Python, Go, C++

View Session Details

02

12:25 PM - 12:45 PM |
Room 3

Autonomy Without a Spec Is Just Hope With a GPU Bill

Archie Sharma

COO at Zencoder

Will Fleury

Head of Engineering, Zencoder

View Session Details

03

12:55 PM - 1:15 PM |
Room 3

Building AI Avatars for Speaking Practice

Gerard París

Back-end Engineer at Preply

Serge Harb

Software Engineer, Preply

View Session Details

04

2:15 PM - 3:05 PM |
Room 3

Iceberg for Agents - Turning Lakehouse Data Into AI-Ready Context

Andrew Madson

Head of Developer Relations at Fivetran | Author of "Apache Polaris - The Definitive Guide". Authoring "AI-Ready Data" for Wiley and "Data Transformation" for O'Reilly

View Session Details

05

4:35 PM - 5:25 PM |
Room 1 🎥

Coding a Multi-Agent Game Master with Strands Agents

Olivier Leplus

Developer Advocate at AWS & Google Developer Expert on Web Technologies

Tiffany Souterre

Senior Developer Advocate @AWS

View Session Details

-18
Days

-18
Hours

-18
Minutes

-27
Seconds

Buy Ticket

Contact Info

Venue

Social Links

Keeping data private in real-time pipelines

Session Speakers

Olena Kutsenko

Session Details

Tags:

More Talks from This Track

01

02

03

04

05

Quick Links

Contact Us

Contact Info

Venue

Social Links

Keeping data private in real-time pipelines

Session Speakers

Olena Kutsenko

Session Details

Tags:

More Talks from This Track

01

02

03

04

05