Agenda

Explore the full two-day session lineup, and stay tuned for additional exciting sessions and speakers.
11:15 AM PDT
2:15 PM EDT

Teaching large language models to zip their lips

Andrew Carr

Andrew Carr

Senior applied research scientist
Gretel AI

Language models, despite their rapid advancements, can still leak sensitive information due to their training process. To address this, researchers have explored differential privacy (DP), which offers a mathematical privacy guarantee but faces challenges in implementation. Reinforcement Learning from Privacy Feedback (RLPF) is a novel approach that replaces human feedback with privacy measures and uses reinforcement learning to enhance a model’s multi-task capabilities while reducing data leakage. This research opens up opportunities for applying RLPF to various problems and paves the way for more secure AI systems.

Image

Watch on demand

Watch all of the live sessions on-demand and discover the latest developments in data-centric AI.
Watch on demand