Home Podcasts The Information Bottleneck
The Information Bottleneck

The Information Bottleneck

Ravid Shwartz-Ziv & Allen Roush 42 Episodes Jul 2, 2026

Two AI researchers, Ravid Shwartz-Ziv and Allen Roush, discuss the latest trends, news, and research in Generative AI, LLMs, GPUs, and Cloud Systems. The podcast covers cutting-edge developments in artificial intelligence and machine learning, offering insights from experts in the field.

Episodes

Why All Models Learn the Same Thing with Phillip Isola (MIT) Jul 2, 2026 01:11:28 Phillip Isola, professor at MIT, joins us to talk about representation learning: what makes a representation good, why different models seem to converge on similar representations, and whether pre-training is really over.We discuss the platonic representation hypothesis and its limits, why clustering structure matters more than global geometry, and Phillip's new neural thickets paper arguing that
AI for Science with Qichao Hu (Molecular Universe / SES AI) Jun 29, 2026 01:00:56 Most AI-for-science companies are selling shovels. Qichao Hu wants the gold.In this episode, we talk with Qichao, the founder and CEO of Molecular Universe, the AI-for-science platform that grew out of SES AI, a high-energy-density battery developer he's run for fourteen years. His core distinction is that companies from the AI world build tools, such as foundation models that predict properties,
Infrastructure for AI at Scale - With Benny Chen (Fireworks AI) Jun 24, 2026 01:05:50 We talk a lot on this show about RL, agents, and the move between pre-training and post-training, but not enough about the layer everything actually runs on. Benny Chen, co-founder of Fireworks AI, one of the largest inference platforms around, walks us through what it takes to serve models at scale: sourcing GPUs, writing the kernels, the runtime, and the routing layer that lets a customer hit on
Broken Peer Review, AI, and Worms — with Oded Rechavi Jun 21, 2026 01:18:04 Oded Rechavi is a biologist at Tel Aviv University and the co-founder of QED, a company building AI to review scientific work. He's also spent years studying worms.We start with what's wrong with peer review and grant funding: why it takes years to publish, why reviewers are often your own competitors, and why the whole thing is locked to an economic model that rewards publishing more papers, not
Will AI Take Our Jobs? With Alex Imas (Google/University of Chicago) Jun 16, 2026 01:29:01 Will AI take our jobs? We put the question to Alex Imas, the new Director of AGI Economics at Google DeepMind and a professor at Chicago Booth, whose entire job now is studying how frontier AI reshapes the economy. His short answer: probably some of them, but the popular story is mostly wrong about which jobs and how fast.Alex makes the case that a job is a bundle of tasks, not a single thing AI e
Why AI Benchmarks Are Lying to You - with Wenhu Chen (Meta/University of Waterloo) Jun 13, 2026 01:19:03 In this episode, we sit down with Wenhu Chen, research scientist at Meta MSL, assistant professor at the University of Waterloo, and the person behind MMLU-Pro and MMMU. If you've read a frontier model release in the last two years, you've seen his benchmarks. That makes him one of the best people to answer the question everyone dances around: when a model jumps from 40% to 90% on your benchmark,
Jürgen Schmidhuber - Part 2: JEPA, the Road to AGI, and Who Really Invented Modern AI Jun 7, 2026 01:29:29 In the second half of our conversation with Jürgen Schmidhuber, we focus on the key ideas he's pursued since the early 1990s and discuss why he believes these concepts are only now being rediscovered.We start with JEPA. Jürgen argues that the method LeCun named in 2022 is the same family he published in 1992 as Predictability Maximization. From there he traces the adversarial lineage back further
Jürgen Schmidhuber - World Models, RL, and the Year that changed AI (Part 1) Jun 4, 2026 01:37:56 In this episode, we host Jürgen Schmidhuber - the man, the legend, one of the godfathers of modern AI. His lab worked out many ideas behind today’s systems (LSTM, world models, artificial curiosity, Transformer variants, and even GAN-style setups) decades before they became fashionable, and he’s just as well known for making sure people remember who did what first. This is the first of two conve
AI for Science and the Thermodynamics of Generative AI - with Max Welling (UvA, CuspAI) May 29, 2026 01:13:46 In this episode, we sit with Max Welling, Professor of Machine Learning at the University of Amsterdam, co-founder and CTO of CuspAI, and a foundational figure behind variational autoencoders (VAEs), equivariant networks, and Bayesian deep learning. We talk about AI for science, the physics underneath generative models, and what's still missing on the road to real intelligence.Max starts with what
After Math Falls, What's Next? with Julia Kempe (NYU/Meta) May 25, 2026 01:14:43 Julia Kempe on Why Math Will Fall Next, Superhuman Provers, and the Return of the Renaissance ResearcherIn this episode, we sit down with Julia Kempe, a Professor at NYU's Center for Data Science and researcher at Meta FAIR's Foundations of Reasoning team,  for a wide-ranging conversation on the future of AI research.We dig into why verifiable domains like mathematics may be on track to "fall" the
Language, Cognition, and the Limits of LLMs - with Tal Linzen (NYU/Google) May 17, 2026 01:23:26 We host Tal Linzen, Associate Professor at NYU and Research Scientist at Google, for a conversation on the intersection of cognitive science and large language models.We discussed why children can learn language from around 100 million words while LLMs need trillions, and the surprising finding that as models get better at predicting the next word, they become worse models of how humans actually p
Intelligence in an Open World - with Mengye Ren (NYU) May 20, 2026 00:59:16 We talk with Mengye Ren, Assistant Professor at NYU's Center for Data Science, about what intelligence actually means once you step outside a benchmark, and why scaling a single centralized model isn't the whole story.We get into why intelligence has to be defined in open environments, not closed ones, and what that means for how we measure progress. We push on the creativity question: today's mod

Recommended