Home Podcasts The Test Set by Posit
The Test Set by Posit

The Test Set by Posit

Posit, PBC 23 Episodes Jun 29, 2026

A Posit podcast for data science junkies, anomaly hunters, and those who play outside the confidence interval. Hosted by Michael Chow, with co-hosts Wes McKinney & Hadley Wickham.

Episodes

Confidently Incorrect — with Caitlin Colgrove Jun 29, 2026 3609 Caitlin Colgrove is the CTO of Hex, the data workspace for building and sharing data projects using SQL and Python that somehow counts a Sweetgreen chef as a power user. She joins Michael, Hadley, and Isabel to talk about what AI agents actually get wrong in data work (it's not the hallucinations, it's supreme overconfidence), why data teams aren't going anywhere, and how she thinks
The Bothness of It — with Alex Hillman Jun 15, 2026 4442 Alex Hillman built one of America's first co-working spaces, wrote a business book in tweets, and recently handed his inbox to a Claude Code agent — not to draft emails, but to notice when a friendship is going cold. In this episode, Alex, Michael, Wes, and Hadley dig into marketing for people who hate marketing, what 20 years of email reveals about your relationships, and why the hardest par
The Code Doesn't Lie — with Mike Bostock Jun 1, 2026 4108 Mike Bostock made D3 when the browser was still a joke. He built bl.ocks when people needed somewhere to share their work. Now he's building Observable — reactive notebooks with an AI that actually looks at what it made. In this episode: the three-GIF bar chart that launched 25 years of viz, why open source needs both intrinsic and extrinsic motivation, and why an agent that can't see it
The Wonder-Driven Builder — with Paige Bailey May 18, 2026 2743 Paige Bailey is a developer relations engineering lead at Google DeepMind. She's a geophysicist-turned-AI-engineer who was once told by her professors that building open-source libraries was a waste of time. We talk about her path from planetary science to TensorFlow, why statisticians have a hidden edge in the age of AI, and what it means to be a curious generalist when the cost of building
Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam May 4, 2026 4547 Vincent Warmerdam has been the first full-time hire at a startup, a spacey punster who accidentally got himself a job, a bartender at an Amsterdam comedy theater, and a Dutch bike tour guide — and he'll tell you all of it was career development. Now doing DevRel at Marimo, Vincent makes the case for reactive notebooks, Lego-brick widgets, and why "number go up" is not a data science
Everything's a Fad (Including This Podcast) — with Benn Stancil Apr 20, 2026 5709 Benn Stancil built Mode Analytics, spent a decade in the data trenches, and now writes some of the sharpest, funniest essays in the data world. On The Test Set, he talks about the cultural shift from Nate Silver to Rick Rubin why AI might kill the analytics dashboard, and what happens when a thousand startups all build the same thing. Plus: boy bands as a model for collaboration, and why the best
Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy Apr 6, 2026 3931 dbt Labs CEO Tristan Handy drops into The Test Set to map the fault lines between the data science world and the enterprise data world — and explain why analytics engineers are basically pissed-off data analysts who decided to organize the bookshelf. We get into SQL's glow-up, the community magic of dbt Slack, what AI agents mean for data warehouses, and why everyone's building iOS apps
Your VP Is Doing a Rogue Analysis in Cursor Right Now — with Nell Thomas Mar 23, 2026 4964 Nell Thomas has spent two decades in data — from equity research to the DNC to Facebook to leading a 400-person data org at Shopify. She walks Michael and Wes through the modern data stack role by role, gets honest about what AI is and isn't changing about data work, and admits the semantic layer has been her greatest leadership failure. Plus: Sneakers gets the respect it deserves.Episode Not
Sleeping Rats and Sociopathic Agents — with Phillip Cloud Mar 9, 2026 3387 Phillip Cloud has been shaping the Python data ecosystem since the early pandas days — and he has *opinions*. Now a principal engineer at NVIDIA leading the Ibis project, Phillip talks about how he stumbled into open source via an eye movement lab, why he prefers his coding agents cold and emotionless, and what happens when you ask an LLM for woodworking trig. Plus: terminal user interfaces, the f
More productive but a lot less fun — with Charlie Marsh Feb 23, 2026 5714 Charlie Marsh built Ruff, uv, and Ty — the tools that mass-fixed Python's worst pain points. Now he's grappling with what happens when agents start writing most of the code. In this episode, Charlie gets real about his team trusting his PRs less, the gnarly middle of coding with agents, and whether Python is even the right language for an agentic future. It's honest, a wee existenti
Alenka Frim: What yoga teaches us about discipline and collaboration in data science Feb 9, 2026 3674 Alenka Frim went from teaching yoga full-time to becoming a committer and PMC Member on Apache Arrow. In this episode, Alenka joins The Test Set hosts to talk about how Arrow grew from spec to critical infrastructure, and why she started contributing to a project she had never even used. She reflects on imposter syndrome, the discipline of showing up (on the mat and in GitHub), and how agents are
Emily Riederer: Column selectors, data quality, and learning in public Jan 26, 2026 3499 Emily Riederer writes Python with an R accent, and we’re all comfortable with it. In this episode, Emily reflects on her journey through R, Python, and SQL — from lessons learned in averaging default values (oops, we're not all rich!) to discovering that column selectors are way cooler than they sound. She weighs in on the delicate art of learning in public, why frustration often makes the be

Recommended