Home Podcasts The Test Set by Posit
The Test Set by Posit

The Test Set by Posit

Posit, PBC 23 episodes Latest Jun 1, 2026

A Posit podcast for data science junkies, anomaly hunters, and those who play outside the confidence interval. Hosted by Michael Chow, with co-hosts Wes McKinney & Hadley Wickham.

Episodes

The Code Doesn't Lie — with Mike Bostock Jun 1, 2026 4108 Mike Bostock made D3 when the browser was still a joke. He built bl.ocks when people needed somewhere to share their work. Now he's building Observable — reactive notebooks with an AI that actually looks at what it made. In this episode: the three-GIF bar chart that launched 25 years of viz, why open source needs both intrinsic and extrinsic motivation, and why an agent that can't see it
The Wonder-Driven Builder — with Paige Bailey May 18, 2026 2743 Paige Bailey is a developer relations engineering lead at Google DeepMind. She's a geophysicist-turned-AI-engineer who was once told by her professors that building open-source libraries was a waste of time. We talk about her path from planetary science to TensorFlow, why statisticians have a hidden edge in the age of AI, and what it means to be a curious generalist when the cost of building
Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam May 4, 2026 4547 Vincent Warmerdam has been the first full-time hire at a startup, a spacey punster who accidentally got himself a job, a bartender at an Amsterdam comedy theater, and a Dutch bike tour guide — and he'll tell you all of it was career development. Now doing DevRel at Marimo, Vincent makes the case for reactive notebooks, Lego-brick widgets, and why "number go up" is not a data science
Everything's a Fad (Including This Podcast) — with Benn Stancil Apr 20, 2026 5709 Benn Stancil built Mode Analytics, spent a decade in the data trenches, and now writes some of the sharpest, funniest essays in the data world. On The Test Set, he talks about the cultural shift from Nate Silver to Rick Rubin why AI might kill the analytics dashboard, and what happens when a thousand startups all build the same thing. Plus: boy bands as a model for collaboration, and why the best
Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy Apr 6, 2026 3931 dbt Labs CEO Tristan Handy drops into The Test Set to map the fault lines between the data science world and the enterprise data world — and explain why analytics engineers are basically pissed-off data analysts who decided to organize the bookshelf. We get into SQL's glow-up, the community magic of dbt Slack, what AI agents mean for data warehouses, and why everyone's building iOS apps
Your VP Is Doing a Rogue Analysis in Cursor Right Now — with Nell Thomas Mar 23, 2026 4964 Nell Thomas has spent two decades in data — from equity research to the DNC to Facebook to leading a 400-person data org at Shopify. She walks Michael and Wes through the modern data stack role by role, gets honest about what AI is and isn't changing about data work, and admits the semantic layer has been her greatest leadership failure. Plus: Sneakers gets the respect it deserves.Episode Not
Sleeping Rats and Sociopathic Agents — with Phillip Cloud Mar 9, 2026 3387 Phillip Cloud has been shaping the Python data ecosystem since the early pandas days — and he has *opinions*. Now a principal engineer at NVIDIA leading the Ibis project, Phillip talks about how he stumbled into open source via an eye movement lab, why he prefers his coding agents cold and emotionless, and what happens when you ask an LLM for woodworking trig. Plus: terminal user interfaces, the f
More productive but a lot less fun — with Charlie Marsh Feb 23, 2026 5714 Charlie Marsh built Ruff, uv, and Ty — the tools that mass-fixed Python's worst pain points. Now he's grappling with what happens when agents start writing most of the code. In this episode, Charlie gets real about his team trusting his PRs less, the gnarly middle of coding with agents, and whether Python is even the right language for an agentic future. It's honest, a wee existenti
Alenka Frim: What yoga teaches us about discipline and collaboration in data science Feb 9, 2026 3674 Alenka Frim went from teaching yoga full-time to becoming a committer and PMC Member on Apache Arrow. In this episode, Alenka joins The Test Set hosts to talk about how Arrow grew from spec to critical infrastructure, and why she started contributing to a project she had never even used. She reflects on imposter syndrome, the discipline of showing up (on the mat and in GitHub), and how agents are
Emily Riederer: Column selectors, data quality, and learning in public Jan 26, 2026 3499 Emily Riederer writes Python with an R accent, and we’re all comfortable with it. In this episode, Emily reflects on her journey through R, Python, and SQL — from lessons learned in averaging default values (oops, we're not all rich!) to discovering that column selectors are way cooler than they sound. She weighs in on the delicate art of learning in public, why frustration often makes the be
Rebecca Barter: Persistent learning, tool building, and ‘Will code even exist?’ Jan 12, 2026 3407 Rebecca Barter, senior data scientist at Arine and adjunct assistant professor at the University of Utah, refuses to work on things she doesn’t care about. Lucky for us, she cares about a lot, most of all impact. In this episode, Rebecca joins The Test Set to talk about learning fast, building better tools, and staying motivated and adaptable.She shares how moving between R, Python, SQL, and dashb
Marco Gorelli: Narwhals, ecosystem glue, and the value of boring work Dec 15, 2025 3101 You’ve probably used Narwhals without realizing it. It’s the compatibility layer helping apps and libraries like Plotly play nice with Pandas, Polars, Arrow, and more — while keeping computation native instead of converting everything to Pandas. In this episode, Marco Gorelli explains how his weekend experiment turned into essential ecosystem infrastructure and why data types, not APIs, are where

Recommended

Playing