Name: LLMs as Data Scientists: What Actually Works
Start: 2026-05-15T14:00:00-0500
End: 2026-05-15T14:30:00-0500

LLMs as Data Scientists: What Actually Works

Friday May 15, 2026 2:00pm - 2:30pm CDT

(a) Theater

Large language models are increasingly being used to write SQL, summarize datasets, and automate parts of the data science workflow. In some cases, they perform remarkably well. In others, they fail in subtle but systematic ways, including misinterpreting metrics, hallucinating joins, or reasoning inconsistently over schemas. This talk examines what actually happens when LLMs are applied to real structured data, using concrete examples to separate surface fluency from reliable analytical behavior.

We will explore a practical approach to improving reliability by providing LLMs with interpretable, structured components derived directly from the data itself. I will demo how exposing explicit statistical structure changes model behavior and reduces hallucination while improving reasoning over real datasets. The goal is to clarify where LLMs add value today, where they remain brittle, and which system design choices materially improve performance in analytics workflows.

Speakers

Ben Lengerich, PhD, MS

Founder/CEO, Intelligible AI

Ben Lengerich is an Assistant Professor at the University of Wisconsin–Madison and the founder of Intelligible. His research connects statistical modeling and foundation models. He received his PhD from Carnegie Mellon and postdoc at MIT.

Friday May 15, 2026 2:00pm - 2:30pm CDT
(a) Theater Best Buy HQ, 7700 Knox Ave S, Richfield, MN 55423

4 - More Technical

Data Tech 2026

Ben Lengerich, PhD, MS

Get help with the event

Data Tech 2026

Ben Lengerich, PhD, MS

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Get help with the event