Omar Khattab – Building More Reliable and Scalable AI Systems with Language Model Programming

Thursday, March 21
10:00 am - 11:00 am

Grier B 34-401B

Abstract
It is now easy to build impressive demos with language models (LMs) but turning these into reliable systems currently requires brittle combinations of prompting, chaining, and finetuning LMs. I will present LM programming, a systematic way to address this by defining and improving three layers of the LM stack. I start with how to adapt LMs to search for information most effectively (ColBERT) and how to scale such search to billions of tokens (PLAID). I then discuss the right architectures and supervision strategies (e.g., ColBERT-QA, Baleen, Hindsight) for allowing LMs to retrieve and cite verifiable sources in their responses. This leads to DSPy, a programming model that replaces ad-hoc techniques for using LMs with composable modules for building and automatically supervising controllable programs built with LMs. Even simple systems expressed in DSPy routinely outperform large standalone LMs and standard hand-crafted prompting pipelines, in some cases while using only small models. I highlight how ColBERT and DSPy have sparked applications at dozens of leading tech companies and academic labs. I then conclude by discussing how DSPy enables a new degree of research modularity around LMs, one that stands to allow open research to again lead the development of AI systems.

Bio
Omar is a fifth-year CS Ph.D. candidate at Stanford NLP, whose work spans Natural Language Processing (NLP), information retrieval (IR), and ML systems. Omar’s research creates models, systems, supervision strategies, and programming abstractions for building reliable, transparent, and scalable NLP systems. Omar is the author of the ColBERT retrieval model, which has helped shape the modern landscape of neural information retrieval. Omar’s lines of work on ColBERT and DSPy form the basis of influential open-source projects (e.g., exceeding 500,000 downloads per month) and have sparked applications at Google, Amazon, IBM, VMware, Databricks, Baidu, AliExpress, and numerous startups. Omar’s Ph.D. has been supported by the Eltoukhy Family Graduate Fellowship and the Apple Scholars in AI/ML PhD Fellowship.

Details

  • Date: Thursday, March 21
  • Time: 10:00 am - 11:00 am
  • Category:
  • Location: Grier B 34-401B

Host