Modern Paradigms in Generalization

About

Generalization, broadly construed, is the ability of machine learning methods to perform well in scenarios outside their training data.

Despite being a well-developed field with a rich history, contemporary phenomena — in particular, those arising from deep learning, most specifically large image and language models — are well beyond our current mathematical tool kit and vocabulary. It is not merely that analyses are too loose to be effective; rather, the settings have drastically evolved from the standard statistical setting of similar training and testing data, as the following examples illuminate: self-driving cars may need to navigate unfamiliar and even private or inaccessible roads; image generation software is expected to provide compelling images from essentially arbitrary input strings, with human operators indeed enjoying breaking the training data mold; AlphaFold and related software make protein predictions for species unrelated to those in their training set. The list goes on, without even scratching the surface of large language models and algorithmic tasks.

This program's goal is to bring together remote and local researchers, in both academia and industry, as well as across mathematical and applied disciplines, with the common goals of (a) organizing and crystallizing gaps between the theory and practice of generalization, and (b) sparking collaboration toward a concerted effort to close these gaps.

This research program is funded in part by an award from the ONR.

Organizers

Matus Telgarsky (New York University; chair)

Peter Bartlett (Simons Institute, UC Berkeley)

Daniel Hsu (Columbia University)

Po-Ling Loh (University of Cambridge)

Toni Pitassi (Columbia University)

Andrej Risteski (Carnegie Mellon University)

Rich Zemel (Columbia University)

Long-Term Participants (including Organizers)

Matus Telgarsky (New York University)

Peter Bartlett (Simons Institute, UC Berkeley)

Daniel Hsu (Columbia University)

Po-Ling Loh (University of Cambridge)

Toni Pitassi (Columbia University)

Andrej Risteski (Carnegie Mellon University)

Rich Zemel (Columbia University)

Sivaraman Balakrishnan (Carnegie Mellon University)

Misha Belkin (UCSD)

Peter Bickel (UC Berkeley)

Eunsol Choi (The University of Texas at Austin)

Spencer Frei (UC Davis)

Surbhi Goel (University of Pennsylvania)

Shafi Goldwasser (Simons Institute, UC Berkeley)

Nika Haghtalab (UC Berkeley)

Zaid Harchaoui (University of Washington)

Wei Hu (University of Michigan)

Russell Impagliazzo (UC San Diego)

Varun Jog (University of Cambridge)

Ramya Korlakai Vinayak (University of Wisconsin-Madison)

Samory Kpotufe (Columbia University)

Jason Lee (Princeton University)

Zhiyuan Li (Toyota Technological Institute at Chicago (TTIC))

Jennifer Listgarten (UC Berkeley)

Benjamin Recht (UC Berkeley)

Anant Sahai (UC Berkeley)

Johannes Schmidt-Hieber (University of Twente)

Vatsal Sharan (USC)

Mahdi Soltanolkotabi (University of Southern California)

Nati Srebro (Toyota Technological Institute at Chicago)

Ryan Tibshirani (University of California, Berkeley)

Yusu Wang (UCSD)

Fanny Yang (ETH Zurich)

Han Zhao (University of Illinois Urbana-Champaign)

Nikita Zhivotovskiy (UC Berkeley)

Shuheng Zhou (University of California, Riverside)

Research Fellows

Keaton Ellis (UC Berkeley)

Margalit Glasgow (Stanford University)

Gautam Goel (UC Berkeley)

Soufiane Hayou (UC Berkeley)

Bingbin Liu (Carnegie Mellon University)

Binghui Peng (UC Berkeley)

Ankit Pensia (Simons Institute)

Max Simchowitz (Carnegie Mellon University)

Arsen Vasilyan (Simons Institute)

Yuqing Wang (Simons Institute, UC Berkeley)

Jingfeng Wu (UC Berkeley)

Lydia Zakynthinou (UC Berkeley)

Visiting Graduate Students and Postdocs

Medha Agarwal (University of Washington)

Anastasios Angelopoulos (UC Berkeley)

Navid Ardeshir (Columbia University)

Julian Asilis (University of Southern California)

Mriganka Basu Roy Chowdhury (UC Berkeley)

Erez Buchweitz (UC Berkeley)

Yuhang Cai (University of California, Berkeley)

Juntong Chen (University of Twente)

Sam Chen (UCSD)

Weixin Chen (University of Illinois Urbana-Champaign)

Samuel Deng (Columbia University)

Siddartha Devic (University of Southern California)

Tiffany Ding (UC Berkeley)

Daniil Dmitriev (ETH Zurich)

Deqing Fu (University of Southern California)

Ishan Gaur (UC Berkeley)

Isaac Gibbs (UC Berkeley)

Jeremy Goldwasser (UC Berkeley)

Pulkit Gopalani (University of Michigan)

Yuzheng Hu (University of Illinois Urbana-Champaign)

Kasra Jalaldoust (Columbia University)

Julia Kostin (ETH Zurich)

Brian Lee (UC Berkeley)

Justin Li (New York University)

Yuchen Li (Carnegie Mellon University)

Jingwen Liu (Columbia University)

Zeyu Liu (The University of Texas at Austin)

Andrei Marchis (University of Cambridge)

Ronak Mehta (University of Washington)

Parsa Mirtaheri (UC San Diego)

Mohamad Amin Mohamadi (TTIC)

Riley Nerem (UCSD)

Kazusato Oko (UC Berkeley)

Seunghoon Paik (UC Berkeley)

Junhyung Park (ETH Zurich)

Reese Pathak (UC Berkeley)

Pratik Patil (UC Berkeley)

João Vitor Romano (UC Berkeley)

Sam Schapiro (University of Illinois Urbana-Champaign)

Xueda Shen (UC Berkeley)

Abhishek Shetty (UC Berkeley)

Danny Son (New York University)

Berk Tinaz (University of Southern California)

Bhavya Vasudeva (University of Southern California)

Tobias Wegel (ETH Zurich)

Ruicheng Xian (University of Illinois Urbana-Champaign)

Shuo Xie (TTIC)

Fangyuan Xu (The University of Texas at Austin)

Zhiwei Xu (University of Michigan)

Kunhe Yang (UC Berkeley)

Yongyi Yang (University of Michigan)

Margaux Zaffran (UC Berkeley)

Cindy Zeng (University of Illinois Urbana-Champaign)

Michael Zhang (The University of Texas at Austin)

Eric Zhao (UC Berkeley)