Toward Understanding In-context Learning

Workshop

Special Year on Large Language Models and Transformers, Part 1 Boot Camp

Speaker(s)

Tengyu Ma (Stanford University)

Location

Calvin Lab Auditorium

Date

Wednesday, Sept. 4, 2024

Time

2:30 – 4 p.m. PT

Abstract

I will introduce the in-context learning capability of large language models, the ability to learn to solve a downstream task simply by conditioning on a prompt consisting of input-output examples without any parameter updates. I will present a few papers that aim to theoretically explain the mechanisms of in-context learning on simplified data distributions.

Toward Understanding In-context Learning

Abstract

Video Recording