"Calibeating": Beating Forecasters at Their Own Game

Workshop

Multi-Agent Reinforcement Learning and Bandit Learning

Speaker(s)

Sergiu Hart (Hebrew University of Jerusalem)

Location

Calvin Lab Auditorium

Date

Monday, May 2, 2022

Time

9:05 – 9:45 a.m. PT

Abstract

Forecasters should be tested by the Brier score and not just by the calibration score, which can always be made arbitrarily small. The Brier score is the sum of the calibration score and the refinement score; the latter measures how good the sorting into bins with the same forecast is, and thus attests to "expertise." This raises the question of whether one can gain calibration without losing expertise, which we refer to as "calibeating." We provide an easy way to calibeat any forecast, by a deterministic online procedure. We moreover show that calibeating can be achieved by a stochastic procedure that is itself calibrated, and then extend the results to simultaneously calibeating multiple procedures, and to deterministic procedures that are continuously calibrated.

http://www.ma.huji.ac.il/hart/publ.html#calib-beat

Attachment

Slides

"Calibeating": Beating Forecasters at Their Own Game

Abstract

Attachment

Video Recording