MO2.R5.1

Generalization and Informativeness of Conformal Prediction

Matteo Zecchin, Sangwoo Park, Osvaldo Simeone, King’s College London, United Kingdom; Fredrik Hellström, University College London, United Kingdom

Session:
Estimation and Prediction

Track:
11: Information Theory and Statistics

Location:
Omikron I

Presentation Time:
Mon, 8 Jul, 11:50 - 12:10

Session Chair:
Osvaldo Simeone, King's College, London
Abstract
The safe integration of machine learning modules in decision-making processes hinges on their ability to quantify uncertainty. A popular technique to achieve this goal is conformal prediction (CP), which transforms an arbitrary base predictor into a set predictor with coverage guarantees. While CP certifies the predicted set to contain the target quantity with a user-defined tolerance, it does not provide control over the average size of the predicted sets, i.e., over the informativeness of the prediction. In this work, a theoretical connection is established between the generalization properties of the base predictor and the informativeness of the resulting CP prediction sets. To this end, an upper bound is derived on the expected size of the CP set predictor that builds on generalization error bounds for the base predictor. The derived upper bound provides insights into the dependence of the average size of the CP set predictor on the amount of calibration data, the target reliability, and the generalization performance of the base predictor. The theoretical insights are validated using simple numerical regression and classification tasks.
Resources