Publication

Conformal Prediction with Large Language Models for Multi-Choice Question Answering

July 25, 2023

Topics

People

Groups

Share this publication

Kumar, B.*, Lu, C.*, Gupta, G., Palepu, A., Bellamy, D., Raskar, R., & Beam, A. "Conformal Prediction with Large Language Models for Multi-Choice Question Answering." Neural Conversational AI Workshop at ICML 2023.

Abstract

As large language models continue to be widely developed, robust uncertainty quantification techniques will become crucial for their safe deployment in high-stakes scenarios. In this work, we explore how conformal prediction can be used to provide uncertainty quantification in language models for the specific task of multiple-choice question-answering. We find that the uncertainty estimates from conformal prediction are tightly correlated with prediction accuracy. This observation can be useful for downstream applications such as selective classification and filtering out low-quality predictions. We also investigate the exchangeability assumption required by conformal prediction to out-of-subject questions, which may be a more realistic scenario for many practical applications. Our work contributes towards more trustworthy and reliable usage of large language models in safety-critical situations, where robust guarantees of error rate are required.

via Neural Conversational AI Workshop @ ICML 2023

Conformal Prediction with Large Language Models for Multi-Choice Question Answering

Topics

People

Groups

Abstract

Federated Conformal Predictors for Distributed Uncertainty Quantification

Data Acquisition via Experimental Design for Data Markets

A Perspective on Decentralizing AI

CoDream: Exchanging dreams instead of models for federated aggregation with heterogeneous models

Conformal Prediction with Large Language Models for Multi-Choice Question Answering

Topics

People

Groups

Share this publication

Abstract

Federated Conformal Predictors for Distributed Uncertainty Quantification

Data Acquisition via Experimental Design for Data Markets

A Perspective on Decentralizing AI

CoDream: Exchanging dreams instead of models for federated aggregation with heterogeneous models