“Why don’t machine learning and large language model evaluations report uncertainty?”

Statistical Modeling, Causal Inference, and Social Science 2025-02-22

Summary:

Ilan Strauss and Tim O’Reilly ask: Why don’t ML and LLM model evaluations report uncertainty? Rarely see an interval of some kind. – Because the models are too big (LLMs)? – Or because their ML metrics (Accuracy, recall, precision) are … Continue reading →

Authors:

Andrew

Date tagged:

02/22/2025, 14:20

Date published:

02/22/2025, 09:53

“Why don’t machine learning and large language model evaluations report uncertainty?”

Statistical Modeling, Causal Inference, and Social Science 2025-02-22

Summary:

Link:

From feeds:

Tags:

Authors:

Date tagged:

Date published: