“Why don’t machine learning and large language model evaluations report uncertainty?”

Statistical Modeling, Causal Inference, and Social Science 2025-02-22

Summary:

Ilan Strauss and Tim O’Reilly ask: Why don’t ML and LLM model evaluations report uncertainty? Rarely see an interval of some kind. – Because the models are too big (LLMs)? – Or because their ML metrics (Accuracy, recall, precision) are … Continue reading

Link:

https://statmodeling.stat.columbia.edu/2025/02/22/why-dont-machine-learning-and-large-language-model-evaluations-report-uncertainty/

From feeds:

Statistics and Visualization » Statistical Modeling, Causal Inference, and Social Science

Tags:

computing

Authors:

Andrew

Date tagged:

02/22/2025, 14:20

Date published:

02/22/2025, 09:53