Softmax is on the log, not the logit scale

Statistical Modeling, Causal Inference, and Social Science 2024-12-26

Summary:

Bad Stan naming I realized recently that we followed the confusing terminological convention of ML in our description of Stan’s categorical_logit function. In Stan, if there’s a suffix to a distribution, it describes the scale of one or more of … Continue reading

Link:

https://statmodeling.stat.columbia.edu/2024/12/26/those-are-unnormalized-log-probabilities-not-logits-in-your-neural-networks-final-layer/

From feeds:

Statistics and Visualization » Statistical Modeling, Causal Inference, and Social Science

Tags:

computing

Authors:

Bob Carpenter

Date tagged:

12/26/2024, 20:01

Date published:

12/26/2024, 15:00