Investigating Human-Aligned Large Language Model Uncertainty

Authors

  • Kyle Moore Vanderbilt University
  • Jesse Roberts Tennessee Technological University
  • Daryl Watson Tennessee Technological University
  • Pamela Wisniewski STIR Labs

DOI:

https://doi.org/10.32473/flairs.39.1.141835

Keywords:

Large Language Models, uncertainty quantification, human ai agreement, AI Alignment, uncertainty, Natural Language Processing

Abstract

Recent work has sought to quantify large language model uncertainty to facilitate model control and modulate user trust. Previous works focus on measures of uncertainty that are theoretically grounded or reflect the average overt behavior of the model. In this work, we investigate a variety of uncertainty measures, in order to identify measures that correlate with human group-level uncertainty. We find that Bayesian measures and a variation on entropy measures, top k entropy, tend to agree with human behavior as a function of model size. We find that some strong measures decrease in human-similarity with model size, but, by multiple linear regression, we find that combining multiple uncertainty measures provide comparable human-alignment with reduced size-dependency.

Downloads

Published

06-05-2026

How to Cite

Moore, K., Roberts, J., Watson, D., & Wisniewski, P. (2026). Investigating Human-Aligned Large Language Model Uncertainty. The International FLAIRS Conference Proceedings, 39(1). https://doi.org/10.32473/flairs.39.1.141835