High-Dimension Human Value Representation in Large Language Models

Samuel Cahyawijaya; Delong Chen; Yejin Bang; Leila Khalatbari; Bryan Wilie; Ziwei Ji; Etsuko Ishii; Pascale Fung

Conference ProceedingsOPEN ACCESS

High-Dimension Human Value Representation in Large Language Models

Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies: Long Papers, NAACL-HLT 2025 (2025) 1 5303-5330

DOI: 10.18653/v1/2025.naacl-long.274

1Citations

31Readers

Get full text

Abstract

The widespread application of LLMs across various tasks and fields has necessitated the alignment of these models with human values and preferences. Given various approaches of human value alignment, there is an urgent need to understand the scope and nature of human values injected into these LLMs before their deployment and adoption. We propose UniVaR, a high-dimensional neural representation of symbolic human value distributions in LLMs, orthogonal to model architecture and training data. This is a continuous and scalable representation, self-supervised from the value-relevant output of 8 LLMs and evaluated on 15 open-source and commercial LLMs. Through UniVaR, we visualize and explore how LLMs prioritize different values in 25 languages and cultures, shedding light on complex interplay between human values and language modeling.

Cite

CITATION STYLE

APA

Cahyawijaya, S., Chen, D., Bang, Y., Khalatbari, L., Wilie, B., Ji, Z., … Fung, P. (2025). High-Dimension Human Value Representation in Large Language Models. In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies: Long Papers, NAACL-HLT 2025 (Vol. 1, pp. 5303–5330). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2025.naacl-long.274

High-Dimension Human Value Representation in Large Language Models

Abstract

Cite

Register to see more suggestions