Jongwook Han

PhD Student, Graduate School of Data Science, Seoul National University

Pluralistic value alignment, psychometric evaluation, and LLM behavior

I'm a PhD student at Seoul National University, where I work with Prof. Yohan Jo on how language models represent, express, and can be aligned with diverse human values.

My recent work studies pluralistic value alignment, contamination in psychometric evaluation, and robust ways to measure value expression in language models.

View publications Email Scholar X

Current HOLI Lab, Graduate School of Data Science, Seoul National University

Focus Value alignment, evaluation, and behavior in LLMs

Base Seoul, South Korea

Research Focus

Pluralistic value alignment for language models
Psychometric and behavioral evaluation of LLMs

Background

PhD student, Seoul National University
M.S. in Electrical Engineering, KAIST
B.S. in Integrated Technology, Yonsei University

Representative Work

ACL 2025

Value Portrait

Introduces a psychometrically validated benchmark built from real user-LLM interactions, making value assessment more reliable and ecologically grounded than annotation-heavy alternatives.

Across 44 language models, it shows consistent emphasis on Benevolence, Security, and Self-Direction, while also surfacing demographic biases in how models express values.

Read paper

ICML 2026

Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in LLMs

Separates intrinsic value expression from prompted value expression and studies them mechanistically through value vectors in the residual stream and value neurons in the MLP layers.

The analysis shows that the two mechanisms partly overlap but diverge in practice: prompted values are more steerable, while intrinsic values preserve greater response diversity.

Read paper

Contact

Email johnhan00@snu.ac.kr Google Scholar Publication profile X @jwhansnu

Selected Publications

ICML

Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in LLMs

Jongwook Han^*, Jongwon Lim^*, Injin Kong, and Yohan Jo

2026

Accepted to ICML 2026

arXiv
EACL

Quantifying Data Contamination in Psychometric Evaluations of LLMs

Jongwook Han^*, Woojung Song^*, Jonggeun Lee^*, and Yohan Jo

In Findings of the Association for Computational Linguistics: EACL 2026, 2026

HTML
ACL

Value Portrait: Assessing Language Models’ Values through Psychometrically and Ecologically Valid Items

Jongwook Han^*, Dongmin Choi^*, Woojung Song^*, Eun-Ju Lee, and Yohan Jo

In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025

HTML PDF Video Poster
ACL

PVP: An Image Dataset for Personalized Visual Persuasion with Persuasiveness Ratings, Persuasion Strategies, and Viewer Characteristics

Junseo Kim, Jongwook Han, Dongmin Choi, Jongwook Yoon, Eun-Ju Lee, and Yohan Jo

In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025

HTML PDF Poster

Academic Service

Silver Reviewer, ICML 2026
Registration Chair Assistant, Festival of Learning 2026