Picture for Xiaoyuan Yi

Xiaoyuan Yi

MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?

Add code
Jun 16, 2025
Viaarxiv icon

Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

Add code
Jun 06, 2025
Viaarxiv icon

CAReDiO: Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization

Add code
Apr 09, 2025
Viaarxiv icon

Leveraging Implicit Sentiments: Enhancing Reliability and Validity in Psychological Trait Evaluation of LLMs

Add code
Mar 26, 2025
Viaarxiv icon

Research on Superalignment Should Advance Now with Parallel Optimization of Competence and Conformity

Add code
Mar 08, 2025
Viaarxiv icon

PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning

Add code
Feb 21, 2025
Viaarxiv icon

Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values

Add code
Jan 13, 2025
Viaarxiv icon

The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment

Add code
Dec 24, 2024
Figure 1 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 2 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 3 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Figure 4 for The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment
Viaarxiv icon

Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization

Add code
Oct 16, 2024
Figure 1 for Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization
Figure 2 for Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization
Figure 3 for Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization
Figure 4 for Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization
Viaarxiv icon

CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses

Add code
Jul 15, 2024
Figure 1 for CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Figure 2 for CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Figure 3 for CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Figure 4 for CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Viaarxiv icon