Picture for Yihong Liu

Yihong Liu

Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations

Add code
Jan 12, 2026
Viaarxiv icon

Left, Right, or Center? Evaluating LLM Framing in News Classification and Generation

Add code
Jan 09, 2026
Viaarxiv icon

Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners

Add code
Jan 06, 2026
Viaarxiv icon

Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation

Add code
Jan 01, 2026
Viaarxiv icon

Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors

Add code
Oct 10, 2025
Viaarxiv icon

A Comprehensive Evaluation of Multilingual Chain-of-Thought Reasoning: Performance, Consistency, and Faithfulness Across Languages

Add code
Oct 10, 2025
Viaarxiv icon

Enhancing Robustness of Autoregressive Language Models against Orthographic Attacks via Pixel-based Approach

Add code
Aug 28, 2025
Viaarxiv icon

Refusal Direction is Universal Across Safety-Aligned Languages

Add code
May 22, 2025
Viaarxiv icon

Tracing Multilingual Factual Knowledge Acquisition in Pretraining

Add code
May 20, 2025
Viaarxiv icon

HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization

Add code
Apr 21, 2025
Viaarxiv icon