Picture for Shujian Huang

Shujian Huang

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Add code
Apr 20, 2025
Viaarxiv icon

Could Thinking Multilingually Empower LLM Reasoning?

Add code
Apr 16, 2025
Viaarxiv icon

Elucidating the Design Space of Multimodal Protein Language Models

Add code
Apr 16, 2025
Viaarxiv icon

Understanding LLMs' Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From

Add code
Apr 15, 2025
Viaarxiv icon

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Add code
Apr 02, 2025
Viaarxiv icon

R-PRM: Reasoning-Driven Process Reward Modeling

Add code
Mar 27, 2025
Viaarxiv icon

Process-based Self-Rewarding Language Models

Add code
Mar 05, 2025
Viaarxiv icon

Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation

Add code
Feb 28, 2025
Viaarxiv icon

Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning

Add code
Feb 21, 2025
Viaarxiv icon

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon