Picture for Yongyu Mu

Yongyu Mu

DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering

Add code
Mar 19, 2026
Viaarxiv icon

Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning

Add code
Mar 17, 2026
Viaarxiv icon

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Add code
Oct 24, 2025
Figure 1 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 2 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 3 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 4 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Viaarxiv icon

GRAM: A Generative Foundation Reward Model for Reward Generalization

Add code
Jun 18, 2025
Viaarxiv icon

Dissecting Long Reasoning Models: An Empirical Study

Add code
Jun 05, 2025
Figure 1 for Dissecting Long Reasoning Models: An Empirical Study
Figure 2 for Dissecting Long Reasoning Models: An Empirical Study
Figure 3 for Dissecting Long Reasoning Models: An Empirical Study
Figure 4 for Dissecting Long Reasoning Models: An Empirical Study
Viaarxiv icon

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Add code
Mar 09, 2025
Figure 1 for Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Figure 2 for Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Figure 3 for Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Figure 4 for Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Viaarxiv icon

Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models

Add code
Jan 13, 2025
Viaarxiv icon

SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment

Add code
Jan 07, 2025
Figure 1 for SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Figure 2 for SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Figure 3 for SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Figure 4 for SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Viaarxiv icon

LRHP: Learning Representations for Human Preferences via Preference Pairs

Add code
Oct 06, 2024
Figure 1 for LRHP: Learning Representations for Human Preferences via Preference Pairs
Figure 2 for LRHP: Learning Representations for Human Preferences via Preference Pairs
Figure 3 for LRHP: Learning Representations for Human Preferences via Preference Pairs
Figure 4 for LRHP: Learning Representations for Human Preferences via Preference Pairs
Viaarxiv icon