Picture for Deyi Xiong

Deyi Xiong

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search

Add code
Jun 11, 2025
Viaarxiv icon

FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation

Add code
May 20, 2025
Viaarxiv icon

AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation

Add code
Mar 18, 2025
Figure 1 for AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Figure 2 for AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Figure 3 for AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Figure 4 for AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation
Viaarxiv icon

Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation

Add code
Mar 14, 2025
Viaarxiv icon

N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning

Add code
Mar 12, 2025
Figure 1 for N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning
Figure 2 for N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning
Figure 3 for N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning
Figure 4 for N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning
Viaarxiv icon

Evaluating Discourse Cohesion in Pre-trained Language Models

Add code
Mar 08, 2025
Figure 1 for Evaluating Discourse Cohesion in Pre-trained Language Models
Figure 2 for Evaluating Discourse Cohesion in Pre-trained Language Models
Figure 3 for Evaluating Discourse Cohesion in Pre-trained Language Models
Figure 4 for Evaluating Discourse Cohesion in Pre-trained Language Models
Viaarxiv icon

Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models

Add code
Mar 06, 2025
Viaarxiv icon

The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation

Add code
Mar 05, 2025
Viaarxiv icon

ProBench: Benchmarking Large Language Models in Competitive Programming

Add code
Feb 28, 2025
Viaarxiv icon