Picture for Bei Li

Bei Li

MemoSight: Unifying Context Compression and Multi Token Prediction for Reasoning Acceleration

Add code
Apr 16, 2026
Viaarxiv icon

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

Add code
Mar 26, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Causal Autoregressive Diffusion Language Model

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search

Add code
Jan 16, 2026
Viaarxiv icon

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Add code
Nov 10, 2025
Figure 1 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 2 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 3 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 4 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Viaarxiv icon

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Add code
Oct 24, 2025
Figure 1 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 2 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 3 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Figure 4 for MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Viaarxiv icon