Picture for Tong Yu

Tong Yu

Sam

WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning

Add code
Feb 19, 2026
Viaarxiv icon

AMPS: Adaptive Modality Preference Steering via Functional Entropy

Add code
Feb 13, 2026
Viaarxiv icon

ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces

Add code
Feb 12, 2026
Viaarxiv icon

Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon

Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications

Add code
Jan 05, 2026
Viaarxiv icon

Five Years of SciCap: What We Learned and Future Directions for Scientific Figure Captioning

Add code
Dec 25, 2025
Viaarxiv icon

Yuan-TecSwin: A text conditioned Diffusion model with Swin-transformer blocks

Add code
Dec 18, 2025
Viaarxiv icon

Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model

Add code
Dec 10, 2025
Viaarxiv icon

Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners

Add code
Oct 06, 2025
Figure 1 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 2 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 3 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 4 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Viaarxiv icon

VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding

Add code
Aug 10, 2025
Viaarxiv icon