Picture for Bei Yu

Bei Yu

From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models

Add code
Oct 06, 2025
Viaarxiv icon

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Add code
Aug 11, 2025
Viaarxiv icon

DreamVE: Unified Instruction-based Image and Video Editing

Add code
Aug 08, 2025
Viaarxiv icon

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Add code
Jul 17, 2025
Viaarxiv icon

Deep-Learning-Based Pre-Layout Parasitic Capacitance Prediction on SRAM Designs

Add code
Jul 09, 2025
Viaarxiv icon

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Add code
Jun 17, 2025
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Viaarxiv icon

RTime-QA: A Benchmark for Atomic Temporal Event Understanding in Large Multi-modal Models

Add code
May 25, 2025
Viaarxiv icon

PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval

Add code
May 23, 2025
Figure 1 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 2 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 3 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 4 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Viaarxiv icon

On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding

Add code
May 19, 2025
Viaarxiv icon