Picture for Bei Li

Bei Li

One Size Does Not Fit All: A Distribution-Aware Sparsification for More Precise Model Merging

Add code
Aug 08, 2025
Viaarxiv icon

Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction

Add code
Jul 30, 2025
Viaarxiv icon

GRAM: A Generative Foundation Reward Model for Reward Generalization

Add code
Jun 18, 2025
Viaarxiv icon

TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

Add code
Jun 11, 2025
Viaarxiv icon

Dissecting Long Reasoning Models: An Empirical Study

Add code
Jun 05, 2025
Viaarxiv icon

Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching

Add code
Jun 05, 2025
Viaarxiv icon

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Add code
Mar 09, 2025
Viaarxiv icon

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Add code
Feb 20, 2025
Viaarxiv icon

Optimizing Speech Multi-View Feature Fusion through Conditional Computation

Add code
Jan 14, 2025
Viaarxiv icon

SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment

Add code
Jan 07, 2025
Viaarxiv icon