Picture for Bei Li

Bei Li

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Add code
Oct 24, 2025
Viaarxiv icon

IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method

Add code
Sep 26, 2025
Figure 1 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 2 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 3 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 4 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement

Add code
Aug 28, 2025
Viaarxiv icon

One Size Does Not Fit All: A Distribution-Aware Sparsification for More Precise Model Merging

Add code
Aug 08, 2025
Viaarxiv icon

Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction

Add code
Jul 30, 2025
Viaarxiv icon

GRAM: A Generative Foundation Reward Model for Reward Generalization

Add code
Jun 18, 2025
Viaarxiv icon

TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

Add code
Jun 11, 2025
Figure 1 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 2 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 3 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Figure 4 for TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Viaarxiv icon

Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching

Add code
Jun 05, 2025
Figure 1 for Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching
Figure 2 for Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching
Figure 3 for Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching
Figure 4 for Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching
Viaarxiv icon

Dissecting Long Reasoning Models: An Empirical Study

Add code
Jun 05, 2025
Figure 1 for Dissecting Long Reasoning Models: An Empirical Study
Figure 2 for Dissecting Long Reasoning Models: An Empirical Study
Figure 3 for Dissecting Long Reasoning Models: An Empirical Study
Figure 4 for Dissecting Long Reasoning Models: An Empirical Study
Viaarxiv icon