Picture for Yu Bao

Yu Bao

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Add code
Aug 26, 2025
Viaarxiv icon

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Add code
Aug 20, 2025
Viaarxiv icon

Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice

Add code
Jul 24, 2025
Viaarxiv icon

Permutation Randomization on Nonsmooth Nonconvex Optimization: A Theoretical and Experimental Study

Add code
May 16, 2025
Viaarxiv icon

HOME-3: High-Order Momentum Estimator with Third-Power Gradient for Convex and Smooth Nonconvex Optimization

Add code
May 16, 2025
Viaarxiv icon

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Add code
Apr 20, 2025
Viaarxiv icon

Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models

Add code
Mar 13, 2025
Figure 1 for Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models
Figure 2 for Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models
Figure 3 for Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models
Figure 4 for Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

An Adaptive Gradient Regularization Method

Add code
Jul 24, 2024
Viaarxiv icon

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Add code
Jul 19, 2024
Viaarxiv icon