Picture for Yu Bao

Yu Bao

Permutation Randomization on Nonsmooth Nonconvex Optimization: A Theoretical and Experimental Study

Add code
May 16, 2025
Viaarxiv icon

HOME-3: High-Order Momentum Estimator with Third-Power Gradient for Convex and Smooth Nonconvex Optimization

Add code
May 16, 2025
Viaarxiv icon

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Add code
Apr 20, 2025
Viaarxiv icon

Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

An Adaptive Gradient Regularization Method

Add code
Jul 24, 2024
Viaarxiv icon

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Add code
Jul 19, 2024
Viaarxiv icon

EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling

Add code
Mar 21, 2024
Viaarxiv icon

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Add code
Aug 25, 2023
Viaarxiv icon

Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation

Add code
Mar 31, 2023
Viaarxiv icon