Picture for Mingxuan Wang

Mingxuan Wang

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Add code
Jul 03, 2025
Viaarxiv icon

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Viaarxiv icon

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Add code
May 26, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Viaarxiv icon

OpenAI o1 System Card

Add code
Dec 21, 2024
Figure 1 for OpenAI o1 System Card
Figure 2 for OpenAI o1 System Card
Figure 3 for OpenAI o1 System Card
Figure 4 for OpenAI o1 System Card
Viaarxiv icon

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement

Add code
Dec 16, 2024
Viaarxiv icon

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Add code
Dec 10, 2024
Viaarxiv icon

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Add code
Feb 20, 2024
Figure 1 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 2 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 3 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 4 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Viaarxiv icon