Picture for Guangtao Zeng

Guangtao Zeng

BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization

Add code
Dec 29, 2025
Viaarxiv icon

Tailored Primitive Initialization is the Secret Key to Reinforcement Learning

Add code
Nov 16, 2025
Figure 1 for Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
Figure 2 for Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
Figure 3 for Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
Figure 4 for Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
Viaarxiv icon

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Add code
May 29, 2025
Figure 1 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 2 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 3 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 4 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Viaarxiv icon

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Add code
Feb 04, 2025
Figure 1 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 2 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 3 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 4 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Viaarxiv icon

SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages

Add code
Dec 02, 2024
Figure 1 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 2 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 3 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 4 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Viaarxiv icon

Scaling up Masked Diffusion Models on Text

Add code
Oct 24, 2024
Figure 1 for Scaling up Masked Diffusion Models on Text
Figure 2 for Scaling up Masked Diffusion Models on Text
Figure 3 for Scaling up Masked Diffusion Models on Text
Figure 4 for Scaling up Masked Diffusion Models on Text
Viaarxiv icon

Effi-Code: Unleashing Code Efficiency in Language Models

Add code
Oct 14, 2024
Viaarxiv icon

RegMix: Data Mixture as Regression for Language Model Pre-training

Add code
Jul 01, 2024
Viaarxiv icon

Long Context Transfer from Language to Vision

Add code
Jun 24, 2024
Figure 1 for Long Context Transfer from Language to Vision
Figure 2 for Long Context Transfer from Language to Vision
Figure 3 for Long Context Transfer from Language to Vision
Figure 4 for Long Context Transfer from Language to Vision
Viaarxiv icon

Sailor: Open Language Models for South-East Asia

Add code
Apr 04, 2024
Viaarxiv icon