Picture for Shuo Tang

Shuo Tang

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Scaling Reinforcement Learning for Content Moderation with Large Language Models

Add code
Dec 23, 2025
Figure 1 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 2 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 3 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 4 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Viaarxiv icon

Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains

Add code
Nov 10, 2025
Viaarxiv icon

InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents

Add code
Oct 02, 2025
Viaarxiv icon

BrowseMaster: Towards Scalable Web Browsing via Tool-Augmented Programmatic Agent Pair

Add code
Aug 12, 2025
Viaarxiv icon

MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction

Add code
Aug 09, 2025
Viaarxiv icon

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

Add code
May 29, 2025
Viaarxiv icon

MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems

Add code
Mar 05, 2025
Figure 1 for MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Figure 2 for MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Figure 3 for MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Figure 4 for MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Viaarxiv icon

Self-Evolving Multi-Agent Collaboration Networks for Software Development

Add code
Oct 22, 2024
Figure 1 for Self-Evolving Multi-Agent Collaboration Networks for Software Development
Figure 2 for Self-Evolving Multi-Agent Collaboration Networks for Software Development
Figure 3 for Self-Evolving Multi-Agent Collaboration Networks for Software Development
Figure 4 for Self-Evolving Multi-Agent Collaboration Networks for Software Development
Viaarxiv icon

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation

Add code
Oct 18, 2024
Viaarxiv icon