Picture for Shuyue Hu

Shuyue Hu

Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs

Add code
Apr 15, 2025
Viaarxiv icon

Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute

Add code
Apr 02, 2025
Viaarxiv icon

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

Nature-Inspired Population-Based Evolution of Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

If Multi-Agent Debate is the Answer, What is the Question?

Add code
Feb 12, 2025
Viaarxiv icon

EvoFlow: Evolving Diverse Agentic Workflows On The Fly

Add code
Feb 11, 2025
Figure 1 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 2 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 3 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 4 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Viaarxiv icon

Understanding When and Why Graph Attention Mechanisms Work via Node Classification

Add code
Dec 20, 2024
Viaarxiv icon

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Add code
Nov 26, 2024
Figure 1 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 2 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 3 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 4 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Viaarxiv icon

OASIS: Open Agents Social Interaction Simulations on One Million Agents

Add code
Nov 21, 2024
Figure 1 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 2 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 3 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 4 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Viaarxiv icon

Configurable Mirror Descent: Towards a Unification of Decision Making

Add code
May 20, 2024
Figure 1 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 2 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 3 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 4 for Configurable Mirror Descent: Towards a Unification of Decision Making
Viaarxiv icon