Picture for Ion Stoica

Ion Stoica

Why Do Multi-Agent LLM Systems Fail?

Add code
Mar 17, 2025
Viaarxiv icon

WorldModelBench: Judging Video Generation Models As World Models

Add code
Feb 28, 2025
Figure 1 for WorldModelBench: Judging Video Generation Models As World Models
Figure 2 for WorldModelBench: Judging Video Generation Models As World Models
Figure 3 for WorldModelBench: Judging Video Generation Models As World Models
Figure 4 for WorldModelBench: Judging Video Generation Models As World Models
Viaarxiv icon

Optimizing Model Selection for Compound AI Systems

Add code
Feb 20, 2025
Viaarxiv icon

S*: Test Time Scaling for Code Generation

Add code
Feb 20, 2025
Figure 1 for S*: Test Time Scaling for Code Generation
Figure 2 for S*: Test Time Scaling for Code Generation
Figure 3 for S*: Test Time Scaling for Code Generation
Figure 4 for S*: Test Time Scaling for Code Generation
Viaarxiv icon

Prompt-to-Leaderboard

Add code
Feb 20, 2025
Figure 1 for Prompt-to-Leaderboard
Figure 2 for Prompt-to-Leaderboard
Figure 3 for Prompt-to-Leaderboard
Figure 4 for Prompt-to-Leaderboard
Viaarxiv icon

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Add code
Feb 19, 2025
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Figure 1 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 2 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 3 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 4 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Add code
Feb 10, 2025
Figure 1 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 2 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 3 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 4 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Viaarxiv icon

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

Add code
Feb 06, 2025
Figure 1 for Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning
Figure 2 for Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning
Figure 3 for Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning
Figure 4 for Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning
Viaarxiv icon