Picture for Silvio Savarese

Silvio Savarese

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

Add code
Dec 19, 2025
Viaarxiv icon

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

Add code
Nov 17, 2025
Figure 1 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 2 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 3 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 4 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Viaarxiv icon

SSR: Socratic Self-Refine for Large Language Model Reasoning

Add code
Nov 13, 2025
Figure 1 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 2 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 3 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 4 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Viaarxiv icon

Echoing: Identity Failures when LLM Agents Talk to Each Other

Add code
Nov 12, 2025
Viaarxiv icon

Moirai 2.0: When Less Is More for Time Series Forecasting

Add code
Nov 12, 2025
Figure 1 for Moirai 2.0: When Less Is More for Time Series Forecasting
Figure 2 for Moirai 2.0: When Less Is More for Time Series Forecasting
Figure 3 for Moirai 2.0: When Less Is More for Time Series Forecasting
Figure 4 for Moirai 2.0: When Less Is More for Time Series Forecasting
Viaarxiv icon

Grounded Test-Time Adaptation for LLM Agents

Add code
Nov 06, 2025
Figure 1 for Grounded Test-Time Adaptation for LLM Agents
Figure 2 for Grounded Test-Time Adaptation for LLM Agents
Figure 3 for Grounded Test-Time Adaptation for LLM Agents
Figure 4 for Grounded Test-Time Adaptation for LLM Agents
Viaarxiv icon

Reasoning Curriculum: Bootstrapping Broad LLM Reasoning from Math

Add code
Oct 30, 2025
Viaarxiv icon

ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning

Add code
Oct 09, 2025
Figure 1 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 2 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 3 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 4 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Viaarxiv icon

WALT: Web Agents that Learn Tools

Add code
Oct 01, 2025
Viaarxiv icon

SCUBA: Salesforce Computer Use Benchmark

Add code
Sep 30, 2025
Figure 1 for SCUBA: Salesforce Computer Use Benchmark
Figure 2 for SCUBA: Salesforce Computer Use Benchmark
Figure 3 for SCUBA: Salesforce Computer Use Benchmark
Figure 4 for SCUBA: Salesforce Computer Use Benchmark
Viaarxiv icon