Picture for Shijie Chen

Shijie Chen

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Add code
Jun 26, 2025
Viaarxiv icon

AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

Add code
Jun 09, 2025
Viaarxiv icon

Completing A Systematic Review in Hours instead of Months with Interactive AI Agents

Add code
Apr 21, 2025
Viaarxiv icon

The R2D2 Deep Neural Network Series for Scalable Non-Cartesian Magnetic Resonance Imaging

Add code
Mar 13, 2025
Viaarxiv icon

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Add code
Oct 07, 2024
Figure 1 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 2 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 3 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 4 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Viaarxiv icon

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Add code
Oct 03, 2024
Viaarxiv icon

Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System

Add code
Jul 29, 2023
Viaarxiv icon

Mind2Web: Towards a Generalist Agent for the Web

Add code
Jun 15, 2023
Figure 1 for Mind2Web: Towards a Generalist Agent for the Web
Figure 2 for Mind2Web: Towards a Generalist Agent for the Web
Figure 3 for Mind2Web: Towards a Generalist Agent for the Web
Figure 4 for Mind2Web: Towards a Generalist Agent for the Web
Viaarxiv icon

Error Detection for Text-to-SQL Semantic Parsing

Add code
May 23, 2023
Figure 1 for Error Detection for Text-to-SQL Semantic Parsing
Figure 2 for Error Detection for Text-to-SQL Semantic Parsing
Figure 3 for Error Detection for Text-to-SQL Semantic Parsing
Figure 4 for Error Detection for Text-to-SQL Semantic Parsing
Viaarxiv icon

Text-to-SQL Error Correction with Language Models of Code

Add code
May 22, 2023
Viaarxiv icon