Picture for Fangyu Lei

Fangyu Lei

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Add code
Apr 28, 2026
Viaarxiv icon

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Add code
Apr 20, 2026
Viaarxiv icon

OpenCUA: Open Foundations for Computer-Use Agents

Add code
Aug 12, 2025
Viaarxiv icon

Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN

Add code
May 22, 2025
Viaarxiv icon

GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks

Add code
Feb 20, 2025
Figure 1 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 2 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 3 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 4 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Viaarxiv icon

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Add code
Nov 12, 2024
Figure 1 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 2 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 3 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 4 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Viaarxiv icon

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models

Add code
Oct 09, 2024
Figure 1 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 2 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 3 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 4 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Viaarxiv icon

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

Add code
Mar 01, 2024
Figure 1 for Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent
Figure 2 for Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent
Figure 3 for Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent
Figure 4 for Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent
Viaarxiv icon