Picture for Yujia Qin

Yujia Qin

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Add code
Dec 14, 2025
Viaarxiv icon

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Add code
Nov 12, 2025
Figure 1 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 2 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 3 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Figure 4 for Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Add code
Apr 15, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon

GUICourse: From General Vision Language Models to Versatile GUI Agents

Add code
Jun 17, 2024
Figure 1 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 2 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 3 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 4 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Viaarxiv icon

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

Add code
Mar 13, 2024
Figure 1 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 2 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 3 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 4 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Viaarxiv icon

RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation

Add code
Feb 26, 2024
Figure 1 for RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation
Figure 2 for RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation
Figure 3 for RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation
Figure 4 for RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation
Viaarxiv icon

Large Language Model-based Human-Agent Collaboration for Complex Task Solving

Add code
Feb 20, 2024
Figure 1 for Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Figure 2 for Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Figure 3 for Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Figure 4 for Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Viaarxiv icon