Picture for Dayuan Fu

Dayuan Fu

DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery

Add code
Aug 09, 2025
Viaarxiv icon

DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments

Add code
Apr 07, 2025
Viaarxiv icon

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Add code
Jan 03, 2025
Viaarxiv icon

MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making

Add code
Sep 25, 2024
Viaarxiv icon

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Figure 1 for How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Figure 2 for How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Figure 3 for How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Figure 4 for How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Viaarxiv icon

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Add code
Jun 12, 2024
Figure 1 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 2 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 3 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 4 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Viaarxiv icon

DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations

Add code
Mar 31, 2024
Figure 1 for DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Figure 2 for DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Figure 3 for DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Figure 4 for DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Viaarxiv icon

On Large Language Models' Hallucination with Regard to Known Facts

Add code
Mar 29, 2024
Figure 1 for On Large Language Models' Hallucination with Regard to Known Facts
Figure 2 for On Large Language Models' Hallucination with Regard to Known Facts
Figure 3 for On Large Language Models' Hallucination with Regard to Known Facts
Figure 4 for On Large Language Models' Hallucination with Regard to Known Facts
Viaarxiv icon

BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses

Add code
Mar 02, 2024
Viaarxiv icon

PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability

Add code
Feb 18, 2024
Viaarxiv icon