Picture for Dexun Li

Dexun Li

Efficient Unsupervised Environment Design through Hierarchical Policy Representation Learning

Add code
Feb 10, 2026
Viaarxiv icon

SRR-Judge: Step-Level Rating and Refinement for Enhancing Search-Integrated Reasoning in Search Agents

Add code
Feb 08, 2026
Viaarxiv icon

Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research

Add code
Oct 24, 2025
Viaarxiv icon

Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger

Add code
Feb 18, 2025
Figure 1 for Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Figure 2 for Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Figure 3 for Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Figure 4 for Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger
Viaarxiv icon

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Add code
Jan 15, 2025
Figure 1 for MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents
Figure 2 for MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents
Figure 3 for MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents
Figure 4 for MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents
Viaarxiv icon

ToolACE: Winning the Points of LLM Function Calling

Add code
Sep 02, 2024
Figure 1 for ToolACE: Winning the Points of LLM Function Calling
Figure 2 for ToolACE: Winning the Points of LLM Function Calling
Figure 3 for ToolACE: Winning the Points of LLM Function Calling
Figure 4 for ToolACE: Winning the Points of LLM Function Calling
Viaarxiv icon

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

Add code
Jun 20, 2024
Figure 1 for EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Figure 2 for EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Figure 3 for EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Figure 4 for EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Viaarxiv icon

Meta-Task Planning for Language Agents

Add code
May 28, 2024
Figure 1 for Meta-Task Planning for Language Agents
Figure 2 for Meta-Task Planning for Language Agents
Figure 3 for Meta-Task Planning for Language Agents
Figure 4 for Meta-Task Planning for Language Agents
Viaarxiv icon

Aligning Crowd Feedback via Distributional Preference Reward Modeling

Add code
Feb 21, 2024
Figure 1 for Aligning Crowd Feedback via Distributional Preference Reward Modeling
Figure 2 for Aligning Crowd Feedback via Distributional Preference Reward Modeling
Figure 3 for Aligning Crowd Feedback via Distributional Preference Reward Modeling
Figure 4 for Aligning Crowd Feedback via Distributional Preference Reward Modeling
Viaarxiv icon

A Hierarchical Approach to Environment Design with Generative Trajectory Modeling

Add code
Sep 30, 2023
Figure 1 for A Hierarchical Approach to Environment Design with Generative Trajectory Modeling
Figure 2 for A Hierarchical Approach to Environment Design with Generative Trajectory Modeling
Figure 3 for A Hierarchical Approach to Environment Design with Generative Trajectory Modeling
Figure 4 for A Hierarchical Approach to Environment Design with Generative Trajectory Modeling
Viaarxiv icon