Picture for Qiguang Chen

Qiguang Chen

OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Add code
May 29, 2025
Viaarxiv icon

CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models

Add code
May 25, 2025
Viaarxiv icon

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System

Add code
May 21, 2025
Viaarxiv icon

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning

Add code
May 19, 2025
Viaarxiv icon

Efficient Process Reward Model Training via Active Learning

Add code
Apr 14, 2025
Viaarxiv icon

DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective

Add code
Mar 17, 2025
Viaarxiv icon

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

Text2World: Benchmarking Large Language Models for Symbolic World Model Generation

Add code
Feb 18, 2025
Viaarxiv icon

DivIL: Unveiling and Addressing Over-Invariance for Out-of- Distribution Generalization

Add code
Feb 18, 2025
Viaarxiv icon