Picture for Chengcheng Han

Chengcheng Han

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Add code
May 27, 2026
Viaarxiv icon

GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

Add code
May 27, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon

TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Add code
Mar 02, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Add code
Sep 30, 2025
Viaarxiv icon

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation

Add code
May 30, 2025
Viaarxiv icon

Length Desensitization in Directed Preference Optimization

Add code
Sep 10, 2024
Figure 1 for Length Desensitization in Directed Preference Optimization
Figure 2 for Length Desensitization in Directed Preference Optimization
Figure 3 for Length Desensitization in Directed Preference Optimization
Figure 4 for Length Desensitization in Directed Preference Optimization
Viaarxiv icon