Picture for Hao Chen

Hao Chen

Charlie

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

Add code
Jun 05, 2025
Viaarxiv icon

ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation

Add code
May 30, 2025
Viaarxiv icon

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

Add code
May 29, 2025
Viaarxiv icon

Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems

Add code
May 29, 2025
Viaarxiv icon

Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics

Add code
May 29, 2025
Viaarxiv icon

VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

Hierarchical Instruction-aware Embodied Visual Tracking

Add code
May 27, 2025
Viaarxiv icon

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology

Add code
May 26, 2025
Figure 1 for PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology
Figure 2 for PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology
Figure 3 for PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology
Figure 4 for PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology
Viaarxiv icon

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Add code
May 26, 2025
Viaarxiv icon