Picture for Ran He

Ran He

AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards

Add code
Dec 23, 2025
Viaarxiv icon

Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models

Add code
Dec 17, 2025
Viaarxiv icon

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Add code
Oct 10, 2025
Viaarxiv icon

Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification

Add code
Sep 19, 2025
Viaarxiv icon

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Figure 1 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 2 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 3 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 4 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Viaarxiv icon

Adapting Vision-Language Models Without Labels: A Comprehensive Survey

Add code
Aug 07, 2025
Figure 1 for Adapting Vision-Language Models Without Labels: A Comprehensive Survey
Figure 2 for Adapting Vision-Language Models Without Labels: A Comprehensive Survey
Figure 3 for Adapting Vision-Language Models Without Labels: A Comprehensive Survey
Figure 4 for Adapting Vision-Language Models Without Labels: A Comprehensive Survey
Viaarxiv icon

Test-Time Immunization: A Universal Defense Framework Against Jailbreaks for (Multimodal) Large Language Models

Add code
May 28, 2025
Viaarxiv icon

HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling

Add code
May 27, 2025
Viaarxiv icon

T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search

Add code
May 26, 2025
Viaarxiv icon