Picture for Ruichuan An

Ruichuan An

Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval

Add code
Oct 26, 2025
Viaarxiv icon

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning

Add code
Oct 16, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Add code
Jun 05, 2025
Figure 1 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 2 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 3 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Figure 4 for Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
Viaarxiv icon

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

Add code
May 26, 2025
Viaarxiv icon

SpikeGen: Generative Framework for Visual Spike Stream Processing

Add code
May 23, 2025
Viaarxiv icon

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts

Add code
May 20, 2025
Viaarxiv icon

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Add code
May 20, 2025
Viaarxiv icon

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization

Add code
Mar 17, 2025
Viaarxiv icon

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

Add code
Jan 03, 2025
Figure 1 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 2 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 3 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 4 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Viaarxiv icon