Picture for Yuankai Qi

Yuankai Qi

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

Add code
May 21, 2025
Viaarxiv icon

Learning to Reason and Navigate: Parameter Efficient Action Planning with Large Language Models

Add code
May 12, 2025
Viaarxiv icon

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

Add code
May 02, 2025
Viaarxiv icon

SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting

Add code
Apr 24, 2025
Viaarxiv icon

ProgRoCC: A Progressive Approach to Rough Crowd Counting

Add code
Apr 18, 2025
Viaarxiv icon

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

Add code
Mar 31, 2025
Viaarxiv icon

Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning

Add code
Mar 29, 2025
Viaarxiv icon

Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing

Add code
Mar 15, 2025
Viaarxiv icon

Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models

Add code
Jan 25, 2025
Viaarxiv icon

Adapter-Enhanced Semantic Prompting for Continual Learning

Add code
Dec 15, 2024
Figure 1 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 2 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 3 for Adapter-Enhanced Semantic Prompting for Continual Learning
Figure 4 for Adapter-Enhanced Semantic Prompting for Continual Learning
Viaarxiv icon