Picture for Qiuhong Ke

Qiuhong Ke

Omni2Sound: Towards Unified Video-Text-to-Audio Generation

Add code
Jan 06, 2026
Viaarxiv icon

Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation

Add code
Dec 12, 2025
Viaarxiv icon

DynaPURLS: Dynamic Refinement of Part-aware Representations for Skeleton-based Zero-Shot Action Recognition

Add code
Dec 12, 2025
Viaarxiv icon

TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition

Add code
Dec 12, 2025
Viaarxiv icon

LatentMove: Towards Complex Human Movement Video Generation

Add code
May 28, 2025
Viaarxiv icon

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Add code
May 23, 2025
Viaarxiv icon

3D Surface Reconstruction with Enhanced High-Frequency Details

Add code
May 06, 2025
Viaarxiv icon

TSTMotion: Training-free Scene-aware Text-to-motion Generation

Add code
May 05, 2025
Viaarxiv icon

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Add code
Mar 15, 2025
Viaarxiv icon

Unified Prompt Attack Against Text-to-Image Generation Models

Add code
Feb 23, 2025
Viaarxiv icon