Picture for Yuxin Peng

Yuxin Peng

Scan-and-Print: Patch-level Data Summarization and Augmentation for Content-aware Layout Generation in Poster Design

Add code
May 27, 2025
Viaarxiv icon

PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation

Add code
May 06, 2025
Viaarxiv icon

Benchmarking Large Vision-Language Models on Fine-Grained Image Tasks: A Comprehensive Evaluation

Add code
Apr 21, 2025
Viaarxiv icon

DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding

Add code
Apr 21, 2025
Viaarxiv icon

ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer

Add code
Apr 03, 2025
Viaarxiv icon

STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding

Add code
Mar 20, 2025
Viaarxiv icon

SCAP: Transductive Test-Time Adaptation via Supportive Clique-based Attribute Prompting

Add code
Mar 17, 2025
Viaarxiv icon

Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models

Add code
Jan 25, 2025
Viaarxiv icon

DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification

Add code
Dec 12, 2024
Viaarxiv icon

EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing

Add code
Dec 12, 2024
Figure 1 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 2 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 3 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 4 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Viaarxiv icon