Picture for Wenhan Luo

Wenhan Luo

UNIC: Unified In-Context Video Editing

Add code
Jun 04, 2025
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Viaarxiv icon

LlamaSeg: Image Segmentation via Autoregressive Mask Generation

Add code
May 26, 2025
Viaarxiv icon

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Add code
May 25, 2025
Viaarxiv icon

CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation

Add code
May 11, 2025
Viaarxiv icon

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Viaarxiv icon

Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Add code
May 03, 2025
Viaarxiv icon

VideoVista-CulturalLingo: 360$^\circ$ Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension

Add code
Apr 23, 2025
Viaarxiv icon

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Add code
Mar 13, 2025
Viaarxiv icon

FedDyMem: Efficient Federated Learning with Dynamic Memory and Memory-Reduce for Unsupervised Image Anomaly Detection

Add code
Feb 28, 2025
Viaarxiv icon