Picture for Wenming Yang

Wenming Yang

VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis

Add code
Feb 01, 2026
Viaarxiv icon

TLDiffGAN: A Latent Diffusion-GAN Framework with Temporal Information Fusion for Anomalous Sound Detection

Add code
Feb 01, 2026
Viaarxiv icon

SPAN: Spatial-Projection Alignment for Monocular 3D Object Detection

Add code
Nov 10, 2025
Viaarxiv icon

TACO: Think-Answer Consistency for Optimized Long-Chain Reasoning and Efficient Data Learning via Reinforcement Learning in LVLMs

Add code
May 27, 2025
Viaarxiv icon

PathoSCOPE: Few-Shot Pathology Detection via Self-Supervised Contrastive Learning and Pathology-Informed Synthetic Embeddings

Add code
May 23, 2025
Viaarxiv icon

UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval

Add code
Apr 14, 2025
Viaarxiv icon

VISTA: Unsupervised 2D Temporal Dependency Representations for Time Series Anomaly Detection

Add code
Apr 03, 2025
Viaarxiv icon

DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval

Add code
Mar 06, 2025
Viaarxiv icon

GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning

Add code
Mar 04, 2025
Viaarxiv icon

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Add code
Jan 18, 2025
Figure 1 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 2 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 3 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 4 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Viaarxiv icon