Picture for Xiangyang Luo

Xiangyang Luo

When Vision Misleads, Let Location Speak: A Worldwide Image Geo-Localization Method via Location Attention Mechanism and Large Multimodal Models

Add code
Jun 08, 2026
Viaarxiv icon

IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation

Add code
May 28, 2026
Viaarxiv icon

DualGeo: A Dual-View Framework for Worldwide Image Geo-localization

Add code
Apr 28, 2026
Viaarxiv icon

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Add code
Apr 21, 2026
Viaarxiv icon

Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training

Add code
Mar 26, 2026
Viaarxiv icon

AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models

Add code
Feb 06, 2026
Viaarxiv icon

CIEC: Coupling Implicit and Explicit Cues for Multimodal Weakly Supervised Manipulation Localization

Add code
Feb 03, 2026
Viaarxiv icon

Lossless Copyright Protection via Intrinsic Model Fingerprinting

Add code
Jan 29, 2026
Viaarxiv icon

Mining Forgery Traces from Reconstruction Error: A Weakly Supervised Framework for Multimodal Deepfake Temporal Localization

Add code
Jan 29, 2026
Viaarxiv icon

MARE: Multimodal Alignment and Reinforcement for Explainable Deepfake Detection via Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon