Picture for Yaxiong Wang

Yaxiong Wang

CanonSLR: Canonical-View Guided Multi-View Continuous Sign Language Recognition

Add code
Apr 20, 2026
Viaarxiv icon

Pretrain-then-Adapt: Uncertainty-Aware Test-Time Adaptation for Text-based Person Search

Add code
Apr 07, 2026
Viaarxiv icon

Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation

Add code
Apr 02, 2026
Viaarxiv icon

Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting

Add code
Mar 29, 2026
Viaarxiv icon

Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection

Add code
Mar 02, 2026
Viaarxiv icon

OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL

Add code
Feb 12, 2026
Viaarxiv icon

Tears or Cheers? Benchmarking LLMs via Culturally Elicited Distinct Affective Responses

Add code
Jan 19, 2026
Viaarxiv icon

TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations

Add code
Sep 16, 2025
Figure 1 for Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Figure 2 for Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Figure 3 for Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Figure 4 for Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
Viaarxiv icon

SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Add code
Aug 09, 2025
Viaarxiv icon