Picture for Rui Yang

Rui Yang

InterDigital, Inc

AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models

Add code
Mar 24, 2026
Viaarxiv icon

CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training

Add code
Mar 23, 2026
Viaarxiv icon

Multimodal OCR: Parse Anything from Documents

Add code
Mar 13, 2026
Viaarxiv icon

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

DefenseSplat: Enhancing the Robustness of 3D Gaussian Splatting via Frequency-Aware Filtering

Add code
Feb 22, 2026
Viaarxiv icon

SnapMLA: Efficient Long-Context MLA Decoding via Hardware-Aware FP8 Quantized Pipelining

Add code
Feb 12, 2026
Viaarxiv icon

CIEC: Coupling Implicit and Explicit Cues for Multimodal Weakly Supervised Manipulation Localization

Add code
Feb 03, 2026
Viaarxiv icon

How do Visual Attributes Influence Web Agents? A Comprehensive Evaluation of User Interface Design Factors

Add code
Jan 29, 2026
Viaarxiv icon

Mining Forgery Traces from Reconstruction Error: A Weakly Supervised Framework for Multimodal Deepfake Temporal Localization

Add code
Jan 29, 2026
Viaarxiv icon

DualShield: Safe Model Predictive Diffusion via Reachability Analysis for Interactive Autonomous Driving

Add code
Jan 22, 2026
Viaarxiv icon