Picture for Haoyu Zhang

Haoyu Zhang

SMC-ITA: Sequential Monte Carlo Inference-Time Alignment for Video-to-Audio Generation

Add code
Jun 07, 2026
Viaarxiv icon

Beyond Waypoints: A Trajectory-Centric Waypointing Paradigm for Vision-Language Navigation

Add code
Jun 05, 2026
Viaarxiv icon

Gaussian-Voxel Duet: A Dual-Scaffolding Hybrid Representation for Fast and Accurate Monocular Surface Reconstruction

Add code
May 26, 2026
Viaarxiv icon

JFAA: Technical Report for the EPIC-KITCHENS-100 Action Anticipation Challenge at EgoVis 2026

Add code
May 20, 2026
Viaarxiv icon

OSGNet with MLLM Reranking @ Ego4D Episodic Memory Challenge 2026

Add code
May 20, 2026
Viaarxiv icon

VISTA: Technical Report for the Ego4D Short-Term Object Interaction Anticipation at EgoVis 2026

Add code
May 20, 2026
Viaarxiv icon

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

Add code
May 18, 2026
Viaarxiv icon

Large-Small Model Collaboration for Farmland Semantic Change Detection

Add code
May 12, 2026
Viaarxiv icon

Kinetic-Optimal Scheduling with Moment Correction for Metric-Induced Discrete Flow Matching in Zero-Shot Text-to-Speech

Add code
May 10, 2026
Viaarxiv icon

Exposing LLM Safety Gaps Through Mathematical Encoding:New Attacks and Systematic Analysis

Add code
May 05, 2026
Viaarxiv icon