Picture for Yuheng Zhang

Yuheng Zhang

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction

Add code
Mar 12, 2026
Viaarxiv icon

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments

Add code
Mar 10, 2026
Viaarxiv icon

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies

Add code
Mar 03, 2026
Viaarxiv icon

Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback

Add code
Feb 09, 2026
Viaarxiv icon

TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding

Add code
Jan 11, 2026
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation

Add code
Nov 10, 2025
Viaarxiv icon

Identifying and Calibrating Overconfidence in Noisy Speech Recognition

Add code
Sep 08, 2025
Viaarxiv icon

ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning

Add code
Jun 06, 2025
Viaarxiv icon

CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis

Add code
May 29, 2025
Viaarxiv icon