Picture for Hao Shi

Hao Shi

Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization

Add code
Mar 13, 2026
Viaarxiv icon

Streaming Translation and Transcription Through Speech-to-Text Causal Alignment

Add code
Mar 12, 2026
Viaarxiv icon

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction

Add code
Mar 12, 2026
Viaarxiv icon

OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras

Add code
Mar 09, 2026
Viaarxiv icon

RMBench: Memory-Dependent Robotic Manipulation Benchmark with Insights into Policy Design

Add code
Mar 01, 2026
Viaarxiv icon

Training-Free Intelligibility-Guided Observation Addition for Noisy ASR

Add code
Feb 24, 2026
Viaarxiv icon

ExoGS: A 4D Real-to-Sim-to-Real Framework for Scalable Manipulation Data Collection

Add code
Jan 26, 2026
Viaarxiv icon

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Add code
Dec 30, 2025
Viaarxiv icon

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

Add code
Nov 05, 2025
Viaarxiv icon