Picture for Haiyang Sun

Haiyang Sun

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Add code
Jun 09, 2025
Viaarxiv icon

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth

Add code
May 03, 2025
Viaarxiv icon

SIDME: Self-supervised Image Demoiréing via Masked Encoder-Decoder Reconstruction

Add code
Apr 16, 2025
Viaarxiv icon

Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis

Add code
Apr 14, 2025
Viaarxiv icon

CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving

Add code
Mar 28, 2025
Viaarxiv icon

Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios

Add code
Mar 11, 2025
Viaarxiv icon

BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal Correlation

Add code
Mar 05, 2025
Viaarxiv icon

CMamba: Learned Image Compression with State Space Models

Add code
Feb 07, 2025
Viaarxiv icon