Picture for Zhou Zhao

Zhou Zhao

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

Add code
May 14, 2025
Viaarxiv icon

Rejoining fragmented ancient bamboo slips with physics-driven deep learning

Add code
May 13, 2025
Figure 1 for Rejoining fragmented ancient bamboo slips with physics-driven deep learning
Figure 2 for Rejoining fragmented ancient bamboo slips with physics-driven deep learning
Figure 3 for Rejoining fragmented ancient bamboo slips with physics-driven deep learning
Figure 4 for Rejoining fragmented ancient bamboo slips with physics-driven deep learning
Viaarxiv icon

LiftFeat: 3D Geometry-Aware Local Feature Matching

Add code
May 06, 2025
Figure 1 for LiftFeat: 3D Geometry-Aware Local Feature Matching
Figure 2 for LiftFeat: 3D Geometry-Aware Local Feature Matching
Figure 3 for LiftFeat: 3D Geometry-Aware Local Feature Matching
Figure 4 for LiftFeat: 3D Geometry-Aware Local Feature Matching
Viaarxiv icon

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

Add code
Apr 30, 2025
Figure 1 for RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Figure 2 for RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Figure 3 for RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Figure 4 for RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Viaarxiv icon

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Add code
Apr 29, 2025
Viaarxiv icon

Versatile Framework for Song Generation with Prompt-based Control

Add code
Apr 29, 2025
Viaarxiv icon

Unleashing the Power of Natural Audio Featuring Multiple Sound Sources

Add code
Apr 24, 2025
Viaarxiv icon

OmniAudio: Generating Spatial Audio from 360-Degree Video

Add code
Apr 21, 2025
Figure 1 for OmniAudio: Generating Spatial Audio from 360-Degree Video
Figure 2 for OmniAudio: Generating Spatial Audio from 360-Degree Video
Figure 3 for OmniAudio: Generating Spatial Audio from 360-Degree Video
Figure 4 for OmniAudio: Generating Spatial Audio from 360-Degree Video
Viaarxiv icon

Continual Cross-Modal Generalization

Add code
Apr 01, 2025
Viaarxiv icon

Pathological Prior-Guided Multiple Instance Learning For Mitigating Catastrophic Forgetting in Breast Cancer Whole Slide Image Classification

Add code
Mar 08, 2025
Viaarxiv icon