Picture for Yan Zhou

Yan Zhou

Department of Radiology, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China

A Physics-Driven Neural Network with Parameter Embedding for Generating Quantitative MR Maps from Weighted Images

Add code
Aug 11, 2025
Viaarxiv icon

DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera

Add code
Aug 08, 2025
Viaarxiv icon

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Add code
Jun 16, 2025
Viaarxiv icon

AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models

Add code
May 29, 2025
Viaarxiv icon

Can Multimodal Large Language Models Understand Spatial Relations?

Add code
May 25, 2025
Viaarxiv icon

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Add code
May 05, 2025
Viaarxiv icon

SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain

Add code
Mar 26, 2025
Viaarxiv icon

Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions

Add code
Mar 10, 2025
Figure 1 for Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions
Figure 2 for Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions
Figure 3 for Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions
Figure 4 for Accessing the Effect of Phyllotaxy and Planting Density on Light Use Efficiency in Field-Grown Maize using 3D Reconstructions
Viaarxiv icon

ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis

Add code
Mar 09, 2025
Viaarxiv icon

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Add code
Nov 25, 2024
Viaarxiv icon