Picture for Ziyang Song

Ziyang Song

MV-S2V: Multi-View Subject-Consistent Video Generation

Add code
Jan 27, 2026
Viaarxiv icon

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Add code
Dec 28, 2025
Viaarxiv icon

Depth Anything in $360^\circ$: Towards Scale Invariance in the Wild

Add code
Dec 28, 2025
Viaarxiv icon

Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation

Add code
Dec 24, 2025
Viaarxiv icon

CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Add code
Dec 22, 2025
Figure 1 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 2 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 3 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 4 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Viaarxiv icon

Cross-modal Retrieval Models for Stripped Binary Analysis

Add code
Dec 11, 2025
Figure 1 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 2 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 3 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 4 for Cross-modal Retrieval Models for Stripped Binary Analysis
Viaarxiv icon

Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation

Add code
Aug 07, 2025
Viaarxiv icon

FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

Add code
Jun 09, 2025
Viaarxiv icon

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation

Add code
Jan 05, 2025
Figure 1 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 2 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 3 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 4 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Viaarxiv icon

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Figure 1 for DeepSeek-V3 Technical Report
Figure 2 for DeepSeek-V3 Technical Report
Figure 3 for DeepSeek-V3 Technical Report
Figure 4 for DeepSeek-V3 Technical Report
Viaarxiv icon