Picture for Zhaoyang Huang

Zhaoyang Huang

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

RelightMaster: Precise Video Relighting with Multi-plane Light Images

Add code
Nov 09, 2025
Viaarxiv icon

CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model

Add code
Apr 11, 2025
Figure 1 for CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Figure 2 for CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Figure 3 for CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Figure 4 for CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Viaarxiv icon

GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Add code
Jan 05, 2025
Figure 1 for GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Figure 2 for GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Figure 3 for GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Figure 4 for GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Viaarxiv icon

A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding

Add code
Nov 04, 2024
Figure 1 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 2 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 3 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Figure 4 for A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Viaarxiv icon

ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses

Add code
Oct 31, 2024
Figure 1 for ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses
Figure 2 for ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses
Figure 3 for ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses
Figure 4 for ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses
Viaarxiv icon

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Add code
Oct 27, 2024
Viaarxiv icon

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

Add code
Oct 09, 2024
Viaarxiv icon

BlinkTrack: Feature Tracking over 100 FPS via Events and Images

Add code
Sep 26, 2024
Viaarxiv icon

Phased Consistency Model

Add code
May 28, 2024
Viaarxiv icon