Picture for Zhantao Yang

Zhantao Yang

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Add code
Sep 09, 2025
Viaarxiv icon

STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision

Add code
Aug 12, 2025
Viaarxiv icon

Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

Add code
Mar 12, 2025
Viaarxiv icon

The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control

Add code
Dec 04, 2024
Figure 1 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 2 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 3 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 4 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Viaarxiv icon

Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce

Add code
Oct 28, 2024
Figure 1 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 2 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 3 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 4 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection

Add code
May 30, 2024
Figure 1 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 2 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 3 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 4 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Viaarxiv icon

Eliminating Lipschitz Singularities in Diffusion Models

Add code
Jun 20, 2023
Viaarxiv icon

Dimensionality-Varying Diffusion Process

Add code
Nov 29, 2022
Figure 1 for Dimensionality-Varying Diffusion Process
Figure 2 for Dimensionality-Varying Diffusion Process
Figure 3 for Dimensionality-Varying Diffusion Process
Figure 4 for Dimensionality-Varying Diffusion Process
Viaarxiv icon