Picture for Ye Shi

Ye Shi

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Add code
Feb 09, 2025
Figure 1 for UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
Figure 2 for UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
Figure 3 for UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
Figure 4 for UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
Viaarxiv icon

Evaluating Image Caption via Cycle-consistent Text-to-Image Generation

Add code
Jan 08, 2025
Figure 1 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 2 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 3 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 4 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Viaarxiv icon

AffordDP: Generalizable Diffusion Policy with Transferable Affordance

Add code
Dec 04, 2024
Figure 1 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 2 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 3 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 4 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Viaarxiv icon

SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model

Add code
Dec 02, 2024
Figure 1 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 2 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 3 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 4 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Viaarxiv icon

NLPrompt: Noise-Label Prompt Learning for Vision-Language Models

Add code
Dec 02, 2024
Figure 1 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 2 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 3 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 4 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Viaarxiv icon

Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective

Add code
Oct 30, 2024
Viaarxiv icon

Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method

Add code
Sep 29, 2024
Figure 1 for Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method
Figure 2 for Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method
Figure 3 for Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method
Figure 4 for Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method
Viaarxiv icon

Monocular Human-Object Reconstruction in the Wild

Add code
Jul 31, 2024
Viaarxiv icon

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset

Add code
Jul 30, 2024
Figure 1 for StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset
Figure 2 for StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset
Figure 3 for StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset
Figure 4 for StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset
Viaarxiv icon

Uniform Transformation: Refining Latent Representation in Variational Autoencoders

Add code
Jul 02, 2024
Figure 1 for Uniform Transformation: Refining Latent Representation in Variational Autoencoders
Figure 2 for Uniform Transformation: Refining Latent Representation in Variational Autoencoders
Figure 3 for Uniform Transformation: Refining Latent Representation in Variational Autoencoders
Figure 4 for Uniform Transformation: Refining Latent Representation in Variational Autoencoders
Viaarxiv icon