Picture for Yezhou Yang

Yezhou Yang

Arizona State University

$λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

Add code
Feb 07, 2024
Figure 1 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 2 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 3 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 4 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Viaarxiv icon

Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping

Add code
Jan 16, 2024
Figure 1 for Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
Figure 2 for Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
Figure 3 for Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
Figure 4 for Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
Viaarxiv icon

Open-TI: Open Traffic Intelligence with Augmented Language Model

Add code
Dec 30, 2023
Viaarxiv icon

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

Add code
Dec 07, 2023
Figure 1 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 2 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 3 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 4 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Viaarxiv icon

SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras

Add code
Sep 04, 2023
Figure 1 for SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras
Figure 2 for SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras
Figure 3 for SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras
Figure 4 for SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras
Viaarxiv icon

Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding

Add code
Sep 01, 2023
Viaarxiv icon

Adversarial Bayesian Augmentation for Single-Source Domain Generalization

Add code
Jul 18, 2023
Figure 1 for Adversarial Bayesian Augmentation for Single-Source Domain Generalization
Figure 2 for Adversarial Bayesian Augmentation for Single-Source Domain Generalization
Figure 3 for Adversarial Bayesian Augmentation for Single-Source Domain Generalization
Figure 4 for Adversarial Bayesian Augmentation for Single-Source Domain Generalization
Viaarxiv icon

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Figure 1 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 2 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 3 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 4 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Viaarxiv icon

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Viaarxiv icon

End-to-end Knowledge Retrieval with Multi-modal Queries

Add code
Jun 01, 2023
Figure 1 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 2 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 3 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 4 for End-to-end Knowledge Retrieval with Multi-modal Queries
Viaarxiv icon