Picture for Zhenguo Li

Zhenguo Li

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Add code
Mar 22, 2024
Figure 1 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 2 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 3 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 4 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Viaarxiv icon

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Add code
Mar 20, 2024
Figure 1 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 2 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 3 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 4 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Viaarxiv icon

Editing Massive Concepts in Text-to-Image Diffusion Models

Add code
Mar 20, 2024
Figure 1 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 2 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 3 for Editing Massive Concepts in Text-to-Image Diffusion Models
Figure 4 for Editing Massive Concepts in Text-to-Image Diffusion Models
Viaarxiv icon

Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization

Add code
Mar 14, 2024
Viaarxiv icon

Efficient Transferability Assessment for Selection of Pre-trained Detectors

Add code
Mar 14, 2024
Viaarxiv icon

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Add code
Mar 07, 2024
Figure 1 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 2 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 3 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 4 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Viaarxiv icon

Accelerating Diffusion Sampling with Optimized Time Steps

Add code
Feb 27, 2024
Figure 1 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 2 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 3 for Accelerating Diffusion Sampling with Optimized Time Steps
Figure 4 for Accelerating Diffusion Sampling with Optimized Time Steps
Viaarxiv icon

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

Add code
Feb 23, 2024
Viaarxiv icon

On the Expressive Power of a Variant of the Looped Transformer

Add code
Feb 21, 2024
Figure 1 for On the Expressive Power of a Variant of the Looped Transformer
Figure 2 for On the Expressive Power of a Variant of the Looped Transformer
Figure 3 for On the Expressive Power of a Variant of the Looped Transformer
Figure 4 for On the Expressive Power of a Variant of the Looped Transformer
Viaarxiv icon

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

Add code
Feb 14, 2024
Viaarxiv icon