Picture for Zhenguo Li

Zhenguo Li

GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing

Add code
Jul 08, 2024
Figure 1 for GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Figure 2 for GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Figure 3 for GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Figure 4 for GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Viaarxiv icon

Jailbreaking as a Reward Misspecification Problem

Add code
Jun 20, 2024
Viaarxiv icon

QuickLLaMA: Query-aware Inference Acceleration for Large Language Models

Add code
Jun 11, 2024
Figure 1 for QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
Figure 2 for QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
Figure 3 for QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
Figure 4 for QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
Viaarxiv icon

Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

Add code
Jun 06, 2024
Figure 1 for Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Figure 2 for Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Figure 3 for Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Figure 4 for Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Viaarxiv icon

Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion

Add code
May 24, 2024
Viaarxiv icon

Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Add code
May 24, 2024
Figure 1 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 2 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 3 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Figure 4 for Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Viaarxiv icon

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model

Add code
May 24, 2024
Viaarxiv icon

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes

Add code
May 23, 2024
Figure 1 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 2 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 3 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 4 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Viaarxiv icon

Proving Theorems Recursively

Add code
May 23, 2024
Figure 1 for Proving Theorems Recursively
Figure 2 for Proving Theorems Recursively
Figure 3 for Proving Theorems Recursively
Figure 4 for Proving Theorems Recursively
Viaarxiv icon

CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

Add code
May 23, 2024
Figure 1 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 2 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 3 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 4 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Viaarxiv icon