Picture for Zhenguo Li

Zhenguo Li

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Add code
Feb 12, 2024
Figure 1 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 2 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 3 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 4 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Viaarxiv icon

Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Add code
Feb 08, 2024
Viaarxiv icon

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Add code
Jan 30, 2024
Viaarxiv icon

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

Add code
Jan 18, 2024
Viaarxiv icon

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Add code
Jan 10, 2024
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields

Add code
Dec 26, 2023
Viaarxiv icon

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Add code
Dec 18, 2023
Viaarxiv icon

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Add code
Dec 12, 2023
Viaarxiv icon

Drag-A-Video: Non-rigid Video Editing with Point-based Interaction

Add code
Dec 05, 2023
Figure 1 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 2 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 3 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 4 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Viaarxiv icon