Picture for Zhenguo Li

Zhenguo Li

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Add code
Jan 10, 2024
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields

Add code
Dec 26, 2023
Viaarxiv icon

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Add code
Dec 18, 2023
Viaarxiv icon

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Add code
Dec 12, 2023
Viaarxiv icon

Drag-A-Video: Non-rigid Video Editing with Point-based Interaction

Add code
Dec 05, 2023
Figure 1 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 2 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 3 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 4 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Viaarxiv icon

Animate124: Animating One Image to 4D Dynamic Scene

Add code
Nov 24, 2023
Figure 1 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 2 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 3 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 4 for Animate124: Animating One Image to 4D Dynamic Scene
Viaarxiv icon

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

Add code
Oct 20, 2023
Viaarxiv icon

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Add code
Oct 16, 2023
Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models

Add code
Oct 13, 2023
Viaarxiv icon