Picture for Zhenguo Li

Zhenguo Li

SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields

Add code
Dec 26, 2023
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Add code
Dec 18, 2023
Figure 1 for G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Figure 2 for G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Figure 3 for G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Figure 4 for G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Viaarxiv icon

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Add code
Dec 12, 2023
Figure 1 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 2 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 3 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Figure 4 for Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Viaarxiv icon

Drag-A-Video: Non-rigid Video Editing with Point-based Interaction

Add code
Dec 05, 2023
Figure 1 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 2 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 3 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Figure 4 for Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Viaarxiv icon

Animate124: Animating One Image to 4D Dynamic Scene

Add code
Nov 24, 2023
Figure 1 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 2 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 3 for Animate124: Animating One Image to 4D Dynamic Scene
Figure 4 for Animate124: Animating One Image to 4D Dynamic Scene
Viaarxiv icon

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

Add code
Oct 20, 2023
Figure 1 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 2 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 3 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 4 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Viaarxiv icon

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Add code
Oct 16, 2023
Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models

Add code
Oct 13, 2023
Viaarxiv icon

MagicDrive: Street View Generation with Diverse 3D Geometry Control

Add code
Oct 13, 2023
Figure 1 for MagicDrive: Street View Generation with Diverse 3D Geometry Control
Figure 2 for MagicDrive: Street View Generation with Diverse 3D Geometry Control
Figure 3 for MagicDrive: Street View Generation with Diverse 3D Geometry Control
Figure 4 for MagicDrive: Street View Generation with Diverse 3D Geometry Control
Viaarxiv icon