Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Add code
Dec 10, 2024
Figure 1 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 2 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 3 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 4 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Viaarxiv icon

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Add code
Dec 04, 2024
Figure 1 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 2 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 3 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 4 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Viaarxiv icon

Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos

Add code
Dec 04, 2024
Figure 1 for Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
Figure 2 for Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
Figure 3 for Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
Figure 4 for Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
Viaarxiv icon

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Add code
Nov 29, 2024
Viaarxiv icon

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Add code
Nov 27, 2024
Figure 1 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 2 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 3 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 4 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Viaarxiv icon

Material Anything: Generating Materials for Any 3D Object via Diffusion

Add code
Nov 22, 2024
Figure 1 for Material Anything: Generating Materials for Any 3D Object via Diffusion
Figure 2 for Material Anything: Generating Materials for Any 3D Object via Diffusion
Figure 3 for Material Anything: Generating Materials for Any 3D Object via Diffusion
Figure 4 for Material Anything: Generating Materials for Any 3D Object via Diffusion
Viaarxiv icon

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Add code
Nov 22, 2024
Figure 1 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 2 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 3 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 4 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Viaarxiv icon

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Add code
Nov 22, 2024
Figure 1 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 2 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 3 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 4 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Viaarxiv icon

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Add code
Nov 21, 2024
Figure 1 for Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Figure 2 for Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Figure 3 for Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Figure 4 for Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Viaarxiv icon

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Add code
Nov 20, 2024
Figure 1 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 2 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 3 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 4 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Viaarxiv icon