Picture for Dahua Lin

Dahua Lin

Eric

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Add code
Dec 16, 2024
Figure 1 for IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Figure 2 for IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Figure 3 for IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Figure 4 for IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Viaarxiv icon

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Figure 1 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 2 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 3 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 4 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Viaarxiv icon

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Add code
Dec 10, 2024
Figure 1 for 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Figure 2 for 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Figure 3 for 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Figure 4 for 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Viaarxiv icon

Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians

Add code
Dec 10, 2024
Figure 1 for Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians
Figure 2 for Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians
Figure 3 for Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians
Figure 4 for Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians
Viaarxiv icon

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Add code
Dec 10, 2024
Figure 1 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 2 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 3 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 4 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Viaarxiv icon

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Figure 1 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 2 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 3 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 4 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Viaarxiv icon

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Add code
Dec 04, 2024
Figure 1 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 2 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 3 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 4 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Viaarxiv icon

Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes

Add code
Dec 02, 2024
Figure 1 for Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
Figure 2 for Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
Figure 3 for Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
Figure 4 for Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
Viaarxiv icon

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Add code
Dec 02, 2024
Figure 1 for X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
Figure 2 for X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
Figure 3 for X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
Figure 4 for X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
Viaarxiv icon

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Add code
Nov 20, 2024
Figure 1 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 2 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 3 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Figure 4 for VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models
Viaarxiv icon