Picture for Yue Wang

Yue Wang

Zhongguancun Academy

STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes

Add code
Dec 31, 2024
Figure 1 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 2 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 3 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 4 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Viaarxiv icon

Natural Language Fine-Tuning

Add code
Dec 29, 2024
Viaarxiv icon

Aria-UI: Visual Grounding for GUI Instructions

Add code
Dec 20, 2024
Figure 1 for Aria-UI: Visual Grounding for GUI Instructions
Figure 2 for Aria-UI: Visual Grounding for GUI Instructions
Figure 3 for Aria-UI: Visual Grounding for GUI Instructions
Figure 4 for Aria-UI: Visual Grounding for GUI Instructions
Viaarxiv icon

Learning from Massive Human Videos for Universal Humanoid Pose Control

Add code
Dec 18, 2024
Figure 1 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 2 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 3 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 4 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Viaarxiv icon

How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation

Add code
Dec 12, 2024
Viaarxiv icon

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models

Add code
Dec 10, 2024
Figure 1 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 2 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 3 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Figure 4 for LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models
Viaarxiv icon

Extrapolated Urban View Synthesis Benchmark

Add code
Dec 10, 2024
Figure 1 for Extrapolated Urban View Synthesis Benchmark
Figure 2 for Extrapolated Urban View Synthesis Benchmark
Figure 3 for Extrapolated Urban View Synthesis Benchmark
Figure 4 for Extrapolated Urban View Synthesis Benchmark
Viaarxiv icon

Wavelet Diffusion Neural Operator

Add code
Dec 06, 2024
Figure 1 for Wavelet Diffusion Neural Operator
Figure 2 for Wavelet Diffusion Neural Operator
Figure 3 for Wavelet Diffusion Neural Operator
Figure 4 for Wavelet Diffusion Neural Operator
Viaarxiv icon

Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset

Add code
Dec 05, 2024
Figure 1 for Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset
Figure 2 for Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset
Figure 3 for Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset
Figure 4 for Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset
Viaarxiv icon

InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models

Add code
Dec 05, 2024
Figure 1 for InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Figure 2 for InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Figure 3 for InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Figure 4 for InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Viaarxiv icon