Picture for David Huang

David Huang

TurboVGGT: Fast Visual Geometry Reconstruction with Adaptive Alternating Attention

Add code
May 14, 2026
Viaarxiv icon

Language and Geometry Grounded Sparse Voxel Representations for Holistic Scene Understanding

Add code
Feb 17, 2026
Viaarxiv icon

MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation

Add code
Aug 20, 2025
Figure 1 for MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Figure 2 for MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Figure 3 for MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Figure 4 for MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Viaarxiv icon

Robo-DM: Data Management For Large Robot Datasets

Add code
May 21, 2025
Figure 1 for Robo-DM: Data Management For Large Robot Datasets
Figure 2 for Robo-DM: Data Management For Large Robot Datasets
Figure 3 for Robo-DM: Data Management For Large Robot Datasets
Figure 4 for Robo-DM: Data Management For Large Robot Datasets
Viaarxiv icon

Measuring General Intelligence with Generated Games

Add code
May 12, 2025
Viaarxiv icon

Improving LLM Safety Alignment with Dual-Objective Optimization

Add code
Mar 05, 2025
Figure 1 for Improving LLM Safety Alignment with Dual-Objective Optimization
Figure 2 for Improving LLM Safety Alignment with Dual-Objective Optimization
Figure 3 for Improving LLM Safety Alignment with Dual-Objective Optimization
Figure 4 for Improving LLM Safety Alignment with Dual-Objective Optimization
Viaarxiv icon

Accelerated Preference Elicitation with LLM-Based Proxies

Add code
Jan 24, 2025
Viaarxiv icon

Enhanced Momentum with Momentum Transformers

Add code
Dec 17, 2024
Figure 1 for Enhanced Momentum with Momentum Transformers
Viaarxiv icon

A Hybrid Deep Learning Classification of Perimetric Glaucoma Using Peripapillary Nerve Fiber Layer Reflectance and Other OCT Parameters from Three Anatomy Regions

Add code
Jun 06, 2024
Figure 1 for A Hybrid Deep Learning Classification of Perimetric Glaucoma Using Peripapillary Nerve Fiber Layer Reflectance and Other OCT Parameters from Three Anatomy Regions
Figure 2 for A Hybrid Deep Learning Classification of Perimetric Glaucoma Using Peripapillary Nerve Fiber Layer Reflectance and Other OCT Parameters from Three Anatomy Regions
Viaarxiv icon

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon