Picture for James Hays

James Hays

GaussianFormer3D: Multi-Modal Gaussian-based Semantic Occupancy Prediction with 3D Deformable Attention

Add code
May 15, 2025
Viaarxiv icon

Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers

Add code
Sep 03, 2024
Viaarxiv icon

What Matters in Range View 3D Object Detection

Add code
Jul 25, 2024
Viaarxiv icon

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Add code
Jul 11, 2024
Viaarxiv icon

Granular Privacy Control for Geolocation with Vision Language Models

Add code
Jul 06, 2024
Figure 1 for Granular Privacy Control for Geolocation with Vision Language Models
Figure 2 for Granular Privacy Control for Geolocation with Vision Language Models
Figure 3 for Granular Privacy Control for Geolocation with Vision Language Models
Figure 4 for Granular Privacy Control for Geolocation with Vision Language Models
Viaarxiv icon

SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas

Add code
Jun 27, 2024
Viaarxiv icon

Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection

Add code
Jun 14, 2024
Viaarxiv icon

Personalized Residuals for Concept-Driven Text-to-Image Generation

Add code
May 21, 2024
Viaarxiv icon

I Can't Believe It's Not Scene Flow!

Add code
Mar 07, 2024
Viaarxiv icon

Lidar Panoptic Segmentation and Tracking without Bells and Whistles

Add code
Oct 19, 2023
Viaarxiv icon