Picture for Alan Yuille

Alan Yuille

Johns Hopkins University

Embracing Massive Medical Data

Add code
Jul 05, 2024
Viaarxiv icon

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Figure 1 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 2 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 3 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 4 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Viaarxiv icon

ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

Add code
Jun 13, 2024
Figure 1 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 2 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 3 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 4 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Viaarxiv icon

Autoregressive Pretraining with Mamba in Vision

Add code
Jun 11, 2024
Figure 1 for Autoregressive Pretraining with Mamba in Vision
Figure 2 for Autoregressive Pretraining with Mamba in Vision
Figure 3 for Autoregressive Pretraining with Mamba in Vision
Figure 4 for Autoregressive Pretraining with Mamba in Vision
Viaarxiv icon

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Add code
Jun 08, 2024
Figure 1 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 2 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 3 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 4 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Viaarxiv icon

DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

Add code
Jun 07, 2024
Figure 1 for DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Figure 2 for DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Figure 3 for DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Figure 4 for DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data
Viaarxiv icon

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Add code
Jun 02, 2024
Figure 1 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 2 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 3 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 4 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Viaarxiv icon

Quality Sentinel: Estimating Label Quality and Errors in Medical Segmentation Datasets

Add code
Jun 01, 2024
Viaarxiv icon

Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography

Add code
May 28, 2024
Figure 1 for Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Figure 2 for Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Figure 3 for Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Figure 4 for Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Viaarxiv icon

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

Add code
May 27, 2024
Figure 1 for HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
Figure 2 for HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
Figure 3 for HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
Figure 4 for HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
Viaarxiv icon