Picture for Alan Yuille

Alan Yuille

Johns Hopkins University

Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data

Add code
Jul 18, 2024
Viaarxiv icon

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Add code
Jul 12, 2024
Viaarxiv icon

CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model

Add code
Jul 09, 2024
Viaarxiv icon

Embracing Massive Medical Data

Add code
Jul 05, 2024
Viaarxiv icon

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Viaarxiv icon

ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

Add code
Jun 13, 2024
Figure 1 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 2 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 3 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 4 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Viaarxiv icon

Autoregressive Pretraining with Mamba in Vision

Add code
Jun 11, 2024
Viaarxiv icon

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Add code
Jun 08, 2024
Viaarxiv icon

DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

Add code
Jun 07, 2024
Viaarxiv icon

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Add code
Jun 02, 2024
Figure 1 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 2 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 3 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 4 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Viaarxiv icon