Picture for Dylan Campbell

Dylan Campbell

Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels

Add code
Dec 09, 2024
Viaarxiv icon

SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark

Add code
Dec 01, 2024
Viaarxiv icon

HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images

Add code
Nov 07, 2024
Viaarxiv icon

Unobserved Object Detection using Generative Models

Add code
Oct 08, 2024
Viaarxiv icon

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

Add code
Oct 02, 2024
Figure 1 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 2 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 3 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 4 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Viaarxiv icon

Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

Add code
Jun 06, 2024
Viaarxiv icon

Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods

Add code
Apr 01, 2024
Viaarxiv icon

An Empirical Study Into What Matters for Calibrating Vision-Language Models

Add code
Feb 12, 2024
Viaarxiv icon

SCENES: Subpixel Correspondence Estimation With Epipolar Supervision

Add code
Jan 19, 2024
Viaarxiv icon

Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images

Add code
Dec 07, 2023
Viaarxiv icon