Picture for Matthew Gwilliam

Matthew Gwilliam

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Add code
Jul 09, 2025
Viaarxiv icon

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Add code
Jun 18, 2025
Figure 1 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 2 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 3 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 4 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Viaarxiv icon

Utilization of Neighbor Information for Image Classification with Different Levels of Supervision

Add code
Mar 18, 2025
Viaarxiv icon

NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields

Add code
Nov 04, 2024
Figure 1 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 2 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 3 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 4 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Viaarxiv icon

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Add code
Aug 05, 2024
Figure 1 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 2 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 3 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 4 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Viaarxiv icon

Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions

Add code
Jan 18, 2024
Viaarxiv icon

A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

Add code
Nov 30, 2023
Figure 1 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 2 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 3 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 4 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Viaarxiv icon

Do text-free diffusion models learn discriminative visual representations?

Add code
Nov 30, 2023
Figure 1 for Do text-free diffusion models learn discriminative visual representations?
Figure 2 for Do text-free diffusion models learn discriminative visual representations?
Figure 3 for Do text-free diffusion models learn discriminative visual representations?
Figure 4 for Do text-free diffusion models learn discriminative visual representations?
Viaarxiv icon

Diffusion Models Beat GANs on Image Classification

Add code
Jul 17, 2023
Viaarxiv icon

Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning

Add code
Jun 16, 2022
Figure 1 for Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Figure 2 for Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Figure 3 for Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Figure 4 for Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Viaarxiv icon