Picture for Kyle Sargent

Kyle Sargent

VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression

Add code
Dec 17, 2025
Figure 1 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 2 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 3 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 4 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Viaarxiv icon

Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization

Add code
Mar 14, 2025
Viaarxiv icon

View-Invariant Policy Learning via Zero-Shot Novel View Synthesis

Add code
Sep 05, 2024
Viaarxiv icon

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Add code
May 23, 2024
Figure 1 for Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Figure 2 for Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Figure 3 for Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Figure 4 for Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Viaarxiv icon

WonderJourney: Going from Anywhere to Everywhere

Add code
Dec 06, 2023
Figure 1 for WonderJourney: Going from Anywhere to Everywhere
Figure 2 for WonderJourney: Going from Anywhere to Everywhere
Figure 3 for WonderJourney: Going from Anywhere to Everywhere
Figure 4 for WonderJourney: Going from Anywhere to Everywhere
Viaarxiv icon

ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image

Add code
Oct 27, 2023
Figure 1 for ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Figure 2 for ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Figure 3 for ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Figure 4 for ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Viaarxiv icon

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Add code
Jun 15, 2023
Figure 1 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 2 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 3 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 4 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Viaarxiv icon

VQ3D: Learning a 3D-Aware Generative Model on ImageNet

Add code
Feb 14, 2023
Figure 1 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 2 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 3 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 4 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Viaarxiv icon

Self-supervised AutoFlow

Add code
Dec 08, 2022
Figure 1 for Self-supervised AutoFlow
Figure 2 for Self-supervised AutoFlow
Figure 3 for Self-supervised AutoFlow
Figure 4 for Self-supervised AutoFlow
Viaarxiv icon

Pyramid Adversarial Training Improves ViT Performance

Add code
Nov 30, 2021
Figure 1 for Pyramid Adversarial Training Improves ViT Performance
Figure 2 for Pyramid Adversarial Training Improves ViT Performance
Figure 3 for Pyramid Adversarial Training Improves ViT Performance
Figure 4 for Pyramid Adversarial Training Improves ViT Performance
Viaarxiv icon