Picture for Christoph Feichtenhofer

Christoph Feichtenhofer

Jack

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Add code
Apr 17, 2025
Viaarxiv icon

Perception Encoder: The best visual embeddings are not at the output of the network

Add code
Apr 17, 2025
Viaarxiv icon

An Empirical Study of Autoregressive Pre-training from Videos

Add code
Jan 09, 2025
Figure 1 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 2 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 3 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 4 for An Empirical Study of Autoregressive Pre-training from Videos
Viaarxiv icon

Gaussian Masked Autoencoders

Add code
Jan 06, 2025
Figure 1 for Gaussian Masked Autoencoders
Figure 2 for Gaussian Masked Autoencoders
Figure 3 for Gaussian Masked Autoencoders
Figure 4 for Gaussian Masked Autoencoders
Viaarxiv icon

Altogether: Image Captioning via Re-aligning Alt-text

Add code
Oct 22, 2024
Figure 1 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 2 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 3 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 4 for Altogether: Image Captioning via Re-aligning Alt-text
Viaarxiv icon

SAM 2: Segment Anything in Images and Videos

Add code
Aug 01, 2024
Figure 1 for SAM 2: Segment Anything in Images and Videos
Figure 2 for SAM 2: Segment Anything in Images and Videos
Figure 3 for SAM 2: Segment Anything in Images and Videos
Figure 4 for SAM 2: Segment Anything in Images and Videos
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Window Attention is Bugged: How not to Interpolate Position Embeddings

Add code
Nov 09, 2023
Figure 1 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 2 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 3 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 4 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Viaarxiv icon

Demystifying CLIP Data

Add code
Oct 02, 2023
Figure 1 for Demystifying CLIP Data
Figure 2 for Demystifying CLIP Data
Figure 3 for Demystifying CLIP Data
Figure 4 for Demystifying CLIP Data
Viaarxiv icon

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Jun 01, 2023
Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon