Picture for Chao-Yuan Wu

Chao-Yuan Wu

SAM 2: Segment Anything in Images and Videos

Add code
Aug 01, 2024
Figure 1 for SAM 2: Segment Anything in Images and Videos
Figure 2 for SAM 2: Segment Anything in Images and Videos
Figure 3 for SAM 2: Segment Anything in Images and Videos
Figure 4 for SAM 2: Segment Anything in Images and Videos
Viaarxiv icon

PointInfinity: Resolution-Invariant Point Diffusion Models

Add code
Apr 04, 2024
Figure 1 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 2 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 3 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 4 for PointInfinity: Resolution-Invariant Point Diffusion Models
Viaarxiv icon

Reversible Vision Transformers

Add code
Feb 09, 2023
Figure 1 for Reversible Vision Transformers
Figure 2 for Reversible Vision Transformers
Figure 3 for Reversible Vision Transformers
Figure 4 for Reversible Vision Transformers
Viaarxiv icon

Multiview Compressive Coding for 3D Reconstruction

Add code
Jan 19, 2023
Figure 1 for Multiview Compressive Coding for 3D Reconstruction
Figure 2 for Multiview Compressive Coding for 3D Reconstruction
Figure 3 for Multiview Compressive Coding for 3D Reconstruction
Figure 4 for Multiview Compressive Coding for 3D Reconstruction
Viaarxiv icon

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Add code
Jan 20, 2022
Figure 1 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 2 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 3 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 4 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Viaarxiv icon

A ConvNet for the 2020s

Add code
Jan 10, 2022
Figure 1 for A ConvNet for the 2020s
Figure 2 for A ConvNet for the 2020s
Figure 3 for A ConvNet for the 2020s
Figure 4 for A ConvNet for the 2020s
Viaarxiv icon

Masked Feature Prediction for Self-Supervised Visual Pre-Training

Add code
Dec 16, 2021
Figure 1 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 2 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 3 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 4 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Viaarxiv icon

Improved Multiscale Vision Transformers for Classification and Detection

Add code
Dec 02, 2021
Figure 1 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 2 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 3 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 4 for Improved Multiscale Vision Transformers for Classification and Detection
Viaarxiv icon

Towards Long-Form Video Understanding

Add code
Jun 21, 2021
Figure 1 for Towards Long-Form Video Understanding
Figure 2 for Towards Long-Form Video Understanding
Figure 3 for Towards Long-Form Video Understanding
Figure 4 for Towards Long-Form Video Understanding
Viaarxiv icon

Memory Optimization for Deep Networks

Add code
Oct 29, 2020
Figure 1 for Memory Optimization for Deep Networks
Figure 2 for Memory Optimization for Deep Networks
Figure 3 for Memory Optimization for Deep Networks
Figure 4 for Memory Optimization for Deep Networks
Viaarxiv icon