Picture for Brais Martinez

Brais Martinez

EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers

Add code
May 06, 2022
Figure 1 for EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Figure 2 for EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Figure 3 for EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Figure 4 for EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Viaarxiv icon

SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition

Add code
Apr 10, 2022
Figure 1 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 2 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 3 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 4 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Viaarxiv icon

SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021

Add code
Oct 06, 2021
Figure 1 for SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021
Viaarxiv icon

Space-time Mixing Attention for Video Transformer

Add code
Jun 11, 2021
Figure 1 for Space-time Mixing Attention for Video Transformer
Figure 2 for Space-time Mixing Attention for Video Transformer
Figure 3 for Space-time Mixing Attention for Video Transformer
Figure 4 for Space-time Mixing Attention for Video Transformer
Viaarxiv icon

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization

Add code
Mar 30, 2021
Figure 1 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 2 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 3 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 4 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Viaarxiv icon

Few-shot Action Recognition with Prototype-centered Attentive Learning

Add code
Feb 03, 2021
Figure 1 for Few-shot Action Recognition with Prototype-centered Attentive Learning
Figure 2 for Few-shot Action Recognition with Prototype-centered Attentive Learning
Figure 3 for Few-shot Action Recognition with Prototype-centered Attentive Learning
Figure 4 for Few-shot Action Recognition with Prototype-centered Attentive Learning
Viaarxiv icon

Boundary-sensitive Pre-training for Temporal Localization in Videos

Add code
Nov 24, 2020
Figure 1 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 2 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 3 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 4 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Viaarxiv icon

High-Capacity Expert Binary Networks

Add code
Oct 07, 2020
Figure 1 for High-Capacity Expert Binary Networks
Figure 2 for High-Capacity Expert Binary Networks
Figure 3 for High-Capacity Expert Binary Networks
Figure 4 for High-Capacity Expert Binary Networks
Viaarxiv icon

Towards practical lipreading with distilled and efficient models

Add code
Jul 13, 2020
Figure 1 for Towards practical lipreading with distilled and efficient models
Figure 2 for Towards practical lipreading with distilled and efficient models
Figure 3 for Towards practical lipreading with distilled and efficient models
Figure 4 for Towards practical lipreading with distilled and efficient models
Viaarxiv icon

Egocentric Action Recognition by Video Attention and Temporal Context

Add code
Jul 03, 2020
Figure 1 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 2 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 3 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 4 for Egocentric Action Recognition by Video Attention and Temporal Context
Viaarxiv icon