Picture for Mamshad Nayeem Rizve

Mamshad Nayeem Rizve

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Add code
Jul 18, 2024
Viaarxiv icon

Open Vocabulary Multi-Label Video Classification

Add code
Jul 12, 2024
Viaarxiv icon

VidLA: Video-Language Alignment at Scale

Add code
Mar 21, 2024
Figure 1 for VidLA: Video-Language Alignment at Scale
Figure 2 for VidLA: Video-Language Alignment at Scale
Figure 3 for VidLA: Video-Language Alignment at Scale
Figure 4 for VidLA: Video-Language Alignment at Scale
Viaarxiv icon

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

Add code
Oct 12, 2023
Figure 1 for Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Figure 2 for Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Figure 3 for Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Figure 4 for Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Viaarxiv icon

CDFSL-V: Cross-Domain Few-Shot Learning for Videos

Add code
Sep 15, 2023
Figure 1 for CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Figure 2 for CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Figure 3 for CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Figure 4 for CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Viaarxiv icon

Preserving Modality Structure Improves Multi-Modal Learning

Add code
Aug 24, 2023
Figure 1 for Preserving Modality Structure Improves Multi-Modal Learning
Figure 2 for Preserving Modality Structure Improves Multi-Modal Learning
Figure 3 for Preserving Modality Structure Improves Multi-Modal Learning
Figure 4 for Preserving Modality Structure Improves Multi-Modal Learning
Viaarxiv icon

TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition

Add code
Mar 28, 2023
Figure 1 for TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition
Figure 2 for TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition
Figure 3 for TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition
Figure 4 for TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition
Viaarxiv icon

Towards Realistic Semi-Supervised Learning

Add code
Jul 05, 2022
Figure 1 for Towards Realistic Semi-Supervised Learning
Figure 2 for Towards Realistic Semi-Supervised Learning
Figure 3 for Towards Realistic Semi-Supervised Learning
Figure 4 for Towards Realistic Semi-Supervised Learning
Viaarxiv icon

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning

Add code
Jul 05, 2022
Figure 1 for OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
Figure 2 for OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
Figure 3 for OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
Figure 4 for OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
Viaarxiv icon

UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning

Add code
Apr 05, 2022
Figure 1 for UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning
Figure 2 for UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning
Figure 3 for UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning
Figure 4 for UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning
Viaarxiv icon