Picture for Mubarak Shah

Mubarak Shah

FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition

Add code
Sep 02, 2024
Figure 1 for FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
Figure 2 for FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
Figure 3 for FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
Figure 4 for FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
Viaarxiv icon

Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets

Add code
Sep 02, 2024
Figure 1 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 2 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 3 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 4 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Viaarxiv icon

GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers

Add code
Aug 05, 2024
Figure 1 for GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
Figure 2 for GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
Figure 3 for GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
Figure 4 for GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
Viaarxiv icon

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Add code
Jul 18, 2024
Figure 1 for X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
Figure 2 for X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
Figure 3 for X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
Figure 4 for X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
Viaarxiv icon

Open Vocabulary Multi-Label Video Classification

Add code
Jul 12, 2024
Figure 1 for Open Vocabulary Multi-Label Video Classification
Figure 2 for Open Vocabulary Multi-Label Video Classification
Figure 3 for Open Vocabulary Multi-Label Video Classification
Figure 4 for Open Vocabulary Multi-Label Video Classification
Viaarxiv icon

Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density

Add code
Jul 05, 2024
Figure 1 for Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density
Figure 2 for Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density
Figure 3 for Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density
Figure 4 for Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density
Viaarxiv icon

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Add code
Jul 03, 2024
Figure 1 for SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Figure 2 for SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Figure 3 for SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Figure 4 for SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Viaarxiv icon

Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images

Add code
Jul 02, 2024
Figure 1 for Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images
Figure 2 for Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images
Figure 3 for Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images
Figure 4 for Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images
Viaarxiv icon

Surgical Triplet Recognition via Diffusion Model

Add code
Jun 19, 2024
Figure 1 for Surgical Triplet Recognition via Diffusion Model
Figure 2 for Surgical Triplet Recognition via Diffusion Model
Figure 3 for Surgical Triplet Recognition via Diffusion Model
Figure 4 for Surgical Triplet Recognition via Diffusion Model
Viaarxiv icon

Xi-Net: Transformer Based Seismic Waveform Reconstructor

Add code
Jun 14, 2024
Figure 1 for Xi-Net: Transformer Based Seismic Waveform Reconstructor
Figure 2 for Xi-Net: Transformer Based Seismic Waveform Reconstructor
Figure 3 for Xi-Net: Transformer Based Seismic Waveform Reconstructor
Figure 4 for Xi-Net: Transformer Based Seismic Waveform Reconstructor
Viaarxiv icon