Picture for Mohammadreza Salehi

Mohammadreza Salehi

ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition

Add code
Oct 08, 2024
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Viaarxiv icon

SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery

Add code
Aug 26, 2024
Viaarxiv icon

NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency

Add code
Aug 20, 2024
Figure 1 for NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
Figure 2 for NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
Figure 3 for NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
Figure 4 for NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
Viaarxiv icon

SIGMA: Sinkhorn-Guided Masked Video Modeling

Add code
Jul 22, 2024
Viaarxiv icon

GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features

Add code
Jul 17, 2024
Viaarxiv icon

CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement

Add code
Oct 21, 2023
Viaarxiv icon

SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks

Add code
Oct 18, 2023
Figure 1 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 2 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 3 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 4 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Viaarxiv icon

Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

Add code
Aug 22, 2023
Viaarxiv icon

InForecaster: Forecasting Influenza Hemagglutinin Mutations Through the Lens of Anomaly Detection

Add code
Oct 25, 2022
Viaarxiv icon