Picture for Munawar Hayat

Munawar Hayat

PosSAM: Panoptic Open-vocabulary Segment Anything

Add code
Mar 14, 2024
Figure 1 for PosSAM: Panoptic Open-vocabulary Segment Anything
Figure 2 for PosSAM: Panoptic Open-vocabulary Segment Anything
Figure 3 for PosSAM: Panoptic Open-vocabulary Segment Anything
Figure 4 for PosSAM: Panoptic Open-vocabulary Segment Anything
Viaarxiv icon

EraseDiff: Erasing Data Influence in Diffusion Models

Add code
Jan 11, 2024
Viaarxiv icon

DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

Add code
Jan 01, 2024
Viaarxiv icon

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Add code
Nov 26, 2023
Figure 1 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 2 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 3 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 4 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Viaarxiv icon

Unified Open-Vocabulary Dense Visual Prediction

Add code
Jul 17, 2023
Figure 1 for Unified Open-Vocabulary Dense Visual Prediction
Figure 2 for Unified Open-Vocabulary Dense Visual Prediction
Figure 3 for Unified Open-Vocabulary Dense Visual Prediction
Figure 4 for Unified Open-Vocabulary Dense Visual Prediction
Viaarxiv icon

Open-Vocabulary Object Detection via Scene Graph Discovery

Add code
Jul 07, 2023
Figure 1 for Open-Vocabulary Object Detection via Scene Graph Discovery
Figure 2 for Open-Vocabulary Object Detection via Scene Graph Discovery
Figure 3 for Open-Vocabulary Object Detection via Scene Graph Discovery
Figure 4 for Open-Vocabulary Object Detection via Scene Graph Discovery
Viaarxiv icon

"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

Add code
May 05, 2023
Figure 1 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 2 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 3 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 4 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Viaarxiv icon

Real-time Trajectory-based Social Group Detection

Add code
Apr 12, 2023
Figure 1 for Real-time Trajectory-based Social Group Detection
Figure 2 for Real-time Trajectory-based Social Group Detection
Figure 3 for Real-time Trajectory-based Social Group Detection
Figure 4 for Real-time Trajectory-based Social Group Detection
Viaarxiv icon

ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning

Add code
Mar 22, 2023
Figure 1 for ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning
Figure 2 for ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning
Figure 3 for ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning
Figure 4 for ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning
Viaarxiv icon

MARLIN: Masked Autoencoder for facial video Representation LearnINg

Add code
Nov 12, 2022
Figure 1 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 2 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 3 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 4 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Viaarxiv icon