Picture for Srijan Das

Srijan Das

Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer

Add code
Jul 17, 2024
Viaarxiv icon

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Add code
Jul 04, 2024
Viaarxiv icon

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Add code
Jun 27, 2024
Figure 1 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 2 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 3 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 4 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Figure 1 for LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living
Figure 2 for LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living
Figure 3 for LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living
Figure 4 for LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living
Viaarxiv icon

BAMM: Bidirectional Autoregressive Motion Model

Add code
Apr 01, 2024
Figure 1 for BAMM: Bidirectional Autoregressive Motion Model
Figure 2 for BAMM: Bidirectional Autoregressive Motion Model
Figure 3 for BAMM: Bidirectional Autoregressive Motion Model
Figure 4 for BAMM: Bidirectional Autoregressive Motion Model
Viaarxiv icon

Analysis and Detection of Multilingual Hate Speech Using Transformer Based Deep Learning

Add code
Jan 19, 2024
Viaarxiv icon

SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology

Add code
Dec 22, 2023
Figure 1 for SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology
Figure 2 for SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology
Figure 3 for SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology
Figure 4 for SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology
Viaarxiv icon

Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?

Add code
Dec 07, 2023
Viaarxiv icon

Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living

Add code
Nov 30, 2023
Figure 1 for Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
Figure 2 for Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
Figure 3 for Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
Figure 4 for Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
Viaarxiv icon

Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders

Add code
Oct 31, 2023
Figure 1 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 2 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 3 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 4 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Viaarxiv icon