Action Recognition In Videos


Action recognition in videos is the process of identifying and categorizing human actions or activities in video sequences.

STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving

Add code
Jun 06, 2025
Figure 1 for STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving
Figure 2 for STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving
Figure 3 for STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving
Figure 4 for STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving
Viaarxiv icon

BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors

Add code
Mar 26, 2025
Viaarxiv icon

Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition

Add code
May 23, 2025
Viaarxiv icon

3DPyranet Features Fusion for Spatio-temporal Feature Learning

Add code
Apr 26, 2025
Viaarxiv icon

Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities

Add code
Apr 11, 2025
Viaarxiv icon

DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework

Add code
Mar 19, 2025
Figure 1 for DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Figure 2 for DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Figure 3 for DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Figure 4 for DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Viaarxiv icon

Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization

Add code
Apr 19, 2025
Figure 1 for Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization
Figure 2 for Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization
Figure 3 for Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization
Figure 4 for Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization
Viaarxiv icon

MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition

Add code
Apr 03, 2025
Figure 1 for MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition
Figure 2 for MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition
Figure 3 for MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition
Figure 4 for MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition
Viaarxiv icon

POET: Prompt Offset Tuning for Continual Human Action Adaptation

Add code
Apr 25, 2025
Viaarxiv icon

MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling

Add code
Feb 06, 2025
Figure 1 for MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
Figure 2 for MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
Figure 3 for MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
Figure 4 for MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
Viaarxiv icon