Open Set Action Recognition


MGCA-Net: Multi-Grained Category-Aware Network for Open-Vocabulary Temporal Action Localization

Add code
Nov 17, 2025
Viaarxiv icon

ThaiOCRBench: A Task-Diverse Benchmark for Vision-Language Understanding in Thai

Add code
Nov 06, 2025
Viaarxiv icon

What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos

Add code
Aug 29, 2025
Figure 1 for What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
Figure 2 for What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
Figure 3 for What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
Figure 4 for What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
Viaarxiv icon

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

Add code
Aug 28, 2025
Viaarxiv icon

Medical Image De-Identification Benchmark Challenge

Add code
Jul 31, 2025
Figure 1 for Medical Image De-Identification Benchmark Challenge
Figure 2 for Medical Image De-Identification Benchmark Challenge
Figure 3 for Medical Image De-Identification Benchmark Challenge
Figure 4 for Medical Image De-Identification Benchmark Challenge
Viaarxiv icon

Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning

Add code
Apr 09, 2025
Figure 1 for Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Figure 2 for Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Figure 3 for Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Figure 4 for Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Viaarxiv icon

ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection

Add code
Mar 28, 2025
Figure 1 for ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
Figure 2 for ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
Figure 3 for ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
Figure 4 for ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
Viaarxiv icon

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Add code
Jan 25, 2025
Figure 1 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 2 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 3 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 4 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Viaarxiv icon

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Add code
Mar 20, 2025
Figure 1 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 2 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 3 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 4 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Viaarxiv icon

From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes

Add code
Mar 09, 2025
Figure 1 for From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes
Figure 2 for From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes
Figure 3 for From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes
Figure 4 for From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes
Viaarxiv icon