Open Set Action Recognition


Medical Image De-Identification Benchmark Challenge

Add code
Jul 31, 2025
Viaarxiv icon

Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning

Add code
Apr 09, 2025
Viaarxiv icon

ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection

Add code
Mar 28, 2025
Viaarxiv icon

Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

Add code
Mar 25, 2025
Viaarxiv icon

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Add code
Mar 20, 2025
Figure 1 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 2 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 3 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 4 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Viaarxiv icon

From Motion Signals to Insights: A Unified Framework for Student Behavior Analysis and Feedback in Physical Education Classes

Add code
Mar 09, 2025
Viaarxiv icon

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Add code
Jan 25, 2025
Figure 1 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 2 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 3 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 4 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Viaarxiv icon

Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition

Add code
Dec 20, 2024
Figure 1 for Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
Figure 2 for Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
Figure 3 for Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
Figure 4 for Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition
Viaarxiv icon

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Figure 1 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 2 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 3 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 4 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Viaarxiv icon

Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation

Add code
Dec 03, 2024
Figure 1 for Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation
Figure 2 for Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation
Figure 3 for Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation
Figure 4 for Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation
Viaarxiv icon