Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

May 23, 2025

Yuchen He, Jianbing Lv, Liqi Cheng, Lingyu Meng, Dazhen Deng, Yingcai Wu

Figure 1 for ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

Figure 2 for ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

Figure 3 for ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

Figure 4 for ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

Share this with someone who'll enjoy it:

Abstract:Temporal Action Localization (TAL) aims to detect the start and end timestamps of actions in a video. However, the training of TAL models requires a substantial amount of manually annotated data. Data programming is an efficient method to create training labels with a series of human-defined labeling functions. However, its application in TAL faces difficulties of defining complex actions in the context of temporal video frames. In this paper, we propose ProTAL, a drag-and-link video programming framework for TAL. ProTAL enables users to define \textbf{key events} by dragging nodes representing body parts and objects and linking them to constrain the relations (direction, distance, etc.). These definitions are used to generate action labels for large-scale unlabelled videos. A semi-supervised method is then employed to train TAL models with such labels. We demonstrate the effectiveness of ProTAL through a usage scenario and a user study, providing insights into designing video programming framework.

* Accepted at CHI'25

View paper on

Share this with someone who'll enjoy it:

Title:ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization

Paper and Code