Action Recognition in Temporally Untrimmed Videos!
Automatically recognizing and localizing a large number of action categories from videos in the wild of significant importance for video understanding and multimedia event detection. THUMOS workshop and challenge aims at exploring new challenges and approaches for large-scale action recognition with large number of classes from open source videos in a realistic setting.
Most of the existing action recognition datasets are composed of videos that have been manually trimmed to bound the action of interest. This has been identified to be a considerable limitation as it poorly matches how action recognition is applied in practical settings. Therefore, THUMOS 2015 will conduct the challenge on temporally untrimmed videos. The participants may train their methods using trimmed clips but will be required to test their systems on untrimmed data.
A new forward-looking dataset containing over 430 hours of video data and 45 million frames (70% larger than THUMOS'14) with the following components is made available under this challenge:
Training Set: over 13,000 temporally trimmed videos from 101 action classes.
Validation Set: Over 2100 temporally untrimmed videos with temporal annotations of actions.
Background Set: Approximately 3000 relevant videos guaranteed to not include any instance of the 101 actions.
Test Set: Over 5600 temporally untrimmed videos with withheld ground truth.
All videos are collected from YouTube. We will evaluate the success of the proposed methods based on their performance on the new THUMOS 2015 Dataset in two tasks:
Action Classification: this task accepts submissions for whole-clip action classification on 101 action classes.
Temporal Action Localization: this task accepts submissions on action recognition and temporal localization on a subset of 20 action classes.
Participants may either submit a notebook paper that briefly describes their system, or a research paper detailing their approach. All of the submission results will be summarized during the workshop and included in the workshopconference proceedings. Additionally, the top performers will be invited to give oral presentations, with remaining entries encouraged to present their work in the poster session.