Smart Surveillance Annotation Tool (SSAT), as it own name indicates, is an annotation tool, free and interactive, for the computer vision community. This eases the way researchers annotate massive video datasets. This tool is still in progress, in the future it will be possible to create and manipulate bounding boxes in the video.
Contact: Rafael Vareto (firstname.lastname@example.org)
Video extracted from the Interactive Dataset, CPR contest on Semantic Description of Human Activities (SDHA)
There are a few requirements for SSAT to work properly.
- No operation can be executed if no videos have been previously loaded. To load a video press “File” in the menu bar and then “Open” (or use the shortcut CTR + O), an open dialog will pop up and you can select any video that has a .mp4 or .avi format .
- Note that in the image all features are locked because no video has been loaded.
Image extrated from Interactive Dataset, CPR contest on Semantic Description of Human Activities (SDHA)
After the video is loaded, all functionalities will work properly
- Before beginning the annotation the user must establish the attributes in which he desires to base his annotations. To do this select “Annotation” in the menu bar followed by attributes, another dialog will pop up and the user can create a hierarchy of categories(filters) and labels.
- The attribute box, showed in the image above, will be initially empty. The attributes are defined following a hierarchy of categories and labels, this hierarchy was created to simplify the use of the tool in situations with a large number of different labels. A category can be understood as a collection of labels that have something in common. It is not possible to create a label that does not belong to a category, so the “insert label” button remains deactivated until a category is selected, the user can also edit an already made annotation by double clicking it on the table and making the desired changes.
- There are two types of annotation: frame based and bounding-box based annotation (bounding-box based annotation not yet available in this version), the former comprises the whole video frame whereas the latter specifies a region(box) from the video picture. To annotate, right-click on the video and then choose between frame based and bounding box based.
1. Frame boxes hold the following info: category, label, initial frame, final frame and general information. 2. Bounding boxes encompass information like: category, label, initial frame, final frame, x-axis position, y-axis position,width, height, and general information.
Image extracted from Interactive Dataset, CPR contest on Semantic Description of Human Activities (SDHA)
Presently, the bounding box based annotations are still being developed hence, they do not work properly, the user can create boxes on the screen and they are saved, but the bounding boxes can’t be removed or changed in the core of SSAT
- Whenever the user decides to import/export a file, he must click on file and select import/export JSON (or use the shortcut CTRL + E to export and CTRL + I to import). Then he must specify either the desired file to import or the folder to export. Note that every 10 seconds a security file will be exported to the root directory and if the user opens a video he already has opened before, he will be asked to import this security copy.