We introduce a large-scale video dataset Slovo for Russian Sign Language task. Slovo dataset size is about 16 GB, and it contains 20400 RGB videos for 1000 sign language gestures from 194 singers. Each class has 20 samples. The dataset is divided into training set and test set by subject user_id
. The training set includes 15300 videos, and the test set includes 5100 videos. The total video recording time is ~9.2 hours. About 35% of the videos are recorded in HD format, and 65% of the videos are in FullHD resolution. The average video length with gesture is 50 frames.
Annotation file is easy to use and contains some useful columns, see annotations.csv
file:
attachment_id | user_id | width | height | length | text | train | begin | end | |
---|---|---|---|---|---|---|---|---|---|
0 | de81cc1c-... | 1b... | 1440 | 1920 | 14 | привет | True | 30 | 45 |
1 | 3c0cec5a-... | 64... | 1440 | 1920 | 32 | утро | False | 43 | 66 |
2 | d17ca986-... | cf... | 1920 | 1080 | 44 | улица | False | 12 | 31 |
where:
- attachment_id
- video file name
- user_id
- unique anonymized user ID
- width
- video width
- height
- video height
- length
- video length
- text
- gesture class in Russian Langauge
- train
- train or test boolean flag
- begin
- start of the gesture (for original dataset)
- end
- end of the gesture (for original dataset)
Paper | Code | Results | Date | Stars |
---|