Video object detection is more challenging than image object detection because of the deteriorated frame quality. To enhance the feature representation, state-of-the-art methods propagate temporal information into the deteriorated frame by aligning and aggregating entire feature maps from multiple nearby frames... (read more)
PDFMETHOD | TYPE | |
---|---|---|
![]() |
Working Memory Models |