1 code implementation • 8 Jul 2022 • Chuong H. Nguyen, Su Huynh, Vinh Nguyen, Ngoc Nguyen
Since being introduced in 2020, Vision Transformers (ViT) has been steadily breaking the record for many vision tasks and are often described as ``all-you-need" to replace ConvNet.
4 code implementations • 24 Aug 2021 • Chuong H. Nguyen, Thuy C. Nguyen, Tuan N. Tang, Nam L. H. Phan
Using PAA-ResNet50 as a teacher, our LAD techniques can improve detectors PAA-ResNet101 and PAA-ResNeXt101 to $46 \rm AP$ and $47. 5\rm AP$ on the COCO test-dev set.
no code implementations • 12 Jun 2021 • Thuy C. Nguyen, Tuan N. Tang, Nam LH. Phan, Chuong H. Nguyen, Masayuki Yamazaki, Masao Yamanaka
Video Instance Segmentation (VIS) is a multi-task problem performing detection, segmentation, and tracking simultaneously.
Ranked #20 on Video Instance Segmentation on YouTube-VIS validation
1 code implementation • 25 Apr 2021 • Chuong H. Nguyen, Thuy C. Nguyen, Anh H. Vo, Yamazaki Masayuki
While being simple and flexible, our proposed SSCOD built upon ATSSNet performs significantly better than the baseline of the standard object detection, while still be able to match objects of unknown categories.