Search Results for author: Kaiyou Song

Found 6 papers, 5 papers with code

Bootstrap Masked Visual Modeling via Hard Patches Mining

1 code implementation21 Dec 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tiancai Wang, Xiangyu Zhang, Zhaoxiang Zhang

To empower the model as a teacher, we propose Hard Patches Mining (HPM), predicting patch-wise losses and subsequently determining where to mask.

Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning

1 code implementation16 Dec 2023 Kaiyou Song, Shan Zhang, Tong Wang

In this study, inspired by human beings' way of grasping an image, i. e., focusing on the main object first, we present a semantic-aware autoregressive image modeling (SemAIM) method to tackle this challenge.

Image Classification Instance Segmentation +4

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

1 code implementation NeurIPS 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang

As it is empirically observed that Vision Transformers (ViTs) are quite insensitive to the order of input tokens, the need for an appropriate self-supervised pretext task that enhances the location awareness of ViTs is becoming evident.

Position

Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

1 code implementation CVPR 2023 Kaiyou Song, Jin Xie, Shan Zhang, Zimeng Luo

Different from existing SSL-KD methods that transfer knowledge from a static pre-trained teacher to a student, in MOKD, two different models learn collaboratively in a self-supervised manner.

Knowledge Distillation Representation Learning +1

Hard Patches Mining for Masked Image Modeling

1 code implementation CVPR 2023 Haochen Wang, Kaiyou Song, Junsong Fan, Yuxi Wang, Jin Xie, Zhaoxiang Zhang

We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.

Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning

no code implementations ICCV 2023 Kaiyou Song, Shan Zhang, Zihao An, Zimeng Luo, Tong Wang, Jin Xie

In contrastive self-supervised learning, the common way to learn discriminative representation is to pull different augmented "views" of the same image closer while pushing all other images further apart, which has been proven to be effective.

Representation Learning Self-Supervised Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.