Search Results for author: WonJun Moon

Found 9 papers, 9 papers with code

VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting

1 code implementation27 Dec 2023 Seunggu Kang, WonJun Moon, Euiyeon Kim, Jae-Pil Heo

Zero-Shot Object Counting (ZSOC) aims to count referred instances of arbitrary classes in a query image without human-annotated exemplars.

Decoder Object Counting +1

Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding

2 code implementations15 Nov 2023 WonJun Moon, Sangeek Hyun, SuBeen Lee, Jae-Pil Heo

Dummy tokens conditioned by text query take portions of the attention weights, preventing irrelevant video clips from being represented by the text query.

Highlight Detection Moment Retrieval +3

Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification

1 code implementation28 Jul 2023 SuBeen Lee, WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM.

Cross-Domain Few-Shot Fine-Grained Image Classification

Leveraging Hidden Positives for Unsupervised Semantic Segmentation

1 code implementation CVPR 2023 Hyun Seok Seong, WonJun Moon, SuBeen Lee, Jae-Pil Heo

Specifically, we add the loss propagating to local hidden positives, semantically similar nearby patches, in proportion to the predefined similarity scores.

Contrastive Learning Unsupervised Semantic Segmentation

Query-Dependent Video Representation for Moment Retrieval and Highlight Detection

1 code implementation CVPR 2023 WonJun Moon, Sangeek Hyun, Sanguk Park, Dongchan Park, Jae-Pil Heo

As we observe the insignificant role of a given query in transformer architectures, our encoding module starts with cross-attention layers to explicitly inject the context of text query into video representation.

Highlight Detection Moment Retrieval +4

Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition

1 code implementation24 Nov 2022 WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

A dramatic increase in real-world video volume with extremely diverse and emerging topics naturally forms a long-tailed video distribution in terms of their categories, and it spotlights the need for Video Long-Tailed Recognition (VLTR).

Difficulty-Aware Simulator for Open Set Recognition

1 code implementation20 Jul 2022 WonJun Moon, Junho Park, Hyun Seok Seong, Cheol-Ho Cho, Jae-Pil Heo

Furthermore, moderate- and easy-difficulty samples are also yielded by our modified GAN and Copycat, respectively.

Generative Adversarial Network Open Set Learning

Tailoring Self-Supervision for Supervised Learning

1 code implementation20 Jul 2022 WonJun Moon, Ji-Hwan Kim, Jae-Pil Heo

Our exhaustive experiments validate the merits of LoRot as a pretext task tailored for supervised learning in terms of robustness and generalization capability.

Adversarial Robustness Data Augmentation +3

Task Discrepancy Maximization for Fine-grained Few-Shot Classification

1 code implementation CVPR 2022 SuBeen Lee, WonJun Moon, Jae-Pil Heo

Specifically, TDM learns task-specific channel weights based on two novel components: Support Attention Module (SAM) and Query Attention Module (QAM).

Ranked #9 on Few-Shot Image Classification on CUB 200 5-way 5-shot (using extra training data)

Classification Few-Shot Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.