Search Results for author: WonJun Moon

Found 9 papers, 9 papers with code

VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting

1 code implementation • 27 Dec 2023 • Seunggu Kang, WonJun Moon, Euiyeon Kim, Jae-Pil Heo

Zero-Shot Object Counting (ZSOC) aims to count referred instances of arbitrary classes in a query image without human-annotated exemplars.

Ranked #2 on Zero-Shot Counting on FSC147

Decoder Object Counting +1

Paper
Code

Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding

2 code implementations • 15 Nov 2023 • WonJun Moon, Sangeek Hyun, SuBeen Lee, Jae-Pil Heo

Dummy tokens conditioned by text query take portions of the attention weights, preventing irrelevant video clips from being represented by the text query.

Ranked #1 on Highlight Detection on TvSum

Highlight Detection Moment Retrieval +3

169

Paper
Code

Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification

1 code implementation • 28 Jul 2023 • SuBeen Lee, WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM.

Cross-Domain Few-Shot Fine-Grained Image Classification

Paper
Code

Leveraging Hidden Positives for Unsupervised Semantic Segmentation

1 code implementation • CVPR 2023 • Hyun Seok Seong, WonJun Moon, SuBeen Lee, Jae-Pil Heo

Specifically, we add the loss propagating to local hidden positives, semantically similar nearby patches, in proportion to the predefined similarity scores.

Ranked #2 on Unsupervised Semantic Segmentation on Potsdam-3

Contrastive Learning Unsupervised Semantic Segmentation

Paper
Code

Query-Dependent Video Representation for Moment Retrieval and Highlight Detection

1 code implementation • CVPR 2023 • WonJun Moon, Sangeek Hyun, Sanguk Park, Dongchan Park, Jae-Pil Heo

As we observe the insignificant role of a given query in transformer architectures, our encoding module starts with cross-attention layers to explicitly inject the context of text query into video representation.

Ranked #2 on Highlight Detection on TvSum

Highlight Detection Moment Retrieval +4

169

Paper
Code

Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition

1 code implementation • 24 Nov 2022 • WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

A dramatic increase in real-world video volume with extremely diverse and emerging topics naturally forms a long-tailed video distribution in terms of their categories, and it spotlights the need for Video Long-Tailed Recognition (VLTR).

Paper
Code

Difficulty-Aware Simulator for Open Set Recognition

1 code implementation • 20 Jul 2022 • WonJun Moon, Junho Park, Hyun Seok Seong, Cheol-Ho Cho, Jae-Pil Heo

Furthermore, moderate- and easy-difficulty samples are also yielded by our modified GAN and Copycat, respectively.

Generative Adversarial Network Open Set Learning

Paper
Code

Tailoring Self-Supervision for Supervised Learning

1 code implementation • 20 Jul 2022 • WonJun Moon, Ji-Hwan Kim, Jae-Pil Heo

Our exhaustive experiments validate the merits of LoRot as a pretext task tailored for supervised learning in terms of robustness and generalization capability.

Ranked #9 on Data Augmentation on ImageNet

Adversarial Robustness Data Augmentation +3

Paper
Code

Task Discrepancy Maximization for Fine-grained Few-Shot Classification

1 code implementation • CVPR 2022 • SuBeen Lee, WonJun Moon, Jae-Pil Heo

Specifically, TDM learns task-specific channel weights based on two novel components: Support Attention Module (SAM) and Query Attention Module (QAM).

Ranked #9 on Few-Shot Image Classification on CUB 200 5-way 5-shot (using extra training data)

Classification Few-Shot Image Classification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.