Search Results for author: Joonseok Lee

Found 31 papers, 14 papers with code

Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval

no code implementations10 Jan 2024 Eunyi Lyou, Doyeon Lee, Jooeun Kim, Joonseok Lee

Zero-shot learning offers an efficient solution for a machine learning model to treat unseen categories, avoiding exhaustive data collection.

Representation Learning Retrieval +2

Activity Grammars for Temporal Action Segmentation

1 code implementation NeurIPS 2023 Dayoung Gong, Joonseok Lee, Deunsol Jung, Suha Kwak, Minsu Cho

Sequence prediction on temporal data requires the ability to understand compositional structures of multi-level semantics beyond individual and contextual properties.

Action Segmentation Segmentation

V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

no code implementations11 May 2023 Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk

Video-to-music generation demands both a temporally localized high-quality listening experience and globally aligned video-acoustic signatures.

Music Generation

Perspective Projection-Based 3D CT Reconstruction from Biplanar X-rays

1 code implementation9 Mar 2023 Daeun Kyung, Kyungmin Jo, Jaegul Choo, Joonseok Lee, Edward Choi

X-ray computed tomography (CT) is one of the most common imaging techniques used to diagnose various diseases in the medical field.

Computed Tomography (CT)

MAQA: A Multimodal QA Benchmark for Negation

no code implementations9 Jan 2023 Judith Yue Li, Aren Jansen, Qingqing Huang, Joonseok Lee, Ravi Ganti, Dima Kuzmin

Multimodal learning can benefit from the representation power of pretrained Large Language Models (LLMs).

Negation Question Answering

A Unified Framework for Robustness on Diverse Sampling Errors

no code implementations ICCV 2023 Myeongho Jeon, Myungjoo Kang, Joonseok Lee

Instead of assuming and fixing the target distribution at training, our proposed approach allows adjusting the feature space the model refers to at every prediction, i. e., instance-wise adaptive inference.

Domain Generalization

Towards Efficient Neural Scene Graphs by Learning Consistency Fields

no code implementations9 Oct 2022 Yeji Song, Chaerin Kong, Seoyoung Lee, Nojun Kwak, Joonseok Lee

Neural Radiance Fields (NeRF) achieves photo-realistic image rendering from novel views, and the Neural Scene Graphs (NSG) \cite{ost2021neural} extends it to dynamic scenes (video) with multiple objects.

MuLan: A Joint Embedding of Music Audio and Natural Language

1 code implementation26 Aug 2022 Qingqing Huang, Aren Jansen, Joonseok Lee, Ravi Ganti, Judith Yue Li, Daniel P. W. Ellis

Music tagging and content-based retrieval systems have traditionally been constructed using pre-defined ontologies covering a rigid set of music attributes or text queries.

Cross-Modal Retrieval Music Tagging +2

Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer

1 code implementation15 Apr 2022 Hyungyung Lee, Sungjin Park, Joonseok Lee, Edward Choi

To learn a multimodal semantic correlation in a quantized space, we combine VQ-VAE with a Transformer encoder and apply an input masking strategy.

multimodal generation Quantization

Boundary-aware Self-supervised Learning for Video Scene Segmentation

1 code implementation14 Jan 2022 Jonghwan Mun, Minchul Shin, Gunsoo Han, Sangho Lee, Seongsu Ha, Joonseok Lee, Eun-Sol Kim

Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks.

Scene Segmentation Self-Supervised Learning

A Conservative Approach for Unbiased Learning on Unknown Biases

1 code implementation CVPR 2022 Myeongho Jeon, Daekyung Kim, Woochul Lee, Myungjoo Kang, Joonseok Lee

Although convolutional neural networks (CNNs) achieve state-of-the-art in image classification, recent works address their unreliable predictions due to their excessive dependence on biased training data.

Image Classification

Exploration into Translation-Equivariant Image Quantization

2 code implementations1 Dec 2021 Woncheol Shin, Gyubok Lee, Jiyoung Lee, Eunyi Lyou, Joonseok Lee, Edward Choi

This is an exploratory study that discovers the current image quantization (vector quantization) do not satisfy translation equivariance in the quantized space due to aliasing.

Quantization Text Generation +2

Boundary-aware Pre-training for Video Scene Segmentation

no code implementations29 Sep 2021 Jonghwan Mun, Minchul Shin, Gunsoo Han, Sangho Lee, Seongsu Ha, Joonseok Lee, Eun-Sol Kim

Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks.

Scene Segmentation Self-Supervised Learning

Session-aware Linear Item-Item Models for Session-based Recommendation

3 code implementations30 Mar 2021 Minjin Choi, jinhong Kim, Joonseok Lee, Hyunjung Shim, Jongwuk Lee

Session-based recommendation aims at predicting the next item given a sequence of previous items consumed in the session, e. g., on e-commerce or multimedia streaming services.

Session-Based Recommendations

Local Collaborative Autoencoders

1 code implementation30 Mar 2021 Minjin Choi, Yoonki Jeong, Joonseok Lee, Jongwuk Lee

Top-N recommendation is a challenging problem because complex and sparse user-item interactions should be adequately addressed to achieve high-quality recommendation results.

A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus

no code implementations18 Nov 2020 BoWen Zhang, Hexiang Hu, Joonseok Lee, Ming Zhao, Sheide Chammas, Vihan Jain, Eugene Ie, Fei Sha

Identifying a short segment in a long video that semantically matches a text query is a challenging task that has important application potentials in language-based video search, browsing, and navigation.

Language Modelling Masked Language Modeling +3

Vid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation

1 code implementation16 Oct 2020 Sunghyun Park, Kangyeol Kim, Junsoo Lee, Jaegul Choo, Joonseok Lee, Sookyung Kim, Edward Choi

Video generation models often operate under the assumption of fixed frame rates, which leads to suboptimal performance when it comes to handling flexible frame rates (e. g., increasing the frame rate of the more dynamic portion of the video as well as handling missing video frames).

Video Generation

Saving Face: Investigating the Ethical Concerns of Facial Recognition Auditing

no code implementations3 Jan 2020 Inioluwa Deborah Raji, Timnit Gebru, Margaret Mitchell, Joy Buolamwini, Joonseok Lee, Emily Denton

Although essential to revealing biased performance, well intentioned attempts at algorithmic auditing can have effects that may harm the very populations these measures are meant to protect.

Computers and Society

N-GCN: Multi-scale Graph Convolution for Semi-supervised Node Classification

1 code implementation24 Feb 2018 Sami Abu-El-Haija, Amol Kapoor, Bryan Perozzi, Joonseok Lee

Graph Convolutional Networks (GCNs) have shown significant improvements in semi-supervised learning on graph-structured data.

General Classification Node Classification

Network of Graph Convolutional Networks Trained on Random Walks

no code implementations ICLR 2018 Sami Abu-El-Haija, Amol Kapoor, Bryan Perozzi, Joonseok Lee

Graph Convolutional Networks (GCNs) are a recently proposed architecture which has had success in semi-supervised learning on graph-structured data.

General Classification Node Classification

Personalized Academic Research Paper Recommendation System

no code implementations19 Apr 2013 Joonseok Lee, Kisung Lee, Jennifer G. Kim

To ease this difficulty, we propose a Personalized Academic Research Paper Recommendation System, which recommends related articles, for each researcher, that may be interesting to her/him.

Collaborative Filtering Recommendation Systems +1

Automatic Feature Induction for Stagewise Collaborative Filtering

no code implementations NeurIPS 2012 Joonseok Lee, Mingxuan Sun, Seungyeon Kim, Guy Lebanon

Recent approaches to collaborative filtering have concentrated on estimating an algebraic or statistical model, and using the model for predicting missing ratings.

Collaborative Filtering

Cannot find the paper you are looking for? You can Submit a new open access paper.