no code implementations • 12 Dec 2023 • Jayoung Kim, Yehjin Shin, Jeongwhan Choi, Hyowon Wi, Noseong Park
Structured data, which constitutes a significant portion of existing data types, has been a long-standing research topic in the field of machine learning.
no code implementations • 7 Dec 2023 • Jeongwhan Choi, Hyowon Wi, Jayoung Kim, Yehjin Shin, Kookjin Lee, Nathaniel Trask, Noseong Park
Transformers, renowned for their self-attention mechanism, have achieved state-of-the-art performance across various tasks in natural language processing, computer vision, time-series modeling, etc.
1 code implementation • 25 Apr 2023 • Chaejeong Lee, Jayoung Kim, Noseong Park
With growing attention to tabular data these days, the attempt to apply a synthetic table to various tasks has been expanded toward various scenarios.
1 code implementation • 8 Oct 2022 • Jayoung Kim, Chaejeong Lee, Noseong Park
Our proposed training strategy includes a self-paced learning technique and a fine-tuning strategy, which further increases the sampling quality and diversity by stabilizing the denoising score matching training.
1 code implementation • 17 Aug 2022 • Jihyeon Hyeong, Jayoung Kim, Noseong Park, Sushil Jajodia
Tabular data typically contains private and important information; thus, precautions must be taken before they are shared with others.
1 code implementation • 17 Jun 2022 • Jayoung Kim, Chaejeong Lee, Yehjin Shin, Sewon Park, Minjung Kim, Noseong Park, Jihoon Cho
To our knowledge, we are the first presenting a score-based tabular data oversampling method.
no code implementations • 19 Apr 2022 • Sheo Yon Jhin, Jaehoon Lee, Minju Jo, Seungji Kook, Jinsung Jeon, Jihyeon Hyeong, Jayoung Kim, Noseong Park
Deep learning inspired by differential equations is a recent research trend and has marked the state of the art performance for many machine learning tasks.
1 code implementation • ICLR 2022 • Jaehoon Lee, Jinsung Jeon, Sheo Yon Jhin, Jihyeon Hyeong, Jayoung Kim, Minju Jo, Kook Seungji, Noseong Park
The problem of processing very long time-series data (e. g., a length of more than 10, 000) is a long-standing research problem in machine learning.
1 code implementation • 31 May 2021 • Jayoung Kim, Jinsung Jeon, Jaehoon Lee, Jihyeon Hyeong, Noseong Park
Synthesizing tabular data is attracting much attention these days for various purposes.