Search Results for author: Sercan O. Arik

Found 32 papers, 14 papers with code

PAITS: Pretraining and Augmentation for Irregularly-Sampled Time Series

1 code implementation • 25 Aug 2023 • Nicasia Beebe-Wang, Sayna Ebrahimi, Jinsung Yoon, Sercan O. Arik, Tomas Pfister

In this paper, we present PAITS (Pretraining and Augmentation for Irregularly-sampled Time Series), a framework for identifying suitable pretraining strategies for sparse and irregularly sampled time series datasets.

Time Series

32,745

Paper
Code

Business Metric-Aware Forecasting for Inventory Management

no code implementations • 24 Aug 2023 • Helen Zhou, Sercan O. Arik, Jingtao Wang

We explore a wide range of plausible cost trade-off scenarios, and empirically demonstrate that end-to-end optimization often outperforms optimization of standard business-agnostic forecasting metrics (by up to 45. 7% for a simple scaling model, and up to 54. 0% for an LSTM encoder-decoder model).

Management Time Series

Paper
Add Code

LANISTR: Multimodal Learning from Structured and Unstructured Data

no code implementations • 26 May 2023 • Sayna Ebrahimi, Sercan O. Arik, Yihe Dong, Tomas Pfister

Multimodal large-scale pretraining has shown impressive performance for unstructured data including language, image, audio, and video.

Time Series

Paper
Add Code

Universal Self-Adaptive Prompting

no code implementations • 24 May 2023 • Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Hanjun Dai, Julian Martin Eisenschlos, Sercan O. Arik, Tomas Pfister

A hallmark of modern large language models (LLMs) is their impressive general zero-shot and few-shot abilities, often elicited through in-context learning (ICL) via prompting.

In-Context Learning Natural Language Understanding +2

Paper
Add Code

Better Zero-Shot Reasoning with Self-Adaptive Prompting

no code implementations • 23 May 2023 • Xingchen Wan, Ruoxi Sun, Hanjun Dai, Sercan O. Arik, Tomas Pfister

Modern large language models (LLMs) have demonstrated impressive capabilities at sophisticated tasks, often through step-by-step reasoning similar to humans.

Paper
Add Code

SLM: End-to-end Feature Selection via Sparse Learnable Masks

no code implementations • 6 Apr 2023 • Yihe Dong, Sercan O. Arik

Feature selection has been widely used to alleviate compute requirements during training, elucidate model interpretability, and improve model generalizability.

feature selection

Paper
Add Code

TSMixer: An All-MLP Architecture for Time Series Forecasting

2 code implementations • 10 Mar 2023 • Si-An Chen, Chun-Liang Li, Nate Yoder, Sercan O. Arik, Tomas Pfister

Extending them, in this paper, we investigate the capabilities of linear models for time-series forecasting and present Time-Series Mixer (TSMixer), a novel architecture designed by stacking multi-layer perceptrons (MLPs).

Time Series Time Series Forecasting

32,745

Paper
Code

Neural Spline Search for Quantile Probabilistic Modeling

no code implementations • 12 Jan 2023 • Ruoxi Sun, Chun-Liang Li, Sercan O. Arik, Michael W. Dusenberry, Chen-Yu Lee, Tomas Pfister

Accurate estimation of output quantiles is crucial in many use cases, where it is desired to model the range of possibility.

Attribute regression +2

Paper
Add Code

SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

no code implementations • 30 Nov 2022 • Jinsung Yoon, Kihyuk Sohn, Chun-Liang Li, Sercan O. Arik, Tomas Pfister

Semi-supervised anomaly detection is a common problem, as often the datasets containing anomalies are partially labeled.

Semi-supervised Anomaly Detection Supervised Anomaly Detection

Paper
Add Code

Provable Membership Inference Privacy

no code implementations • 12 Nov 2022 • Zachary Izzo, Jinsung Yoon, Sercan O. Arik, James Zou

However, DP's strong theoretical guarantees often come at the cost of a large drop in its utility for machine learning, and DP guarantees themselves can be difficult to interpret.

Paper
Add Code

Test-Time Adaptation for Visual Document Understanding

no code implementations • 15 Jun 2022 • Sayna Ebrahimi, Sercan O. Arik, Tomas Pfister

For visual document understanding (VDU), self-supervised pretraining has been shown to successfully generate transferable representations, yet, effective adaptation of such representations to distribution shifts at test-time remains to be an unexplored area.

document understanding Language Modelling +5

Paper
Add Code

Self-Adaptive Forecasting for Improved Deep Learning on Non-Stationary Time-Series

no code implementations • 4 Feb 2022 • Sercan O. Arik, Nathanael C. Yoder, Tomas Pfister

Real-world time-series datasets often violate the assumptions of standard supervised learning for forecasting -- their distributions evolve over time, rendering the conventional training and model selection procedures suboptimal.

Model Selection Self-Supervised Learning +2

Paper
Add Code

Controlling Neural Networks with Rule Representations

1 code implementation • NeurIPS 2021 • Sungyong Seo, Sercan O. Arik, Jinsung Yoon, Xiang Zhang, Kihyuk Sohn, Tomas Pfister

The key aspect of DeepCTRL is that it does not require retraining to adapt the rule strength -- at inference, the user can adjust it based on the desired operation point on accuracy vs. rule verification ratio.

Decision Making

Paper
Code

Self-supervise, Refine, Repeat: Improving Unsupervised Anomaly Detection

no code implementations • 11 Jun 2021 • Jinsung Yoon, Kihyuk Sohn, Chun-Liang Li, Sercan O. Arik, Chen-Yu Lee, Tomas Pfister

We demonstrate our method on various unsupervised AD tasks with image and tabular data.

Classification One-Class Classification +3

Paper
Add Code

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

6 code implementations • 26 May 2021 • Zizhao Zhang, Han Zhang, Long Zhao, Ting Chen, Sercan O. Arik, Tomas Pfister

Hierarchical structures are popular in recent vision transformers, however, they require sophisticated designs and massive datasets to work well.

Ranked #84 on Image Classification on CIFAR-10

Image Classification Image Generation

29,680

Paper
Code

Interpretable Sequence Learning for COVID-19 Forecasting

no code implementations • NeurIPS 2020 • Sercan O. Arik, Chun-Liang Li, Jinsung Yoon, Rajarishi Sinha, Arkady Epshteyn, Long T. Le, Vikas Menon, Shashank Singh, Leyou Zhang, Nate Yoder, Martin Nikoltchev, Yash Sonthalia, Hootan Nakhost, Elli Kanal, Tomas Pfister

We propose a novel approach that integrates machine learning into compartmental disease modeling to predict the progression of COVID-19.

Paper
Add Code

Explaining Deep Neural Networks using Unsupervised Clustering

no code implementations • 15 Jul 2020 • Yu-Han Liu, Sercan O. Arik

We propose a novel method to explain trained deep neural networks (DNNs), by distilling them into surrogate models using unsupervised clustering.

Clustering

Paper
Add Code

Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting

34 code implementations • 19 Dec 2019 • Bryan Lim, Sercan O. Arik, Nicolas Loeff, Tomas Pfister

Multi-horizon forecasting problems often contain a complex mix of inputs -- including static (i. e. time-invariant) covariates, known future inputs, and other exogenous time series that are only observed historically -- without any prior information on how they interact with the target.

Interpretable Machine Learning Time Series +1

32,755

Paper
Code

On Completeness-aware Concept-Based Explanations in Deep Neural Networks

2 code implementations • NeurIPS 2020 • Chih-Kuan Yeh, Been Kim, Sercan O. Arik, Chun-Liang Li, Tomas Pfister, Pradeep Ravikumar

Next, we propose a concept discovery method that aims to infer a complete set of concepts that are additionally encouraged to be interpretable, which addresses the limitations of existing methods on concept explanations.

Paper
Code

Consistency-based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

no code implementations • ECCV 2020 • Mingfei Gao, Zizhao Zhang, Guo Yu, Sercan O. Arik, Larry S. Davis, Tomas Pfister

Active learning (AL) combines data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance.

Active Learning Image Classification +1

Paper
Add Code

Distilling Effective Supervision from Severe Label Noise

2 code implementations • CVPR 2020 • Zizhao Zhang, Han Zhang, Sercan O. Arik, Honglak Lee, Tomas Pfister

For instance, on CIFAR100 with a $40\%$ uniform noise ratio and only 10 trusted labeled data per class, our method achieves $80. 2{\pm}0. 3\%$ classification accuracy, where the error rate is only $1. 4\%$ higher than a neural network trained without label noise.

Image Classification

32,755

Paper
Code

LIMIS: Locally Interpretable Modeling using Instance-wise Subsampling

1 code implementation • 26 Sep 2019 • Jinsung Yoon, Sercan O. Arik, Tomas Pfister

Understanding black-box machine learning models is crucial for their widespread adoption.

Reinforcement Learning (RL)

32,745

Paper
Code

Data Valuation using Reinforcement Learning

1 code implementation • ICML 2020 • Jinsung Yoon, Sercan O. Arik, Tomas Pfister

To adaptively learn data values jointly with the target task predictor model, we propose a meta learning framework which we name Data Valuation using Reinforcement Learning (DVRL).

Data Valuation Domain Adaptation +4

32,745

Paper
Code

Consistency-Based Semi-Supervised Active Learning: Towards Minimizing Labeling Budget

no code implementations • 25 Sep 2019 • Mingfei Gao, Zizhao Zhang, Guo Yu, Sercan O. Arik, Larry S. Davis, Tomas Pfister

Active learning (AL) aims to integrate data labeling and model training in a unified way, and to minimize the labeling budget by prioritizing the selection of high value data that can best improve model performance.

Active Learning Representation Learning

Paper
Add Code

Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning

no code implementations • ECCV 2020 • Linchao Zhu, Sercan O. Arik, Yi Yang, Tomas Pfister

We propose a novel adaptive transfer learning framework, learning to transfer learn (L2TL), to improve performance on a target dataset by careful extraction of the related information from a source dataset.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

TabNet: Attentive Interpretable Tabular Learning

19 code implementations • 20 Aug 2019 • Sercan O. Arik, Tomas Pfister

We propose a novel high-performance and interpretable canonical deep tabular data learning architecture, TabNet.

Ranked #1 on Poker Hand Classification on Poker Hand

Decision Making Poker Hand Classification +2

32,756

Paper
Code

ProtoAttend: Attention-Based Prototypical Learning

4 code implementations • 17 Feb 2019 • Sercan O. Arik, Tomas Pfister

We propose a novel inherently interpretable machine learning method that bases decisions on few relevant examples that we call prototypes.

Decision Making General Classification +1

32,756

Paper
Code

Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks

no code implementations • 20 Aug 2018 • Sercan O. Arik, Heewoo Jun, Gregory Diamos

We propose the multi-head convolutional neural network (MCNN) architecture for waveform synthesis from spectrograms.

speech-recognition Speech Recognition +1

Paper
Add Code

Neural Voice Cloning with a Few Samples

2 code implementations • NeurIPS 2018 • Sercan O. Arik, Jitong Chen, Kainan Peng, Wei Ping, Yanqi Zhou

Speaker adaptation is based on fine-tuning a multi-speaker generative model with a few cloning samples.

Speech Synthesis Voice Cloning

499

Paper
Code

Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning

7 code implementations • ICLR 2018 • Wei Ping, Kainan Peng, Andrew Gibiansky, Sercan O. Arik, Ajay Kannan, Sharan Narang, Jonathan Raiman, John Miller

We present Deep Voice 3, a fully-convolutional attention-based neural text-to-speech (TTS) system.

Speech Synthesis

1,933

Paper
Code

Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting

no code implementations • 15 Mar 2017 • Sercan O. Arik, Markus Kliegl, Rewon Child, Joel Hestness, Andrew Gibiansky, Chris Fougner, Ryan Prenger, Adam Coates

Keyword spotting (KWS) constitutes a major component of human-technology interfaces.

Small-Footprint Keyword Spotting speech-recognition +1

Paper
Add Code

Deep Voice: Real-time Neural Text-to-Speech

3 code implementations • ICML 2017 • Sercan O. Arik, Mike Chrzanowski, Adam Coates, Gregory Diamos, Andrew Gibiansky, Yongguo Kang, Xi-An Li, John Miller, Andrew Ng, Jonathan Raiman, Shubho Sengupta, Mohammad Shoeybi

We present Deep Voice, a production-quality text-to-speech system constructed entirely from deep neural networks.

Audio Synthesis Boundary Detection +2

736

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.