Search Results for author: Yoonho Boo

Found 8 papers, 0 papers with code

Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks

no code implementations • 30 Sep 2020 • Yoonho Boo, Sungho Shin, Jungwook Choi, Wonyong Sung

In this study, we propose stochastic precision ensemble training for QDNNs (SPEQ).

Paper
Add Code

Quantized Neural Networks: Characterization and Holistic Optimization

no code implementations • 31 May 2020 • Yoonho Boo, Sungho Shin, Wonyong Sung

This study proposes a holistic approach for the optimization of QDNNs, which contains QDNN training methods as well as quantization-friendly architecture design.

Model Selection Quantization

Paper
Add Code

SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural Networks

no code implementations • 2 Feb 2020 • Sungho Shin, Yoonho Boo, Wonyong Sung

Model averaging is a promising approach for achieving the good generalization capability of DNNs, especially when the loss surface for training contains many sharp minima.

Quantization

Paper
Add Code

Knowledge distillation for optimization of quantized deep neural networks

no code implementations • 4 Sep 2019 • Sungho Shin, Yoonho Boo, Wonyong Sung

Knowledge distillation (KD) is a very popular method for model size reduction.

Knowledge Distillation

Paper
Add Code

Fully Neural Network Based Speech Recognition on Mobile and Embedded Devices

no code implementations • NeurIPS 2018 • Jinhwan Park, Yoonho Boo, Iksoo Choi, Sungho Shin, Wonyong Sung

The RNN implementation on embedded devices can suffer from excessive DRAM accesses because the parameter size of a neural network usually exceeds that of the cache memory and the parameters are used only once for each time step.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

SVD-Softmax: Fast Softmax Approximation on Large Vocabulary Neural Networks

no code implementations • NeurIPS 2017 • Kyuhong Shim, Minjae Lee, Iksoo Choi, Yoonho Boo, Wonyong Sung

The approximate probability of each word can be estimated with only a small part of the weight matrix by using a few large singular values and the corresponding elements for most of the words.

Language Modelling Machine Translation +1

Paper
Add Code

Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations

no code implementations • 1 Jul 2017 • Yoonho Boo, Wonyong Sung

Deep neural networks (DNNs) usually demand a large amount of operations for real-time inference.

Paper
Add Code

Fixed-point optimization of deep neural networks with adaptive step size retraining

no code implementations • 27 Feb 2017 • Sungho Shin, Yoonho Boo, Wonyong Sung

Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations.

Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.