Search Results for author: Yoonho Boo

Found 8 papers, 0 papers with code

Quantized Neural Networks: Characterization and Holistic Optimization

no code implementations31 May 2020 Yoonho Boo, Sungho Shin, Wonyong Sung

This study proposes a holistic approach for the optimization of QDNNs, which contains QDNN training methods as well as quantization-friendly architecture design.

Model Selection Quantization

SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural Networks

no code implementations2 Feb 2020 Sungho Shin, Yoonho Boo, Wonyong Sung

Model averaging is a promising approach for achieving the good generalization capability of DNNs, especially when the loss surface for training contains many sharp minima.

Quantization

Fully Neural Network Based Speech Recognition on Mobile and Embedded Devices

no code implementations NeurIPS 2018 Jinhwan Park, Yoonho Boo, Iksoo Choi, Sungho Shin, Wonyong Sung

The RNN implementation on embedded devices can suffer from excessive DRAM accesses because the parameter size of a neural network usually exceeds that of the cache memory and the parameters are used only once for each time step.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

SVD-Softmax: Fast Softmax Approximation on Large Vocabulary Neural Networks

no code implementations NeurIPS 2017 Kyuhong Shim, Minjae Lee, Iksoo Choi, Yoonho Boo, Wonyong Sung

The approximate probability of each word can be estimated with only a small part of the weight matrix by using a few large singular values and the corresponding elements for most of the words.

Language Modelling Machine Translation +1

Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations

no code implementations1 Jul 2017 Yoonho Boo, Wonyong Sung

Deep neural networks (DNNs) usually demand a large amount of operations for real-time inference.

Fixed-point optimization of deep neural networks with adaptive step size retraining

no code implementations27 Feb 2017 Sungho Shin, Yoonho Boo, Wonyong Sung

Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.