no code implementations • 26 Feb 2024 • Daichi Haraguchi, Brian Kenji Iwana, Seiichi Uchida
In the experiment, we found that semantic information is sufficient to determine the genre; however, text design is helpful in adding more discriminative features for book genres.
1 code implementation • 13 Sep 2023 • Shinnosuke Matsuo, Xiaomeng Wu, Gantugs Atarsaikhan, Akisato Kimura, Kunio Kashino, Brian Kenji Iwana, Seiichi Uchida
Unlike other learnable models using DTW for warping, our model predicts all local correspondences between two time series and is trained based on metric learning, which enables it to learn the optimal data-dependent warping for the target task.
1 code implementation • ICCV 2023 • Wei Pan, Anna Zhu, Xinyu Zhou, Brian Kenji Iwana, Shilin Li
To better capture the local styles, a cross-attention-based style transfer module is adopted to transfer the styles of reference glyphs to the components, where the components are self-learned discrete latent codes through vector quantization without manual definition.
1 code implementation • 16 Jun 2023 • Guangtao Lyu, Kun Liu, Anna Zhu, Seiichi Uchida, Brian Kenji Iwana
To tackle these problems, we propose a novel Feature Erasing and Transferring (FET) mechanism to reconfigure the encoded features for STR in this paper.
no code implementations • 27 Apr 2023 • Yusuke Nagata, Brian Kenji Iwana, Seiichi Uchida
We propose a Transformer-based method to solve this problem and show the results of the typeface contour completion.
1 code implementation • 27 Apr 2023 • Brian Kenji Iwana, Akihiro Kusuda
Transformers are popular neural network models that use layers of self-attention and fully-connected nodes with embedded tokens.
1 code implementation • 13 Dec 2022 • Brian Kenji Iwana
We propose a novel method of normalizing the lengths of the time series in a dataset by exploiting the dynamic matching ability of Dynamic Time Warping (DTW).
1 code implementation • 5 Nov 2021 • Daisuke Oba, Shinnosuke Matsuo, Brian Kenji Iwana
We propose a neural network that dynamically selects the best combination of data augmentation methods using a mutually beneficial gating network and a feature consistency loss.
no code implementations • 29 Jun 2021 • Kaigen Tsuji, Seiichi Uchida, Brian Kenji Iwana
In this paper, we attempt to specifically find the trends in font usage using robust regression on a large collection of text images.
1 code implementation • 24 May 2021 • Wensheng Zhang, Yan Zheng, Taiga Miyazono, Seiichi Uchida, Brian Kenji Iwana
Book covers are intentionally designed and provide an introduction to a book.
1 code implementation • 19 May 2021 • Taiga Miyazono, Brian Kenji Iwana, Daichi Haraguchi, Seiichi Uchida
We propose an end-to-end neural network that inputs the book cover, a target location mask, and a desired book title and outputs stylized text suitable for the cover.
Ranked #1 on Font Generation on Book Cover Dataset
1 code implementation • 28 Mar 2021 • Shinnosuke Matsuo, Xiaomeng Wu, Gantugs Atarsaikhan, Akisato Kimura, Kunio Kashino, Brian Kenji Iwana, Seiichi Uchida
This approach adapts a parameterized attention model to time warping for greater and more adaptive temporal invariance.
no code implementations • 8 Mar 2021 • Shinnosuke Matsuo, Seiichi Uchida, Brian Kenji Iwana
To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding.
no code implementations • 23 Sep 2020 • Keisuke Kanda, Brian Kenji Iwana, Seiichi Uchida
In this study, we use a reinforcement learning (RL) framework to realize handwriting generation with the careful future planning ability.
1 code implementation • 31 Jul 2020 • Brian Kenji Iwana, Seiichi Uchida
In this paper, we survey data augmentation techniques for time series and their application to time series classification with neural networks.
no code implementations • ECCV 2020 • Hiroki Tokunaga, Brian Kenji Iwana, Yuki Teramoto, Akihiko Yoshizawa, Ryoma Bise
We propose a weakly-supervised cell tracking method that can train a convolutional neural network (CNN) by using only the annotation of "cell detection" (i. e., the coordinates of cell positions) without association information, in which cell positions can be easily obtained by nuclear staining.
no code implementations • 22 Jun 2020 • Kohei Baba, Seiichi Uchida, Brian Kenji Iwana
The purpose of this paper is to reveal the ability that Convolutional Neural Networks (CNN) have on the novel task of image-to-image language conversion.
1 code implementation • PLOS ONE 2020 • Gantugs Atarsaikhan, Brian Kenji Iwana, Seiichi Uchida
Designing logos, typefaces, and other decorated shapes can require professional skills.
2 code implementations • 19 Apr 2020 • Brian Kenji Iwana, Seiichi Uchida
In order to address this problem, we propose a novel time series data augmentation called guided warping.
no code implementations • 18 Apr 2020 • Masaya Ikoma, Brian Kenji Iwana, Seiichi Uchida
In natural scenes and documents, we can find the correlation between a text and its color.
1 code implementation • 24 Jan 2020 • Daichi Haraguchi, Shota Harada, Brian Kenji Iwana, Yuto Shinahara, Seiichi Uchida
Moreover, we analyzed the relationship between character classes and font identification accuracy.
no code implementations • 21 Jan 2020 • Gantugs Atarsaikhan, Brian Kenji Iwana, Seiichi Uchida
In our proposed method, the difference of font styles between two different fonts is found and transferred to another font using neural style transfer.
1 code implementation • 6 Aug 2019 • Brian Kenji Iwana, Ryohei Kuroki, Seiichi Uchida
Through qualitative and quantitative analysis, we demonstrate that SGLRP can successfully localize and attribute the regions on input images which contribute to a target object's classification.
no code implementations • 14 Jun 2019 • Taichi Sumi, Brian Kenji Iwana, Hideaki Hayashi, Seiichi Uchida
This research attempts to construct a network that can convert online and offline handwritten characters to each other.
1 code implementation • 26 May 2019 • Kumar Shridhar, Joonho Lee, Hideaki Hayashi, Purvanshi Mehta, Brian Kenji Iwana, Seokjun Kang, Seiichi Uchida, Sheraz Ahmed, Andreas Dengel
We show that ProbAct increases the classification accuracy by +2-3% compared to ReLU or other conventional activation functions on both original datasets and when datasets are reduced to 50% and 25% of the original size.
no code implementations • 25 Aug 2018 • Shailza Jolly, Brian Kenji Iwana, Ryohei Kuroki, Seiichi Uchida
We use LRP to explain the pixel-wise contributions of book cover design and highlight the design elements contributing towards particular genres.
1 code implementation • 2 Mar 2018 • Gantugs Atarsaikhan, Brian Kenji Iwana, Seiichi Uchida
We propose using neural style transfer with clip art and text for the creation of new and genuine logos.
no code implementations • 18 Dec 2017 • Brian Kenji Iwana, Seiichi Uchida
In this paper, we propose a method of improving temporal Convolutional Neural Networks (CNN) by determining the optimal alignment of weights and inputs using dynamic programming.
no code implementations • 25 Dec 2016 • Jinho Lee, Brian Kenji Iwana, Shouta Ide, Seiichi Uchida
Thus, we propose a new and robust tracking method using a Fully Convolutional Network (FCN) to obtain an object probability map and Dynamic Programming (DP) to seek the globally optimal path through all frames of video.
4 code implementations • 28 Oct 2016 • Brian Kenji Iwana, Syed Tahseen Raza Rizvi, Sheraz Ahmed, Andreas Dengel, Seiichi Uchida
Book covers communicate information to potential readers, but can that same information be learned by computers?
Ranked #1 on Genre classification on Book Cover Dataset