Search Results for author: Masanori Suganuma

Found 24 papers, 12 papers with code

SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers

1 code implementation • 7 Nov 2023 • Xiangyong Lu, Masanori Suganuma, Takayuki Okatani

For the first time, it achieves an ImageNet-1K top-1 accuracy of around 80% at a speed of 1. 0 frame/sec on the SBC.

Paper
Code

Visual Abductive Reasoning Meets Driving Hazard Prediction

1 code implementation • 7 Oct 2023 • Korawat Charoenpitaks, Van-Quang Nguyen, Masanori Suganuma, Masahiro Takahashi, Ryoma Niihara, Takayuki Okatani

To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created.

Anomaly Detection Visual Abductive Reasoning

Paper
Code

That's BAD: Blind Anomaly Detection by Implicit Local Feature Clustering

no code implementations • 6 Jul 2023 • Jie Zhang, Masanori Suganuma, Takayuki Okatani

They consider an unsupervised setting, specifically the one-class setting, in which we assume the availability of a set of normal (\textit{i. e.}, anomaly-free) images for training.

Anomaly Detection Clustering +1

Paper
Add Code

Contextual Affinity Distillation for Image Anomaly Detection

no code implementations • 6 Jul 2023 • Jie Zhang, Masanori Suganuma, Takayuki Okatani

The local student, which is used in previous studies mainly focuses on structural anomaly detection while the global student pays attention to logical anomalies.

Ranked #13 on Anomaly Detection on MVTec LOCO AD

Anomaly Detection Knowledge Distillation

Paper
Add Code

Reference-based Motion Blur Removal: Learning to Utilize Sharpness in the Reference Image

no code implementations • 6 Jul 2023 • Han Zou, Masanori Suganuma, Takayuki Okatani

We can utilize an alternative shot of the identical scene, just like in video deblurring, or we can even employ a distinct image from another scene.

Deblurring Image Deblurring

Paper
Add Code

RefVSR++: Exploiting Reference Inputs for Reference-based Video Super-resolution

no code implementations • 6 Jul 2023 • Han Zou, Masanori Suganuma, Takayuki Okatani

Then, we propose an improved method, RefVSR++, which can aggregate two features in parallel in the temporal direction, one for aggregating the fused LR and Ref inputs and the other for Ref inputs over time.

Reference-based Video Super-Resolution Video Super-Resolution

Paper
Add Code

GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features

2 code implementations • 20 Jul 2022 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

Current state-of-the-art methods for image captioning employ region-based features, as they provide object-level information that is essential to describe the content of images; they are usually extracted by an object detector such as Faster R-CNN.

Ranked #8 on Image Captioning on nocaps in-domain

Image Captioning

171

Paper
Code

Rectifying Open-set Object Detection: A Taxonomy, Practical Applications, and Proper Evaluation

no code implementations • 20 Jul 2022 • Yusuke Hosoya, Masanori Suganuma, Takayuki Okatani

In this paper, we first point out that the recent studies' formalization of OSOD, which generalizes open-set recognition (OSR) and thus considers an unlimited variety of unknown objects, has a fundamental issue.

Image Classification Object +3

Paper
Add Code

Single-image Defocus Deblurring by Integration of Defocus Map Prediction Tracing the Inverse Problem Computation

no code implementations • 7 Jul 2022 • Qian Ye, Masanori Suganuma, Takayuki Okatani

Considering the spatial variant property of the defocus blur and the blur level indicated in the defocus map, we employ the defocus map as conditional guidance to adjust the features from the input blurring images instead of simple concatenation.

Deblurring Image Deblurring +1

Paper
Add Code

Learning Regularized Multi-Scale Feature Flow for High Dynamic Range Imaging

no code implementations • 6 Jul 2022 • Qian Ye, Masanori Suganuma, Jun Xiao, Takayuki Okatani

Reconstructing ghosting-free high dynamic range (HDR) images of dynamic scenes from a set of multi-exposure images is a challenging task, especially with large object motion and occlusions, leading to visible artifacts using existing methods.

Vocal Bursts Intensity Prediction

Paper
Add Code

Rethinking Unsupervised Domain Adaptation for Semantic Segmentation

2 code implementations • 30 Jun 2022 • Zhijie Wang, Masanori Suganuma, Takayuki Okatani

Due to its high annotation cost, researchers have developed many UDA methods for semantic segmentation, which assume no labeled sample is available in the target domain.

Semantic Segmentation Unsupervised Domain Adaptation

268

Paper
Code

Improved Few-shot Segmentation by Redefinition of the Roles of Multi-level CNN Features

no code implementations • 14 Sep 2021 • Zhijie Wang, Masanori Suganuma, Takayuki Okatani

This study is concerned with few-shot segmentation, i. e., segmenting the region of an unseen object class in a query image, given support image(s) of its instances.

Paper
Add Code

Cross-Region Domain Adaptation for Class-level Alignment

no code implementations • 14 Sep 2021 • Zhijie Wang, Xing Liu, Masanori Suganuma, Takayuki Okatani

To cope with this, we propose a method that applies adversarial training to align two feature distributions in the target domain.

Ranked #1 on Domain Adaptation on Synscapes-to-Cityscapes

Semantic Segmentation Synthetic-to-Real Translation +1

Paper
Add Code

Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes

no code implementations • ICCV 2021 • Wenzheng Song, Masanori Suganuma, Xing Liu, Noriyuki Shimobayashi, Daisuke Maruta, Takayuki Okatani

To consider if and how well we can utilize such information stored in RAW-format images for image matching, we have created a new dataset named MID (matching in the dark).

Paper
Add Code

Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

1 code implementation • 1 Jun 2021 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It then integrates the prediction with the visual information etc., yielding the final prediction of an action and an object.

Instruction Following

Paper
Code

How Can CNNs Use Image Position for Segmentation?

no code implementations • 7 May 2020 • Rito Murase, Masanori Suganuma, Takayuki Okatani

We draw a mixed conclusion from the experimental results; the positional encoding certainly works in some cases, but the absolute image position may not be so important for segmentation tasks as we think.

Image Segmentation Medical Image Segmentation +4

Paper
Add Code

Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs

1 code implementation • ECCV 2020 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It has been a primary concern in recent studies of vision and language tasks to design an effective attention mechanism dealing with interactions between the two modalities.

Ranked #7 on Visual Dialog on Visual Dialog v1.0 test-std

Visual Dialog

Paper
Code

Analysis and a Solution of Momentarily Missed Detection for Anchor-based Object Detectors

no code implementations • 21 Oct 2019 • Yusuke Hosoya, Masanori Suganuma, Takayuki Okatani

The employment of convolutional neural networks has led to significant performance improvement on the task of object detection.

object-detection Object Detection

Paper
Add Code

Restoring Images with Unknown Degradation Factors by Recurrent Use of a Multi-branch Network

1 code implementation • 10 Jul 2019 • Xing Liu, Masanori Suganuma, Xiyang Luo, Takayuki Okatani

The employment of convolutional neural networks has achieved unprecedented performance in the task of image restoration for a variety of degradation factors.

Deblurring JPEG Artifact Removal +1

Paper
Code

Hyperparameter-Free Out-of-Distribution Detection Using Softmax of Scaled Cosine Similarity

1 code implementation • 25 May 2019 • Engkarat Techapanurak, Masanori Suganuma, Takayuki Okatani

The ability to detect out-of-distribution (OOD) samples is vital to secure the reliability of deep neural networks in real-world applications.

Metric Learning Out-of-Distribution Detection

Paper
Code

Dual Residual Networks Leveraging the Potential of Paired Operations for Image Restoration

1 code implementation • CVPR 2019 • Xing Liu, Masanori Suganuma, Zhun Sun, Takayuki Okatani

In this paper, we study design of deep neural networks for tasks of image restoration.

Image Restoration

152

Paper
Code

Attention-based Adaptive Selection of Operations for Image Restoration in the Presence of Unknown Combined Distortions

1 code implementation • CVPR 2019 • Masanori Suganuma, Xing Liu, Takayuki Okatani

There are many different types of distortion which affect image quality.

Image Restoration

Paper
Code

Exploiting the Potential of Standard Convolutional Autoencoders for Image Restoration by Evolutionary Search

1 code implementation • ICML 2018 • Masanori Suganuma, Mete Ozay, Takayuki Okatani

Researchers have applied deep neural networks to image restoration tasks, in which they proposed various network architectures, loss functions, and training methods.

Image Restoration

Paper
Code

A Genetic Programming Approach to Designing Convolutional Neural Network Architectures

5 code implementations • 3 Apr 2017 • Masanori Suganuma, Shinichi Shirakawa, Tomoharu Nagao

To evaluate the proposed method, we constructed a CNN architecture for the image classification task with the CIFAR-10 dataset.

General Classification Image Classification +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.