Search Results for author: Kyosuke Nishida

Found 25 papers, 4 papers with code

Scene-Text Aware Image and Text Retrieval with Dual-Encoder

no code implementations ACL 2022 Shumpei Miyawaki, Taku Hasegawa, Kyosuke Nishida, Takuma Kato, Jun Suzuki

We tackle the tasks of image and text retrieval using a dual-encoder model in which images and text are encoded independently.

Retrieval Text Retrieval

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

1 code implementation24 Jan 2024 Ryota Tanaka, Taichi Iki, Kyosuke Nishida, Kuniko Saito, Jun Suzuki

We study the problem of completing various visual document understanding (VDU) tasks, e. g., question answering and information extraction, on real-world documents through human-written instructions.

document understanding Question Answering +1

Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

no code implementations3 Apr 2023 Tsuyoshi Baba, Kosuke Nishida, Kyosuke Nishida

Our model represents the edit direction as a normal vector in the CLIP space obtained by training a SVM to classify positive and negative images.

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

1 code implementation12 Jan 2023 Ryota Tanaka, Kyosuke Nishida, Kosuke Nishida, Taku Hasegawa, Itsumi Saito, Kuniko Saito

Visual question answering on document images that contain textual, visual, and layout information, called document VQA, has received much attention recently.

Evidence Selection Question Answering +1

Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge

no code implementations14 Oct 2022 Kosuke Nishida, Naoki Yoshinaga, Kyosuke Nishida

Although named entity recognition (NER) helps us to extract domain-specific entities from text (e. g., artists in the music domain), it is costly to create a large amount of training data or a structured knowledge base to perform accurate NER in the target domain.

named-entity-recognition Named Entity Recognition +1

Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions

no code implementations Findings (NAACL) 2022 Kosuke Nishida, Kyosuke Nishida, Shuichi Nishioka

Our proposed model, LIDE (Learning from Image and DEscription), has a text decoder to generate the descriptions and a text encoder to obtain the text representations of machine- or user-generated descriptions.

Few-Shot Image Classification

Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction

no code implementations17 Nov 2021 Kosuke Nishida, Kyosuke Nishida, Itsumi Saito, Sen Yoshida

In this study, we define an interpretable reading comprehension (IRC) model as a pipeline model with the capability of predicting unanswerable queries.

Reading Comprehension

Task-adaptive Pre-training of Language Models with Word Embedding Regularization

no code implementations Findings (ACL) 2021 Kosuke Nishida, Kyosuke Nishida, Sen Yoshida

TAPTER runs additional pre-training by making the static word embeddings of a PTLM close to the word embeddings obtained in the target domain with fastText.

Domain Adaptation Question Answering +1

VisualMRC: Machine Reading Comprehension on Document Images

1 code implementation27 Jan 2021 Ryota Tanaka, Kyosuke Nishida, Sen Yoshida

In this study, we introduce a new visual machine reading comprehension dataset, named VisualMRC, wherein given a question and a document image, a machine reads and comprehends texts in the image to answer the question in natural language.

Machine Reading Comprehension Natural Language Understanding +2

A Transformer-based Audio Captioning Model with Keyword Estimation

no code implementations1 Jul 2020 Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito

TRACKE estimates keywords, which comprise a word set corresponding to audio events/scenes in the input audio, and generates the caption while referring to the estimated keywords to reduce word-selection indeterminacy.

Acoustic Scene Classification Audio captioning +2

Abstractive Summarization with Combination of Pre-trained Sequence-to-Sequence and Saliency Models

no code implementations29 Mar 2020 Itsumi Saito, Kyosuke Nishida, Kosuke Nishida, Junji Tomita

Experimental results showed that most of the combination models outperformed a simple fine-tuned seq-to-seq model on both the CNN/DM and XSum datasets even if the seq-to-seq model is pre-trained on large-scale corpora.

Abstractive Text Summarization Text Generation

Length-controllable Abstractive Summarization by Guiding with Summary Prototype

no code implementations21 Jan 2020 Itsumi Saito, Kyosuke Nishida, Kosuke Nishida, Atsushi Otsuka, Hisako Asano, Junji Tomita, Hiroyuki Shindo, Yuji Matsumoto

Unlike the previous models, our length-controllable abstractive summarization model incorporates a word-level extractive module in the encoder-decoder model instead of length embeddings.

Abstractive Text Summarization

Multi-style Generative Reading Comprehension

no code implementations ACL 2019 Kyosuke Nishida, Itsumi Saito, Kosuke Nishida, Kazutoshi Shinoda, Atsushi Otsuka, Hisako Asano, Junji Tomita

Second, whereas previous studies built a specific model for each answer style because of the difficulty of acquiring one general model, our approach learns multi-style answers within a model to improve the NLG capability for all styles involved.

Abstractive Text Summarization Question Answering +2

Commonsense Knowledge Base Completion and Generation

no code implementations CONLL 2018 Itsumi Saito, Kyosuke Nishida, Hisako Asano, Junji Tomita

To improve the accuracy of CKB completion and expand the size of CKBs, we formulate a new commonsense knowledge base generation task (CKB generation) and propose a joint learning method that incorporates both CKB completion and CKB generation.

Knowledge Base Completion Question Answering +1

Retrieve-and-Read: Multi-task Learning of Information Retrieval and Reading Comprehension

no code implementations31 Aug 2018 Kyosuke Nishida, Itsumi Saito, Atsushi Otsuka, Hisako Asano, Junji Tomita

Previous MRS studies, in which the IR component was trained without considering answer spans, struggled to accurately find a small number of relevant passages from a large set of passages.

Information Retrieval Multi-Task Learning +2

Automatically Extracting Variant-Normalization Pairs for Japanese Text Normalization

no code implementations IJCNLP 2017 Itsumi Saito, Kyosuke Nishida, Kugatsu Sadamitsu, Kuniko Saito, Junji Tomita

Social media texts, such as tweets from Twitter, contain many types of non-standard tokens, and the number of normalization approaches for handling such noisy text has been increasing.

Machine Translation Morphological Analysis

Learning and Detecting Concept Drift

1 code implementation Graduate School of Information Science and Technology, Hokkaido University 2008 Kyosuke Nishida

When concept drift is detected, the online classifier is reinitialized to prepare for the learning of the next concept.

Cannot find the paper you are looking for? You can Submit a new open access paper.