Search Results for author: Sadao Kurohashi

Found 159 papers, 33 papers with code

A Method for Building a Commonsense Inference Dataset based on Basic Events

no code implementations • EMNLP 2020 • Kazumasa Omura, Daisuke Kawahara, Sadao Kurohashi

We present a scalable, low-bias, and low-cost method for building a commonsense inference dataset that combines automatic extraction from a corpus and crowdsourcing.

Multiple-choice Transfer Learning

Paper
Add Code

Filtering of Noisy Web-Crawled Parallel Corpus: the Japanese-Bulgarian Language Pair

no code implementations • CLIB 2022 • Iglika Nikolova-Stoupak, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi

The corpus utilised to train machine translation models in the study is CCMatrix, provided by OPUS.

Machine Translation Translation

Paper
Add Code

Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation

no code implementations • NAACL (ACL) 2022 • Prakhar Saxena, Yin Jou Huang, Sadao Kurohashi

Each person has a unique personality which affects how they feel and convey emotions.

Ranked #19 on Emotion Recognition in Conversation on MELD

Emotion Recognition in Conversation

Paper
Add Code

Explicit Use of Topicality in Dialogue Response Generation

no code implementations • NAACL (ACL) 2022 • Takumi Yoshikoshi, Hayato Atarashi, Takashi Kodama, Sadao Kurohashi

In this study, we propose a dialogue system that responds appropriately following the topic by selecting the entity with the highest “topicality.” In topicality estimation, the model is trained through self-supervised learning that regards entities that appear in both context and response as the topic entities.

Response Generation Self-Supervised Learning

Paper
Add Code

Construction of Hierarchical Structured Knowledge-based Recommendation Dialogue Dataset and Dialogue System

no code implementations • dialdoc (ACL) 2022 • Takashi Kodama, Ribeka Tanaka, Sadao Kurohashi

We work on a recommendation dialogue system to help a user understand the appealing points of some target (e. g., a movie).

Movie Recommendation

Paper
Add Code

Meta Ensemble for Japanese-Chinese Neural Machine Translation: Kyoto-U+ECNU Participation to WAT 2020

no code implementations • AACL (WAT) 2020 • Zhuoyuan Mao, Yibin Shen, Chenhui Chu, Sadao Kurohashi, Cheqing Jin

This paper describes the Japanese-Chinese Neural Machine Translation (NMT) system submitted by the joint team of Kyoto University and East China Normal University (Kyoto-U+ECNU) to WAT 2020 (Nakazawa et al., 2020).

Denoising Machine Translation +2

Paper
Add Code

Overview of the 7th Workshop on Asian Translation

no code implementations • AACL (WAT) 2020 • Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi

This paper presents the results of the shared tasks from the 7th workshop on Asian translation (WAT2020).

Translation

Paper
Add Code

Japanese Zero Anaphora Resolution Can Benefit from Parallel Texts Through Neural Transfer Learning

no code implementations • Findings (EMNLP) 2021 • Masato Umakoshi, Yugo Murawaki, Sadao Kurohashi

Parallel texts of Japanese and a non-pro-drop language have the potential of improving the performance of Japanese zero anaphora resolution (ZAR) because pronouns dropped in the former are usually mentioned explicitly in the latter.

Cross-Lingual Transfer Language Modelling +3

Paper
Add Code

Overview of the 5th Workshop on Asian Translation

no code implementations • PACLIC 2018 • Toshiaki Nakazawa, Katsuhito Sudoh, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Sadao Kurohashi

Translation

Paper
Add Code

Improving Commonsense Contingent Reasoning by Pseudo-data and Its Application to the Related Tasks

no code implementations • COLING 2022 • Kazumasa Omura, Sadao Kurohashi

Contingent reasoning is one of the essential abilities in natural language understanding, and many language resources annotated with contingent relations have been constructed.

Natural Language Understanding Relation +1

Paper
Add Code

Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data

1 code implementation • LREC 2022 • Felix Virgo, Fei Cheng, Sadao Kurohashi

However, the amount of training data for tasks like duration question answering, i. e., McTACO, is very limited, suggesting a need for external duration information to improve this task.

Question Answering Temporal Information Extraction

Paper
Code

JaMIE: A Pipeline Japanese Medical Information Extraction System with Novel Relation Annotation

no code implementations • LREC 2022 • Fei Cheng, Shuntaro Yada, Ribeka Tanaka, Eiji Aramaki, Sadao Kurohashi

In this paper, we first propose a novel relation annotation schema for investigating the medical and temporal relations between medical entities in Japanese medical reports.

Relation Relation Extraction

Paper
Add Code

Constructing a Culinary Interview Dialogue Corpus with Video Conferencing Tool

no code implementations • LREC 2022 • Taro Okahisa, Ribeka Tanaka, Takashi Kodama, Yin Jou Huang, Sadao Kurohashi

Interview is an efficient way to elicit knowledge from experts of different domains.

Paper
Add Code

Overview of the 8th Workshop on Asian Translation

no code implementations • ACL (WAT) 2021 • Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi

This paper presents the results of the shared tasks from the 8th workshop on Asian translation (WAT2021).

Translation

Paper
Add Code

Flexible Visual Grounding

1 code implementation • ACL 2022 • Yongmin Kim, Chenhui Chu, Sadao Kurohashi

Existing visual grounding datasets are artificially made, where every query regarding an entity must be able to be grounded to a corresponding image region, i. e., answerable.

Visual Grounding

Paper
Code

Annotating a Driving Experience Corpus with Behavior and Subjectivity

no code implementations • PACLIC 2018 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Paper
Add Code

Overview of the 9th Workshop on Asian Translation

no code implementations • WAT 2022 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Raj Dabre, Shohei Higashiyama, Shantipriya Parida, Anoop Kunchukuttan, Makoto Morishita, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi

This paper presents the results of the shared tasks from the 9th workshop on Asian translation (WAT2022).

Translation

Paper
Add Code

Kyoto University MT System Description for IWSLT 2017

no code implementations • IWSLT 2017 • Raj Dabre, Fabien Cromieres, Sadao Kurohashi

We describe here our Machine Translation (MT) model and the results we obtained for the IWSLT 2017 Multilingual Shared Task.

Machine Translation NMT +1

Paper
Add Code

Improving Bridging Reference Resolution using Continuous Essentiality from Crowdsourcing

1 code implementation • COLING (CRAC) 2022 • Nobuhiro Ueda, Sadao Kurohashi

Bridging reference resolution is the task of finding nouns that complement essential information of another noun.

Paper
Code

J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

2 code implementations • 28 Mar 2024 • Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino

Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users.

Paper
Code

AcTED: Automatic Acquisition of Typical Event Duration for Semi-supervised Temporal Commonsense QA

no code implementations • 27 Mar 2024 • Felix Virgo, Fei Cheng, Lis Kanashiro Pereira, Masayuki Asahara, Ichiro Kobayashi, Sadao Kurohashi

We propose a voting-driven semi-supervised approach to automatically acquire the typical duration of an event and use it as pseudo-labeled data.

Paper
Add Code

Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese

2 code implementations • 6 Mar 2024 • Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu, Sadao Kurohashi

The creation of instruction data and evaluation benchmarks for serving Large language models often involves enormous human annotation.

Paper
Code

RecMind: Japanese Movie Recommendation Dialogue with Seeker's Internal State

no code implementations • 21 Feb 2024 • Takashi Kodama, Hirokazu Kiyomaru, Yin Jou Huang, Sadao Kurohashi

Since there are no existing annotated resources for the analysis, we constructed RecMind, a Japanese movie recommendation dialogue dataset with annotations of the seeker's internal state at the entity level.

Movie Recommendation Response Generation

Paper
Add Code

Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts

1 code implementation • 7 Nov 2023 • Haiyue Song, Raj Dabre, Chenhui Chu, Atsushi Fujita, Sadao Kurohashi

To create the parallel corpora, we propose a dynamic programming based sentence alignment algorithm which leverages the cosine similarity of machine-translated sentences.

Benchmarking Machine Translation +3

Paper
Code

Video-Helpful Multimodal Machine Translation

1 code implementation • 31 Oct 2023 • Yihang Li, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi, Wei Li

In addition to the extensive training set, EVA contains a video-helpful evaluation set in which subtitles are ambiguous, and videos are guaranteed helpful for disambiguation.

Multimodal Machine Translation Translation

Paper
Code

Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category Learning

no code implementations • Findings of the Association for Computational Linguistics 2020 • Fei Cheng, Masayuki Asahara, Ichiro Kobayashi, Sadao Kurohashi

Temporal relation classification is a pair-wise task for identifying the relation of a temporal link (TLINK) between two mentions, i. e. event, time, and document creation time (DCT).

Multi-Task Learning Relation +2

Paper
Add Code

Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise

no code implementations • 5 Oct 2023 • Zhen Wan, Yating Zhang, Yexiang Wang, Fei Cheng, Sadao Kurohashi

In the zero-shot setting of four Chinese legal tasks, our method improves accuracy by 33. 3\% compared to the direct generation by GPT-4.

Domain Adaptation

Paper
Add Code

SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation

no code implementations • 31 Jul 2023 • Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi, Eiichiro Sumita

Sub-word segmentation is an essential pre-processing step for Neural Machine Translation (NMT).

Machine Translation NMT +1

Paper
Add Code

MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting

1 code implementation • 26 May 2023 • Tatsuro Inaba, Hirokazu Kiyomaru, Fei Cheng, Sadao Kurohashi

Large language models (LLMs) have achieved impressive performance on various reasoning tasks.

Task 2

Paper
Code

Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation

no code implementations • 17 May 2023 • Zhuoyuan Mao, Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi

The language-independency of encoded representations within multilingual neural machine translation (MNMT) models is crucial for their generalization ability on zero-shot translation.

Machine Translation Translation

Paper
Add Code

Towards Speech Dialogue Translation Mediating Speakers of Different Languages

1 code implementation • 16 May 2023 • Shuichiro Shimizu, Chenhui Chu, Sheng Li, Sadao Kurohashi

We present a new task, speech dialogue translation mediating speakers of different languages.

Translation

Paper
Code

Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation

no code implementations • 16 May 2023 • Zhuoyuan Mao, Raj Dabre, Qianying Liu, Haiyue Song, Chenhui Chu, Sadao Kurohashi

This paper studies the impact of layer normalization (LayerNorm) on zero-shot translation (ZST).

Machine Translation

Paper
Add Code

SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation

1 code implementation • 15 May 2023 • Junfeng Jiang, Chengzhang Dong, Sadao Kurohashi, Akiko Aizawa

In this paper, we provide a feasible definition of dialogue segmentation points with the help of document-grounded dialogues and release a large-scale supervised dataset called SuperDialseg, containing 9, 478 dialogues based on two prevalent document-grounded dialogue corpora, and also inherit their useful dialogue-related annotations.

Segmentation

Paper
Code

Comprehensive Solution Program Centric Pretraining for Table-and-Text Hybrid Numerical Reasoning

no code implementations • 12 May 2023 • Qianying Liu, Dongsheng Yang, Wenjie Zhong, Fei Cheng, Sadao Kurohashi

Numerical reasoning over table-and-text hybrid passages, such as financial reports, poses significant challenges and has numerous potential applications.

Paper
Add Code

GPT-RE: In-context Learning for Relation Extraction using Large Language Models

1 code implementation • 3 May 2023 • Zhen Wan, Fei Cheng, Zhuoyuan Mao, Qianying Liu, Haiyue Song, Jiwei Li, Sadao Kurohashi

In spite of the potential for ground-breaking achievements offered by large language models (LLMs) (e. g., GPT-3), they still lag significantly behind fully-supervised baselines (e. g., fine-tuned BERT) in relation extraction (RE).

In-Context Learning Relation +2

Paper
Code

Textual Enhanced Contrastive Learning for Solving Math Word Problems

1 code implementation • 29 Nov 2022 • Yibin Shen, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi

Solving math word problems is the task that analyses the relation of quantities and requires an accurate understanding of contextual natural language information.

Contrastive Learning Math

Paper
Code

Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction

1 code implementation • 21 Oct 2022 • Zhen Wan, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi, Jiwei Li

Relation extraction (RE) has achieved remarkable progress with the help of pre-trained language models.

Relation Relation Extraction

Paper
Code

ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision

no code implementations • 13 Oct 2022 • Qianying Liu, Wenyu Guan, Jianhao Shen, Fei Cheng, Sadao Kurohashi

To address this problem, we propose a novel search algorithm with combinatorial strategy \textbf{ComSearch}, which can compress the search space by excluding mathematically equivalent equations.

Math

Paper
Add Code

Seeking Diverse Reasoning Logic: Controlled Equation Expression Generation for Solving Math Word Problems

1 code implementation • 21 Sep 2022 • Yibin Shen, Qianying Liu, Zhuoyuan Mao, Zhen Wan, Fei Cheng, Sadao Kurohashi

To solve Math Word Problems, human students leverage diverse reasoning logic that reaches different possible equation solutions.

Math

Paper
Code

EMS: Efficient and Effective Massively Multilingual Sentence Representation Learning

1 code implementation • 31 May 2022 • Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi

Massively multilingual sentence representation models, e. g., LASER, SBERT-distill, and LaBSE, help significantly improve cross-lingual downstream tasks.

Contrastive Learning Genre classification +4

Paper
Code

Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision

no code implementations • 18 May 2022 • Zhen Wan, Fei Cheng, Qianying Liu, Zhuoyuan Mao, Haiyue Song, Sadao Kurohashi

Contrastive pre-training on distant supervision has shown remarkable effectiveness in improving supervised relation extraction tasks.

Contrastive Learning Relation +1

Paper
Add Code

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

no code implementations • Findings (NAACL) 2022 • Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi

Meanwhile, the contrastive objective can implicitly utilize automatically learned word alignment, which has not been explored in many-to-many NMT.

Machine Translation NMT +4

Paper
Add Code

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

1 code implementation • 8 Apr 2022 • Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi

Low-resource speech recognition has been long-suffering from insufficient training data.

speech-recognition Speech Recognition

Paper
Code

Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation

1 code implementation • 20 Jan 2022 • Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi

In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English.

Low-Resource Neural Machine Translation NMT +1

Paper
Code

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

1 code implementation • LREC 2022 • Yihang Li, Shuichiro Shimizu, Weiqi Gu, Chenhui Chu, Sadao Kurohashi

Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations.

Multimodal Machine Translation Sentence +1

Paper
Code

Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding

no code implementations • 10 Nov 2021 • Qianying Liu, Fei Cheng, Sadao Kurohashi

Meta learning with auxiliary languages has demonstrated promising improvements for cross-lingual natural language processing.

Cross-Lingual Transfer Meta-Learning +3

Paper
Add Code

JaMIE: A Pipeline Japanese Medical Information Extraction System

1 code implementation • 8 Nov 2021 • Fei Cheng, Shuntaro Yada, Ribeka Tanaka, Eiji Aramaki, Sadao Kurohashi

We present an open-access natural language processing toolkit for Japanese medical information extraction.

Paper
Code

Video-guided Machine Translation with Spatial Hierarchical Attention Network

no code implementations • ACL 2021 • Weiqi Gu, Haiyue Song, Chenhui Chu, Sadao Kurohashi

Video-guided machine translation, as one type of multimodal machine translations, aims to engage video contents as auxiliary information to address the word sense ambiguity problem in machine translation.

Action Detection Machine Translation +2

Paper
Add Code

Contextualized and Generalized Sentence Representations by Contrastive Self-Supervised Learning: A Case Study on Discourse Relation Analysis

no code implementations • NAACL 2021 • Hirokazu Kiyomaru, Sadao Kurohashi

The model is trained to maximize the similarity between the representation of the target sentence with its context and that of the masked target sentence with the same context.

Self-Supervised Learning Sentence

Paper
Add Code

Lightweight Cross-Lingual Sentence Representation Learning

1 code implementation • ACL 2021 • Zhuoyuan Mao, Prakhar Gupta, Pei Wang, Chenhui Chu, Martin Jaggi, Sadao Kurohashi

Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks.

Contrastive Learning Document Classification +4

Paper
Code

Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model

1 code implementation • NAACL 2021 • Honai Ueoka, Yugo Murawaki, Sadao Kurohashi

With advances in neural language models, the focus of linguistic steganography has shifted from edit-based approaches to generation-based ones.

Language Modelling Linguistic steganography

Paper
Code

Extractive Summarization Considering Discourse and Coreference Relations based on Heterogeneous Graph

no code implementations • EACL 2021 • Yin Jou Huang, Sadao Kurohashi

In this paper, we propose a heterogeneous graph based model for extractive summarization that incorporates both discourse and coreference relations.

Extractive Summarization

Paper
Add Code

Modeling and Utilizing User's Internal State in Movie Recommendation Dialogue

no code implementations • 5 Dec 2020 • Takashi Kodama, Ribeka Tanaka, Sadao Kurohashi

In this paper, we model the UIS in dialogues, taking movie recommendation dialogues as examples, and construct a dialogue system that changes its response based on the UIS.

Movie Recommendation

Paper
Add Code

Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers

no code implementations • COLING 2020 • Oleksandr Harust, Yugo Murawaki, Sadao Kurohashi

We propose a novel task of native-like expression identification by contrasting texts written by native speakers and those by proficient second language speakers.

Sentence

Paper
Add Code

BERT-based Cohesion Analysis of Japanese Texts

1 code implementation • COLING 2020 • Nobuhiro Ueda, Daisuke Kawahara, Sadao Kurohashi

The meaning of natural language text is supported by cohesion among various kinds of entities, including coreference relations, predicate-argument structures, and bridging anaphora relations.

coreference-resolution

Paper
Code

Reverse Operation based Data Augmentation for Solving Math Word Problems

1 code implementation • 4 Oct 2020 • Qianying Liu, Wenyu Guan, Sujian Li, Fei Cheng, Daisuke Kawahara, Sadao Kurohashi

Automatically solving math word problems is a critical task in the field of natural language processing.

Data Augmentation Math +1

Paper
Code

Minimize Exposure Bias of Seq2Seq Models in Joint Entity and Relation Extraction

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ranran Haoran Zhang, Qianying Liu, Aysa Xuemo Fan, Heng Ji, Daojian Zeng, Fei Cheng, Daisuke Kawahara, Sadao Kurohashi

We propose a novel Sequence-to-Unordered-Multi-Tree (Seq2UMTree) model to minimize the effects of exposure bias by limiting the decoding length to three within a triplet and removing the order among triplets.

Joint Entity and Relation Extraction Relation

Paper
Code

A System for Worldwide COVID-19 Information Aggregation

no code implementations • EMNLP (NLP-COVID19) 2020 • Akiko Aizawa, Frederic Bergeron, Junjie Chen, Fei Cheng, Katsuhiko Hayashi, Kentaro Inui, Hiroyoshi Ito, Daisuke Kawahara, Masaru Kitsuregawa, Hirokazu Kiyomaru, Masaki Kobayashi, Takashi Kodama, Sadao Kurohashi, Qianying Liu, Masaki Matsubara, Yusuke Miyao, Atsuyuki Morishima, Yugo Murawaki, Kazumasa Omura, Haiyue Song, Eiichiro Sumita, Shinji Suzuki, Ribeka Tanaka, Yu Tanaka, Masashi Toyoda, Nobuhiro Ueda, Honai Ueoka, Masao Utiyama, Ying Zhong

The global pandemic of COVID-19 has made the public pay close attention to related news, covering various domains, such as sanitation, treatment, and effects on education.

Machine Translation Translation

Paper
Add Code

Pre-training via Leveraging Assisting Languages for Neural Machine Translation

no code implementations • ACL 2020 • Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi, Eiichiro Sumita

Sequence-to-sequence (S2S) pre-training using large monolingual data is known to improve performance for various S2S NLP tasks.

Machine Translation NMT +1

Paper
Add Code

Building a Japanese Typo Dataset from Wikipedia's Revision History

no code implementations • ACL 2020 • Yu Tanaka, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

User generated texts contain many typos for which correction is necessary for NLP systems to work.

Paper
Add Code

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

1 code implementation • LREC 2020 • Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi

Monolingual pre-training approaches such as MASS (MAsked Sequence to Sequence) are extremely effective in boosting NMT quality for languages with small parallel corpora.

Machine Translation NMT +2

Paper
Code

Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse Connectives

no code implementations • LREC 2020 • Yudai Kishimoto, Yugo Murawaki, Sadao Kurohashi

BERT, a neural network-based language model pre-trained on large corpora, is a breakthrough in natural language processing, significantly outperforming previous state-of-the-art models in numerous tasks.

General Classification Implicit Discourse Relation Classification +3

Paper
Add Code

Development of a Japanese Personality Dictionary based on Psychological Methods

no code implementations • LREC 2020 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

In this study, we collect personality words, using word embeddings, and construct a personality dictionary with weights for Big Five traits.

Word Embeddings

Paper
Add Code

Acquiring Social Knowledge about Personality and Driving-related Behavior

no code implementations • LREC 2020 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Using them, we automatically extracted collocations between personality descriptors and driving-related behavior from a driving behavior and subjectivity corpus (1, 803, 328 sentences after filtering) and obtained unique 5, 334 collocations.

Paper
Add Code

Towards a Versatile Medical-Annotation Guideline Feasible Without Heavy Medical Knowledge: Starting From Critical Lung Diseases

no code implementations • LREC 2020 • Shuntaro Yada, Ayami Joh, Ribeka Tanaka, Fei Cheng, Eiji Aramaki, Sadao Kurohashi

Applying natural language processing (NLP) to medical and clinical texts can bring important social benefits by mining valuable information from unstructured text.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Pre-training via Leveraging Assisting Languages and Data Selection for Neural Machine Translation

no code implementations • 23 Jan 2020 • Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi, Eiichiro Sumita

To this end, we propose to exploit monolingual corpora of other languages to complement the scarcity of monolingual corpora for the LOI.

Machine Translation NMT +1

Paper
Add Code

Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation

1 code implementation • LREC 2020 • Haiyue Song, Raj Dabre, Atsushi Fujita, Sadao Kurohashi

To address this, we examine a language independent framework for parallel corpus mining which is a quick and effective way to mine a parallel corpus from publicly available lectures at Coursera.

Benchmarking Domain Adaptation +4

Paper
Code

Emotion helps Sentiment: A Multi-task Model for Sentiment and Emotion Analysis

no code implementations • 28 Nov 2019 • Abhishek Kumar, Asif Ekbal, Daisuke Kawahra, Sadao Kurohashi

Our network also boosts the performance of emotion analysis by 5 F-score points on Stance Sentiment Emotion Corpus.

Emotion Recognition Sentiment Analysis

Paper
Add Code

Automatically Neutralizing Subjective Bias in Text

1 code implementation • 21 Nov 2019 • Reid Pryzant, Richard Diehl Martinez, Nathan Dass, Sadao Kurohashi, Dan Jurafsky, Diyi Yang

To address this issue, we introduce a novel testbed for natural language generation: automatically bringing inappropriately subjective text into a neutral point of view ("neutralizing" biased text).

Sentence Text Generation

189

Paper
Code

Machine Comprehension Improves Domain-Specific Japanese Predicate-Argument Structure Analysis

no code implementations • WS 2019 • Norio Takahashi, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

To improve the accuracy of predicate-argument structure (PAS) analysis, large-scale training data and knowledge for PAS analysis are indispensable.

Reading Comprehension

Paper
Add Code

Diversity-aware Event Prediction based on a Conditional Variational Autoencoder with Reconstruction

no code implementations • WS 2019 • Hirokazu Kiyomaru, Kazumasa Omura, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

Typical event sequences are an important class of commonsense knowledge.

Paper
Add Code

Overview of the 6th Workshop on Asian Translation

no code implementations • WS 2019 • Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Yusuke Oda, Shantipriya Parida, Ond{\v{r}}ej Bojar, Sadao Kurohashi

This paper presents the results of the shared tasks from the 6th workshop on Asian translation (WAT2019) including Ja↔En, Ja↔Zh scientific paper translation subtasks, Ja↔En, Ja↔Ko, Ja↔En patent translation subtasks, Hi↔En, My↔En, Km↔En, Ta↔En mixed domain subtasks and Ru↔Ja news commentary translation task.

Translation

Paper
Add Code

Minimally Supervised Learning of Affective Events Using Discourse Relations

no code implementations • IJCNLP 2019 • Jun Saito, Yugo Murawaki, Sadao Kurohashi

Recognizing affective events that trigger positive or negative sentiment has a wide range of natural language processing applications but remains a challenging problem mainly because the polarity of an event is not necessarily predictable from its constituent words.

Paper
Add Code

Kyoto University Participation to the WMT 2019 News Shared Task

no code implementations • WS 2019 • Fabien Cromieres, Sadao Kurohashi

We describe here the experiments we did for the the news translation shared task of WMT 2019.

Machine Translation Translation

Paper
Add Code

Applying Machine Translation to Psychology: Automatic Translation of Personality Adjectives

no code implementations • WS 2019 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Shrinking Japanese Morphological Analyzers With Neural Networks and Semi-supervised Learning

no code implementations • NAACL 2019 • Arseny Tolmachev, Daisuke Kawahara, Sadao Kurohashi

Morphological analyzers are trained on data hand-annotated with segmentation boundaries and part of speech tags.

Chinese Word Segmentation Morphological Analysis +2

Paper
Add Code

Improving Event Coreference Resolution by Learning Argument Compatibility from Unlabeled Data

no code implementations • NAACL 2019 • Yin Jou Huang, Jing Lu, Sadao Kurohashi, Vincent Ng

Argument compatibility is a linguistic condition that is frequently incorporated into modern event coreference resolution systems.

coreference-resolution Event Coreference Resolution +1

Paper
Add Code

FAQ Retrieval using Query-Question Similarity and BERT-Based Query-Answer Relevance

1 code implementation • 8 May 2019 • Wataru Sakata, Tomohide Shibata, Ribeka Tanaka, Sadao Kurohashi

On the other hand, the relevance between the query and answer can be learned by using QA pairs in a FAQ database.

Information Retrieval Question Similarity +1

Paper
Code

Juman++: A Morphological Analysis Toolkit for Scriptio Continua

1 code implementation • EMNLP 2018 • Arseny Tolmachev, Daisuke Kawahara, Sadao Kurohashi

We present a three-part toolkit for developing morphological analyzers for languages without natural word boundaries.

Art Analysis Language Modelling +2

365

Paper
Code

A Multi-task Ensemble Framework for Emotion, Sentiment and Intensity Prediction

no code implementations • 3 Aug 2018 • Md. Shad Akhtar, Deepanway Ghosal, Asif Ekbal, Pushpak Bhattacharyya, Sadao Kurohashi

In this paper, through multi-task ensemble framework we address three problems of emotion and sentiment analysis i. e. "emotion classification & intensity", "valence, arousal & dominance for emotion" and "valence & arousal} for sentiment".

Emotion Classification General Classification +1

Paper
Add Code

Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion

1 code implementation • COLING 2018 • Naoki Otani, Hirokazu Kiyomaru, Daisuke Kawahara, Sadao Kurohashi

Considerable effort has been devoted to building commonsense knowledge bases.

Knowledge Base Completion Machine Translation +1

Paper
Code

A Knowledge-Augmented Neural Network Model for Implicit Discourse Relation Classification

no code implementations • COLING 2018 • Yudai Kishimoto, Yugo Murawaki, Sadao Kurohashi

Identifying discourse relations that are not overtly marked with discourse connectives remains a challenging problem.

General Classification Implicit Discourse Relation Classification +4

Paper
Add Code

Entity-Centric Joint Modeling of Japanese Coreference Resolution and Predicate Argument Structure Analysis

no code implementations • ACL 2018 • Tomohide Shibata, Sadao Kurohashi

Our experimental results demonstrate the proposed method can improve the performance of the inter-sentential zero anaphora resolution drastically, which is a notoriously difficult task in predicate argument structure analysis.

coreference-resolution Reading Comprehension

Paper
Add Code

Neural Adversarial Training for Semi-supervised Japanese Predicate-argument Structure Analysis

no code implementations • ACL 2018 • Shuhei Kurita, Daisuke Kawahara, Sadao Kurohashi

Japanese predicate-argument structure (PAS) analysis involves zero anaphora resolution, which is notoriously difficult.

Paper
Add Code

Knowledge-enriched Two-layered Attention Network for Sentiment Analysis

no code implementations • NAACL 2018 • Abhishek Kumar, Daisuke Kawahara, Sadao Kurohashi

We propose a novel two-layered attention network based on Bidirectional Long Short-Term Memory for sentiment analysis.

Knowledge Graph Embedding Sentiment Analysis +1

Paper
Add Code

Comprehensive Annotation of Various Types of Temporal Information on the Time Axis

no code implementations • LREC 2018 • Tomohiro Sakaguchi, Daisuke Kawahara, Sadao Kurohashi

Common Sense Reasoning

Paper
Add Code

Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations

no code implementations • LREC 2018 • Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

Paper
Add Code

Overview of the 4th Workshop on Asian Translation

no code implementations • WS 2017 • Toshiaki Nakazawa, Shohei Higashiyama, Chenchen Ding, Hideya Mino, Isao Goto, Hideto Kazawa, Yusuke Oda, Graham Neubig, Sadao Kurohashi

For the WAT2017, 12 institutions participated in the shared tasks.

Machine Translation Translation

Paper
Add Code

Kyoto University Participation to WAT 2017

1 code implementation • WS 2017 • Fabien Cromieres, Raj Dabre, Toshiaki Nakazawa, Sadao Kurohashi

We describe here our approaches and results on the WAT 2017 shared translation tasks.

Language Modelling Machine Translation +1

Paper
Code

MMCR4NLP: Multilingual Multiway Corpora Repository for Natural Language Processing

1 code implementation • 3 Oct 2017 • Raj Dabre, Sadao Kurohashi

Multilinguality is gradually becoming ubiquitous in the sense that more and more researchers have successfully shown that using additional languages help improve the results in many Natural Language Processing tasks.

Machine Translation Multilingual NLP +3

Paper
Code

Automatic Extraction of High-Quality Example Sentences for Word Learning Using a Determinantal Point Process

no code implementations • WS 2017 • Arseny Tolmachev, Sadao Kurohashi

Flashcard systems are effective tools for learning words but have their limitations in teaching word usage.

Sentence Sentence Similarity

Paper
Add Code

Automatically Acquired Lexical Knowledge Improves Japanese Joint Morphological and Dependency Analysis

no code implementations • WS 2017 • Daisuke Kawahara, Yuta Hayashibe, Hajime Morita, Sadao Kurohashi

This paper presents a joint model for morphological and dependency analysis based on automatically acquired lexical knowledge.

Lemmatization Morphological Analysis +2

Paper
Add Code

Improving Shared Argument Identification in Japanese Event Knowledge Acquisition

no code implementations • WS 2017 • Yin Jou Huang, Sadao Kurohashi

Shared arguments of event knowledge encode patterns of role shifting in successive events.

Coreference Resolution Text Generation

Paper
Add Code

Neural Joint Model for Transition-based Chinese Syntactic Analysis

no code implementations • ACL 2017 • Shuhei Kurita, Daisuke Kawahara, Sadao Kurohashi

We present neural network-based joint models for Chinese word segmentation, POS tagging and dependency parsing.

Chinese Word Segmentation Dependency Parsing +4

Paper
Add Code

An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation

no code implementations • ACL 2017 • Chenhui Chu, Raj Dabre, Sadao Kurohashi

In this paper, we propose a novel domain adaptation method named {``}mixed fine tuning{''} for neural machine translation (NMT).

Domain Adaptation Machine Translation +2

Paper
Add Code

Improving Chinese Semantic Role Labeling using High-quality Surface and Deep Case Frames

no code implementations • EACL 2017 • Gongye Jin, Daisuke Kawahara, Sadao Kurohashi

To compensate the deficiency of the surface case frames, we compile deep case frames from automatic semantic roles.

Chinese Semantic Role Labeling Dependency Parsing +4

Paper
Add Code

Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

no code implementations • MTSummit 2017 • Raj Dabre, Fabien Cromieres, Sadao Kurohashi

In this paper, we explore a simple solution to "Multi-Source Neural Machine Translation" (MSNMT) which only relies on preprocessing a N-way multilingual corpus without modifying the Neural Machine Translation (NMT) architecture or training procedure.

Machine Translation NMT +2

Paper
Add Code

An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation

no code implementations • 12 Jan 2017 • Chenhui Chu, Raj Dabre, Sadao Kurohashi

In this paper, we propose a novel domain adaptation method named "mixed fine tuning" for neural machine translation (NMT).

Domain Adaptation Machine Translation +2

Paper
Add Code

Reading Comprehension using Entity-based Memory Network

no code implementations • 12 Dec 2016 • Xun Wang, Katsuhito Sudoh, Masaaki Nagata, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

This paper introduces a novel neural network model for question answering, the \emph{entity-based memory network}.

Question Answering Reading Comprehension

Paper
Add Code

Kyoto University Participation to WAT 2016

1 code implementation • WS 2016 • Fabien Cromieres, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

We report very good translation results, especially when using neural MT for Chinese-to-Japanese translation.

Machine Translation Spelling Correction +1

Paper
Code

Large-Scale Acquisition of Commonsense Knowledge via a Quiz Game on a Dialogue System

no code implementations • WS 2016 • Naoki Otani, Daisuke Kawahara, Sadao Kurohashi, Nobuhiro Kaji, Manabu Sassano

Commonsense knowledge is essential for fully understanding language in many situations.

Common Sense Reasoning Question Answering

Paper
Add Code

Consistent Word Segmentation, Part-of-Speech Tagging and Dependency Labelling Annotation for Chinese Language

no code implementations • COLING 2016 • Mo Shen, Wingmui Li, HyunJeong Choe, Chenhui Chu, Daisuke Kawahara, Sadao Kurohashi

In this paper, we propose a new annotation approach to Chinese word segmentation, part-of-speech (POS) tagging and dependency labelling that aims to overcome the two major issues in traditional morphology-based annotation: Inconsistency and data sparsity.

Chinese Word Segmentation Machine Translation +6

Paper
Add Code

Overview of the 3rd Workshop on Asian Translation

no code implementations • WS 2016 • Toshiaki Nakazawa, Chenchen Ding, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi

For the WAT2016, 15 institutions participated in the shared tasks.

Machine Translation Translation

Paper
Add Code

SCTB: A Chinese Treebank in Scientific Domain

no code implementations • WS 2016 • Chenhui Chu, Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohashi

Treebanks are curial for natural language processing (NLP).

Chinese Word Segmentation Machine Translation +1

Paper
Add Code

IRT-based Aggregation Model of Crowdsourced Pairwise Comparison for Evaluating Machine Translations

no code implementations • EMNLP 2016 • Naoki Otani, Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohashi

Machine Translation Text Matching

Paper
Add Code

Insertion Position Selection Model for Flexible Non-Terminals in Dependency Tree-to-Tree Machine Translation

no code implementations • EMNLP 2016 • Toshiaki Nakazawa, John Richardson, Sadao Kurohashi

Machine Translation Position +2

Paper
Add Code

The Kyoto University Cross-Lingual Pronoun Translation System

no code implementations • WS 2016 • Raj Dabre, Yevgeniy Puzikov, Fabien Cromieres, Sadao Kurohashi

Lemmatization Machine Translation +2

Paper
Add Code

Cross-language Projection of Dependency Trees with Constrained Partial Parsing for Tree-to-Tree Machine Translation

no code implementations • WS 2016 • Yu Shen, Chenhui Chu, Fabien Cromieres, Sadao Kurohashi

Dependency Parsing Machine Translation +1

Paper
Add Code

Dependency Forest based Word Alignment

no code implementations • ACL 2016 • Hitoshi Otsuki, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

Machine Translation Word Alignment

Paper
Add Code

Neural Network-Based Model for Japanese Predicate Argument Structure Analysis

no code implementations • ACL 2016 • Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

Machine Translation Semantic Role Labeling

Paper
Add Code

Supervised Syntax-based Alignment between English Sentences and Abstract Meaning Representation Graphs

no code implementations • 7 Jun 2016 • Chenhui Chu, Sadao Kurohashi

As alignment links are not given between English sentences and Abstract Meaning Representation (AMR) graphs in the AMR annotation, automatic alignment becomes indispensable for training an AMR parser.

AMR Parsing

Paper
Add Code

Flexible Non-Terminals for Dependency Tree-to-Tree Reordering

no code implementations • NAACL 2016 • John Richardson, Fabien Cromi{\`e}res, Toshiaki Nakazawa, Sadao Kurohashi

Paper
Add Code

Design of Word Association Games using Dialog Systems for Acquisition of Word Association Knowledge

no code implementations • WS 2016 • Yuichiro Machida, Daisuke Kawahara, Sadao Kurohashi, Manabu Sassano

Paper
Add Code

M2L at SemEval-2016 Task 8: AMR Parsing with Neural Networks

no code implementations • SEMEVAL 2016 • Yevgeniy Puzikov, Daisuke Kawahara, Sadao Kurohashi

AMR Parsing Transition-Based Dependency Parsing

Paper
Add Code

Parallel Sentence Extraction from Comparable Corpora with Neural Network Features

no code implementations • LREC 2016 • Chenhui Chu, Raj Dabre, Sadao Kurohashi

Parallel corpora are crucial for machine translation (MT), however they are quite scarce for most language pairs and domains.

Machine Translation Sentence +1

Paper
Add Code

Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons

no code implementations • LREC 2016 • Antoine Bourlon, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

Sentence alignment is a task that consists in aligning the parallel sentences in a translated article pair.

Boundary Detection Machine Translation +2

Paper
Add Code

Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation

no code implementations • LREC 2016 • Chenhui Chu, Sadao Kurohashi

Out-of-vocabulary (OOV) word is a crucial problem in statistical machine translation (SMT) with low resources.

Machine Translation Translation +1

Paper
Add Code

ASPEC: Asian Scientific Paper Excerpt Corpus

no code implementations • LREC 2016 • Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi, Hitoshi Isahara

In this paper, we describe the details of the ASPEC (Asian Scientific Paper Excerpt Corpus), which is the first large-size parallel corpus of scientific paper domain.

Machine Translation Translation

Paper
Add Code

Proceedings of the 2nd Workshop on Asian Translation (WAT2015)

no code implementations • WS 2015 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi, Eiichiro Sumita

Translation

Paper
Add Code

KyotoEBMT System Description for the 2nd Workshop on Asian Translation

no code implementations • WS 2015 • John Richardson, Raj Dabre, Chenhui Chu, Fabien Cromi{\`e}res, Toshiaki Nakazawa, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Overview of the 2nd Workshop on Asian Translation

no code implementations • WS 2015 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi, Eiichiro Sumita

Machine Translation Translation

Paper
Add Code

Pivot-Based Topic Models for Low-Resource Lexicon Extraction

no code implementations • PACLIC 2015 • John Richardson, Toshiaki Nakazawa, Sadao Kurohashi

Semantic Textual Similarity Topic Models

Paper
Add Code

Large-scale Dictionary Construction via Pivot-based Statistical Machine Translation with Significance Pruning and Neural Network Features

no code implementations • PACLIC 2015 • Raj Dabre, Chenhui Chu, Fabien Cromieres, Toshiaki Nakazawa, Sadao Kurohashi

Language Modelling Machine Translation +1

Paper
Add Code

Cross-language Projection of Dependency Trees for Tree-to-tree Machine Translation

no code implementations • PACLIC 2015 • Yu Shen, Chenhui Chu, Fabien Cromieres, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Morphological Analysis for Unsegmented Languages using Recurrent Neural Network Language Model

no code implementations • EMNLP 2015 • Hajime Morita, Daisuke Kawahara, Sadao Kurohashi

Language Modelling Lemmatization +3

Paper
Add Code

Chinese Semantic Role Labeling using High-quality Syntactic Knowledge

no code implementations • WS 2015 • Gongye Jin, Daisuke Kawahara, Sadao Kurohashi

Chinese Semantic Role Labeling Dependency Parsing +4

Paper
Add Code

Location Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency

no code implementations • WS 2015 • Takashi Awamura, Daisuke Kawahara, Eiji Aramaki, Tomohide Shibata, Sadao Kurohashi

Word Sense Disambiguation

Paper
Add Code

Classification and Acquisition of Contradictory Event Pairs using Crowdsourcing

no code implementations • WS 2015 • Yu Takabatake, Hajime Morita, Daisuke Kawahara, Sadao Kurohashi, Ryuichiro Higashinaka, Yoshihiro Matsuo

Classification General Classification +1

Paper
Add Code

Leveraging Small Multilingual Corpora for SMT Using Many Pivot Languages

no code implementations • HLT 2015 • Sadao Kurohashi, Pushpak Bhattacharyya, Raj Dabre, Fabien Cromieres

Machine Translation Translation

Paper
Add Code

Improving Statistical Machine Translation Accuracy Using Bilingual Lexicon Extractionwith Paraphrases

no code implementations • PACLIC 2014 • Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

KyotoEBMT System Description for the 1st Workshop on Asian Translation

no code implementations • WS 2014 • John Richardson, Fabien Cromi{\`e}res, Toshiaki Nakazawa, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Overview of the 1st Workshop on Asian Translation

no code implementations • WS 2014 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Sadao Kurohashi, Eiichiro Sumita

Machine Translation Translation

Paper
Add Code

Translation Rules with Right-Hand Side Lattices

no code implementations • EMNLP 2014 • Fabien Cromi{\`e}res, Sadao Kurohashi

Language Modelling Machine Translation +1

Paper
Add Code

Rapid Development of a Corpus with Discourse Annotations using Two-stage Crowdsourcing

no code implementations • COLING 2014 • Daisuke Kawahara, Yuichiro Machida, Tomohide Shibata, Sadao Kurohashi, Hayato Kobayashi, Manabu Sassano

Paper
Add Code

Chinese Morphological Analysis with Character-level POS Tagging

no code implementations • ACL 2014 • Mo Shen, Hongxiao Liu, Daisuke Kawahara, Sadao Kurohashi

Chinese Word Segmentation Morphological Analysis +3

Paper
Add Code

KyotoEBMT: An Example-Based Dependency-to-Dependency Translation Framework

no code implementations • ACL 2014 • John Richardson, Fabien Cromi{\`e}res, Toshiaki Nakazawa, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations

no code implementations • LREC 2014 • Tomoko Izumi, Tomohide Shibata, Hisako Asano, Yoshihiro Matsuo, Sadao Kurohashi

We construct a large corpus of Japanese predicate phrases for synonym-antonym relations.

Binary Classification General Classification +1

Paper
Add Code

Constructing a Chinese---Japanese Parallel Corpus from Wikipedia

no code implementations • LREC 2014 • Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

Using the system, we construct a Chinese―Japanese parallel corpus with more than 126k highly accurate parallel sentences from Wikipedia.

Machine Translation Sentence +1

Paper
Add Code

A Framework for Compiling High Quality Knowledge Resources From Raw Corpora

no code implementations • LREC 2014 • Gongye Jin, Daisuke Kawahara, Sadao Kurohashi

The identification of various types of relations is a necessary step to allow computers to understand natural language text.

Dependency Parsing Machine Translation +2

Paper
Add Code

A Large Scale Database of Strongly-related Events in Japanese

no code implementations • LREC 2014 • Tomohide Shibata, Shotaro Kohama, Sadao Kurohashi

This paper presents a large scale database of strongly-related events in Japanese, which has been acquired with our proposed method (Shibata and Kurohashi, 2011).

Common Sense Reasoning coreference-resolution +1