Search Results for author: Ngoc Thang Vu

Found 104 papers, 24 papers with code

IMSurReal Too: IMS in the Surface Realization Shared Task 2020

1 code implementation • MSR (COLING) 2020 • Xiang Yu, Simon Tannert, Ngoc Thang Vu, Jonas Kuhn

We introduce the IMS contribution to the Surface Realization Shared Task 2020.

Paper
Code

Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas

no code implementations • NAACL (AmericasNLP) 2021 • Manuel Mager, Arturo Oncevay, Abteen Ebrahimi, John Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Rolando Coto-Solano, Alexis Palmer, Elisabeth Mager-Hois, Vishrav Chaudhary, Graham Neubig, Ngoc Thang Vu, Katharina Kann

This paper presents the results of the 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas.

Machine Translation Translation

Paper
Add Code

IMS’ Systems for the IWSLT 2021 Low-Resource Speech Translation Task

no code implementations • ACL (IWSLT) 2021 • Pavel Denisov, Manuel Mager, Ngoc Thang Vu

This paper describes the submission to the IWSLT 2021 Low-Resource Speech Translation Shared Task by IMS team.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

“It’s our fault!”: Insights Into Users’ Understanding and Interaction With an Explanatory Collaborative Dialog System

no code implementations • CoNLL (EMNLP) 2021 • Katharina Weitz, Lindsey Vanderlyn, Ngoc Thang Vu, Elisabeth André

We conclude that creating shared mental models between users and AI systems is important to achieving successful dialogs.

Paper
Add Code

»textklang« – Towards a Multi-Modal Exploration Platform for German Poetry

no code implementations • LREC 2022 • Nadja Schauffler, Toni Bernhart, Andre Blessing, Gunilla Eschenbach, Markus Gärtner, Kerstin Jung, Anna Kinder, Julia Koch, Sandra Richter, Gabriel Viehhauser, Ngoc Thang Vu, Lorenz Wesemann, Jonas Kuhn

We present the steps taken towards an exploration platform for a multi-modal corpus of German lyric poetry from the Romantic era developed in the project »textklang«.

Paper
Add Code

A Two-stage Model for Slot Filling in Low-resource Settings: Domain-agnostic Non-slot Reduction and Pretrained Contextual Embeddings

no code implementations • EMNLP (sustainlp) 2020 • Cennet Oguz, Ngoc Thang Vu

In this paper, we propose a novel two-stage model architecture that can be trained with only a few in-domain hand-labeled examples.

slot-filling Slot Filling +1

Paper
Add Code

“It seemed like an annoying woman”: On the Perception and Ethical Considerations of Affective Language in Text-Based Conversational Agents

no code implementations • CoNLL (EMNLP) 2021 • Lindsey Vanderlyn, Gianna Weber, Michael Neumann, Dirk Väth, Sarina Meyer, Ngoc Thang Vu

Based on statistical and qualitative analysis of the responses, we found language style played an important role in how human-like participants perceived a dialog agent as well as how likable.

Chatbot

Paper
Add Code

Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training

1 code implementation • 16 Apr 2024 • Pavel Denisov, Ngoc Thang Vu

Our zero-shot evaluation results confirm the robustness of our approach across multiple tasks, including speech translation and multilingual spoken language understanding, thereby opening new avenues for applying LLMs in the speech domain.

Paper
Code

Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering

1 code implementation • 26 Mar 2024 • Pascal Tilli, Ngoc Thang Vu

In this work, we introduce an interpretable approach for graph-based VQA and demonstrate competitive performance on the GQA dataset.

Decision Making Explainable artificial intelligence +3

Paper
Code

Towards a Zero-Data, Controllable, Adaptive Dialog System

no code implementations • 26 Mar 2024 • Dirk Väth, Lindsey Vanderlyn, Ngoc Thang Vu

Conversational Tree Search (V\"ath et al., 2023) is a recent approach to controllable dialog systems, where domain experts shape the behavior of a Reinforcement Learning agent through a dialog tree.

Language Modelling Large Language Model +1

Paper
Add Code

Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings

no code implementations • 8 Mar 2024 • Wei Zhou, Heike Adel, Hendrik Schuff, Ngoc Thang Vu

Attribution scores indicate the importance of different input parts and can, thus, explain model behaviour.

Paper
Add Code

The IMS Toucan System for the Blizzard Challenge 2023

1 code implementation • 26 Oct 2023 • Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu

For our contribution to the Blizzard Challenge 2023, we improved on the system we submitted to the Blizzard Challenge 2021.

448

Paper
Code

Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

no code implementations • 26 Oct 2023 • Florian Lux, Pascal Tilli, Sarina Meyer, Ngoc Thang Vu

Customizing voice and speaking style in a speech synthesis system with intuitive and fine-grained controls is challenging, given that little data with appropriate labels is available.

Speech Synthesis

Paper
Add Code

Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study

no code implementations • 23 Oct 2023 • Injy Hamed, Nizar Habash, Ngoc Thang Vu

Linguistic theories and random lexical replacement prove to be effective in the lack of CSW parallel data, where both approaches achieve similar results.

Data Augmentation Machine Translation +2

Paper
Add Code

Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding

1 code implementation • 9 Oct 2023 • Pavel Denisov, Ngoc Thang Vu

A number of methods have been proposed for End-to-End Spoken Language Understanding (E2E-SLU) using pretrained models, however their evaluation often lacks multilingual setup and tasks that require prediction of lexical fillers, such as slot filling.

slot-filling Slot Filling +3

Paper
Code

Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction

no code implementations • 11 Jun 2023 • Manuel Mager, Rajat Bhatnagar, Graham Neubig, Ngoc Thang Vu, Katharina Kann

Neural models have drastically advanced state of the art for machine translation (MT) between high-resource languages.

Machine Translation Translation

Paper
Add Code

Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers

no code implementations • 31 May 2023 • Manuel Mager, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu

In recent years machine translation has become very successful for high-resource language pairs.

Machine Translation Translation

Paper
Add Code

Neighboring Words Affect Human Interpretation of Saliency Explanations

1 code implementation • 4 May 2023 • Alon Jacovi, Hendrik Schuff, Heike Adel, Ngoc Thang Vu, Yoav Goldberg

Word-level saliency explanations ("heat maps over words") are often used to communicate feature-attribution in text-based models.

Paper
Code

Modeling Speaker-Listener Interaction for Backchannel Prediction

no code implementations • 10 Apr 2023 • Daniel Ortega, Sarina Meyer, Antje Schweitzer, Ngoc Thang Vu

We present our latest findings on backchannel modeling novelly motivated by the canonical use of the minimal responses Yeah and Uh-huh in English and their correspondent tokens in German, and the effect of encoding the speaker-listener interaction.

Paper
Add Code

Oh, Jeez! or Uh-huh? A Listener-aware Backchannel Predictor on ASR Transcriptions

no code implementations • 10 Apr 2023 • Daniel Ortega, Chia-Yu Li, Ngoc Thang Vu

This paper presents our latest investigation on modeling backchannel in conversations.

Paper
Add Code

Conversational Tree Search: A New Hybrid Dialog Task

1 code implementation • 17 Mar 2023 • Dirk Väth, Lindsey Vanderlyn, Ngoc Thang Vu

Conversational interfaces provide a flexible and easy way for users to seek information that may otherwise be difficult or inconvenient to obtain.

Information Retrieval Navigate +1

Paper
Code

ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English

no code implementations • 22 Nov 2022 • Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu

We present our work on collecting ArzEn-ST, a code-switched Egyptian Arabic - English Speech Translation Corpus.

Machine Translation Translation

Paper
Add Code

Low-Resource Multilingual and Zero-Shot Multispeaker TTS

1 code implementation • 21 Oct 2022 • Florian Lux, Julia Koch, Ngoc Thang Vu

While neural methods for text-to-speech (TTS) have shown great advances in modeling multiple speakers, even in zero-shot settings, the amount of data needed for those approaches is generally not feasible for the vast majority of the world's over 6, 000 spoken languages.

Meta-Learning Voice Cloning

448

Paper
Code

Combining Contrastive and Non-Contrastive Losses for Fine-Tuning Pretrained Models in Speech Analysis

no code implementations • 21 Oct 2022 • Florian Lux, Ching-Yi Chen, Ngoc Thang Vu

This pretrained model is then finetuned to a specific task.

Emotion Classification

Paper
Add Code

Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses

no code implementations • 20 Oct 2022 • Chia-Yu Li, Ngoc Thang Vu

In this paper, we exploit the advantages from both inter-domain loss and CycleGAN to achieve better shared representation of unpaired speech and text inputs and thus improve the speech-to-text mapping.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Challenges in Explanation Quality Evaluation

no code implementations • 13 Oct 2022 • Hendrik Schuff, Heike Adel, Peng Qi, Ngoc Thang Vu

This approach assumes that explanations which reach higher proxy scores will also provide a greater benefit to human users.

Question Answering

Paper
Add Code

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

1 code implementation • 13 Oct 2022 • Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

In order to protect the privacy of speech data, speaker anonymization aims for hiding the identity of a speaker by changing the voice in speech recordings.

Generative Adversarial Network

Paper
Code

Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text

no code implementations • 11 Oct 2022 • Marwa Gaser, Manuel Mager, Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu

For extreme low-resource scenarios, a combination of frequency and morphology-based segmentations is shown to perform the best.

Machine Translation Segmentation

Paper
Add Code

The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles

no code implementations • 31 Jul 2022 • Injy Hamed, Alia El Bolock, Cornelia Herbert, Slim Abdennadher, Ngoc Thang Vu

Given that the factors giving rise to CS vary from one country to the other, as well as from one person to the other, CS is found to be a speaker-dependant behaviour, where the frequency by which the foreign language is embedded differs across speakers.

Paper
Add Code

PoeticTTS -- Controllable Poetry Reading for Literary Studies

no code implementations • 11 Jul 2022 • Julia Koch, Florian Lux, Nadja Schauffler, Toni Bernhart, Felix Dieterle, Jonas Kuhn, Sandra Richter, Gabriel Viehhauser, Ngoc Thang Vu

Speech synthesis for poetry is challenging due to specific intonation patterns inherent to poetic speech.

Speech Synthesis

Paper
Add Code

Speaker Anonymization with Phonetic Intermediate Representations

1 code implementation • 11 Jul 2022 • Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

In this work, we propose a speaker anonymization pipeline that leverages high quality automatic speech recognition and synthesis systems to generate speech conditioned on phonetic transcriptions and anonymized speaker embeddings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

2 code implementations • 24 Jun 2022 • Florian Lux, Julia Koch, Ngoc Thang Vu

The cloning of a speaker's voice using an untranscribed reference sample is one of the great advances of modern neural text-to-speech (TTS) methods.

448

Paper
Code

Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation

no code implementations • 25 May 2022 • Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu

Results show that using a predictive model results in more natural CS sentences compared to the random approach, as reported in human judgements.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Meta Learning for Natural Language Processing: A Survey

no code implementations • NAACL 2022 • Hung-Yi Lee, Shang-Wen Li, Ngoc Thang Vu

Deep learning has been the mainstream technique in natural language processing (NLP) area.

Meta-Learning

Paper
Add Code

BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages

no code implementations • Findings (ACL) 2022 • Manuel Mager, Arturo Oncevay, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu

Morphologically-rich polysynthetic languages present a challenge for NLP systems due to data sparsity, and a common strategy to handle this issue is to apply subword segmentation.

Machine Translation Segmentation +1

Paper
Add Code

Language-Agnostic Meta-Learning for Low-Resource Text-to-Speech with Articulatory Features

1 code implementation • ACL 2022 • Florian Lux, Ngoc Thang Vu

While neural text-to-speech systems perform remarkably well in high-resource scenarios, they cannot be applied to the majority of the over 6, 000 spoken languages in the world due to a lack of appropriate training data.

Meta-Learning

448

Paper
Code

Human Interpretation of Saliency-based Explanation Over Text

1 code implementation • 27 Jan 2022 • Hendrik Schuff, Alon Jacovi, Heike Adel, Yoav Goldberg, Ngoc Thang Vu

In this work, we focus on this question through a study of saliency-based explanations over textual data.

Paper
Code

Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition

no code implementations • 19 Dec 2021 • Chia Yu Li, Ngoc Thang Vu

We investigate densely connected convolutional networks (DenseNets) and their extension with domain adversarial training for noise robust speech recognition.

Robust Speech Recognition speech-recognition

Paper
Add Code

Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching

no code implementations • 19 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu

Code-Switching (CS) is a common linguistic phenomenon in multilingual communities that consists of switching between languages while speaking.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Predicting User Code-Switching Level from Sociological and Psychological Profiles

no code implementations • 13 Dec 2021 • Injy Hamed, Alia El Bolock, Nader Rizk, Cornelia Herbert, Slim Abdennadher, Ngoc Thang Vu

Multilingual speakers tend to alternate between languages within a conversation, a phenomenon referred to as "code-switching" (CS).

Paper
Add Code

Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks

no code implementations • 12 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu

This paper presents our latest effort on improving Code-switching language models that suffer from data scarcity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN

no code implementations • 12 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu

This paper presents our latest investigations on improving automatic speech recognition for noisy speech via speech enhancement.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

2 code implementations • 29 Nov 2021 • Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe

However, there are few open source toolkits that can be used to generate reproducible results on different Spoken Language Understanding (SLU) benchmarks.

Spoken Language Understanding

7,851

Paper
Code

Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking

1 code implementation • EMNLP (ACL) 2021 • Dirk Väth, Pascal Tilli, Ngoc Thang Vu

On the way towards general Visual Question Answering (VQA) systems that are able to answer arbitrary questions, the need arises for evaluation beyond single-metric leaderboards for specific datasets.

Benchmarking Question Answering +1

Paper
Code

Does External Knowledge Help Explainable Natural Language Inference? Automatic Evaluation vs. Human Ratings

1 code implementation • EMNLP (BlackboxNLP) 2021 • Hendrik Schuff, Hsiu-Yu Yang, Heike Adel, Ngoc Thang Vu

For this, we investigate different sources of external knowledge and evaluate the performance of our models on in-domain data as well as on special transfer datasets that are designed to assess fine-grained reasoning capabilities.

Natural Language Inference

Paper
Code

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

no code implementations • 29 Aug 2021 • Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu

In this paper, we present our work on code-switched Egyptian Arabic-English automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Meta Learning and Its Applications to Natural Language Processing

no code implementations • ACL 2021 • Hung-Yi Lee, Ngoc Thang Vu, Shang-Wen Li

Meta-learning is one of the most important new techniques in machine learning in recent years.

Dialogue Generation Few-Shot Learning +2

Paper
Add Code

Thought Flow Nets: From Single Predictions to Trains of Model Thought

no code implementations • 26 Jul 2021 • Hendrik Schuff, Heike Adel, Ngoc Thang Vu

In addition, we conduct a qualitative analysis of thought flow correction patterns and explore how thought flow predictions affect human users within a crowdsourcing study.

Question Answering

Paper
Add Code

IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task

no code implementations • 30 Jun 2021 • Pavel Denisov, Manuel Mager, Ngoc Thang Vu

This paper describes the submission to the IWSLT 2021 Low-Resource Speech Translation Shared Task by IMS team.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages

1 code implementation • ACL 2022 • Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Meza-Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann

Continued pretraining offers improvements, with an average accuracy of 44. 05%.

Cross-Lingual Transfer Natural Language Understanding +3

Paper
Code

Few-shot Learning for Slot Tagging with Attentive Relational Network

no code implementations • EACL 2021 • Cennet Oguz, Ngoc Thang Vu

Metric-based learning is a well-known family of methods for few-shot learning, especially in computer vision.

Few-Shot Learning

Paper
Add Code

Investigations on Audiovisual Emotion Recognition in Noisy Conditions

no code implementations • 2 Mar 2021 • Michael Neumann, Ngoc Thang Vu

In this paper we explore audiovisual emotion recognition under noisy acoustic conditions with a focus on speech features.

Speech Emotion Recognition

Paper
Add Code

Meta-Learning for improving rare word recognition in end-to-end ASR

no code implementations • 25 Feb 2021 • Florian Lux, Ngoc Thang Vu

We propose a new method of generating meaningful embeddings for speech, changes to four commonly used meta learning approaches to enable them to perform keyword spotting in continuous signals and an approach of combining their outcomes into an end-to-end automatic speech recognition system to improve rare word recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

no code implementations • COLING 2020 • Daniel Grießhaber, Johannes Maucher, Ngoc Thang Vu

Recently, leveraging pre-trained Transformer based language models in down stream, task specific models has advanced state of the art results in natural language understanding tasks.

Active Learning Language Modelling +1

Paper
Add Code

Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension

no code implementations • CONLL 2020 • Ekta Sood, Simon Tannert, Diego Frassinelli, Andreas Bulling, Ngoc Thang Vu

We compare state of the art networks based on long short-term memory (LSTM), convolutional neural models (CNN) and XLNet Transformer architectures.

Machine Reading Comprehension

Paper
Add Code

F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering

1 code implementation • EMNLP 2020 • Hendrik Schuff, Heike Adel, Ngoc Thang Vu

The user study shows that our models increase the ability of the users to judge the correctness of the system and that scores like F1 are not enough to estimate the usefulness of a model in a practical setting with human users.

Model Selection Question Answering

Paper
Code

Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning

no code implementations • 3 Jul 2020 • Pavel Denisov, Ngoc Thang Vu

Spoken language understanding is typically based on pipeline architectures including speech recognition and natural language understanding steps.

Natural Language Understanding speech-recognition +2

Paper
Add Code

Ensemble Self-Training for Low-Resource Languages: Grapheme-to-Phoneme Conversion and Morphological Inflection

no code implementations • WS 2020 • Xiang Yu, Ngoc Thang Vu, Jonas Kuhn

We present an iterative data augmentation framework, which trains and searches for an optimal ensemble and simultaneously annotates new training data in a self-training style.

Data Augmentation Morphological Inflection

Paper
Add Code

Fast and Accurate Non-Projective Dependency Tree Linearization

no code implementations • ACL 2020 • Xiang Yu, Simon Tannert, Ngoc Thang Vu, Jonas Kuhn

We propose a graph-based method to tackle the dependency tree linearization task.

Traveling Salesman Problem

Paper
Add Code

ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents

1 code implementation • ACL 2020 • Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu

We present ADVISER - an open-source, multi-domain dialog system toolkit that enables the development of multi-modal (incorporating speech, text and vision), socially-engaged (e. g. emotion recognition, engagement level prediction and backchanneling) conversational agents.

BIG-bench Machine Learning Emotion Recognition

Paper
Code

ArzEn: A Speech Corpus for Code-switched Egyptian Arabic-English

no code implementations • LREC 2020 • Injy Hamed, Ngoc Thang Vu, Slim Abdennadher

In this paper, we first discuss the CS phenomenon in Egypt and the factors that gave rise to the current language.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Cairo Student Code-Switch (CSCS) Corpus: An Annotated Egyptian Arabic-English Corpus

no code implementations • LREC 2020 • Mohamed Balabel, Injy Hamed, Slim Abdennadher, Ngoc Thang Vu, {\"O}zlem {\c{C}}etino{\u{g}}lu

Code-switching has become a prevalent phenomenon across many communities.

Paper
Add Code

IMSurReal: IMS at the Surface Realization Shared Task 2019

no code implementations • WS 2019 • Xiang Yu, Agnieszka Falenska, Marina Haid, Ngoc Thang Vu, Jonas Kuhn

We introduce the IMS contribution to the Surface Realization Shared Task 2019.

Paper
Add Code

Head-First Linearization with Tree-Structured Representation

no code implementations • WS 2019 • Xiang Yu, Agnieszka Falenska, Ngoc Thang Vu, Jonas Kuhn

We present a dependency tree linearization model with two novel components: (1) a tree-structured encoder based on bidirectional Tree-LSTM that propagates information first bottom-up then top-down, which allows each token to access information from the entire tree; and (2) a linguistically motivated head-first decoder that emphasizes the central role of the head and linearizes the subtree by incrementally attaching the dependents on both sides of the head.

Paper
Add Code

Code-switching Language Modeling With Bilingual Word Embeddings: A Case Study for Egyptian Arabic-English

no code implementations • 24 Sep 2019 • Injy Hamed, Moritz Zhu, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu

Code-switching (CS) is a widespread phenomenon among bilingual and multilingual societies.

Language Modelling Word Embeddings

Paper
Add Code

To Combine or Not To Combine? A Rainbow Deep Reinforcement Learning Agent for Dialog Policies

no code implementations • WS 2019 • Dirk V{\"a}th, Ngoc Thang Vu

In this paper, we explore state-of-the-art deep reinforcement learning methods for dialog policy training such as prioritized experience replay, double deep Q-Networks, dueling network architectures and distributional learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

IMS-Speech: A Speech to Text Tool

no code implementations • 13 Aug 2019 • Pavel Denisov, Ngoc Thang Vu

We present the IMS-Speech, a web based tool for German and English speech transcription aiming to facilitate research in various disciplines which require accesses to lexical information in spoken language materials.

Ranked #4 on Speech Recognition on TUDA (using extra training data)

speech-recognition Speech Recognition

Paper
Add Code

End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning

no code implementations • 13 Aug 2019 • Pavel Denisov, Ngoc Thang Vu

This paper presents our latest investigation on end-to-end automatic speech recognition (ASR) for overlapped speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Learning the Dyck Language with Attention-based Seq2Seq Models

no code implementations • WS 2019 • Xiang Yu, Ngoc Thang Vu, Jonas Kuhn

The generalized Dyck language has been used to analyze the ability of Recurrent Neural Networks (RNNs) to learn context-free grammars (CFGs).

Paper
Add Code

ADVISER: A Dialog System Framework for Education \& Research

no code implementations • ACL 2019 • Daniel Ortega, Dirk V{\"a}th, Gianna Weber, V, Lindsey erlyn, Maximilian Schmidt, Moritz V{\"o}lkel, Zorica Karacevic, Ngoc Thang Vu

In this paper, we present ADVISER - an open source dialog system framework for education and research purposes.

Paper
Add Code

Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions

no code implementations • 28 Feb 2019 • Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu

This paper presents our latest investigations on dialog act (DA) classification on automatically generated transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning

no code implementations • WS 2018 • Xiang Yu, Ngoc Thang Vu, Jonas Kuhn

We present a general approach with reinforcement learning (RL) to approximate dynamic oracles for transition systems where exact dynamic oracles are difficult to derive.

Dependency Parsing Imitation Learning +4

Paper
Add Code

Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity

no code implementations • WS 2018 • Glorianna Jagfeld, Sabrina Jenne, Ngoc Thang Vu

We present a comparison of word-based and character-based sequence-to-sequence models for data-to-text natural language generation, which generate natural language descriptions for structured inputs.

Text Generation

Paper
Add Code

Comparing Attention-based Convolutional and Recurrent Neural Networks: Success and Limitations in Machine Reading Comprehension

1 code implementation • CONLL 2018 • Matthias Blohm, Glorianna Jagfeld, Ekta Sood, Xiang Yu, Ngoc Thang Vu

We propose a machine reading comprehension model based on the compare-aggregate framework with two-staged attention that achieves state-of-the-art results on the MovieQA question answering dataset.

Machine Reading Comprehension Question Answering

Paper
Code

Densely Connected Convolutional Networks for Speech Recognition

no code implementations • 10 Aug 2018 • Chia Yu Li, Ngoc Thang Vu

This paper presents our latest investigation on Densely Connected Convolutional Networks (DenseNets) for acoustic modelling (AM) in automatic speech recognition.

Acoustic Modelling Automatic Speech Recognition +2

Paper
Add Code

Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition

no code implementations • 30 Jul 2018 • Pavel Denisov, Ngoc Thang Vu, Marc Ferras Font

In this paper, we investigate the use of adversarial learning for unsupervised adaptation to unseen recording conditions, more specifically, single microphone far-field speech.

Robust Speech Recognition speech-recognition +1

Paper
Add Code

Low-Resource Text Classification using Domain-Adversarial Learning

no code implementations • 13 Jul 2018 • Daniel Grießhaber, Ngoc Thang Vu, Johannes Maucher

Deep learning techniques have recently shown to be successful in many natural language processing tasks forming state-of-the-art systems.

General Classification text-classification +1

Paper
Add Code

Addressing Low-Resource Scenarios with Character-aware Embeddings

no code implementations • WS 2018 • Sean Papay, Sebastian Pad{\'o}, Ngoc Thang Vu

Most modern approaches to computing word embeddings assume the availability of text corpora with billions of words.

Morphological Analysis Word Embeddings

Paper
Add Code

Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

no code implementations • 14 May 2018 • Sabrina Stehwien, Ngoc Thang Vu, Antje Schweitzer

Pitch accent detection often makes use of both acoustic and lexical features based on the fact that pitch accents tend to correlate with certain words.

Cross-corpus Word Embeddings

Paper
Add Code

Investigations on End-to-End Audiovisual Fusion

no code implementations • 30 Apr 2018 • Michael Wand, Ngoc Thang Vu, Juergen Schmidhuber

Audiovisual speech recognition (AVSR) is a method to alleviate the adverse effect of noise in the acoustic signal.

speech-recognition Speech Recognition

Paper
Add Code

Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

no code implementations • NAACL 2018 • Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

We present two novel datasets for the low-resource language Vietnamese to assess models of semantic similarity: ViCon comprises pairs of synonyms and antonyms across word classes, thus offering data to distinguish between similarity and dissimilarity.

Paper
Add Code

Lexico-acoustic Neural-based Models for Dialog Act Classification

no code implementations • 2 Mar 2018 • Daniel Ortega, Ngoc Thang Vu

We propose a neural model that processes both lexical and acoustic features for classification.

Classification Dialog Act Classification +1

Paper
Add Code

Cross-lingual and Multilingual Speech Emotion Recognition on English and French

no code implementations • 1 Mar 2018 • Michael Neumann, Ngoc Thang Vu

Research on multilingual speech emotion recognition faces the problem that most available speech corpora differ from each other in important ways, such as annotation methods or interaction scenarios.

Speech Emotion Recognition

Paper
Add Code

Syntactic and Semantic Features For Code-Switching Factored Language Models

no code implementations • 4 Oct 2017 • Heike Adel, Ngoc Thang Vu, Katrin Kirchhoff, Dominic Telaar, Tanja Schultz

The experimental results reveal that Brown word clusters, part-of-speech tags and open-class words are the most effective at reducing the perplexity of factored language models on the Mandarin-English Code-Switching corpus SEAME.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Enriching ASR Lattices with POS Tags for Dependency Parsing

no code implementations • WS 2017 • Moritz Stiefel, Ngoc Thang Vu

Parsing speech requires a richer representation than 1-best or n-best hypotheses, e. g. lattices.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Neural-based Context Representation Learning for Dialog Act Classification

no code implementations • WS 2017 • Daniel Ortega, Ngoc Thang Vu

We explore context representation learning methods in neural-based models for dialog act classification.

Classification Dialog Act Classification +2

Paper
Add Code

Improving coreference resolution with automatically predicted prosodic information

no code implementations • WS 2017 • Ina Rösiger, Sabrina Stehwien, Arndt Riester, Ngoc Thang Vu

Adding manually annotated prosodic information, specifically pitch accents and phrasing, to the typical text-based feature set for coreference resolution has previously been shown to have a positive effect on German data.

coreference-resolution

Paper
Add Code

Hierarchical Embeddings for Hypernymy Detection and Directionality

no code implementations • EMNLP 2017 • Kim Anh Nguyen, Maximilian Köper, Sabine Schulte im Walde, Ngoc Thang Vu

We present a novel neural model HyperVec to learn hierarchical embeddings for hypernymy detection and directionality.

Lexical Entailment

Paper
Add Code

Encoding Word Confusion Networks with Recurrent Neural Networks for Dialog State Tracking

no code implementations • WS 2017 • Glorianna Jagfeld, Ngoc Thang Vu

This paper presents our novel method to encode word confusion networks, which can represent a rich hypothesis space of automatic speech recognition systems, via recurrent neural networks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

A General-Purpose Tagger with Convolutional Neural Networks

1 code implementation • WS 2017 • Xiang Yu, Agnieszka Faleńska, Ngoc Thang Vu

We present a general-purpose tagger based on convolutional neural networks (CNN), used for both composing word vectors and encoding context information.

Morphological Tagging Part-Of-Speech Tagging

Paper
Code

Prosodic Event Recognition using Convolutional Neural Networks with Context Information

no code implementations • 2 Jun 2017 • Sabrina Stehwien, Ngoc Thang Vu

This paper demonstrates the potential of convolutional neural networks (CNN) for detecting and classifying prosodic events on words, specifically pitch accents and phrase boundary tones, from frame-based acoustic features.

Position

Paper
Add Code

Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech

no code implementations • 2 Jun 2017 • Michael Neumann, Ngoc Thang Vu

Speech emotion recognition is an important and challenging task in the realm of human-computer interaction.

MULTI-VIEW LEARNING Speech Emotion Recognition

Paper
Add Code

Character Composition Model with Convolutional Neural Networks for Dependency Parsing on Morphologically Rich Languages

1 code implementation • ACL 2017 • Xiang Yu, Ngoc Thang Vu

We present a transition-based dependency parser that uses a convolutional neural network to compose word representations from characters.

Dependency Parsing Word Embeddings

Paper
Code

Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

1 code implementation • EACL 2017 • Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Distinguishing between antonyms and synonyms is a key task to achieve high performance in NLP systems.

General Classification

Paper
Code

Challenges of Computational Processing of Code-Switching

no code implementations • WS 2016 • Özlem Çetinoğlu, Sarah Schulz, Ngoc Thang Vu

This paper addresses challenges of Natural Language Processing (NLP) on non-canonical multilingual data in which two or more languages are mixed.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +7

Paper
Add Code

Neural-based Noise Filtering from Word Embeddings

1 code implementation • COLING 2016 • Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Word embeddings have been demonstrated to benefit NLP tasks impressively.

Denoising Word Embeddings

Paper
Code

Towards a text analysis system for political debates

no code implementations • WS 2016 • Dieu-Thu Le, Ngoc Thang Vu, Andre Blessing

Time Series Analysis

Paper
Add Code

Sequential Convolutional Neural Networks for Slot Filling in Spoken Language Understanding

no code implementations • 24 Jun 2016 • Ngoc Thang Vu

We investigate the usage of convolutional neural networks (CNNs) for the slot filling task in spoken language understanding.

General Classification slot-filling +2

Paper
Add Code

Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

no code implementations • ACL 2016 • Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

We propose a novel vector representation that integrates lexical contrast into distributional vectors and strengthens the most salient features for determining degrees of word similarity.

Word Embeddings Word Similarity

Paper
Add Code

Combining Recurrent and Convolutional Neural Networks for Relation Classification

no code implementations • NAACL 2016 • Ngoc Thang Vu, Heike Adel, Pankaj Gupta, Hinrich Schütze

This paper investigates two different neural architectures for the task of relation classification: convolutional neural networks and recurrent neural networks.

Classification General Classification +2