Search Results for author: Elman Mansimov

Found 20 papers, 9 papers with code

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

no code implementations • 10 Jan 2024 • Dennis Ulmer, Elman Mansimov, Kaixiang Lin, Justin Sun, Xibin Gao, Yi Zhang

This metric is used to filter the generated conversational data that is fed back in LLM for training.

Paper
Add Code

Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

1 code implementation • 24 May 2023 • Mujeen Sung, James Gung, Elman Mansimov, Nikolaos Pappas, Raphael Shu, Salvatore Romeo, Yi Zhang, Vittorio Castelli

Intent classification (IC) plays an important role in task-oriented dialogue systems.

Contrastive Learning intent-classification +2

Paper
Code

Conversation Style Transfer using Few-Shot Learning

no code implementations • 16 Feb 2023 • Shamik Roy, Raphael Shu, Nikolaos Pappas, Elman Mansimov, Yi Zhang, Saab Mansour, Dan Roth

Conventional text style transfer approaches focus on sentence-level style transfer without considering contextual information, and the style is described with attributes (e. g., formality).

Few-Shot Learning In-Context Learning +5

Paper
Add Code

Improving Prediction Backward-Compatiblility in NLP Model Upgrade with Gated Fusion

no code implementations • 4 Feb 2023 • Yi-An Lai, Elman Mansimov, Yuqing Xie, Yi Zhang

When upgrading neural models to a newer version, new errors that were not encountered in the legacy version can be introduced, known as regression errors.

regression

Paper
Add Code

Backward Compatibility During Data Updates by Weight Interpolation

no code implementations • 25 Jan 2023 • Raphael Schumann, Elman Mansimov, Yi-An Lai, Nikolaos Pappas, Xibin Gao, Yi Zhang

This method interpolates between the weights of the old and new model and we show in extensive experiments that it reduces negative flips without sacrificing the improved accuracy of the new model.

regression

Paper
Add Code

Dialog2API: Task-Oriented Dialogue with API Description and Example Programs

no code implementations • 20 Dec 2022 • Raphael Shu, Elman Mansimov, Tamer Alkhouli, Nikolaos Pappas, Salvatore Romeo, Arshit Gupta, Saab Mansour, Yi Zhang, Dan Roth

The conversational model interacts with the environment by generating and executing programs triggering a set of pre-defined APIs.

In-Context Learning Semantic Parsing +1

Paper
Add Code

Label Semantic Aware Pre-training for Few-shot Text Classification

1 code implementation • ACL 2022 • Aaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth

Label semantic aware systems have leveraged this information for improved text classification performance during fine-tuning and prediction.

Few-Shot Text Classification Sentence +2

Paper
Code

Measuring and Reducing Model Update Regression in Structured Prediction for NLP

no code implementations • 7 Feb 2022 • Deng Cai, Elman Mansimov, Yi-An Lai, Yixuan Su, Lei Shu, Yi Zhang

First, we measure and analyze model update regression in different model update settings.

Dependency Parsing Knowledge Distillation +4

Paper
Add Code

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

2 code implementations • ACL 2022 • Yixuan Su, Lei Shu, Elman Mansimov, Arshit Gupta, Deng Cai, Yi-An Lai, Yi Zhang

Pre-trained language models have been recently shown to benefit task-oriented dialogue (TOD) systems.

Dialogue State Tracking End-To-End Dialogue Modelling +2

150

Paper
Code

Semantic Parsing in Task-Oriented Dialog with Recursive Insertion-based Encoder

no code implementations • 9 Sep 2021 • Elman Mansimov, Yi Zhang

At the generation time, the model constructs the semantic parse tree by recursively inserting the predicted non-terminal labels at the predicted positions until termination.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Towards End-to-End In-Image Neural Machine Translation

no code implementations • EMNLP (nlpbt) 2020 • Elman Mansimov, Mitchell Stern, Mia Chen, Orhan Firat, Jakob Uszkoreit, Puneet Jain

In this paper, we offer a preliminary investigation into the task of in-image machine translation: transforming an image containing text in one language into an image containing the same text in another language.

Machine Translation Translation

Paper
Add Code

Capturing document context inside sentence-level neural machine translation models with self-training

no code implementations • CODI 2021 • Elman Mansimov, Gábor Melis, Lei Yu

Neural machine translation (NMT) has arguably achieved human level parity when trained and evaluated at the sentence-level.

Machine Translation NMT +2

Paper
Add Code

A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models

1 code implementation • 29 May 2019 • Elman Mansimov, Alex Wang, Sean Welleck, Kyunghyun Cho

We investigate this problem by proposing a generalized model of sequence generation that unifies decoding in directed and undirected models.

Machine Translation Natural Language Inference +3

Paper
Code

Molecular geometry prediction using a deep generative graph neural network

1 code implementation • 31 Mar 2019 • Elman Mansimov, Omar Mahmood, Seokho Kang, Kyunghyun Cho

Conventional conformation generation methods minimize hand-designed molecular force field energy functions that are often not well correlated with the true energy function of a molecule observed in nature.

Paper
Code

Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement

2 code implementations • EMNLP 2018 • Jason Lee, Elman Mansimov, Kyunghyun Cho

We propose a conditional non-autoregressive neural sequence model based on iterative refinement.

Ranked #5 on Machine Translation on IWSLT2015 German-English

Caption Generation Denoising +2

119

Paper
Code

Simple Nearest Neighbor Policy Method for Continuous Control Tasks

no code implementations • ICLR 2018 • Elman Mansimov, Kyunghyun Cho

As this policy does not require any optimization, it allows us to investigate the underlying difficulty of a task without being distracted by optimization difficulty of a learning algorithm.

Continuous Control

Paper
Add Code

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

8 code implementations • NeurIPS 2017 • Yuhuai Wu, Elman Mansimov, Shun Liao, Roger Grosse, Jimmy Ba

In this work, we propose to apply trust region optimization to deep reinforcement learning using a recently proposed Kronecker-factored approximation to the curvature.

Atari Games Continuous Control +2

15,316

Paper
Code

Generating Images from Captions with Attention

2 code implementations • 9 Nov 2015 • Elman Mansimov, Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov

Motivated by the recent progress in generative models, we introduce a model that generates images from natural language descriptions.

Retrieval Text-to-Image Generation

591

Paper
Code

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks

no code implementations • 25 Mar 2015 • Elman Mansimov, Nitish Srivastava, Ruslan Salakhutdinov

We propose a new way of incorporating temporal information present in videos into Spatial Convolutional Neural Networks (ConvNets) trained on images, that avoids training Spatio-Temporal ConvNets from scratch.

Paper
Add Code

Unsupervised Learning of Video Representations using LSTMs

10 code implementations • 16 Feb 2015 • Nitish Srivastava, Elman Mansimov, Ruslan Salakhutdinov

We further evaluate the representations by finetuning them for a supervised learning problem - human action recognition on the UCF-101 and HMDB-51 datasets.

Action Recognition Temporal Action Localization

353

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.