Search Results for author: Bhathiya Hemanthage

Found 3 papers, 0 papers with code

Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments

no code implementations • SIGDIAL (ACL) 2022 • Alessandro Suglia, Bhathiya Hemanthage, Malvina Nikandrou, George Pantazopoulos, Amit Parekh, Arash Eshghi, Claudio Greco, Ioannis Konstas, Oliver Lemon, Verena Rieser

We demonstrate EMMA, an embodied multimodal agent which has been developed for the Alexa Prize SimBot challenge.

Conditional Text Generation

Paper
Add Code

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

no code implementations • 7 Nov 2023 • Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon, Alessandro Suglia

Interactive and embodied tasks pose at least two fundamental challenges to existing Vision & Language (VL) models, including 1) grounding language in trajectories of actions and observations, and 2) referential disambiguation.

Decoder Text Generation

Paper
Add Code

SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation

no code implementations • 10 Jul 2023 • Bhathiya Hemanthage, Christian Dondrup, Phil Bartie, Oliver Lemon

SimpleMTOD is a simple language model which recasts several sub-tasks in multimodal task-oriented dialogues as sequence prediction tasks.

coreference-resolution dialog state tracking +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.