MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

15 Oct 2020 Aman Khullar Udit Arora

This paper presents MAST, a new model for Multimodal Abstractive Text Summarization that utilizes information from all three modalities -- text, audio and video -- in a multimodal video. Prior work on multimodal abstractive text summarization only utilized information from the text and video modalities... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Multimodal Abstractive Text Summarization How2 300h MAST ROUGE-L 43.23 # 1

Methods used in the Paper