no code implementations • IJCNLP 2019 • Xinyu Xiao, Lingfeng Wang, Bin Fan, Shinming Xiang, Chunhong Pan
To address these problems, we propose an Adaptive Semantic Guidance Network (ASGN), which instantiates the whole video semantics to different POS-aware semantics with the supervision of part of speech (POS) tag.