Search Results for author: Van-Quang Nguyen

Found 6 papers, 4 papers with code

KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain

no code implementations16 Jan 2024 Anh-Cuong Pham, Van-Quang Nguyen, Thi-Hong Vuong, Quang-Thuy Ha

Image captioning is a crucial task with applications in a wide range of domains, including healthcare and education.

Vietnamese Image Captioning

Visual Abductive Reasoning Meets Driving Hazard Prediction

1 code implementation7 Oct 2023 Korawat Charoenpitaks, Van-Quang Nguyen, Masanori Suganuma, Masahiro Takahashi, Ryoma Niihara, Takayuki Okatani

To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created.

Anomaly Detection Visual Abductive Reasoning

Leveraging Video Coding Knowledge for Deep Video Enhancement

no code implementations27 Feb 2023 Thong Bach, Thuong Nguyen Canh, Van-Quang Nguyen

Recent advancements in deep learning techniques have significantly improved the quality of compressed videos.

Video Compression Video Enhancement +1

GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features

2 code implementations20 Jul 2022 Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

Current state-of-the-art methods for image captioning employ region-based features, as they provide object-level information that is essential to describe the content of images; they are usually extracted by an object detector such as Faster R-CNN.

Image Captioning

Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

1 code implementation1 Jun 2021 Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It then integrates the prediction with the visual information etc., yielding the final prediction of an action and an object.

Instruction Following

Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs

1 code implementation ECCV 2020 Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It has been a primary concern in recent studies of vision and language tasks to design an effective attention mechanism dealing with interactions between the two modalities.

Visual Dialog

Cannot find the paper you are looking for? You can Submit a new open access paper.