Search Results for author: Van-Quang Nguyen

Found 6 papers, 4 papers with code

KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain

no code implementations • 16 Jan 2024 • Anh-Cuong Pham, Van-Quang Nguyen, Thi-Hong Vuong, Quang-Thuy Ha

Image captioning is a crucial task with applications in a wide range of domains, including healthcare and education.

Paper
Add Code

Visual Abductive Reasoning Meets Driving Hazard Prediction

1 code implementation • 7 Oct 2023 • Korawat Charoenpitaks, Van-Quang Nguyen, Masanori Suganuma, Masahiro Takahashi, Ryoma Niihara, Takayuki Okatani

To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created.

Anomaly Detection Visual Abductive Reasoning

Paper
Code

Leveraging Video Coding Knowledge for Deep Video Enhancement

no code implementations • 27 Feb 2023 • Thong Bach, Thuong Nguyen Canh, Van-Quang Nguyen

Recent advancements in deep learning techniques have significantly improved the quality of compressed videos.

Video Compression Video Enhancement +1

Paper
Add Code

GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features

2 code implementations • 20 Jul 2022 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

Current state-of-the-art methods for image captioning employ region-based features, as they provide object-level information that is essential to describe the content of images; they are usually extracted by an object detector such as Faster R-CNN.

Ranked #8 on Image Captioning on nocaps in-domain

Image Captioning

172

Paper
Code

Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

1 code implementation • 1 Jun 2021 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It then integrates the prediction with the visual information etc., yielding the final prediction of an action and an object.

Instruction Following

Paper
Code

Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs

1 code implementation • ECCV 2020 • Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

It has been a primary concern in recent studies of vision and language tasks to design an effective attention mechanism dealing with interactions between the two modalities.

Ranked #7 on Visual Dialog on Visual Dialog v1.0 test-std

Visual Dialog

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.