Search Results for author: Trung-Nghia Le

Found 29 papers, 10 papers with code

Cluster-based Video Summarization with Temporal Context Awareness

1 code implementation6 Apr 2024 Hai-Dang Huynh-Lam, Ngoc-Phuong Ho-Thi, Minh-Triet Tran, Trung-Nghia Le

In this paper, we present TAC-SUM, a novel and efficient training-free approach for video summarization that addresses the limitations of existing cluster-based models by incorporating temporal context.

Clustering Unsupervised Video Summarization

Enhancing Video Summarization with Context Awareness

1 code implementation6 Apr 2024 Hai-Dang Huynh-Lam, Ngoc-Phuong Ho-Thi, Minh-Triet Tran, Trung-Nghia Le

Despite the importance of video summarization, there is a lack of diverse and representative datasets, hindering comprehensive evaluation and benchmarking of algorithms.

Benchmarking Informativeness +1

iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer

1 code implementation13 Mar 2024 Dinh-Khoi Vo, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

With a user-friendly interface, iCONTRA enables both experienced designers and novices to effortlessly explore creative design concepts and efficiently generate thematic collections.

NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images

1 code implementation12 Dec 2023 Gia-Bao Le, Van-Tien Nguyen, Trung-Nghia Le, Minh-Triet Tran

In addressing the demands of this critical task, self-supervised learning (SSL) methods have emerged as a valuable resource, leveraging their efficiency in circumventing the need for a large number of annotations, which can be both costly and time-consuming to deploy supervised methods.

Contrastive Learning Multi-class Classification +3

Multi-Branch Network for Imagery Emotion Prediction

1 code implementation12 Dec 2023 Quoc-Bao Ninh, Hai-Chan Nguyen, Triet Huynh, Trung-Nghia Le

Experimental results on EMOTIC dataset, which contains large-scale images of people in unconstrained situations labeled with 26 discrete categories of emotions and VAD values, show that our proposed method significantly outperforms state-of-the-art methods with 28. 4% in mAP and 0. 93 in MAE.

Unveiling Camouflage: A Learnable Fourier-based Augmentation for Camouflaged Object Detection and Instance Segmentation

no code implementations29 Aug 2023 Minh-Quan Le, Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Thanh-Toan Do

Camouflaged object detection (COD) and camouflaged instance segmentation (CIS) aim to recognize and segment objects that are blended into their surroundings, respectively.

Generative Adversarial Network Instance Segmentation +3

DM-VTON: Distilled Mobile Real-time Virtual Try-On

1 code implementation26 Aug 2023 Khoi-Nguyen Nguyen-Ngoc, Thanh-Tung Phan-Nguyen, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Additionally, we propose Virtual Try-on-guided Pose for Data Synthesis to address the limited pose variation observed in training images.

Human Parsing Knowledge Distillation +1

VIDES: Virtual Interior Design via Natural Language and Visual Guidance

no code implementations26 Aug 2023 Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

The system successfully captures the essence of users' descriptions while providing flexibility for customization.

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation

1 code implementation9 Mar 2023 Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran

To overcome the disadvantage of the point estimation mechanism, we propose a novel approach, dubbed MaskDiff, which models the underlying conditional distribution of a binary mask, which is conditioned on an object region and $K-$shot information.

Few-Shot Learning Instance Segmentation +1

Multilingual Communication System with Deaf Individuals Utilizing Natural and Visual Languages

no code implementations1 Dec 2022 Tuan-Luc Huynh, Khoi-Nguyen Nguyen-Ngoc, Chi-Bien Chu, Minh-Triet Tran, Trung-Nghia Le

To bridge this language barrier, we propose a novel multilingual communication system, namely MUGCAT, to improve the communication efficiency of sign language users.

Semantic Similarity Semantic Textual Similarity

Analysis of Master Vein Attacks on Finger Vein Recognition Systems

no code implementations18 Oct 2022 Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen

The results raise the alarm about the robustness of such systems and suggest that master vein attacks should be considered an important security measure.

Finger Vein Recognition

Rethinking Adversarial Examples for Location Privacy Protection

no code implementations28 Jun 2022 Trung-Nghia Le, Ta Gu, Huy H. Nguyen, Isao Echizen

We have investigated a new application of adversarial examples, namely location privacy protection against landmark recognition systems.

Image Manipulation Landmark Recognition

Robust Deepfake On Unrestricted Media: Generation And Detection

no code implementations13 Feb 2022 Trung-Nghia Le, Huy H Nguyen, Junichi Yamagishi, Isao Echizen

Recent advances in deep learning have led to substantial improvements in deepfake generation, resulting in fake media with a more realistic appearance.

DeepFake Detection Face Swapping

Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently

no code implementations29 Dec 2021 Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen

Deep neural networks are vulnerable to adversarial examples (AEs), which have adversarial transferability: AEs generated for the source model can mislead another (target) model's predictions.

OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild

no code implementations ICCV 2021 Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

To promote these new tasks, we have created the first large-scale dataset posing a high level of challenges that is designed with face-wise rich annotations explicitly for face forgery detection and segmentation, namely OpenForensics.

Face Detection Face Swapping +1

Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation

no code implementations7 Jun 2021 Trung-Nghia Le, Tam V. Nguyen, Minh-Triet Tran

For human instance, we develop skeleton-guided segmentation in a frame along with object flow to correct and refine the result across frames.

Human-Object Interaction Detection Instance Segmentation +3

Anabranch Network for Camouflaged Object Segmentation

2 code implementations Computer Vision and Image Understanding 2019 Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, Akihiro Sugimoto

Different from existing networks for segmentation, our proposed network possesses the second branch for classification to predict the probability of containing camouflaged object(s) in an image, which is then fused into the main branch for segmentation to boost up the segmentation accuracy.

Benchmarking Camouflaged Object Segmentation +3

Fashion-Guided Adversarial Attack on Person Segmentation

1 code implementation17 Apr 2021 Marc Treu, Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen

It generates adversarial textures learned from fashion style images and then overlays them on the clothing regions in the original image to make all persons in the image invisible to person segmentation networks.

Adversarial Attack Human Instance Segmentation +2

MirrorNet: Bio-Inspired Camouflaged Object Segmentation

no code implementations Pattern Recognition Journal 2020 Jinnan Yan, Trung-Nghia Le, Khanh-Duy Nguyen, Minh-Triet Tran, Thanh-Toan Do, Tam V. Nguyen

Differently from existing networks for segmentation, our proposed network possesses two segmentation streams: the main stream and the mirror stream corresponding with the original image and its flipped image, respectively.

Camouflaged Object Segmentation Camouflage Segmentation +3

Semantic Instance Meets Salient Object: Study on Video Semantic Salient Instance Segmentation

no code implementations4 Jul 2018 Trung-Nghia Le, Akihiro Sugimoto

In addition, to tackle the task of VSSIS, we augment the DAVIS-2017 benchmark dataset by assigning semantic ground-truth for salient instance labels, obtaining SEmantic Salient Instance Video (SESIV) dataset.

Instance Segmentation Robot Navigation +3

Video Salient Object Detection Using Spatiotemporal Deep Features

no code implementations4 Aug 2017 Trung-Nghia Le, Akihiro Sugimoto

STCRF is our extension of CRF to the temporal domain and describes the relationships among neighboring regions both in a frame and over frames.

Object object-detection +4

Region-Based Multiscale Spatiotemporal Saliency for Video

no code implementations4 Aug 2017 Trung-Nghia Le, Akihiro Sugimoto

Detecting salient objects from a video requires exploiting both spatial and temporal knowledge included in the video.

Saliency Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.