no code implementations • 10 Apr 2024 • Dat Viet Thanh Nguyen, Anh Tran, Hoai Nam Vu, Cuong Pham, Minh Hoai
This network has a camera calibration module that can compute an embedding vector that represents the spatial configuration between the driver and the camera system.
no code implementations • 27 Mar 2024 • Trong-Tung Nguyen, Duc-Anh Nguyen, Anh Tran, Cuong Pham
Our work addresses limitations seen in previous approaches for object-centric editing problems, such as unrealistic results due to shape discrepancies and limited control in object replacement or insertion.
no code implementations • 24 Mar 2024 • Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai
This algorithm works by transforming a blurry input image, which is challenging to deblur, into another blurry image that is more amenable to deblurring.
1 code implementation • 9 Mar 2024 • Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do
Inspired by the benefits of the frequency domain, we propose a novel module that functions as an attention mechanism in the frequency domain.
no code implementations • 23 Feb 2024 • Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby
This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023.
1 code implementation • 28 Dec 2023 • Yifeng Huang, Duc DUy Nguyen, Lam Nguyen, Cuong Pham, Minh Hoai
To develop and evaluate our approach, we introduce a diverse and realistic dataset consisting of real-world data from 37 subjects and 50 action categories, encompassing both sensor and audio data.
no code implementations • 28 Dec 2023 • Trung Tuan Dao, Duc Hong Vu, Cuong Pham, Anh Tran
The existing facial datasets, while having plentiful images at near frontal views, lack images with extreme head poses, leading to the downgraded performance of deep learning models when dealing with profile or pitched faces.
1 code implementation • 17 Dec 2023 • Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tran, Cuong Pham, Khoi Nguyen
We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes.
Ranked #1 on 3D Open-Vocabulary Instance Segmentation on S3DIS
3D Instance Segmentation 3D Open-Vocabulary Instance Segmentation +4
no code implementations • 3 Dec 2023 • Quang Nguyen, Truong Vu, Cuong Pham, Anh Tran, Khoi Nguyen
In the ever-expanding digital landscape, safeguarding sensitive information remains paramount.
no code implementations • 1 Dec 2023 • Duc-Anh Nguyen, Cuong Pham, Nhien-An Le-Khac
Various types of sensors can be used for Human Activity Recognition (HAR), and each of them has different strengths and weaknesses.
Ranked #1 on Human Activity Recognition on PAMAP2 (Accuracy metric)
1 code implementation • 3 Sep 2023 • Ngan Dao Hoang, Dat Tran-Anh, Manh Luong, Cong Tran, Cuong Pham
In this work, our aim is to develop a framework that can effectively perform cough classification even in situations when enormous cough data is not available, while also addressing privacy concerns.
no code implementations • 3 Sep 2023 • Son Tran, Cong Tran, Anh Tran, Cuong Pham
In this paper, we push forward the state-of-the-art performance of unsupervised MOT methods by proposing UnsMOT, a novel framework that explicitly combines the appearance and motion features of objects with geometric information to provide more accurate tracking.
1 code implementation • CVPR 2023 • Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai
We consider the challenging task of training models for image-to-video deblurring, which aims to recover a sequence of sharp images corresponding to a given blurry image input.
1 code implementation • 2 Dec 2022 • Dat Viet Thanh Nguyen, Phong Tran The, Tan M. Dinh, Cuong Pham, Anh Tuan Tran
The network can synthesize various image degradation and restore the sharp image via a quality control code.
no code implementations • 27 Oct 2022 • Cuong Pham, Tuan Hoang, Thanh-Toan Do
Knowledge distillation which learns a lightweight student model by distilling knowledge from a cumbersome teacher model is an attractive approach for learning compact deep neural networks (DNNs).
1 code implementation • 29 Oct 2021 • Manh-Ha Bui, Viet-Anh Tran, Cuong Pham
To be more specific, we design new hardware which consists of an acoustic sensor to collect audio features from the nose, as well as an accelerometer and gyroscope to collect movement on the chest as a result of an individual's breathing.
no code implementations • 10 Jul 2021 • Cuong Pham, Tung Le
Premier League is known as one of the most competitive football league in the world, hence there are many goals are scored here every match.