Search Results for author: Anh Nguyen

Found 98 papers, 55 papers with code

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

no code implementations • 22 Apr 2024 • Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, ZiYi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.

Language Modelling

Paper
Add Code

High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces

no code implementations • 20 Apr 2024 • Baoru Huang, Yida Wang, Anh Nguyen, Daniel Elson, Francisco Vasconcelos, Danail Stoyanov

In surgical oncology, screening colonoscopy plays a pivotal role in providing diagnostic assistance, such as biopsy, and facilitating surgical navigation, particularly in polyp detection.

Camera Localization Image Generation +4

Paper
Add Code

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

no code implementations • 14 Apr 2024 • Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.

Data Augmentation Multi-Task Learning +2

Paper
Add Code

Weakly-Supervised Learning via Multi-Lateral Decoder Branching for Guidewire Segmentation in Robot-Assisted Cardiovascular Catheterization

no code implementations • 11 Apr 2024 • Olatunji Mumini Omisore, Toluwanimi Akinyemi, Anh Nguyen, Lei Wang

Thus, we offer a less expensive method for real-time tool segmentation and tracking during robot-assisted cardiac catheterization.

Segmentation Weakly-supervised Learning

Paper
Add Code

Allowing humans to interactively guide machines where to look does not always improve human-AI team's classification accuracy

1 code implementation • 8 Apr 2024 • Giang Nguyen, Mohammad Reza Taesiri, Sunnie S. Y. Kim, Anh Nguyen

We build CHM-Corr++, an interactive interface for CHM-Corr, enabling users to edit the feature importance map provided by CHM-Corr and observe updated model decisions.

Feature Importance Image Classification

Paper
Code

Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model

1 code implementation • 2 Apr 2024 • Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen

The results reveal that Samba achieved unparalleled performance on commonly used remote sensing datasets for semantic segmentation.

Segmentation Semantic Segmentation

Paper
Code

On the Effectiveness of Heterogeneous Ensemble Methods for Re-identification

no code implementations • 19 Mar 2024 • Simon Klüttermann, Jérôme Rutinowski, Anh Nguyen, Britta Grimme, Moritz Roidl, Emmanuel Müller

In this contribution, we introduce a novel ensemble method for the re-identification of industrial entities, using images of chipwood pallets and galvanized metal plates as dataset examples.

Paper
Add Code

ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

no code implementations • 18 Mar 2024 • Minh Tran, Winston Bounsavy, Khoa Vo, Anh Nguyen, Tri Nguyen, Ngan Le

Consequently, this compromised quality of visible features during the subsequent visible-to-amodal transition.

Amodal Instance Segmentation Semantic Segmentation

Paper
Add Code

Autonomous Catheterization with Open-source Simulator and Expert Trajectory

1 code implementation • 17 Jan 2024 • Tudor Jianu, Baoru Huang, Tuan Vo, Minh Nhat Vu, Jingxuan Kang, Hoan Nguyen, Olatunji Omisore, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen

Endovascular robots have been actively developed in both academia and industry.

Paper
Code

Leveraging Habitat Information for Fine-grained Bird Identification

no code implementations • 22 Dec 2023 • Tin Nguyen, Anh Nguyen

Training CNNs and ViTs with habitat-augmented data results in an improvement of up to +0. 83 and +0. 23 points on NABirds and CUB-200, respectively.

Image Augmentation

Paper
Add Code

WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

no code implementations • 15 Dec 2023 • Huy Le, Tung Kieu, Anh Nguyen, Ngan Le

Text-video retrieval, a prominent sub-field within the domain of multimodal information retrieval, has witnessed remarkable growth in recent years.

Information Retrieval Knowledge Distillation +3

Paper
Add Code

TinyGSM: achieving >80% on GSM8k with small language models

no code implementations • 14 Dec 2023 • Bingbin Liu, Sebastien Bubeck, Ronen Eldan, Janardhan Kulkarni, Yuanzhi Li, Anh Nguyen, Rachel Ward, Yi Zhang

Specifically for solving grade school math, the smallest model size so far required to break the 80\% barrier on the GSM8K benchmark remains to be 34B.

Ranked #58 on Arithmetic Reasoning on GSM8K

Arithmetic Reasoning GSM8K +2

Paper
Add Code

GlitchBench: Can large multimodal models detect video game glitches?

no code implementations • 8 Dec 2023 • Mohammad Reza Taesiri, Tianjun Feng, Anh Nguyen, Cor-Paul Bezemer

To address this gap, we introduce GlitchBench, a novel benchmark derived from video game quality assurance tasks, to test and evaluate the reasoning capabilities of LMMs.

Paper
Add Code

Generating Valid and Natural Adversarial Examples with Large Language Models

no code implementations • 20 Nov 2023 • Zimu Wang, Wei Wang, Qi Chen, Qiufeng Wang, Anh Nguyen

Deep learning-based natural language processing (NLP) models, particularly pre-trained language models (PLMs), have been revealed to be vulnerable to adversarial attacks.

Adversarial Attack valid

Paper
Add Code

Shape-Sensitive Loss for Catheter and Guidewire Segmentation

no code implementations • 19 Nov 2023 • Chayun Kongtongvattana, Baoru Huang, Jingxuan Kang, Hoan Nguyen, Olajide Olufemi, Anh Nguyen

By computing the cosine similarity between these feature vectors, we gain a nuanced understanding of image similarity that goes beyond the limitations of traditional overlap-based measures.

Paper
Add Code

3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images

no code implementations • 19 Nov 2023 • Tudor Jianu, Baoru Huang, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen

Endovascular navigation, essential for diagnosing and treating endovascular diseases, predominantly hinges on fluoroscopic images due to the constraints in sensory feedback.

Paper
Add Code

Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding

no code implementations • 31 Oct 2023 • Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

In the era of the Internet of Things (IoT), the retrieval of relevant medical information has become essential for efficient clinical decision-making.

Decision Making Information Retrieval +2

Paper
Add Code

Controllable Group Choreography using Contrastive Diffusion

no code implementations • 29 Oct 2023 • Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications.

Paper
Add Code

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

1 code implementation • 5 Oct 2023 • Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

Open-Fusion harnesses the power of a pre-trained vision-language foundation model (VLFM) for open-set semantic comprehension and employs the Truncated Signed Distance Function (TSDF) for swift 3D scene reconstruction.

3D Scene Reconstruction

Paper
Code

Learning to Terminate in Object Navigation

1 code implementation • 28 Sep 2023 • Yuhang Song, Anh Nguyen, Chun-Yi Lee

This paper tackles the critical challenge of object navigation in autonomous navigation systems, particularly focusing on the problem of target approach and episode termination in environments with long optimal episode length in Deep Reinforcement Learning (DRL) based methods.

Autonomous Navigation Object +2

Paper
Code

I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses

1 code implementation • 24 Sep 2023 • Trong Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le

In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification.

Language Modelling

Paper
Code

Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

1 code implementation • 18 Sep 2023 • An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen

Foundation models such as ChatGPT have made significant strides in robotic tasks due to their universal representation of real-world domains.

Robotic Grasping World Knowledge

Paper
Code

Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery

1 code implementation • 7 Jul 2023 • Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S. Elson

In surgical oncology, it is challenging for surgeons to identify lymph nodes and completely resect cancer even with pre-operative imaging systems like PET and CT, because of the lack of reliable intraoperative visualization tools.

Paper
Code

HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation

1 code implementation • 20 Jun 2023 • An Dinh Vuong, Toan Tien Nguyen, Minh Nhat Vu, Baoru Huang, Dzung Nguyen, Huynh Thi Thanh Binh, Thieu Vo, Anh Nguyen

Visual navigation, a foundational aspect of Embodied AI (E-AI), has been significantly studied in the past few years.

Collision Avoidance Computational Efficiency +3

Paper
Code

A Client-server Deep Federated Learning for Cross-domain Surgical Image Segmentation

no code implementations • 14 Jun 2023 • Ronast Subedi, Rebati Raman Gaire, Sharib Ali, Anh Nguyen, Danail Stoyanov, Binod Bhattarai

This paper presents a solution to the cross-domain adaptation problem for 2D surgical image segmentation, explicitly considering the privacy protection of distributed datasets belonging to different centers.

Domain Adaptation Federated Learning +3

Paper
Add Code

Self-Supervised Learning for Point Clouds Data: A Survey

no code implementations • 9 May 2023 • Changyu Zeng, Wei Wang, Anh Nguyen, Yutao Yue

We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts' characteristics.

Pedestrian Detection Self-Supervised Learning

Paper
Add Code

Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching

no code implementations • 16 Apr 2023 • Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen

In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images.

Image-to-Image Translation

Paper
Add Code

Music-Driven Group Choreography

no code implementations • CVPR 2023 • Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

The proposed dataset consists of 16. 7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres.

Paper
Add Code

Style Transfer for 2D Talking Head Animation

1 code implementation • 17 Mar 2023 • Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new method to generate talking head animation with learnable style references.

Style Transfer

Paper
Code

Open-Vocabulary Affordance Detection in 3D Point Clouds

1 code implementation • 4 Mar 2023 • Toan Nguyen, Minh Nhat Vu, An Vuong, Dzung Nguyen, Thieu Vo, Ngan Le, Anh Nguyen

In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds.

Affordance Detection

Paper
Code

A Light-weight Deep Learning Model for Remote Sensing Image Classification

no code implementations • 25 Feb 2023 • Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin

In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image.

Image Classification Knowledge Distillation +1

Paper
Add Code

ViDeBERTa: A powerful pre-trained language model for Vietnamese

1 code implementation • 25 Jan 2023 • Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu

This paper presents ViDeBERTa, a new pre-trained monolingual language model for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture.

Language Modelling named-entity-recognition +5

Paper
Code

A Large-Scale Study of a Sleep Tracking and Improving Device with Closed-loop and Personalized Real-time Acoustic Stimulation

no code implementations • 4 Nov 2022 • Anh Nguyen, Galen Pogoncheff, Ban Xuan Dong, Nam Bui, Hoang Truong, Nhat Pham, Linh Nguyen, Hoang Huu Nguyen, Sy Duong-Quy, Sangtae Ha, Tam Vu

Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life.

Sleep Staging

Paper
Add Code

SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation

1 code implementation • 27 Oct 2022 • Zhaorui Tan, Xi Yang, Zihan Ye, Qiufeng Wang, Yuyao Yan, Anh Nguyen, Kaizhu Huang

Generating consistent and high-quality images from given texts is essential for visual-language understanding.

Paper
Code

Uncertainty-aware Label Distribution Learning for Facial Expression Recognition

1 code implementation • 21 Sep 2022 • Nhat Le, Khanh Nguyen, Quang Tran, Erman Tjiputra, Bac Le, Anh Nguyen

In this paper, we propose a new uncertainty-aware label distribution learning method to improve the robustness of deep models against uncertainty and ambiguity.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Code

Inverse Image Frequency for Long-tailed Image Recognition

1 code implementation • 11 Sep 2022 • Konstantinos Panagiotis Alexandridis, Shan Luo, Anh Nguyen, Jiankang Deng, Stefanos Zafeiriou

The long-tailed distribution is a common phenomenon in the real world.

Instance Segmentation Semantic Segmentation

Paper
Code

Self-Supervised Depth Estimation in Laparoscopic Image using 3D Geometric Consistency

3 code implementations • 17 Aug 2022 • Baoru Huang, Jian-Qing Zheng, Anh Nguyen, Chi Xu, Ioannis Gkouzionis, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S. Elson

Depth estimation is a crucial step for image-guided intervention in robotic surgery and laparoscopic imaging system.

Depth Estimation

Paper
Code

Visual correspondence-based explanations improve AI robustness and human-AI team accuracy

1 code implementation • 26 Jul 2022 • Giang Nguyen, Mohammad Reza Taesiri, Anh Nguyen

Via a large-scale, human study on ImageNet and CUB, our correspondence-based explanations are found to be more useful to users than kNN explanations.

Image Classification

Paper
Code

Long-tailed Instance Segmentation using Gumbel Optimized Loss

1 code implementation • 22 Jul 2022 • Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo

Major advancements have been made in the field of object detection and segmentation recently.

Ranked #11 on Instance Segmentation on LVIS v1.0 val

Instance Segmentation object-detection +2

Paper
Code

CodeT: Code Generation with Generated Tests

1 code implementation • 21 Jul 2022 • Bei Chen, Fengji Zhang, Anh Nguyen, Daoguang Zan, Zeqi Lin, Jian-Guang Lou, Weizhu Chen

A natural way to evaluate the quality and correctness of a code solution is to run it against a set of test cases, but the manual creation of such test cases is often costly and time-consuming.

Ranked #1 on Code Generation on APPS (Introductory Pass@1 metric)

Code Generation

550

Paper
Code

Reducing Training Time in Cross-Silo Federated Learning using Multigraph Topology

1 code implementation • ICCV 2023 • Tuong Do, Binh X. Nguyen, Vuong Pham, Toan Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new multigraph topology for cross-silo federated learning.

Federated Learning

Paper
Code

PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search

1 code implementation • 19 Jul 2022 • Thang M. Pham, Seunghyun Yoon, Trung Bui, Anh Nguyen

While contextualized word embeddings have been a de-facto standard, learning contextualized phrase embeddings is less explored and being hindered by the lack of a human-annotated benchmark that tests machine understanding of phrase semantics given a context sentence or paragraph (instead of phrases alone).

Information Retrieval Natural Language Understanding +5

Paper
Code

How explainable are adversarially-robust CNNs?

no code implementations • 25 May 2022 • Mehdi Nourelahi, Lars Kotthoff, Peijie Chen, Anh Nguyen

Here, we perform the first, large-scale evaluation of the relations of the three criteria using 9 feature-importance methods and 12 ImageNet-trained CNNs that are of 3 training algorithms and 5 CNN architectures.

Feature Importance

Paper
Add Code

Fine-Grained Visual Classification using Self Assessment Classifier

1 code implementation • 21 May 2022 • Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.

Ranked #4 on Fine-Grained Image Classification on Stanford Dogs

Classification Continual Learning +1

Paper
Code

Understanding Public Opinion on Using Hydroxychloroquine for COVID-19 Treatment via Social Media

2 code implementations • 1 Jan 2022 • Thuy T. Do, Du Nguyen, Anh Le, Anh Nguyen, Dong Nguyen, Nga Hoang, Uyen Le, Tuan Tran

This paper studies the reactions of social network users on the recommendation of using HCQ for COVID-19 treatment by analyzing the reaction patterns and sentiment of the tweets.

Descriptive Sentiment Analysis

Paper
Code

DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover's Distance Improves Out-Of-Distribution Face Identification

1 code implementation • CVPR 2022 • Hai Phan, Anh Nguyen

Face identification (FI) is ubiquitous and drives many high-stake decisions made by law enforcement.

Face Identification Re-Ranking

Paper
Code

Global-Local Attention for Emotion Recognition

1 code implementation • 7 Nov 2021 • Nhat Le, Khanh Nguyen, Anh Nguyen, Bac Le

Our network is designed to extract features from both facial and context regions independently, then learn them together using the attention module.

Emotion Recognition

Paper
Code

Double Trouble: How to not explain a text classifier's decisions using counterfactuals synthesized by masked language models?

1 code implementation • 22 Oct 2021 • Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen

We find two reasons why IM is not better than LOO: (1) deleting a single word from the input only marginally reduces a classifier's accuracy; and (2) a highly predictable word is always given near-zero attribution, regardless of its true importance to the classifier.

Causal Inference

Paper
Code

Deep Federated Learning for Autonomous Driving

1 code implementation • 12 Oct 2021 • Anh Nguyen, Tuong Do, Minh Tran, Binh X. Nguyen, Chien Duong, Tu Phan, Erman Tjiputra, Quang D. Tran

We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods.

Autonomous Driving Federated Learning

Paper
Code

DeepECMP: Predicting Extracellular Matrix Proteins using Deep Learning

no code implementations • 7 Oct 2021 • Mohamed Ghafoor, Anh Nguyen

Introduction: The extracellular matrix (ECM) is a networkof proteins and carbohydrates that has a structural and bio-chemical function.

Paper
Add Code

Coarse-to-Fine Reasoning for Visual Question Answering

2 code implementations • 6 Oct 2021 • Binh X. Nguyen, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task.

Ranked #1 on Visual Question Answering (VQA) on GQA test-dev

Question Answering Visual Question Answering

Paper
Code

Light-weight Deformable Registration using Adversarial Learning with Distilling Knowledge

1 code implementation • 4 Oct 2021 • Minh Q. Tran, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We design the student network such as it is light-weight and well suitable for deployment on a typical CPU.

Paper
Code

Attention Gate in Traffic Forecasting

no code implementations • 27 Sep 2021 • Anh Lam, Anh Nguyen, Bac Le

An attention gates filter features from the contraction path before combining with features on the expansion path, it enables our model to reduce the effect of non-traffic region features and focus more on crucial region features.

Paper
Add Code

Semi-Supervised Adversarial Discriminative Domain Adaptation

1 code implementation • 27 Sep 2021 • Thai-Vu Nguyen, Anh Nguyen, Nghia Le, Bac Le

Domain adaptation is a potential method to train a powerful deep neural network, which can handle the absence of labeled data.

Domain Adaptation Emotion Recognition

Paper
Code

Occlusion-robust Visual Markerless Bone Tracking for Computer-Assisted Orthopaedic Surgery

no code implementations • 24 Aug 2021 • Xue Hu, Anh Nguyen, Ferdinando Rodriguez y Baena

In practice, by using a high-quality commercial RGB-D camera, our proposed visual tracking method achieves an accuracy of 1-2 degress and 2-4 mm on a model knee, which meets the standard for clinical applications.

Anatomy Point Cloud Segmentation +1

Paper
Add Code

Self-Supervised Generative Adversarial Network for Depth Estimation in Laparoscopic Images

no code implementations • 9 Jul 2021 • Baoru Huang, Jianqing Zheng, Anh Nguyen, David Tuch, Kunal Vyas, Stamatia Giannarou, Daniel S. Elson

Dense depth estimation and 3D reconstruction of a surgical scene are crucial steps in computer assisted surgery.

3D Reconstruction Depth Estimation +1

Paper
Add Code

The DEformer: An Order-Agnostic Distribution Estimating Transformer

1 code implementation • ICML Workshop INNF 2021 • Michael A. Alcorn, Anh Nguyen

In this paper, we propose an alternative approach for encoding feature identities, where each feature's identity is included alongside its value in the input.

Density Estimation

Paper
Code

Inverting Adversarially Robust Networks for Image Synthesis

1 code implementation • 13 Jun 2021 • Renan A. Rojas-Gomez, Raymond A. Yeh, Minh N. Do, Anh Nguyen

Despite unconditional feature inversion being the foundation of many image synthesis applications, training an inverter demands a high computational budget, large decoding capacity and imposing conditions such as autoregressive priors.

Anomaly Detection Deep Feature Inversion +3

Paper
Code

The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

1 code implementation • NeurIPS 2021 • Giang Nguyen, Daeyoung Kim, Anh Nguyen

Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications.

Paper
Code

Multiple Meta-model Quantifying for Medical Visual Question Answering

2 code implementations • 19 May 2021 • Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran, Anh Nguyen

However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized.

Ranked #5 on Medical Visual Question Answering on PathVQA

Medical Visual Question Answering Meta-Learning +3

Paper
Code

baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents

1 code implementation • NeurIPS 2021 • Michael A. Alcorn, Anh Nguyen

In many multi-agent spatiotemporal systems, agents operate under the influence of shared, unobserved variables (e. g., the play a team is executing in a game of basketball).

Ranked #1 on Trajectory Modeling on NBA SportVU

Trajectory Modeling

Paper
Code

Graph-based Person Signature for Person Re-Identifications

1 code implementation • 14 Apr 2021 • Binh X. Nguyen, Binh D. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person.

Ranked #48 on Person Re-Identification on DukeMTMC-reID

Attribute Multi-Task Learning +1

Paper
Code

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

no code implementations • 5 Apr 2021 • Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network.

Acoustic Scene Classification Classification +2

Paper
Add Code

Speech Emotion Recognition using Semantic Information

1 code implementation • 4 Mar 2021 • Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller

In this paper, we propose a novel framework that can capture both the semantic and the paralinguistic information in the signal.

Speech Emotion Recognition Sound Audio and Speech Processing

Paper
Code

WaNet -- Imperceptible Warping-based Backdoor Attack

1 code implementation • 20 Feb 2021 • Anh Nguyen, Anh Tran

With the thriving of deep learning and the widespread practice of using pre-trained networks, backdoor attacks have become an increasing security threat drawing many research interests in recent years.

Backdoor Attack

Paper
Code

baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling

1 code implementation • NeurIPS 2021 • Michael A. Alcorn, Anh Nguyen

Multi-agent spatiotemporal modeling is a challenging task from both an algorithmic design and computational complexity perspective.

Paper
Code

Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?

no code implementations • Findings (ACL) 2021 • Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen

Encouraging classifiers to capture word order information improves the performance on most GLUE tasks, SQuAD 2. 0 and out-of-samples.

Natural Language Inference Natural Language Understanding +2

Paper
Add Code

Deep Learning Framework Applied for Predicting Anomaly of Respiratory Sounds

no code implementations • 26 Dec 2020 • Dat Ngo, Lam Pham, Anh Nguyen, Ben Phan, Khoa Tran, Truong Nguyen

This paper proposes a robust deep learning framework used for classifying anomaly of respiratory cycles.

Paper
Add Code

A Vietnamese Dataset for Evaluating Machine Reading Comprehension

no code implementations • COLING 2020 • Kiet Nguyen, Vu Nguyen, Anh Nguyen, Ngan Nguyen

Due to the lack of benchmark datasets for Vietnamese, we present the Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset for the low-resource language as Vietnamese to evaluate MRC models.

Machine Reading Comprehension Question Answering +1

Paper
Add Code

Input-Aware Dynamic Backdoor Attack

1 code implementation • NeurIPS 2020 • Anh Nguyen, Anh Tran

In recent years, neural backdoor attack has been considered to be a potential security threat to deep learning systems.

Backdoor Attack

Paper
Code

Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

no code implementations • 31 Jul 2020 • Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran

In this work, we propose a multimodal fusion approach to address the problem of autonomous navigation in complex environments such as collapsed cites, or natural caves.

Robotics

Paper
Add Code

Are there any 'object detectors' in the hidden layers of CNNs trained to identify objects or scenes?

1 code implementation • 2 Jul 2020 • Ella M. Gale, Nicholas Martin, Ryan Blything, Anh Nguyen, Jeffrey S. Bowers

We find that the different measures provide different estimates of object selectivity, with precision and CCMAS measures providing misleadingly high estimates.

General Classification Image Classification +1

Paper
Code

DeepEventMine: end-to-end neural nested event extraction from biomedical texts

1 code implementation • 17 Jun 2020 • Hai-Long Trieu, Thy Thy Tran, Khoa N A Duong, Anh Nguyen, Makoto Miwa, Sophia Ananiadou

Motivation Recent neural approaches on event extraction from text mainly focus on flat events in general domain, while there are less attempts to detect nested and overlapping events.

Ranked #1 on Event Extraction on GENIA 2013

Sentence

Paper
Code

End-to-End Real-time Catheter Segmentation with Optical Flow-Guided Warping during Endovascular Intervention

no code implementations • 16 Jun 2020 • Anh Nguyen, Dennis Kundrat, Giulio Dagnino, Wenqiang Chi, Mohamed E. M. K. Abdelaziz, Yao Guo, YingLiang Ma, Trevor M. Y. Kwok, Celia Riga, Guang-Zhong Yang

In this paper, we present FW-Net, an end-to-end and real-time deep learning framework for endovascular intervention.

Optical Flow Estimation Segmentation

Paper
Add Code

The shape and simplicity biases of adversarially robust ImageNet-trained CNNs

1 code implementation • 16 Jun 2020 • Peijie Chen, Chirag Agarwal, Anh Nguyen

Increasingly more similarities between human vision and convolutional neural networks (CNNs) have been revealed in the past few years.

Image Generation

Paper
Code

SAM: The Sensitivity of Attribution Methods to Hyperparameters

1 code implementation • CVPR 2020 • Naman Bansal, Chirag Agarwal, Anh Nguyen

Attribution methods can provide powerful insights into the reasons for a classifier's decision.

Paper
Code

A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings

no code implementations • 10 Oct 2019 • Qi Li, Long Mai, Michael A. Alcorn, Anh Nguyen

Large, pre-trained generative models have been increasingly popular and useful to both the research and wider communities.

Model Editing

Paper
Add Code

Explaining image classifiers by removing input features using generative models

1 code implementation • 9 Oct 2019 • Chirag Agarwal, Anh Nguyen

Perturbation-based explanation methods often measure the contribution of an input feature to an image classifier's outputs by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.

counterfactual Object Localization

Paper
Code

Removing input features via a generative model to explain their attributions to classifier's decisions

no code implementations • 25 Sep 2019 • Chirag Agarwal, Dan Schonfeld, Anh Nguyen

Interpretability methods often measure the contribution of an input feature to an image classifier's decisions by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.

counterfactual

Paper
Add Code

Understanding Neural Networks via Feature Visualization: A survey

1 code implementation • 18 Apr 2019 • Anh Nguyen, Jason Yosinski, Jeff Clune

A neuroscience method to understanding the brain is to find and study the preferred stimuli that highly activate an individual cell or groups of cells.

BIG-bench Machine Learning

Paper
Code

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

no code implementations • 23 Mar 2019 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

We propose V2CNet, a new deep learning framework to automatically translate the demonstration videos to commands that can be directly used in robotic applications.

Paper
Add Code

Scene Understanding for Autonomous Manipulation with Deep Learning

no code implementations • 23 Mar 2019 • Anh Nguyen

In this study, our long-term goal is to bridge the gap between computer vision and robotics by developing visual methods that can be used in real robots.

Action Understanding Affordance Detection +6

Paper
Add Code

Strike (with) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects

1 code implementation • CVPR 2019 • Michael A. Alcorn, Qi Li, Zhitao Gong, Chengfei Wang, Long Mai, Wei-Shinn Ku, Anh Nguyen

Using our framework and a self-assembled dataset of 3D objects, we investigate the vulnerability of DNNs to OoD poses of well-known objects in ImageNet.

Paper
Code

Improving Adversarial Robustness by Encouraging Discriminative Features

no code implementations • 1 Nov 2018 • Chirag Agarwal, Anh Nguyen, Dan Schonfeld

Intuitively, the center loss encourages DNNs to simultaneously learns a center for the deep features of each class, and minimize the distances between the intra-class deep features and their corresponding class centers.

Adversarial Robustness

Paper
Add Code

Selectivity metrics can overestimate the selectivity of units: a case study on AlexNet

no code implementations • 27 Sep 2018 • Ella M. Gale, Anh Nguyen, Ryan Blything, Nicholas Martin and Jeffrey S. Bowers

These findings highlight the problem with current selectivity measures and show that new measures are required in order to provide a better assessment of learned representations in NNs.

Paper
Add Code

VectorDefense: Vectorization as a Defense to Adversarial Examples

1 code implementation • 23 Apr 2018 • Vishaal Munusamy Kabilan, Brandon Morris, Anh Nguyen

Training deep neural networks on images represented as grids of pixels has brought to light an interesting phenomenon known as adversarial examples.

Paper
Code

Object Captioning and Retrieval with Natural Language

1 code implementation • 16 Mar 2018 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object.

Object Retrieval

Paper
Code

The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

no code implementations • 9 Mar 2018 • Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter, Laurent Keller, Carole Knibbe, Peter Krcah, Richard E. Lenski, Hod Lipson, Robert MacCurdy, Carlos Maestre, Risto Miikkulainen, Sara Mitri, David E. Moriarty, Jean-Baptiste Mouret, Anh Nguyen, Charles Ofria, Marc Parizeau, David Parsons, Robert T. Pennock, William F. Punch, Thomas S. Ray, Marc Schoenauer, Eric Shulte, Karl Sims, Kenneth O. Stanley, François Taddei, Danesh Tarapore, Simon Thibault, Westley Weimer, Richard Watson, Jason Yosinski

Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them.

Artificial Life

Paper
Add Code

Spatial PixelCNN: Generating Images from Patches

no code implementations • 3 Dec 2017 • Nader Akoury, Anh Nguyen

In this paper we propose Spatial PixelCNN, a conditional autoregressive model that generates images from small patches.

Paper
Add Code

Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks

no code implementations • 1 Oct 2017 • Anh Nguyen, Dimitrios Kanoulas, Luca Muratore, Darwin G. Caldwell, Nikos G. Tsagarakis

We present a new method to translate videos to commands for robotic manipulation using Deep Recurrent Neural Networks (RNN).

Translation

Paper
Add Code

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

2 code implementations • 21 Sep 2017 • Thanh-Toan Do, Anh Nguyen, Ian Reid

We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images.

Affordance Detection Object +2

119

Paper
Code

Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks

1 code implementation • 22 Aug 2017 • Anh Nguyen, Thanh-Toan Do, Darwin G. Caldwell, Nikos G. Tsagarakis

Our method first creates the event image from a list of events that occurs in a very short time interval, then a Stacked Spatial LSTM Network (SP-LSTM) is used to learn the camera pose.

Paper
Code

Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning

no code implementations • 16 Mar 2017 • Mohammed Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Ali Swanson, Meredith Palmer, Craig Packer, Jeff Clune

Having accurate, detailed, and up-to-date information about the location and behavior of animals in the wild would revolutionize our ability to study and conserve ecosystems.

Paper
Add Code

Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

1 code implementation • CVPR 2017 • Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski

PPGNs are composed of 1) a generator network G that is capable of drawing a wide range of image types and 2) a replaceable "condition" network C that tells the generator what to draw.

Image Captioning Image Inpainting

539

Paper
Code

Synthesizing the preferred inputs for neurons in neural networks via deep generator networks

5 code implementations • NeurIPS 2016 • Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune

Understanding the inner workings of such computational brains is both fascinating basic science that is interesting in its own right - similar to why we study the human brain - and will enable researchers to further improve DNNs.

473

Paper
Code

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

no code implementations • 11 Feb 2016 • Anh Nguyen, Jason Yosinski, Jeff Clune

Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron.

Paper
Add Code

Understanding Neural Networks Through Deep Visualization

7 code implementations • 22 Jun 2015 • Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, Hod Lipson

The first is a tool that visualizes the activations produced on each layer of a trained convnet as it processes an image or video (e. g. a live webcam stream).

Interpretable Machine Learning

3,986

Paper
Code

Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images

2 code implementations • CVPR 2015 • Anh Nguyen, Jason Yosinski, Jeff Clune

Here we show a related result: it is easy to produce images that are completely unrecognizable to humans, but that state-of-the-art DNNs believe to be recognizable objects with 99. 99% confidence (e. g. labeling with certainty that white noise static is a lion).

Evolutionary Algorithms

170

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.