Search Results for author: Anh Nguyen

Found 98 papers, 55 papers with code

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

no code implementations22 Apr 2024 Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, ZiYi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.

Language Modelling

High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces

no code implementations20 Apr 2024 Baoru Huang, Yida Wang, Anh Nguyen, Daniel Elson, Francisco Vasconcelos, Danail Stoyanov

In surgical oncology, screening colonoscopy plays a pivotal role in providing diagnostic assistance, such as biopsy, and facilitating surgical navigation, particularly in polyp detection.

Camera Localization Image Generation +4

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

no code implementations14 Apr 2024 Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.

Data Augmentation Multi-Task Learning +2

Allowing humans to interactively guide machines where to look does not always improve human-AI team's classification accuracy

1 code implementation8 Apr 2024 Giang Nguyen, Mohammad Reza Taesiri, Sunnie S. Y. Kim, Anh Nguyen

We build CHM-Corr++, an interactive interface for CHM-Corr, enabling users to edit the feature importance map provided by CHM-Corr and observe updated model decisions.

Feature Importance Image Classification

Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model

1 code implementation2 Apr 2024 Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen

The results reveal that Samba achieved unparalleled performance on commonly used remote sensing datasets for semantic segmentation.

Segmentation Semantic Segmentation

On the Effectiveness of Heterogeneous Ensemble Methods for Re-identification

no code implementations19 Mar 2024 Simon Klüttermann, Jérôme Rutinowski, Anh Nguyen, Britta Grimme, Moritz Roidl, Emmanuel Müller

In this contribution, we introduce a novel ensemble method for the re-identification of industrial entities, using images of chipwood pallets and galvanized metal plates as dataset examples.

Leveraging Habitat Information for Fine-grained Bird Identification

no code implementations22 Dec 2023 Tin Nguyen, Anh Nguyen

Training CNNs and ViTs with habitat-augmented data results in an improvement of up to +0. 83 and +0. 23 points on NABirds and CUB-200, respectively.

Image Augmentation

WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

no code implementations15 Dec 2023 Huy Le, Tung Kieu, Anh Nguyen, Ngan Le

Text-video retrieval, a prominent sub-field within the domain of multimodal information retrieval, has witnessed remarkable growth in recent years.

Information Retrieval Knowledge Distillation +3

TinyGSM: achieving >80% on GSM8k with small language models

no code implementations14 Dec 2023 Bingbin Liu, Sebastien Bubeck, Ronen Eldan, Janardhan Kulkarni, Yuanzhi Li, Anh Nguyen, Rachel Ward, Yi Zhang

Specifically for solving grade school math, the smallest model size so far required to break the 80\% barrier on the GSM8K benchmark remains to be 34B.

Arithmetic Reasoning GSM8K +2

GlitchBench: Can large multimodal models detect video game glitches?

no code implementations8 Dec 2023 Mohammad Reza Taesiri, Tianjun Feng, Anh Nguyen, Cor-Paul Bezemer

To address this gap, we introduce GlitchBench, a novel benchmark derived from video game quality assurance tasks, to test and evaluate the reasoning capabilities of LMMs.

Generating Valid and Natural Adversarial Examples with Large Language Models

no code implementations20 Nov 2023 Zimu Wang, Wei Wang, Qi Chen, Qiufeng Wang, Anh Nguyen

Deep learning-based natural language processing (NLP) models, particularly pre-trained language models (PLMs), have been revealed to be vulnerable to adversarial attacks.

Adversarial Attack valid

Shape-Sensitive Loss for Catheter and Guidewire Segmentation

no code implementations19 Nov 2023 Chayun Kongtongvattana, Baoru Huang, Jingxuan Kang, Hoan Nguyen, Olajide Olufemi, Anh Nguyen

By computing the cosine similarity between these feature vectors, we gain a nuanced understanding of image similarity that goes beyond the limitations of traditional overlap-based measures.

3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images

no code implementations19 Nov 2023 Tudor Jianu, Baoru Huang, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen

Endovascular navigation, essential for diagnosing and treating endovascular diseases, predominantly hinges on fluoroscopic images due to the constraints in sensory feedback.

Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding

no code implementations31 Oct 2023 Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

In the era of the Internet of Things (IoT), the retrieval of relevant medical information has become essential for efficient clinical decision-making.

Decision Making Information Retrieval +2

Controllable Group Choreography using Contrastive Diffusion

no code implementations29 Oct 2023 Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications.

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

1 code implementation5 Oct 2023 Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

Open-Fusion harnesses the power of a pre-trained vision-language foundation model (VLFM) for open-set semantic comprehension and employs the Truncated Signed Distance Function (TSDF) for swift 3D scene reconstruction.

3D Scene Reconstruction

Learning to Terminate in Object Navigation

1 code implementation28 Sep 2023 Yuhang Song, Anh Nguyen, Chun-Yi Lee

This paper tackles the critical challenge of object navigation in autonomous navigation systems, particularly focusing on the problem of target approach and episode termination in environments with long optimal episode length in Deep Reinforcement Learning (DRL) based methods.

Autonomous Navigation Object +2

I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses

1 code implementation24 Sep 2023 Trong Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le

In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification.

Language Modelling

Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

1 code implementation18 Sep 2023 An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen

Foundation models such as ChatGPT have made significant strides in robotic tasks due to their universal representation of real-world domains.

Robotic Grasping World Knowledge

Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery

1 code implementation7 Jul 2023 Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S. Elson

In surgical oncology, it is challenging for surgeons to identify lymph nodes and completely resect cancer even with pre-operative imaging systems like PET and CT, because of the lack of reliable intraoperative visualization tools.

A Client-server Deep Federated Learning for Cross-domain Surgical Image Segmentation

no code implementations14 Jun 2023 Ronast Subedi, Rebati Raman Gaire, Sharib Ali, Anh Nguyen, Danail Stoyanov, Binod Bhattarai

This paper presents a solution to the cross-domain adaptation problem for 2D surgical image segmentation, explicitly considering the privacy protection of distributed datasets belonging to different centers.

Domain Adaptation Federated Learning +3

Self-Supervised Learning for Point Clouds Data: A Survey

no code implementations9 May 2023 Changyu Zeng, Wei Wang, Anh Nguyen, Yutao Yue

We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts' characteristics.

Pedestrian Detection Self-Supervised Learning

Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching

no code implementations16 Apr 2023 Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen

In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images.

Image-to-Image Translation

Music-Driven Group Choreography

no code implementations CVPR 2023 Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

The proposed dataset consists of 16. 7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres.

Style Transfer for 2D Talking Head Animation

1 code implementation17 Mar 2023 Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new method to generate talking head animation with learnable style references.

Style Transfer

Open-Vocabulary Affordance Detection in 3D Point Clouds

1 code implementation4 Mar 2023 Toan Nguyen, Minh Nhat Vu, An Vuong, Dzung Nguyen, Thieu Vo, Ngan Le, Anh Nguyen

In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds.

Affordance Detection

A Light-weight Deep Learning Model for Remote Sensing Image Classification

no code implementations25 Feb 2023 Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin

In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image.

Image Classification Knowledge Distillation +1

ViDeBERTa: A powerful pre-trained language model for Vietnamese

1 code implementation25 Jan 2023 Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu

This paper presents ViDeBERTa, a new pre-trained monolingual language model for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture.

Language Modelling named-entity-recognition +5

A Large-Scale Study of a Sleep Tracking and Improving Device with Closed-loop and Personalized Real-time Acoustic Stimulation

no code implementations4 Nov 2022 Anh Nguyen, Galen Pogoncheff, Ban Xuan Dong, Nam Bui, Hoang Truong, Nhat Pham, Linh Nguyen, Hoang Huu Nguyen, Sy Duong-Quy, Sangtae Ha, Tam Vu

Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life.

Sleep Staging

Uncertainty-aware Label Distribution Learning for Facial Expression Recognition

1 code implementation21 Sep 2022 Nhat Le, Khanh Nguyen, Quang Tran, Erman Tjiputra, Bac Le, Anh Nguyen

In this paper, we propose a new uncertainty-aware label distribution learning method to improve the robustness of deep models against uncertainty and ambiguity.

Facial Expression Recognition Facial Expression Recognition (FER)

Visual correspondence-based explanations improve AI robustness and human-AI team accuracy

1 code implementation26 Jul 2022 Giang Nguyen, Mohammad Reza Taesiri, Anh Nguyen

Via a large-scale, human study on ImageNet and CUB, our correspondence-based explanations are found to be more useful to users than kNN explanations.

Image Classification

CodeT: Code Generation with Generated Tests

1 code implementation21 Jul 2022 Bei Chen, Fengji Zhang, Anh Nguyen, Daoguang Zan, Zeqi Lin, Jian-Guang Lou, Weizhu Chen

A natural way to evaluate the quality and correctness of a code solution is to run it against a set of test cases, but the manual creation of such test cases is often costly and time-consuming.

 Ranked #1 on Code Generation on APPS (Introductory Pass@1 metric)

Code Generation

PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search

1 code implementation19 Jul 2022 Thang M. Pham, Seunghyun Yoon, Trung Bui, Anh Nguyen

While contextualized word embeddings have been a de-facto standard, learning contextualized phrase embeddings is less explored and being hindered by the lack of a human-annotated benchmark that tests machine understanding of phrase semantics given a context sentence or paragraph (instead of phrases alone).

Information Retrieval Natural Language Understanding +5

How explainable are adversarially-robust CNNs?

no code implementations25 May 2022 Mehdi Nourelahi, Lars Kotthoff, Peijie Chen, Anh Nguyen

Here, we perform the first, large-scale evaluation of the relations of the three criteria using 9 feature-importance methods and 12 ImageNet-trained CNNs that are of 3 training algorithms and 5 CNN architectures.

Feature Importance

Fine-Grained Visual Classification using Self Assessment Classifier

1 code implementation21 May 2022 Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.

Classification Continual Learning +1

Understanding Public Opinion on Using Hydroxychloroquine for COVID-19 Treatment via Social Media

2 code implementations1 Jan 2022 Thuy T. Do, Du Nguyen, Anh Le, Anh Nguyen, Dong Nguyen, Nga Hoang, Uyen Le, Tuan Tran

This paper studies the reactions of social network users on the recommendation of using HCQ for COVID-19 treatment by analyzing the reaction patterns and sentiment of the tweets.

Descriptive Sentiment Analysis

Global-Local Attention for Emotion Recognition

1 code implementation7 Nov 2021 Nhat Le, Khanh Nguyen, Anh Nguyen, Bac Le

Our network is designed to extract features from both facial and context regions independently, then learn them together using the attention module.

Emotion Recognition

Double Trouble: How to not explain a text classifier's decisions using counterfactuals synthesized by masked language models?

1 code implementation22 Oct 2021 Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen

We find two reasons why IM is not better than LOO: (1) deleting a single word from the input only marginally reduces a classifier's accuracy; and (2) a highly predictable word is always given near-zero attribution, regardless of its true importance to the classifier.

Causal Inference

Deep Federated Learning for Autonomous Driving

1 code implementation12 Oct 2021 Anh Nguyen, Tuong Do, Minh Tran, Binh X. Nguyen, Chien Duong, Tu Phan, Erman Tjiputra, Quang D. Tran

We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods.

Autonomous Driving Federated Learning

DeepECMP: Predicting Extracellular Matrix Proteins using Deep Learning

no code implementations7 Oct 2021 Mohamed Ghafoor, Anh Nguyen

Introduction: The extracellular matrix (ECM) is a networkof proteins and carbohydrates that has a structural and bio-chemical function.

Coarse-to-Fine Reasoning for Visual Question Answering

2 code implementations6 Oct 2021 Binh X. Nguyen, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task.

Question Answering Visual Question Answering

Light-weight Deformable Registration using Adversarial Learning with Distilling Knowledge

1 code implementation4 Oct 2021 Minh Q. Tran, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We design the student network such as it is light-weight and well suitable for deployment on a typical CPU.

Attention Gate in Traffic Forecasting

no code implementations27 Sep 2021 Anh Lam, Anh Nguyen, Bac Le

An attention gates filter features from the contraction path before combining with features on the expansion path, it enables our model to reduce the effect of non-traffic region features and focus more on crucial region features.

Semi-Supervised Adversarial Discriminative Domain Adaptation

1 code implementation27 Sep 2021 Thai-Vu Nguyen, Anh Nguyen, Nghia Le, Bac Le

Domain adaptation is a potential method to train a powerful deep neural network, which can handle the absence of labeled data.

Domain Adaptation Emotion Recognition

Occlusion-robust Visual Markerless Bone Tracking for Computer-Assisted Orthopaedic Surgery

no code implementations24 Aug 2021 Xue Hu, Anh Nguyen, Ferdinando Rodriguez y Baena

In practice, by using a high-quality commercial RGB-D camera, our proposed visual tracking method achieves an accuracy of 1-2 degress and 2-4 mm on a model knee, which meets the standard for clinical applications.

Anatomy Point Cloud Segmentation +1

The DEformer: An Order-Agnostic Distribution Estimating Transformer

1 code implementation ICML Workshop INNF 2021 Michael A. Alcorn, Anh Nguyen

In this paper, we propose an alternative approach for encoding feature identities, where each feature's identity is included alongside its value in the input.

Density Estimation

Inverting Adversarially Robust Networks for Image Synthesis

1 code implementation13 Jun 2021 Renan A. Rojas-Gomez, Raymond A. Yeh, Minh N. Do, Anh Nguyen

Despite unconditional feature inversion being the foundation of many image synthesis applications, training an inverter demands a high computational budget, large decoding capacity and imposing conditions such as autoregressive priors.

Anomaly Detection Deep Feature Inversion +3

The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

1 code implementation NeurIPS 2021 Giang Nguyen, Daeyoung Kim, Anh Nguyen

Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications.

Multiple Meta-model Quantifying for Medical Visual Question Answering

2 code implementations19 May 2021 Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran, Anh Nguyen

However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized.

Medical Visual Question Answering Meta-Learning +3

baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents

1 code implementation NeurIPS 2021 Michael A. Alcorn, Anh Nguyen

In many multi-agent spatiotemporal systems, agents operate under the influence of shared, unobserved variables (e. g., the play a team is executing in a game of basketball).

Trajectory Modeling

Graph-based Person Signature for Person Re-Identifications

1 code implementation14 Apr 2021 Binh X. Nguyen, Binh D. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person.

Attribute Multi-Task Learning +1

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

no code implementations5 Apr 2021 Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network.

Acoustic Scene Classification Classification +2

Speech Emotion Recognition using Semantic Information

1 code implementation4 Mar 2021 Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller

In this paper, we propose a novel framework that can capture both the semantic and the paralinguistic information in the signal.

Speech Emotion Recognition Sound Audio and Speech Processing

WaNet -- Imperceptible Warping-based Backdoor Attack

1 code implementation20 Feb 2021 Anh Nguyen, Anh Tran

With the thriving of deep learning and the widespread practice of using pre-trained networks, backdoor attacks have become an increasing security threat drawing many research interests in recent years.

Backdoor Attack

baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling

1 code implementation NeurIPS 2021 Michael A. Alcorn, Anh Nguyen

Multi-agent spatiotemporal modeling is a challenging task from both an algorithmic design and computational complexity perspective.

Deep Learning Framework Applied for Predicting Anomaly of Respiratory Sounds

no code implementations26 Dec 2020 Dat Ngo, Lam Pham, Anh Nguyen, Ben Phan, Khoa Tran, Truong Nguyen

This paper proposes a robust deep learning framework used for classifying anomaly of respiratory cycles.

A Vietnamese Dataset for Evaluating Machine Reading Comprehension

no code implementations COLING 2020 Kiet Nguyen, Vu Nguyen, Anh Nguyen, Ngan Nguyen

Due to the lack of benchmark datasets for Vietnamese, we present the Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset for the low-resource language as Vietnamese to evaluate MRC models.

Machine Reading Comprehension Question Answering +1

Input-Aware Dynamic Backdoor Attack

1 code implementation NeurIPS 2020 Anh Nguyen, Anh Tran

In recent years, neural backdoor attack has been considered to be a potential security threat to deep learning systems.

Backdoor Attack

Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

no code implementations31 Jul 2020 Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran

In this work, we propose a multimodal fusion approach to address the problem of autonomous navigation in complex environments such as collapsed cites, or natural caves.

Robotics

Are there any 'object detectors' in the hidden layers of CNNs trained to identify objects or scenes?

1 code implementation2 Jul 2020 Ella M. Gale, Nicholas Martin, Ryan Blything, Anh Nguyen, Jeffrey S. Bowers

We find that the different measures provide different estimates of object selectivity, with precision and CCMAS measures providing misleadingly high estimates.

General Classification Image Classification +1

DeepEventMine: end-to-end neural nested event extraction from biomedical texts

1 code implementation17 Jun 2020 Hai-Long Trieu, Thy Thy Tran, Khoa N A Duong, Anh Nguyen, Makoto Miwa, Sophia Ananiadou

Motivation Recent neural approaches on event extraction from text mainly focus on flat events in general domain, while there are less attempts to detect nested and overlapping events.

Sentence

The shape and simplicity biases of adversarially robust ImageNet-trained CNNs

1 code implementation16 Jun 2020 Peijie Chen, Chirag Agarwal, Anh Nguyen

Increasingly more similarities between human vision and convolutional neural networks (CNNs) have been revealed in the past few years.

Image Generation

SAM: The Sensitivity of Attribution Methods to Hyperparameters

1 code implementation CVPR 2020 Naman Bansal, Chirag Agarwal, Anh Nguyen

Attribution methods can provide powerful insights into the reasons for a classifier's decision.

A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings

no code implementations10 Oct 2019 Qi Li, Long Mai, Michael A. Alcorn, Anh Nguyen

Large, pre-trained generative models have been increasingly popular and useful to both the research and wider communities.

Model Editing

Explaining image classifiers by removing input features using generative models

1 code implementation9 Oct 2019 Chirag Agarwal, Anh Nguyen

Perturbation-based explanation methods often measure the contribution of an input feature to an image classifier's outputs by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.

counterfactual Object Localization

Removing input features via a generative model to explain their attributions to classifier's decisions

no code implementations25 Sep 2019 Chirag Agarwal, Dan Schonfeld, Anh Nguyen

Interpretability methods often measure the contribution of an input feature to an image classifier's decisions by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.

counterfactual

Understanding Neural Networks via Feature Visualization: A survey

1 code implementation18 Apr 2019 Anh Nguyen, Jason Yosinski, Jeff Clune

A neuroscience method to understanding the brain is to find and study the preferred stimuli that highly activate an individual cell or groups of cells.

BIG-bench Machine Learning

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

no code implementations23 Mar 2019 Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

We propose V2CNet, a new deep learning framework to automatically translate the demonstration videos to commands that can be directly used in robotic applications.

Scene Understanding for Autonomous Manipulation with Deep Learning

no code implementations23 Mar 2019 Anh Nguyen

In this study, our long-term goal is to bridge the gap between computer vision and robotics by developing visual methods that can be used in real robots.

Action Understanding Affordance Detection +6

Strike (with) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects

1 code implementation CVPR 2019 Michael A. Alcorn, Qi Li, Zhitao Gong, Chengfei Wang, Long Mai, Wei-Shinn Ku, Anh Nguyen

Using our framework and a self-assembled dataset of 3D objects, we investigate the vulnerability of DNNs to OoD poses of well-known objects in ImageNet.

Improving Adversarial Robustness by Encouraging Discriminative Features

no code implementations1 Nov 2018 Chirag Agarwal, Anh Nguyen, Dan Schonfeld

Intuitively, the center loss encourages DNNs to simultaneously learns a center for the deep features of each class, and minimize the distances between the intra-class deep features and their corresponding class centers.

Adversarial Robustness

Selectivity metrics can overestimate the selectivity of units: a case study on AlexNet

no code implementations27 Sep 2018 Ella M. Gale, Anh Nguyen, Ryan Blything, Nicholas Martin and Jeffrey S. Bowers

These findings highlight the problem with current selectivity measures and show that new measures are required in order to provide a better assessment of learned representations in NNs.

VectorDefense: Vectorization as a Defense to Adversarial Examples

1 code implementation23 Apr 2018 Vishaal Munusamy Kabilan, Brandon Morris, Anh Nguyen

Training deep neural networks on images represented as grids of pixels has brought to light an interesting phenomenon known as adversarial examples.

Object Captioning and Retrieval with Natural Language

1 code implementation16 Mar 2018 Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object.

Object Retrieval

Spatial PixelCNN: Generating Images from Patches

no code implementations3 Dec 2017 Nader Akoury, Anh Nguyen

In this paper we propose Spatial PixelCNN, a conditional autoregressive model that generates images from small patches.

Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks

no code implementations1 Oct 2017 Anh Nguyen, Dimitrios Kanoulas, Luca Muratore, Darwin G. Caldwell, Nikos G. Tsagarakis

We present a new method to translate videos to commands for robotic manipulation using Deep Recurrent Neural Networks (RNN).

Translation

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

2 code implementations21 Sep 2017 Thanh-Toan Do, Anh Nguyen, Ian Reid

We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images.

Affordance Detection Object +2

Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks

1 code implementation22 Aug 2017 Anh Nguyen, Thanh-Toan Do, Darwin G. Caldwell, Nikos G. Tsagarakis

Our method first creates the event image from a list of events that occurs in a very short time interval, then a Stacked Spatial LSTM Network (SP-LSTM) is used to learn the camera pose.

Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning

no code implementations16 Mar 2017 Mohammed Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Ali Swanson, Meredith Palmer, Craig Packer, Jeff Clune

Having accurate, detailed, and up-to-date information about the location and behavior of animals in the wild would revolutionize our ability to study and conserve ecosystems.

Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

1 code implementation CVPR 2017 Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski

PPGNs are composed of 1) a generator network G that is capable of drawing a wide range of image types and 2) a replaceable "condition" network C that tells the generator what to draw.

Image Captioning Image Inpainting

Synthesizing the preferred inputs for neurons in neural networks via deep generator networks

5 code implementations NeurIPS 2016 Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune

Understanding the inner workings of such computational brains is both fascinating basic science that is interesting in its own right - similar to why we study the human brain - and will enable researchers to further improve DNNs.

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

no code implementations11 Feb 2016 Anh Nguyen, Jason Yosinski, Jeff Clune

Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron.

Understanding Neural Networks Through Deep Visualization

7 code implementations22 Jun 2015 Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, Hod Lipson

The first is a tool that visualizes the activations produced on each layer of a trained convnet as it processes an image or video (e. g. a live webcam stream).

Interpretable Machine Learning

Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images

2 code implementations CVPR 2015 Anh Nguyen, Jason Yosinski, Jeff Clune

Here we show a related result: it is easy to produce images that are completely unrecognizable to humans, but that state-of-the-art DNNs believe to be recognizable objects with 99. 99% confidence (e. g. labeling with certainty that white noise static is a lion).

Evolutionary Algorithms

Cannot find the paper you are looking for? You can Submit a new open access paper.