Search Results for author: Xiaoqian Jiang

Found 42 papers, 7 papers with code

GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console

1 code implementation4 Apr 2024 Anindita Nath, Savannah Mwesigwa, Yulin Dai, Xiaoqian Jiang, Zhongming Zhao

Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge.

De-identification is not always enough

no code implementations31 Jan 2024 Atiquer Rahman Sarkar, Yao-Shun Chuang, Noman Mohammed, Xiaoqian Jiang

In this work, we demonstrated that (i) de-identification of real clinical notes does not protect records against a membership inference attack, (ii) proposed a novel approach to generate synthetic clinical notes using the current state-of-the-art large language models, (iii) evaluated the performance of the synthetically generated notes in a clinical domain task, and (iv) proposed a way to mount a membership inference attack where the target model is trained with synthetic data.

De-identification Inference Attack +2

Extracting periodontitis diagnosis in clinical notes with RoBERTa and regular expression

no code implementations17 Nov 2023 Yao-Shun Chuang, Chun-Teh Lee, Ryan Brandon, Trung Duong Tran, Oluwabunmi Tokede, Muhammad F. Walji, Xiaoqian Jiang

This study aimed to utilize text processing and natural language processing (NLP) models to mine clinical notes for the diagnosis of periodontitis and to evaluate the performance of a named entity recognition (NER) model on different regular expression (RE) methods.

named-entity-recognition Named Entity Recognition +1

FERI: A Multitask-based Fairness Achieving Algorithm with Applications to Fair Organ Transplantation

no code implementations20 Oct 2023 Can Li, Dejian Lai, Xiaoqian Jiang, Kai Zhang

Liver transplantation often faces fairness challenges across subgroups defined by sensitive attributes like age group, gender, and race/ethnicity.

Fairness

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research

1 code implementation4 Sep 2023 Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Kwei-Herng Lai, Daochen Zha, Ruixiang Tang, Fan Yang, Alfredo Costilla Reyes, Kaixiong Zhou, Xiaoqian Jiang, Xia Hu

The exponential growth in scholarly publications necessitates advanced tools for efficient article retrieval, especially in interdisciplinary fields where diverse terminologies are used to describe similar research.

named-entity-recognition Named Entity Recognition +5

Sensitive Data Detection with High-Throughput Machine Learning Models in Electrical Health Records

no code implementations30 Apr 2023 Kai Zhang, Xiaoqian Jiang

Based on this novel finding, we engineered over 30 features from the metadata of the original features and used machine learning to build classification models to automatically identify PHI fields in structured Electronic Health Record (EHR) data.

De-identification

Predicting multiple sclerosis disease severity with multimodal deep neural networks

1 code implementation8 Apr 2023 Kai Zhang, John A. Lincoln, Xiaoqian Jiang, Elmer V. Bernstam, Shayan Shams

Multiple Sclerosis (MS) is a chronic disease developed in human brain and spinal cord, which can cause permanent damage or deterioration of the nerves.

Disease Prediction

Large Language Models for Healthcare Data Augmentation: An Example on Patient-Trial Matching

no code implementations24 Mar 2023 Jiayi Yuan, Ruixiang Tang, Xiaoqian Jiang, Xia Hu

The process of matching patients with suitable clinical trials is essential for advancing medical research and providing optimal care.

Data Augmentation Text Generation

SPeC: A Soft Prompt-Based Calibration on Performance Variability of Large Language Model in Clinical Notes Summarization

no code implementations23 Mar 2023 Yu-Neng Chuang, Ruixiang Tang, Xiaoqian Jiang, Xia Hu

Electronic health records (EHRs) store an extensive array of patient information, encompassing medical histories, diagnoses, treatments, and test outcomes.

Language Modelling Large Language Model

Does Synthetic Data Generation of LLMs Help Clinical Text Mining?

no code implementations8 Mar 2023 Ruixiang Tang, Xiaotian Han, Xiaoqian Jiang, Xia Hu

Our method has resulted in significant improvements in the performance of downstream tasks, improving the F1-score from 23. 37% to 63. 99% for the named entity recognition task and from 75. 86% to 83. 59% for the relation extraction task.

Code Generation named-entity-recognition +5

Fairly Predicting Graft Failure in Liver Transplant for Organ Assigning

no code implementations18 Feb 2023 Sirui Ding, Ruixiang Tang, Daochen Zha, Na Zou, Kai Zhang, Xiaoqian Jiang, Xia Hu

To tackle this problem, this work proposes a fair machine learning framework targeting graft failure prediction in liver transplant.

Fairness Knowledge Distillation

Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels

no code implementations3 Nov 2022 Qiuchen Zhang, Jing Ma, Jian Lou, Li Xiong, Xiaoqian Jiang

PATE combines an ensemble of "teacher models" trained on sensitive data and transfers the knowledge to a "student" model through the noisy aggregation of teachers' votes for labeling unlabeled public data which the student model will be trained on.

Transfer Learning

MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning

no code implementations1 Aug 2022 Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Bhavani

By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks.

Mortality Prediction Multi-Task Learning +1

Scalable Causal Structure Learning: Scoping Review of Traditional and Deep Learning Algorithms and New Opportunities in Biomedicine

no code implementations15 Oct 2021 Pulakesh Upadhyaya, Kai Zhang, Can Li, Xiaoqian Jiang, Yejin Kim

Causal structure learning refers to a process of identifying causal structures from observational data, and it can have multiple applications in biomedicine and health care.

BIG-bench Machine Learning Causal Discovery +1

Federated Learning Algorithms for Generalized Mixed-effects Model (GLMM) on Horizontally Partitioned Data from Distributed Sources

no code implementations28 Sep 2021 Wentao Li, Jiayi Tong, Md. Monowar Anjum, Noman Mohammed, Yong Chen, Xiaoqian Jiang

Objectives: This paper develops two algorithms to achieve federated generalized linear mixed effect models (GLMM), and compares the developed model's outcomes with each other, as well as that from the standard R package (`lme4').

Federated Learning

Use of the Deep Learning Approach to Measure Alveolar Bone Level

no code implementations24 Sep 2021 Chun-Teh Lee, Tanjida Kabir, Jiman Nelson, Sally Sheng, Hsiu-Wan Meng, Thomas E. Van Dyke, Muhammad F. Walji, Xiaoqian Jiang, Shayan Shams

Conclusion: The proposed DL model provides reliable RBL measurements and image-based periodontal diagnosis using periapical radiographic images.

A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy

no code implementations10 Sep 2021 Kai Zhang, Chao Tian, Kun Zhang, Todd Johnson, Xiaoqian Jiang

The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data.

An Empirical Study of UMLS Concept Extraction from Clinical Notes using Boolean Combination Ensembles

no code implementations4 Aug 2021 Greg M. Silverman, Raymond L. Finzel, Michael V. Heinz, Jake Vasilakes, Jacob C. Solinsky, Reed McEwan, Benjamin C. Knoll, Christopher J. Tignanelli, Hongfang Liu, Hua Xu, Xiaoqian Jiang, Genevieve B. Melton, Serguei VS Pakhomov

Our objective in this study is to investigate the behavior of Boolean operators on combining annotation output from multiple Natural Language Processing (NLP) systems across multiple corpora and to assess how filtering by aggregation of Unified Medical Language System (UMLS) Metathesaurus concepts affects system performance for Named Entity Recognition (NER) of UMLS concepts.

named-entity-recognition Named Entity Recognition +1

Relational graph convolutional networks for predicting blood-brain barrier penetration of drug molecules

1 code implementation4 Jul 2021 Yan Ding, Xiaoqian Jiang, Yejin Kim

The RGCN model achieved an overall accuracy of 0. 872, an AUROC of 0. 919 and an AUPRC of 0. 838 for the testing dataset with the drug-protein interactions and the Mordred descriptors as the input.

COVID-19 Smart Chatbot Prototype for Patient Monitoring

no code implementations11 Mar 2021 Hannah Lei, Weiqi Lu, Alan Ji, Emmett Bertram, Paul Gao, Xiaoqian Jiang, Arko Barman

Many COVID-19 patients developed prolonged symptoms after the infection, including fatigue, delirium, and headache.

BIG-bench Machine Learning Chatbot

Real-time Prediction for Mechanical Ventilation in COVID-19 Patients using A Multi-task Gaussian Process Multi-objective Self-attention Network

no code implementations1 Feb 2021 Kai Zhang, Siddharth Karanth, Bela Patel, Robert Murphy, Xiaoqian Jiang

We propose a novel in-time risk trajectory predictive model to handle the irregular sampling rate in the data, which follows the dynamics of risk of performing mechanical ventilation for individual patients.

Trajectory Prediction

Drug repurposing for COVID-19 using graph neural network and harmonizing multiple evidence

no code implementations23 Sep 2020 Kanglin Hsieh, Yinyin Wang, Luyao Chen, Zhongming Zhao, Sean Savitz, Xiaoqian Jiang, Jing Tang, Yejin Kim

In summary, we demonstrated that the integration of extensive interactions, deep neural networks, and rigorous validation can facilitate the rapid identification of candidate drugs for COVID-19 treatment.

Dataset and Enhanced Model for Eligibility Criteria-to-SQL Semantic Parsing

no code implementations LREC 2020 Xiaojing Yu, Tianlong Chen, Zhengjie Yu, Huiyu Li, Yang Yang, Xiaoqian Jiang, Anxiao Jiang

Compared to existing datasets, the queries in the dataset here are derived from the eligibility criteria of clinical trials and include \textit{Order-sensitive, Counting-based, and Boolean-type} cases which are not seen before.

Semantic Parsing Text-To-SQL

Privacy-Preserving Tensor Factorization for Collaborative Health Data Analysis

no code implementations26 Aug 2019 Jing Ma, Qiuchen Zhang, Jian Lou, Joyce C. Ho, Li Xiong, Xiaoqian Jiang

We propose DPFact, a privacy-preserving collaborative tensor factorization method for computational phenotyping using EHR.

Computational Phenotyping Privacy Preserving

From Brain Imaging to Graph Analysis: a study on ADNI's patient cohort

no code implementations14 May 2019 Rui Zhang, Luca Giancardo, Danilo A. Pena, Yejin Kim, Hanghang Tong, Xiaoqian Jiang

In this paper, we studied the association between the change of structural brain volumes to the potential development of Alzheimer's disease (AD).

feature selection General Classification +1

Discriminative Sleep Patterns of Alzheimer's Disease via Tensor Factorization

no code implementations14 May 2019 Yejin Kim, Xiaoqian Jiang, Luyao Chen, Xiaojin Li, Licong Cui

Sleep change is commonly reported in Alzheimer's disease (AD) patients and their brain wave studies show decrease in dreaming and non-dreaming stages.

EEG

Combining Representation Learning with Tensor Factorization for Risk Factor Analysis - an application to Epilepsy and Alzheimer's disease

no code implementations14 May 2019 Xiaoqian Jiang, Samden Lhatoo, Guo-Qiang Zhang, Luyao Chen, Yejin Kim

Existing studies consider Alzheimer's disease (AD) a comorbidity of epilepsy, but also recognize epilepsy to occur more frequently in patients with AD than those without.

Representation Learning

Model-Protected Multi-Task Learning

1 code implementation18 Sep 2018 Jian Liang, Ziqi Liu, Jiayu Zhou, Xiaoqian Jiang, Chang-Shui Zhang, Fei Wang

Multi-task learning (MTL) refers to the paradigm of learning multiple related tasks together.

Multi-Task Learning Privacy Preserving

Federated Tensor Factorization for Computational Phenotyping

no code implementations11 Apr 2017 Yejin Kim, Jimeng Sun, Hwanjo Yu, Xiaoqian Jiang

In this paper, we developed a novel solution to enable federated tensor factorization for computational phenotyping without sharing patient-level data.

Computational Phenotyping

SAFETY: Secure gwAs in Federated Environment Through a hYbrid solution with Intel SGX and Homomorphic Encryption

1 code implementation7 Mar 2017 Md Nazmus Sadat, Md Momin Al Aziz, Noman Mohammed, Feng Chen, Shuang Wang, Xiaoqian Jiang

In this article, we present SAFETY, a hybrid framework, which can securely perform GWAS on federated genomic datasets using homomorphic encryption and recently introduced secure hardware component of Intel Software Guard Extensions (Intel SGX) to ensure high efficiency and privacy at the same time.

Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.