Search Results for author: Tianxi Cai

Found 22 papers, 7 papers with code

Contrastive Learning on Multimodal Analysis of Electronic Health Records

no code implementations22 Mar 2024 Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou

To accommodate the statistical analysis of multimodal EHR data, in this paper, we propose a novel multimodal feature embedding generative model and design a multimodal contrastive loss to obtain the multimodal EHR feature representation.

Contrastive Learning Privacy Preserving +1

Inference of Dependency Knowledge Graph for Electronic Health Records

no code implementations25 Dec 2023 Zhiwei Xu, Ziming Gan, Doudou Zhou, Shuting Shen, Junwei Lu, Tianxi Cai

The effective analysis of high-dimensional Electronic Health Record (EHR) data, with substantial potential for healthcare research, presents notable methodological challenges.

feature selection

Distributionally Robust Transfer Learning

no code implementations12 Sep 2023 Xin Xiong, Zijian Guo, Tianxi Cai

Many existing transfer learning methods rely on leveraging information from source data that closely resembles the target data.

Transfer Learning

Knowledge Graph Embedding with Electronic Health Records Data via Latent Graphical Block Model

no code implementations31 May 2023 Junwei Lu, Jin Yin, Tianxi Cai

To overcome these challenges, we propose to infer the conditional dependency structure among EHR features via a latent graphical block model (LGBM).

Knowledge Graph Embedding

Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

no code implementations28 Sep 2022 Tianxi Cai, Dong Xia, Luwan Zhang, Doudou Zhou

Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects.

Graph Learning

Multimodal Learning on Graphs for Disease Relation Extraction

1 code implementation16 Mar 2022 Yucong Lin, Keming Lu, Sheng Yu, Tianxi Cai, Marinka Zitnik

On a dataset annotated by human experts, REMAP improves text-based disease relation extraction by 10. 0% (accuracy) and 17. 2% (F1-score) by fusing disease knowledge graphs with text information.

Knowledge Graphs Relation +1

Targeting Underrepresented Populations in Precision Medicine: A Federated Transfer Learning Approach

no code implementations27 Aug 2021 Sai Li, Tianxi Cai, Rui Duan

With only a small number of communications across participating sites, the proposed method can achieve performance comparable to the pooled analysis where individual-level data are directly pooled together.

Data Integration Transfer Learning

Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices

no code implementations21 May 2021 Doudou Zhou, Tianxi Cai, Junwei Lu

Besides, we prove the statistical rate for the eigenspace of the underlying matrix, which is comparable to the rate under the independently missing assumption.

Electrical Engineering Knowledge Graphs +2

Surrogate Assisted Semi-supervised Inference for High Dimensional Risk Prediction

no code implementations4 May 2021 Jue Hou, Zijian Guo, Tianxi Cai

Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors.

Genetic Risk Prediction Imputation +2

Semi-Supervised Off Policy Reinforcement Learning

no code implementations9 Dec 2020 Aaron Sonabend-W, Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee

2) The surrogate variables we leverage in the modified SSL framework are predictive of the outcome but not informative to the optimal policy or value function.

Imputation Q-Learning +2

Efficient Estimation and Evaluation of Prediction Rules in Semi-Supervised Settings under Stratified Sampling

1 code implementation19 Oct 2020 Jessica Gronsbell, Molei Liu, Lu Tian, Tianxi Cai

In step II, we augment the initial imputations to ensure the consistency of the resulting estimators regardless of the specification of the prediction model or the imputation model.

Imputation Missing Labels +1

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

2 code implementations NeurIPS 2020 Aaron Sonabend-W, Junwei Lu, Leo A. Celi, Tianxi Cai, Peter Szolovits

However, the adoption of such policies in practice is often challenging, as they are hard to interpret within the application context, and lack measures of uncertainty for the learned policy value and its decisions.

reinforcement-learning Reinforcement Learning (RL) +2

Multi-view Banded Spectral Clustering with Application to ICD9 Clustering

1 code implementation6 Apr 2018 Luwan Zhang, Katherine Liao, Issac Kohane, Tianxi Cai

To bridge this gap, in this paper we propose a novel spectral clustering method that optimally combines multiple data sources while leveraging the prior distance knowledge.

Clustering Community Detection +1

Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data

4 code implementations4 Apr 2018 Andrew L. Beam, Benjamin Kompa, Allen Schmaltz, Inbar Fried, Griffin Weber, Nathan P. Palmer, Xu Shi, Tianxi Cai, Isaac S. Kohane

Word embeddings are a popular approach to unsupervised learning of word relationships that are widely used in natural language processing.

Word Embeddings

A Fast Divide-and-Conquer Sparse Cox Regression

2 code implementations2 Apr 2018 Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, Isaac Kohane, Tianxi Cai

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$.

Computation Applications

Surrogate Aided Unsupervised Recovery of Sparse Signals in Single Index Models for Binary Outcomes

no code implementations18 Jan 2017 Abhishek Chakrabortty, Matey Neykov, Raymond Carroll, Tianxi Cai

We consider the recovery of regression coefficients, denoted by $\boldsymbol{\beta}_0$, for a single index model (SIM) relating a binary outcome $Y$ to a set of possibly high dimensional covariates $\boldsymbol{X}$, based on a large but 'unlabeled' dataset $\mathcal{U}$, with $Y$ never observed.

Efficient and Adaptive Linear Regression in Semi-Supervised Settings

no code implementations17 Jan 2017 Abhishek Chakrabortty, Tianxi Cai

It is often of interest to investigate if and when the unlabeled data can be exploited to improve estimation of the regression parameter in the adopted linear model.

Imputation regression

L1-Regularized Least Squares for Support Recovery of High Dimensional Single Index Models with Gaussian Designs

no code implementations25 Nov 2015 Matey Neykov, Jun S. Liu, Tianxi Cai

In the present paper we analyze algorithms based on covariance screening and least squares with $L_1$ penalization (i. e. LASSO) and demonstrate that they can also enjoy optimal (up to a scalar) rescaled sample size in terms of support recovery, albeit under slightly different assumptions on $f$ and $\varepsilon$ compared to the SIR based algorithms.

Structured Matrix Completion with Applications to Genomic Data Integration

no code implementations8 Apr 2015 Tianxi Cai, T. Tony Cai, Anru Zhang

Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics and electrical engineering.

Data Integration Electrical Engineering +1

NILE: Fast Natural Language Processing for Electronic Health Records

no code implementations23 Nov 2013 Sheng Yu, Tianrun Cai, Tianxi Cai

This paper introduces the design and performance of Narrative Information Linear Extraction (NILE), a natural language processing (NLP) package for EHR analysis that we share with the medical informatics community.

named-entity-recognition Named Entity Recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.