Search Results for author: Tianxi Cai

Found 22 papers, 7 papers with code

Contrastive Learning on Multimodal Analysis of Electronic Health Records

no code implementations • 22 Mar 2024 • Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou

To accommodate the statistical analysis of multimodal EHR data, in this paper, we propose a novel multimodal feature embedding generative model and design a multimodal contrastive loss to obtain the multimodal EHR feature representation.

Contrastive Learning Privacy Preserving +1

Paper
Add Code

Inference of Dependency Knowledge Graph for Electronic Health Records

no code implementations • 25 Dec 2023 • Zhiwei Xu, Ziming Gan, Doudou Zhou, Shuting Shen, Junwei Lu, Tianxi Cai

The effective analysis of high-dimensional Electronic Health Record (EHR) data, with substantial potential for healthcare research, presents notable methodological challenges.

feature selection

Paper
Add Code

Distributionally Robust Transfer Learning

no code implementations • 12 Sep 2023 • Xin Xiong, Zijian Guo, Tianxi Cai

Many existing transfer learning methods rely on leveraging information from source data that closely resembles the target data.

Transfer Learning

Paper
Add Code

Knowledge Graph Embedding with Electronic Health Records Data via Latent Graphical Block Model

no code implementations • 31 May 2023 • Junwei Lu, Jin Yin, Tianxi Cai

To overcome these challenges, we propose to infer the conditional dependency structure among EHR features via a latent graphical block model (LGBM).

Knowledge Graph Embedding

Paper
Add Code

LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic Health Records

1 code implementation • 19 May 2023 • Jun Wen, Jue Hou, Clara-Lea Bonzel, Yihan Zhao, Victor M. Castro, Vivian S. Gainer, Dana Weisenfeld, Tianrun Cai, Yuk-Lam Ho, Vidul A. Panickan, Lauren Costa, Chuan Hong, J. Michael Gaziano, Katherine P. Liao, Junwei Lu, Kelly Cho, Tianxi Cai

We propose a LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.

Representation Learning Unsupervised Pre-training

Paper
Code

Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

no code implementations • 28 Sep 2022 • Tianxi Cai, Dong Xia, Luwan Zhang, Doudou Zhou

Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects.

Graph Learning

Paper
Add Code

Federated Offline Reinforcement Learning

no code implementations • 11 Jun 2022 • Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai

Extensive simulations demonstrate the effectiveness of the proposed algorithm.

Offline RL Privacy Preserving +2

Paper
Add Code

Multimodal Learning on Graphs for Disease Relation Extraction

1 code implementation • 16 Mar 2022 • Yucong Lin, Keming Lu, Sheng Yu, Tianxi Cai, Marinka Zitnik

On a dataset annotated by human experts, REMAP improves text-based disease relation extraction by 10. 0% (accuracy) and 17. 2% (F1-score) by fusing disease knowledge graphs with text information.

Knowledge Graphs Relation +1

Paper
Code

Targeting Underrepresented Populations in Precision Medicine: A Federated Transfer Learning Approach

no code implementations • 27 Aug 2021 • Sai Li, Tianxi Cai, Rui Duan

With only a small number of communications across participating sites, the proposed method can achieve performance comparable to the pooled analysis where individual-level data are directly pooled together.

Data Integration Transfer Learning

Paper
Add Code

Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices

no code implementations • 21 May 2021 • Doudou Zhou, Tianxi Cai, Junwei Lu

Besides, we prove the statistical rate for the eigenspace of the underlying matrix, which is comparable to the rate under the independently missing assumption.

Electrical Engineering Knowledge Graphs +2

Paper
Add Code

Surrogate Assisted Semi-supervised Inference for High Dimensional Risk Prediction

no code implementations • 4 May 2021 • Jue Hou, Zijian Guo, Tianxi Cai

Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors.

Genetic Risk Prediction Imputation +2

Paper
Add Code

Semi-Supervised Off Policy Reinforcement Learning

no code implementations • 9 Dec 2020 • Aaron Sonabend-W, Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee

2) The surrogate variables we leverage in the modified SSL framework are predictive of the outcome but not informative to the optimal policy or value function.

Imputation Q-Learning +2

Paper
Add Code

Efficient Estimation and Evaluation of Prediction Rules in Semi-Supervised Settings under Stratified Sampling

1 code implementation • 19 Oct 2020 • Jessica Gronsbell, Molei Liu, Lu Tian, Tianxi Cai

In step II, we augment the initial imputations to ensure the consistency of the resulting estimators regardless of the specification of the prediction model or the imputation model.

Imputation Missing Labels +1

Paper
Code

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

2 code implementations • NeurIPS 2020 • Aaron Sonabend-W, Junwei Lu, Leo A. Celi, Tianxi Cai, Peter Szolovits

However, the adoption of such policies in practice is often challenging, as they are hard to interpret within the application context, and lack measures of uncertainty for the learned policy value and its decisions.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Code

Multi-view Banded Spectral Clustering with Application to ICD9 Clustering

1 code implementation • 6 Apr 2018 • Luwan Zhang, Katherine Liao, Issac Kohane, Tianxi Cai

To bridge this gap, in this paper we propose a novel spectral clustering method that optimally combines multiple data sources while leveraging the prior distance knowledge.

Clustering Community Detection +1

Paper
Code

Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data

4 code implementations • 4 Apr 2018 • Andrew L. Beam, Benjamin Kompa, Allen Schmaltz, Inbar Fried, Griffin Weber, Nathan P. Palmer, Xu Shi, Tianxi Cai, Isaac S. Kohane

Word embeddings are a popular approach to unsupervised learning of word relationships that are widely used in natural language processing.

Word Embeddings

Paper
Code

A Fast Divide-and-Conquer Sparse Cox Regression

2 code implementations • 2 Apr 2018 • Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, Isaac Kohane, Tianxi Cai

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$.

Computation Applications

Paper
Code

Surrogate Aided Unsupervised Recovery of Sparse Signals in Single Index Models for Binary Outcomes

no code implementations • 18 Jan 2017 • Abhishek Chakrabortty, Matey Neykov, Raymond Carroll, Tianxi Cai

We consider the recovery of regression coefficients, denoted by $\boldsymbol{\beta}_0$, for a single index model (SIM) relating a binary outcome $Y$ to a set of possibly high dimensional covariates $\boldsymbol{X}$, based on a large but 'unlabeled' dataset $\mathcal{U}$, with $Y$ never observed.

Paper
Add Code

Efficient and Adaptive Linear Regression in Semi-Supervised Settings

no code implementations • 17 Jan 2017 • Abhishek Chakrabortty, Tianxi Cai

It is often of interest to investigate if and when the unlabeled data can be exploited to improve estimation of the regression parameter in the adopted linear model.

Imputation regression

Paper
Add Code

L1-Regularized Least Squares for Support Recovery of High Dimensional Single Index Models with Gaussian Designs

no code implementations • 25 Nov 2015 • Matey Neykov, Jun S. Liu, Tianxi Cai

In the present paper we analyze algorithms based on covariance screening and least squares with $L_1$ penalization (i. e. LASSO) and demonstrate that they can also enjoy optimal (up to a scalar) rescaled sample size in terms of support recovery, albeit under slightly different assumptions on $f$ and $\varepsilon$ compared to the SIR based algorithms.

Paper
Add Code

Structured Matrix Completion with Applications to Genomic Data Integration

no code implementations • 8 Apr 2015 • Tianxi Cai, T. Tony Cai, Anru Zhang

Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics and electrical engineering.

Data Integration Electrical Engineering +1

Paper
Add Code

NILE: Fast Natural Language Processing for Electronic Health Records

no code implementations • 23 Nov 2013 • Sheng Yu, Tianrun Cai, Tianxi Cai

This paper introduces the design and performance of Narrative Information Linear Extraction (NILE), a natural language processing (NLP) package for EHR analysis that we share with the medical informatics community.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.