Search Results for author: Rui Xin

Found 9 papers, 8 papers with code

Language models scale reliably with over-training and on downstream tasks

1 code implementation • 13 Mar 2024 • Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G. Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt

We fit scaling laws that extrapolate in both the number of model parameters and the ratio of training tokens to parameters.

Language Modelling

Paper
Code

Optimal Sparse Survival Trees

1 code implementation • 27 Jan 2024 • Rui Zhang, Rui Xin, Margo Seltzer, Cynthia Rudin

Interpretability is crucial for doctors, hospitals, pharmaceutical companies and biotechnology corporations to analyze and make decisions for high stakes problems that involve human health.

Survival Analysis

Paper
Code

Inferior Alveolar Nerve Segmentation in CBCT images using Connectivity-Based Selective Re-training

1 code implementation • 18 Aug 2023 • Yusheng Liu, Rui Xin, Tao Yang, Lisheng Wang

Inferior Alveolar Nerve (IAN) canal detection in CBCT is an important step in many dental and maxillofacial surgery applications to prevent irreversible damage to the nerve during the procedure. The ToothFairy2023 Challenge aims to establish a 3D maxillofacial dataset consisting of all sparse labels and partial dense labels, and improve the ability of automatic IAN segmentation.

Paper
Code

Optimal Sparse Regression Trees

1 code implementation • 28 Nov 2022 • Rui Zhang, Rui Xin, Margo Seltzer, Cynthia Rudin

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications.

Clustering regression

Paper
Code

TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization

3 code implementations • 19 Sep 2022 • Zijie J. Wang, Chudi Zhong, Rui Xin, Takuya Takagi, Zhi Chen, Duen Horng Chau, Cynthia Rudin, Margo Seltzer

Given thousands of equally accurate machine learning (ML) models, how can users choose among them?

369

Paper
Code

Exploring the Whole Rashomon Set of Sparse Decision Trees

2 code implementations • 16 Sep 2022 • Rui Xin, Chudi Zhong, Zhi Chen, Takuya Takagi, Margo Seltzer, Cynthia Rudin

We show three applications of the Rashomon set: 1) it can be used to study variable importance for the set of almost-optimal trees (as opposed to a single tree), 2) the Rashomon set for accuracy enables enumeration of the Rashomon sets for balanced accuracy and F1-score, and 3) the Rashomon set for a full dataset can be used to produce Rashomon sets constructed with only subsets of the data set.

125

Paper
Code

Materials Transformers Language Models for Generative Materials Design: a benchmark study

1 code implementation • 27 Jun 2022 • Nihang Fu, Lai Wei, Yuqi Song, Qinyang Li, Rui Xin, Sadman Sadeed Omee, Rongzhi Dong, Edirisuriya M. Dilanga Siriwardane, Jianjun Hu

We also find that the properties of the generated samples can be tailored by training the models with selected training sets such as high-bandgap materials.

Paper
Code

Multitask Learning for Citation Purpose Classification

no code implementations • NAACL (sdp) 2021 • Alex Oesterling, Angikar Ghosal, Haoyang Yu, Rui Xin, Yasa Baig, Lesia Semenova, Cynthia Rudin

The goal of the competition is to classify a citation in a scientific article based on its purpose.

Classification

Paper
Add Code

Active learning based generative design for the discovery of wide bandgap materials

2 code implementations • 28 Feb 2021 • Rui Xin, Edirisuriya M. D. Siriwardane, Yuqi Song, Yong Zhao, Steph-Yves Louis, Alireza Nasiri, Jianjun Hu

Our experiments show that while active learning itself may sample chemically infeasible candidates, these samples help to train effective screening models for filtering out materials with desired properties from the hypothetical materials created by the generative model.

Active Learning Band Gap

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.