Search Results for author: Feiyang Kang

Found 4 papers, 1 papers with code

FASTTRACK: Fast and Accurate Fact Tracing for LLMs

no code implementations22 Apr 2024 Si Chen, Feiyang Kang, Ning Yu, Ruoxi Jia

Existing approaches to fact tracing rely on assessing the similarity between each training sample and the query along a certain dimension, such as lexical similarity, gradient, or embedding space.

The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes

no code implementations14 Feb 2024 Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, Ruoxi Jia

Inspired by this, we introduce a new method for estimating the influence of training data, which requires calculating gradients for specific test samples, paired with a forward pass for each training point.

Memorization

Data Acquisition: A New Frontier in Data-centric AI

no code implementations22 Nov 2023 Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia, Carole-Jean Wu, Matei Zaharia, James Zou

As Machine Learning (ML) systems continue to grow, the demand for relevant and comprehensive datasets becomes imperative.

LAVA: Data Valuation without Pre-Specified Learning Algorithms

1 code implementation28 Apr 2023 Hoang Anh Just, Feiyang Kang, Jiachen T. Wang, Yi Zeng, Myeongseob Ko, Ming Jin, Ruoxi Jia

(1) We develop a proxy for the validation performance associated with a training set based on a non-conventional class-wise Wasserstein distance between training and validation sets.

Data Valuation

Cannot find the paper you are looking for? You can Submit a new open access paper.