Data Summarization
33 papers with code • 0 benchmarks • 2 datasets
Data Summarization is a central problem in the area of machine learning, where we want to compute a small summary of the data.
Benchmarks
These leaderboards are used to track progress in Data Summarization
Libraries
Use these libraries to find Data Summarization models and implementationsLatest papers
An Online Algorithm for Nonparametric Correlations
This paper investigates the problem of computing nonparametric correlations on the fly for streaming data.
Scalable k-Means Clustering via Lightweight Coresets
As such, they have been successfully used to scale up clustering models to massive data sets.
Sequential Quantiles via Hermite Series Density Estimation
These algorithms go beyond existing sequential quantile estimation algorithms in that they allow arbitrary quantiles (as opposed to pre-specified quantiles) to be estimated at any point in time.