Data Summarization

33 papers with code • 0 benchmarks • 2 datasets

Data Summarization is a central problem in the area of machine learning, where we want to compute a small summary of the data.

Source: How to Solve Fair k-Center in Massive Data Models

Libraries

Use these libraries to find Data Summarization models and implementations

DiffRed: Dimensionality Reduction guided by stable rank

s3-lab-iit/diffred 9 Mar 2024

We rigorously prove that DiffRed achieves a general upper bound of $O\left(\sqrt{\frac{1-p}{k_2}}\right)$ on Stress and $O\left(\frac{(1-p)}{\sqrt{k_2*\rho(A^{*})}}\right)$ on M1 where $p$ is the fraction of variance explained by the first $k_1$ principal components and $\rho(A^{*})$ is the stable rank of $A^{*}$.

9
09 Mar 2024

Time-to-Pattern: Information-Theoretic Unsupervised Learning for Scalable Time Series Summarization

alirezaghods/t2p-time-to-pattern 26 Aug 2023

Data summarization is the process of generating interpretable and representative subsets from a dataset.

0
26 Aug 2023

ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

pranonrahman/chartsumm 26 Apr 2023

Automatic chart to text summarization is an effective tool for the visually impaired people along with providing precise insights of tabular data in natural language to the user.

10
26 Apr 2023

MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering

huggingface/transformers 19 Dec 2022

Visual language data such as plots, charts, and infographics are ubiquitous in the human world.

123,180
19 Dec 2022

Black-box Coreset Variational Inference

facebookresearch/blackbox-coresets-vi 4 Nov 2022

Recent advances in coreset methods have shown that a selection of representative datapoints can replace massive volumes of data for Bayesian inference, preserving the relevant statistical information and significantly accelerating subsequent downstream tasks.

8
04 Nov 2022

Balancing Utility and Fairness in Submodular Maximization (Technical Report)

yhwang1990/code-bsm-release 2 Nov 2022

Submodular function maximization is a fundamental combinatorial optimization problem with plenty of applications -- including data summarization, influence maximization, and recommendation.

0
02 Nov 2022

Streaming Algorithms for Diversity Maximization with Fairness Constraints

yhwang1990/code-fdm 30 Jul 2022

Given a set $X$ of $n$ elements, it asks to select a subset $S$ of $k \ll n$ elements with maximum \emph{diversity}, as quantified by the dissimilarities among the elements in $S$.

0
30 Jul 2022

Towards Neural Numeric-To-Text Generation From Temporal Personal Health Data

neato47/neural-numeric-to-text-generation 11 Jul 2022

We examine recurrent, convolutional, and Transformer-based encoder-decoder models to automatically generate natural language summaries from numeric temporal personal health data.

0
11 Jul 2022

Group Equality in Adaptive Submodular Maximization

j-yuan/gequality 7 Jul 2022

In this paper, we study the classic submodular maximization problem subject to a group equality constraint under both non-adaptive and adaptive settings.

0
07 Jul 2022

Submodlib: A Submodular Optimization Library

decile-team/submodlib 22 Feb 2022

A recent work has also leveraged submodular functions to propose submodular information measures which have been found to be very useful in solving the problems of guided subset selection and guided summarization.

76
22 Feb 2022