Topic Models

210 papers with code • 6 benchmarks • 12 datasets

A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for the discovery of hidden semantic structures in a text body.

Benchmarks

Add a Result

These leaderboards are used to track progress in Topic Models

Dataset	Best Model	Compare
AG News	DeTiME	See all
20NewsGroups	vONTSS	See all
20 Newsgroups	Bayesian SMM	See all
Arxiv HEP-TH citation graph	JoSH	See all
NYT	JoSH	See all
AgNews	vONTSS	See all

Libraries

Use these libraries to find Topic Models models and implementations

mind-Lab/octis

3 papers

681

YongfeiYan/Neural-Document-Modeling

3 papers

ahoho/topics

3 papers

d2klab/tomodapi

3 papers

See all 5 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Latent Dirichlet Allocation

vrjkmr/arxiv-topic • 1 Jan 2003

Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities.

Paper
Code

Learning Topic Models - Going beyond SVD

sc782/pyJSMF-RAW • 9 Apr 2012

Topic Modeling is an approach used for automatic comprehension and classification of data in a variety of settings, and perhaps the canonical application is in uncovering thematic structure in a corpus of documents.

Paper
Code

Stochastic Variational Inference

dfm/arxiv-analysis • 29 Jun 2012

We develop stochastic variational inference, a scalable algorithm for approximating posterior distributions.

Paper
Code

A Practical Algorithm for Topic Modeling with Provable Guarantees

sc782/pyJSMF-RAW • 19 Dec 2012

Topic models provide a useful method for dimensionality reduction and exploratory data analysis in large text corpora.

Paper
Code

Partial Membership Latent Dirichlet Allocation

TigerSense/PMLDA • 9 Nov 2015

Topic models (e. g., pLSA, LDA, SLDA) have been widely used for segmenting imagery.

Paper
Code

Partial Membership Latent Dirichlet Allocation

TigerSense/PMLDA • 28 Dec 2016

Topic models (e. g., pLSA, LDA, sLDA) have been widely used for segmenting imagery.

Paper
Code

Topic Modeling based on Keywords and Context

JohnTailor/tkm • 7 Oct 2017

Current topic models often suffer from discovering topics not matching human intuition, unnatural switching of topics within documents and high computational demands.

Paper
Code

Coherence-Aware Neural Topic Modeling

YongfeiYan/Neural-Document-Modeling • • EMNLP 2018

Topic models are evaluated based on their ability to describe documents well (i. e. low perplexity) and to produce topics that carry coherent semantic meaning.

Paper
Code

Dirichlet belief networks for topic structure learning

ethanhezhao/DirBN • NeurIPS 2018

Recently, considerable research effort has been devoted to developing deep architectures for topic models to learn topic structures.

Paper
Code

Learning document embeddings along with their uncertainties

skesiraju/BaySMM • • 20 Aug 2019

We present Bayesian subspace multinomial model (Bayesian SMM), a generative log-linear model that learns to represent documents in the form of Gaussian distributions, thereby encoding the uncertainty in its co-variance.

Paper
Code

Topic Models

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result