Topic Models

210 papers with code • 6 benchmarks • 12 datasets

A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for the discovery of hidden semantic structures in a text body.

Libraries

Use these libraries to find Topic Models models and implementations

Most implemented papers

Latent Dirichlet Allocation

vrjkmr/arxiv-topic 1 Jan 2003

Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities.

Learning Topic Models - Going beyond SVD

sc782/pyJSMF-RAW 9 Apr 2012

Topic Modeling is an approach used for automatic comprehension and classification of data in a variety of settings, and perhaps the canonical application is in uncovering thematic structure in a corpus of documents.

Stochastic Variational Inference

dfm/arxiv-analysis 29 Jun 2012

We develop stochastic variational inference, a scalable algorithm for approximating posterior distributions.

A Practical Algorithm for Topic Modeling with Provable Guarantees

sc782/pyJSMF-RAW 19 Dec 2012

Topic models provide a useful method for dimensionality reduction and exploratory data analysis in large text corpora.

Partial Membership Latent Dirichlet Allocation

TigerSense/PMLDA 9 Nov 2015

Topic models (e. g., pLSA, LDA, SLDA) have been widely used for segmenting imagery.

Partial Membership Latent Dirichlet Allocation

TigerSense/PMLDA 28 Dec 2016

Topic models (e. g., pLSA, LDA, sLDA) have been widely used for segmenting imagery.

Topic Modeling based on Keywords and Context

JohnTailor/tkm 7 Oct 2017

Current topic models often suffer from discovering topics not matching human intuition, unnatural switching of topics within documents and high computational demands.

Coherence-Aware Neural Topic Modeling

YongfeiYan/Neural-Document-Modeling EMNLP 2018

Topic models are evaluated based on their ability to describe documents well (i. e. low perplexity) and to produce topics that carry coherent semantic meaning.

Dirichlet belief networks for topic structure learning

ethanhezhao/DirBN NeurIPS 2018

Recently, considerable research effort has been devoted to developing deep architectures for topic models to learn topic structures.

Learning document embeddings along with their uncertainties

skesiraju/BaySMM 20 Aug 2019

We present Bayesian subspace multinomial model (Bayesian SMM), a generative log-linear model that learns to represent documents in the form of Gaussian distributions, thereby encoding the uncertainty in its co-variance.