Topic Models

210 papers with code • 6 benchmarks • 12 datasets

A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for the discovery of hidden semantic structures in a text body.

Benchmarks

Add a Result

These leaderboards are used to track progress in Topic Models

Dataset	Best Model	Compare
AG News	DeTiME	See all
20NewsGroups	vONTSS	See all
20 Newsgroups	Bayesian SMM	See all
Arxiv HEP-TH citation graph	JoSH	See all
NYT	JoSH	See all
AgNews	vONTSS	See all

Libraries

Use these libraries to find Topic Models models and implementations

mind-Lab/octis

3 papers

686

YongfeiYan/Neural-Document-Modeling

3 papers

ahoho/topics

3 papers

d2klab/tomodapi

3 papers

See all 5 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

A Coefficient of Determination for Probabilistic Topic Models

TommyJones/tidylda • 20 Nov 2019

This research proposes a new (old) metric for evaluating goodness of fit in topic models, the coefficient of determination, or $R^2$.

Paper
Code

Cross-lingual Contextualized Topic Models with Zero-shot Learning

MilaNLProc/contextualized-topic-models • • EACL 2021

They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models.

Paper
Code

Top2Vec: Distributed Representations of Topics

ddangelov/Top2Vec • • 19 Aug 2020

Distributed representations of documents and words have gained popularity due to their ability to capture semantics of words and documents.

Paper
Code

Using Transformer based Ensemble Learning to classify Scientific Articles

SDPRA-2021/shared-task • • 19 Feb 2021

The first one is a RoBERTa [10] based model built over these abstracts.

Paper
Code

Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

ahoho/topics • NeurIPS 2021

To address the standardization gap, we systematically evaluate a dominant classical model and two state-of-the-art neural models on two commonly used datasets.

Paper
Code

Contrastive Learning for Neural Topic Model

nguyentthong/CLNTM • • NeurIPS 2021

Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample.

Paper
Code

Improving Contextualized Topic Models with Negative Sampling

adhyasuman/ctmneg • • 27 Mar 2023

Topic modeling has emerged as a dominant method for exploring large document collections.

Paper
Code

A Survey on Neural Topic Models: Methods, Applications, and Challenges

bobxwu/topmost • • 27 Jan 2024

In this paper, we present a comprehensive survey on neural topic models concerning methods, applications, and challenges.

Paper
Code

Software Framework for Topic Modelling with Large Corpora

RaRe-Technologies/gensim • Workshop On New Challenges For NLP Frameworks 2010

Large corpora are ubiquitous in today’s world and memory quickly becomes the limiting factor in practical applications of the Vector Space Model (VSM).

Paper
Code

Supervised Topic Models

labixiaoK/lda • NeurIPS 2007

We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents.

Paper
Code

Topic Models

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result