Topic Models

210 papers with code • 6 benchmarks • 12 datasets

A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for the discovery of hidden semantic structures in a text body.

Libraries

Use these libraries to find Topic Models models and implementations

Most implemented papers

A Coefficient of Determination for Probabilistic Topic Models

TommyJones/tidylda 20 Nov 2019

This research proposes a new (old) metric for evaluating goodness of fit in topic models, the coefficient of determination, or $R^2$.

Cross-lingual Contextualized Topic Models with Zero-shot Learning

MilaNLProc/contextualized-topic-models EACL 2021

They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models.

Top2Vec: Distributed Representations of Topics

ddangelov/Top2Vec 19 Aug 2020

Distributed representations of documents and words have gained popularity due to their ability to capture semantics of words and documents.

Using Transformer based Ensemble Learning to classify Scientific Articles

SDPRA-2021/shared-task 19 Feb 2021

The first one is a RoBERTa [10] based model built over these abstracts.

Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

ahoho/topics NeurIPS 2021

To address the standardization gap, we systematically evaluate a dominant classical model and two state-of-the-art neural models on two commonly used datasets.

Contrastive Learning for Neural Topic Model

nguyentthong/CLNTM NeurIPS 2021

Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample.

Improving Contextualized Topic Models with Negative Sampling

adhyasuman/ctmneg 27 Mar 2023

Topic modeling has emerged as a dominant method for exploring large document collections.

A Survey on Neural Topic Models: Methods, Applications, and Challenges

bobxwu/topmost 27 Jan 2024

In this paper, we present a comprehensive survey on neural topic models concerning methods, applications, and challenges.

Software Framework for Topic Modelling with Large Corpora

RaRe-Technologies/gensim Workshop On New Challenges For NLP Frameworks 2010

Large corpora are ubiquitous in today’s world and memory quickly becomes the limiting factor in practical applications of the Vector Space Model (VSM).

Supervised Topic Models

labixiaoK/lda NeurIPS 2007

We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents.