no code implementations • 16 Dec 2022 • Muriël de Groot, Mohammad Aliannejadi, Marcel R. Haas
We further analyze the performance of the HDBSCAN clustering algorithm utilized by BERTopic and find that it classifies a majority of the documents as outliers.