Keyword Extraction

25 papers with code • 3 benchmarks • 5 datasets

Keyword extraction is tasked with the automatic identification of terms that best describe the subject of a document (Source: Wikipedia).

Latest papers with no code

Keyword Extraction in Scientific Documents

no code yet • 5 Jul 2022

The scientific publication output grows exponentially.

Unsupervised Learning Algorithms for Keyword Extraction in an Undergraduate Thesis

no code yet • 23 Jun 2022

The amount of data managed in many academic institutions has increased in recent years, particularly in all the research work done by undergraduate students, who simply use empirical techniques for keyword selection, forgetting existing technical methods to assist their students in this process.

Born for Auto-Tagging: Faster and better with new objective functions

no code yet • 15 Jun 2022

The strength of BAT converges faster and better than other SOTA models, as its 4-layer structure achieves the best F scores at 50 epochs.

Using virtual edges to extract keywords from texts modeled as complex networks

no code yet • 4 May 2022

Detecting keywords in texts is important for many text mining applications.

Cross-view Brain Decoding

no code yet • 18 Apr 2022

Also, the decoded representations are sufficiently detailed to enable high accuracy for cross-view-translation tasks with following pairwise accuracy: IC (78. 0), IT (83. 0), KE (83. 7) and SF (74. 5).

Semantic Similarity Computing for Scientific Academic Conferences fused with domain features

no code yet • 21 Mar 2022

Aiming at the problem that the current general-purpose semantic text similarity calculation methods are difficult to use the semantic information of scientific academic conference data, a semantic similarity calculation algorithm for scientific academic conferences by fusion with domain features is proposed.

Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised?

no code yet • LREC 2022

We find that the pretrained models fine-tuned on a multilingual corpus covering languages that do not appear in the test set (i. e. in a zero-shot setting), consistently outscore unsupervised models in all six languages.

Open Domain Response Generation Guided by Retrieved Conversations

no code yet • ACL ARR January 2022

Open domain response generation is the task of creating a response givena user query in any topics/domain.

Coherence-Based Document Clustering

no code yet • 29 Sep 2021

While these algorithms differ in their modeling approach, they have in common that hyperparameter optimization is difficult and is mainly achieved by maximizing the extracted topic coherence scores via a grid search.

Explainable Point-Based Document Visualizations

no code yet • 28 Sep 2021

Popular techniques to construct data maps are t-SNE and UMAP.