no code implementations • NAACL 2019 • Tim vor der Br{\"u}ck, Marc Pouly
The prevalent way to estimate the similarity of two documents based on word embeddings is to apply the cosine similarity measure to the two centroids obtained from the embedding vectors associated with the words in each document.
no code implementations • LREC 2016 • Tim vor der Br{\"u}ck, Alex Mehler, er
We present a morphological tagger for Latin, called TTLab Latin Tagger based on Conditional Random Fields (TLT-CRF) which uses a large Latin lexicon.
no code implementations • LREC 2014 • Tim vor der Br{\"u}ck, Alex Mehler, er, Zahurul Islam
The paper describes a procedure for the automatic generation of a large full-form lexicon of English.