Getting the \#\#life out of living: How Adequate Are Word-Pieces for Modelling Complex Morphology?

WS 2020 Stav KleinReut Tsarfaty

This work investigates the most basic units that underlie contextualized word embeddings, such as BERT {---} the so-called word pieces. In Morphologically-Rich Languages (MRLs) which exhibit morphological fusion and non-concatenative morphology, the different units of meaning within a word may be fused, intertwined, and cannot be separated linearly... (read more)

PDF Abstract

Code


No code implementations yet. Submit your code now

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper