Constituency Parsing

73 papers with code • 4 benchmarks • 6 datasets

Constituency parsing aims to extract a constituency-based parse tree from a sentence that represents its syntactic structure according to a phrase structure grammar.

Example:

             Sentence (S)
                 |
   +-------------+------------+
   |                          |
 Noun (N)                Verb Phrase (VP)
   |                          |
 John                 +-------+--------+
                      |                |
                    Verb (V)         Noun (N)
                      |                |
                    sees              Bill

Recent approaches convert the parse tree into a sequence following a depth-first traversal in order to be able to apply sequence-to-sequence models to it. The linearized version of the above parse tree looks as follows: (S (N) (VP V N)).

Benchmarks

Add a Result

These leaderboards are used to track progress in Constituency Parsing

Dataset	Best Model	Compare
Penn Treebank	SAPar + XLNet	See all
CTB5	Attach-Juxtapose Parser + BERT	See all
CTB7	CRF Parser + Electra	See all
ATB	SAPar	See all

Datasets

Subtasks

Constituency Grammar Induction

Most implemented papers

Most implemented Social Latest No code

Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference

nikitakit/tetra-tagging • ACL 2020

We present a constituency parsing algorithm that, like a supertagger, works by assigning labels to each word in a sentence.

Paper
Code

Rethinking Self-Attention: Towards Interpretability in Neural Parsing

KhalilMrini/LAL-Parser • • Findings of the Association for Computational Linguistics 2020

Finally, we find that the Label Attention heads learn relations between syntactic categories and show pathways to analyze errors.

Paper
Code

Fast and Accurate Neural CRF Constituency Parsing

yzhangcs/crfpar • • IJCAI 2020

Estimating probability distribution is one of the core issues in the NLP field.

Paper
Code

Converting the Point of View of Messages Spoken to Virtual Assistants

alexa/alexa-point-of-view-dataset • Findings of the Association for Computational Linguistics 2020

CopyNet was the most natural, with a relative perplexity of 1. 59.

Paper
Code

StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling

google-research/google-research • • ACL 2021

There are two major classes of natural language grammar -- the dependency grammar that models one-to-one correspondences between words and the constituency grammar that models the assembly of one or several corresponded words.

Paper
Code

Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning

epfl-dlab/gcd • • 23 May 2023

In this work, we demonstrate that formal grammars can describe the output space for a much wider range of tasks and argue that GCD can serve as a unified framework for structured NLP tasks in general.

Paper
Code