TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	EXTRA DATA	REMOVE
Constituency Parsing	Penn Treebank	Self-training	F1 score	92.1	# 23

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/effective-self-training-for-parsing/constituency-parsing-on-penn-treebank)](https://paperswithcode.com/sota/constituency-parsing-on-penn-treebank?p=effective-self-training-for-parsing)`

Effective Self-Training for Parsing

NAACL 2006 · David McClosky, Eugene Charniak, and Mark Johnson ·

We present a simple, but surprisingly effective, method of self-training a two-phase parser-reranker system using readily available unlabeled data. We show that this type of bootstrapping is possible for parsing when the bootstrapped parses are processed by a discriminative reranker. Our improved model achieves an f-score of 92.1%, an absolute 1.1% improvement (12% error reduction) over the previous best result for Wall Street Journal parsing. Finally, we provide some analysis to better understand the phenomenon.

PDF Abstract