TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Selective MBR)	Mean F1 (WSJ)	66.2	# 2
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Generative MBR)	Max F1 (WSJ)	71.9	# 1
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Generative MBR)	Mean F1 (WSJ)	70.4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ensemble-distillation-for-unsupervised/constituency-grammar-induction-on-ptb)](https://paperswithcode.com/sota/constituency-grammar-induction-on-ptb?p=ensemble-distillation-for-unsupervised)`

Ensemble Distillation for Unsupervised Constituency Parsing

3 Oct 2023 · Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie C. K. Cheung, Lili Mou ·

We investigate the unsupervised constituency parsing task, which organizes words and phrases of a sentence into a hierarchical structure without using linguistically annotated data. We observe that existing unsupervised parsers capture differing aspects of parsing structures, which can be leveraged to enhance unsupervised parsing performance. To this end, we propose a notion of "tree averaging," based on which we further propose a novel ensemble method for unsupervised parsing. To improve inference efficiency, we further distill the ensemble knowledge into a student model; such an ensemble-then-distill process is an effective approach to mitigate the over-smoothing problem existing in common multi-teacher distilling methods. Experiments show that our method surpasses all previous approaches, consistently demonstrating its effectiveness and robustness across various runs, with different ensemble components, and under domain-shift conditions.

PDF Abstract

Code

Add Remove Mark official

manga-uofa/ed4ucp official

Tasks

Add Remove

Constituency Grammar Induction

Constituency Parsing

Sentence

Datasets

Penn Treebank PTB Diagnostic ECG Database

Results from the Paper

Add Remove

Ranked #1 on Constituency Grammar Induction on PTB Diagnostic ECG Database

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Selective MBR)	Mean F1 (WSJ)	66.2	# 2	Compare
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Generative MBR)	Max F1 (WSJ)	71.9	# 1	Compare
Constituency Grammar Induction	PTB Diagnostic ECG Database	Ensemble (Generative MBR)	Mean F1 (WSJ)	70.4	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Ensemble Distillation for Unsupervised Constituency Parsing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove