TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Column Type Annotation	VizNet-Sato-Full	Watchog	Macro-F1	85.63	# 1
Columns Property Annotation	WikiTables-TURL-CPA	Watchog	Macro-F1	88.45	# 1
Column Type Annotation	WikiTables-TURL-CTA	Watchog	Macro-F1	78.72	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/watchog-a-light-weight-contrastive-learning/column-type-annotation-on-viznet-sato-full)](https://paperswithcode.com/sota/column-type-annotation-on-viznet-sato-full?p=watchog-a-light-weight-contrastive-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/watchog-a-light-weight-contrastive-learning/columns-property-annotation-on-wikitables)](https://paperswithcode.com/sota/columns-property-annotation-on-wikitables?p=watchog-a-light-weight-contrastive-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/watchog-a-light-weight-contrastive-learning/column-type-annotation-on-wikitables-turl-cta)](https://paperswithcode.com/sota/column-type-annotation-on-wikitables-turl-cta?p=watchog-a-light-weight-contrastive-learning)`

Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation

Proceedings of the ACM on Management of Data 2023 · Zhengjie Miao, Jin Wang ·

Relational Web tables provide valuable resources for numerous downstream applications, making table understanding, especially column annotation that identifies semantic types and relations of columns, a hot topic in the field of data management. Despite recent efforts to improve different tasks in table understanding by using the power of large pre-trained language models, existing methods heavily rely on large-scale and high-quality labeled instances, while they still suffer from the data sparsity problem due to the imbalanced data distribution among different classes. In this paper, we propose the Watchog framework, which employs contrastive learning techniques to learn robust representations for tables by leveraging a large-scale unlabeled table corpus with minimal overhead. Our approach enables the learned table representations to enhance fine tuning with much fewer additional labeled instances than in prior studies for downstream column annotation tasks. Besides, we further proposed optimization techniques for semi-supervised settings. Experimental results on popular benchmarking datasets illustrate the superiority of our proposed techniques in two column annotation tasks under different settings. In particular, our Watchog framework effectively alleviates the class imbalance issue caused by a long-tailed label distribution. In the semi-supervised setting, Watchog outperforms the best-known method by up to 26% and 41% in Micro and Macro F1 scores, respectively, on the task of semantic type detection.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Benchmarking

Columns Property Annotation

Column Type Annotation

Contrastive Learning

Datasets

WikiTables-TURL VizNet-Sato

Results from the Paper

Add Remove

Ranked #1 on Columns Property Annotation on WikiTables-TURL-CPA ( Macro-F1 metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Column Type Annotation	VizNet-Sato-Full	Watchog	Macro-F1	85.63	# 1	Compare
Columns Property Annotation	WikiTables-TURL-CPA	Watchog	Macro-F1	88.45	# 1	Compare
Column Type Annotation	WikiTables-TURL-CTA	Watchog	Macro-F1	78.72	# 1	Compare

Methods

Add Remove

Contrastive Learning

Edit Social Preview

Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove