no code implementations • Proceedings of the ACM on Management of Data 2023 • Zhengjie Miao, Jin Wang
Relational Web tables provide valuable resources for numerous downstream applications, making table understanding, especially column annotation that identifies semantic types and relations of columns, a hot topic in the field of data management.
Ranked #1 on Columns Property Annotation on WikiTables-TURL-CPA ( Macro-F1 metric)
1 code implementation • Proceedings of the International Conference on Management of Data (SIGMOD) 2021 • Zhengjie Miao, Yuliang Li and Xiaolan Wang
Meanwhile, the risk of creating noisy examples and the large space of hyper-parameters make DA less attractive in practice.
1 code implementation • 7 Feb 2020 • Zhengjie Miao, Yuliang Li, Xiaolan Wang, Wang-Chiew Tan
A novelty of Snippext is its clever use of a two-prong approach to achieve state-of-the-art (SOTA) performance with little labeled training data through: (1) data augmentation to automatically generate more labeled training data from existing ones, and (2) a semi-supervised learning technique to leverage the massive amount of unlabeled data in addition to the (limited amount of) labeled data.