Search Results for author: Sehyun Choi

Found 6 papers, 5 papers with code

Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers

no code implementations3 Apr 2024 Sehyun Choi

Motivated by this approach, we propose Cross-Architecture Transfer Learning (XATL), in which the weights of the shared components between LCI and self-attention-based transformers, such as layernorms, MLPs, input/output embeddings, are directly transferred to the new architecture from already pre-trained model parameters.

Language Modelling Transfer Learning

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

1 code implementation15 Nov 2023 Zhaowei Wang, Haochen Shi, Weiqi Wang, Tianqing Fang, Hongming Zhang, Sehyun Choi, Xin Liu, Yangqiu Song

Cognitive research indicates that abstraction ability is essential in human intelligence, which remains under-explored in language models.

Benchmarking

CKBP v2: An Expert-Annotated Evaluation Set for Commonsense Knowledge Base Population

1 code implementation20 Apr 2023 Tianqing Fang, Quyet V. Do, Sehyun Choi, Weiqi Wang, Yangqiu Song

Populating Commonsense Knowledge Bases (CSKB) is an important yet hard task in NLP, as it tackles knowledge from external sources with unseen events and entities.

Knowledge Base Population

Cannot find the paper you are looking for? You can Submit a new open access paper.