PanGu-$α$

Introduced by Zeng et al. in PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

PanGu-$α$ is an autoregressive language model (ALM) with up to 200 billion parameters pretrained on a large corpus of text, mostly in Chinese language. The architecture of PanGu-$α$ is based on Transformer, which has been extensively used as the backbone of a variety of pretrained language models such as BERT and GPT. Different from them, there's an additional query layer developed on top of Transformer layers which aims to explicitly induce the expected output.

Source: PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Dialogue Generation	1	11.11%
Few-Shot Image Classification	1	11.11%
In-Context Learning	1	11.11%
Natural Language Inference	1	11.11%
Natural Language Understanding	1	11.11%
Question Answering	1	11.11%
Reading Comprehension	1	11.11%
Text Classification	1	11.11%
Text Summarization	1	11.11%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Transformer	Transformers

Categories

Add Remove

Language Models