Unsupervised Pre-training

104 papers with code • 2 benchmarks • 7 datasets

Pre-training a neural network using unsupervised (self-supervised) auxiliary tasks on unlabeled data.

Benchmarks

Add a Result

These leaderboards are used to track progress in Unsupervised Pre-training

Trend	Dataset	Best Model	Paper	Code	Compare
	UCI measles				See all
	Measles	15RDLs			See all

Libraries

Use these libraries to find Unsupervised Pre-training models and implementations

pytorch/fairseq

2 papers

29,301

athena-team/athena

2 papers

942

Datasets

Latest papers

Most implemented Social Latest No code

Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

thuml/ContextWM • • NeurIPS 2023

To tackle this issue, we introduce Contextualized World Models (ContextWM) that explicitly separate context and dynamics modeling to overcome the complexity and diversity of in-the-wild videos and facilitate knowledge transfer between distinct scenes.

29 May 2023

Paper
Code

Rethinking Semi-supervised Learning with Language Models

zhengxiangshi/powerfulpromptft • • 22 May 2023

Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks.

22 May 2023

Paper
Code

PTGB: Pre-Train Graph Neural Networks for Brain Network Analysis

owen-yang-18/brainnn-pretrain • • 20 May 2023

The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways.

20 May 2023

Paper
Code

LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic Health Records

celehs/latte • • 19 May 2023

We propose a LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.

19 May 2023

Paper
Code

Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner

zhengxiangshi/dept • • 2 May 2023

Language models (LMs) trained on vast quantities of unlabelled data have greatly advanced the field of natural language processing (NLP).

02 May 2023

Paper
Code

PUNR: Pre-training with User Behavior Modeling for News Recommendation

ma787639046/punr • • 25 Apr 2023

Firstly, we introduce the user behavior masking pre-training task to recover the masked user behaviors based on their contextual behaviors.

25 Apr 2023

Paper
Code

Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource Languages

cnaigithub/SpeechDewarping • • 28 Mar 2023

We empirically demonstrate the effectiveness of our proposed method in low-resource language scenarios, achieving outstanding performance compared to competing methods.

28 Mar 2023

Paper
Code

MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation

mic-dkfz/multitalent • • 25 Mar 2023

Our findings offer a new direction for the medical imaging community to effectively utilize the wealth of available data for improved segmentation performance.

25 Mar 2023

Paper
Code

Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast

KangchengLiu/FAC_Foreground_Aware_Contrast • • CVPR 2023

The second is that we prevent over-discrimination between 3D segments/objects and encourage grouped foreground-to-background distinctions at the segment level with adaptive feature learning in a Siamese correspondence network, which adaptively learns feature correlations within and across point cloud views effectively.

11 Mar 2023

Paper
Code

DocILE Benchmark for Document Information Localization and Extraction

rossumai/docile • • 11 Feb 2023

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

106

11 Feb 2023

Paper
Code

Unsupervised Pre-training

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result