Unsupervised Pre-training

104 papers with code • 2 benchmarks • 7 datasets

Pre-training a neural network using unsupervised (self-supervised) auxiliary tasks on unlabeled data.

Libraries

Use these libraries to find Unsupervised Pre-training models and implementations
2 papers
29,301

Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

thuml/ContextWM NeurIPS 2023

To tackle this issue, we introduce Contextualized World Models (ContextWM) that explicitly separate context and dynamics modeling to overcome the complexity and diversity of in-the-wild videos and facilitate knowledge transfer between distinct scenes.

46
29 May 2023

Rethinking Semi-supervised Learning with Language Models

zhengxiangshi/powerfulpromptft 22 May 2023

Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks.

69
22 May 2023

PTGB: Pre-Train Graph Neural Networks for Brain Network Analysis

owen-yang-18/brainnn-pretrain 20 May 2023

The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways.

11
20 May 2023

LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic Health Records

celehs/latte 19 May 2023

We propose a LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.

5
19 May 2023

Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner

zhengxiangshi/dept 2 May 2023

Language models (LMs) trained on vast quantities of unlabelled data have greatly advanced the field of natural language processing (NLP).

83
02 May 2023

PUNR: Pre-training with User Behavior Modeling for News Recommendation

ma787639046/punr 25 Apr 2023

Firstly, we introduce the user behavior masking pre-training task to recover the masked user behaviors based on their contextual behaviors.

3
25 Apr 2023

Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource Languages

cnaigithub/SpeechDewarping 28 Mar 2023

We empirically demonstrate the effectiveness of our proposed method in low-resource language scenarios, achieving outstanding performance compared to competing methods.

26
28 Mar 2023

MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation

mic-dkfz/multitalent 25 Mar 2023

Our findings offer a new direction for the medical imaging community to effectively utilize the wealth of available data for improved segmentation performance.

43
25 Mar 2023

Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast

KangchengLiu/FAC_Foreground_Aware_Contrast CVPR 2023

The second is that we prevent over-discrimination between 3D segments/objects and encourage grouped foreground-to-background distinctions at the segment level with adaptive feature learning in a Siamese correspondence network, which adaptively learns feature correlations within and across point cloud views effectively.

37
11 Mar 2023

DocILE Benchmark for Document Information Localization and Extraction

rossumai/docile 11 Feb 2023

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

106
11 Feb 2023