Representation Learning
3682 papers with code • 5 benchmarks • 9 datasets
Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.
Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.
Representation learning can be divided into:
- Supervised representation learning: learning representations on task A using annotated data and used to solve task B
- Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.
More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.
Here are some additional readings to go deeper on the task:
- Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
- A Few Words on Representation Learning - Thalles Silva
( Image credit: Visualizing and Understanding Convolutional Networks )
Libraries
Use these libraries to find Representation Learning models and implementationsDatasets
Subtasks
- Disentanglement
- Graph Representation Learning
- Sentence Embeddings
- Network Embedding
- Network Embedding
- Sentence Embedding
- Knowledge Graph Embeddings
- Document Embedding
- Learning Word Embeddings
- Multilingual Word Embeddings
- Learning Semantic Representations
- Feature Upsampling
- Learning Network Representations
- Sentence Embeddings For Biomedical Texts
- Part-based Representation Learning
- Learning Representation Of Multi-View Data
- Learning Representation On Graph
Latest papers
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study
Forecasting the short-term spread of an ongoing disease outbreak is a formidable challenge due to the complexity of contributing factors, some of which can be characterized through interlinked, multi-modality variables such as epidemiological time series data, viral biology, population demographics, and the intersection of public policy and human behavior.
VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection
Out-of-distribution (OOD) detection plays a crucial role in ensuring the safety and reliability of deep neural networks in various applications.
ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos
Our framework leverages both labeled and unlabelled data to robustly learn action representations in videos, combining pseudo-labeling with contrastive learning for effective learning from both types of samples.
BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model
We introduce the \textbf{B}i-Directional \textbf{S}parse \textbf{Hop}field Network (\textbf{BiSHop}), a novel end-to-end framework for deep tabular learning.
Masked Completion via Structured Diffusion with White-Box Transformers
We do this by exploiting a fundamental connection between diffusion, compression, and (masked) completion, deriving a deep transformer-like masked autoencoder architecture, called CRATE-MAE, in which the role of each layer is mathematically fully interpretable: they transform the data distribution to and from a structured representation.
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT
This is also a notable improvement over the Adapter and LoRA, which require 37-39 GB GPU memory and 350-380 seconds per epoch for training.
Universal representations for financial transactional data: embracing local, global, and external contexts
Effective processing of financial transactions is essential for banking data analysis.
ContrastCAD: Contrastive Learning-based Representation Learning for Computer-Aided Design Models
However, learning CAD models is still a challenge, because they can be represented as complex shapes with long construction sequences.
HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs
Based on the generative SSL task, we propose a hypergraph SSL method, HypeBoy.
Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning
Deep representation learning methods struggle with continual learning, suffering from both catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful units.