Trending Research

LambdaNetworks: Modeling long-range Interactions without Attention

ICLR 2021 lucidrains/lambda-networks

We present a general framework for capturing long-range interactions between an input and structured contextual information (e. g. a pixel surrounded by other pixels).

IMAGE CLASSIFICATION INSTANCE SEGMENTATION OBJECT DETECTION SCENE SEGMENTATION

824
3.69 stars / hour

AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients

15 Oct 2020juntang-zhuang/Adabelief-Optimizer

Viewing the exponential moving average (EMA) of the noisy gradient as the prediction of the gradient at the next time step, if the observed gradient greatly deviates from the prediction, we distrust the current observation and take a small step; if the observed gradient is close to the prediction, we trust it and take a large step.

IMAGE CLASSIFICATION LANGUAGE MODELLING

538
1.51 stars / hour

Proximal Policy Optimization Algorithms

20 Jul 2017lab-ml/nn

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent.

DOTA 2 POLICY GRADIENT METHODS

168
1.44 stars / hour

Neural circuit policies enabling auditable autonomy

13 Oct 2020mlech26l/keras-ncp

A central goal of artificial intelligence in high-stakes decision-making applications is to design a single algorithm that simultaneously expresses generalizability by learning coherent representations of their world and interpretable explanations of its dynamics.

AUTONOMOUS VEHICLES DECISION MAKING

399
1.29 stars / hour

FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping

31 Dec 2019mindslab-ai/faceshifter

We propose a novel attributes encoder for extracting multi-level target face attributes, and a new generator with carefully designed Adaptive Attentional Denormalization (AAD) layers to adaptively integrate the identity and the attributes for face synthesis.

FACE GENERATION FACE SWAPPING

58
1.25 stars / hour

FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval

20 May 2020alibaba/EasyTransfer

In this paper, we address the text and image matching in cross-modal retrieval of the fashion industry.

CROSS-MODAL RETRIEVAL

241
1.00 stars / hour

fairseq

7 Sep 2019pytorch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MACHINE TRANSLATION

9,876
0.81 stars / hour

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

ACL 2019 lab-ml/labml_nn

Transformers have a potential of learning longer-term dependency, but are limited by a fixed-length context in the setting of language modeling.

LANGUAGE MODELLING

168
0.61 stars / hour

A Survey of Knowledge-Enhanced Text Generation

9 Oct 2020wyu97/KENLG-Reading

To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models.

TEXT GENERATION

125
0.60 stars / hour