Trust-PCL: An Off-Policy Trust Region Method for Continuous Control

tensorflow/models • • ICLR 2018

When evaluated on a number of continuous control tasks, Trust-PCL improves the solution quality and sample efficiency of TRPO.

Continuous Control Reinforcement Learning (RL)

76,563

Paper
Code

One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

tensorflow/models • • 11 Dec 2013

We propose a new benchmark corpus to be used for measuring progress in statistical language modeling.

Ranked #22 on Language Modelling on One Billion Word

Language Modelling

76,563

Paper
Code

MoViNets: Mobile Video Networks for Efficient Video Recognition

tensorflow/models • • CVPR 2021

We present Mobile Video Networks (MoViNets), a family of computation and memory efficient video networks that can operate on streaming video for online inference.

Ranked #3 on Action Classification on Charades

Action Classification Action Recognition +4

76,563

Paper
Code

Meta-Learning Update Rules for Unsupervised Representation Learning

tensorflow/models • • ICLR 2019

Specifically, we target semi-supervised classification performance, and we meta-learn an algorithm -- an unsupervised weight update rule -- that produces representations useful for this task.

Meta-Learning Representation Learning

76,563

Paper
Code

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

tensorflow/models • • ICLR 2019

We study the problem of representation learning in goal-conditioned hierarchical reinforcement learning.

2D Human Pose Estimation Continuous Control +4

76,563

Paper
Code

Scalable Learning of Non-Decomposable Objectives

tensorflow/models • • 16 Aug 2016

Modern retrieval systems are often driven by an underlying machine learning model.

Retrieval

76,563

Paper
Code

Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks

tensorflow/models • • NeurIPS 2016

We study the problem of synthesizing a number of likely future frames from a single input image.

76,563

Paper
Code

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

tensorflow/models • • 2 Nov 2016

The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random variable with fixed distribution.

Density Estimation Structured Prediction

76,563

Paper
Code

Adversarial Machine Learning at Scale

tensorflow/models • • 4 Nov 2016

Adversarial examples are malicious inputs designed to fool machine learning models.

BIG-bench Machine Learning

76,563

Paper
Code

Deep Residual Learning for Image Recognition

tensorflow/models • • CVPR 2016

Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

Ranked #1 on Image Classification on cifar100

Domain Generalization +11

76,563

Paper
Code

Search Results