Search Results

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control

tensorflow/models ICLR 2018

When evaluated on a number of continuous control tasks, Trust-PCL improves the solution quality and sample efficiency of TRPO.

Continuous Control Reinforcement Learning (RL)

One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

tensorflow/models 11 Dec 2013

We propose a new benchmark corpus to be used for measuring progress in statistical language modeling.

Language Modelling

MoViNets: Mobile Video Networks for Efficient Video Recognition

tensorflow/models CVPR 2021

We present Mobile Video Networks (MoViNets), a family of computation and memory efficient video networks that can operate on streaming video for online inference.

Action Classification Action Recognition +4

Meta-Learning Update Rules for Unsupervised Representation Learning

tensorflow/models ICLR 2019

Specifically, we target semi-supervised classification performance, and we meta-learn an algorithm -- an unsupervised weight update rule -- that produces representations useful for this task.

Meta-Learning Representation Learning

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

tensorflow/models ICLR 2019

We study the problem of representation learning in goal-conditioned hierarchical reinforcement learning.

2D Human Pose Estimation Continuous Control +4

Scalable Learning of Non-Decomposable Objectives

tensorflow/models 16 Aug 2016

Modern retrieval systems are often driven by an underlying machine learning model.

Retrieval

Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks

tensorflow/models NeurIPS 2016

We study the problem of synthesizing a number of likely future frames from a single input image.

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

tensorflow/models 2 Nov 2016

The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random variable with fixed distribution.

Density Estimation Structured Prediction

Adversarial Machine Learning at Scale

tensorflow/models 4 Nov 2016

Adversarial examples are malicious inputs designed to fool machine learning models.

BIG-bench Machine Learning

Deep Residual Learning for Image Recognition

tensorflow/models CVPR 2016

Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

Domain Generalization +11