Search Results for author: Ruotian Luo

Found 10 papers, 6 papers with code

Goal-driven text descriptions for images

no code implementations • 28 Aug 2021 • Ruotian Luo

In Chapter 3, we focus on generating the referring expression, a text description for an object in the image so that a receiver can infer which object is being described.

Caption Generation Descriptive +8

Paper
Add Code

Controlling Length in Image Captioning

1 code implementation • 29 May 2020 • Ruotian Luo, Greg Shakhnarovich

We develop and evaluate captioning models that allow control of caption length.

Image Captioning

984

Paper
Code

Pixel Consensus Voting for Panoptic Segmentation

no code implementations • CVPR 2020 • Haochen Wang, Ruotian Luo, Michael Maire, Greg Shakhnarovich

The core of our approach, Pixel Consensus Voting, is a framework for instance segmentation based on the Generalized Hough transform.

Ranked #36 on Panoptic Segmentation on COCO test-dev

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Detection and Description of Change in Visual Streams

no code implementations • 27 Mar 2020 • Davis Gilton, Ruotian Luo, Rebecca Willett, Greg Shakhnarovich

This paper presents a framework for the analysis of changes in visual streams: ordered sequences of images, possibly separated by significant time gaps.

Change Detection Representation Learning

Paper
Add Code

A Better Variant of Self-Critical Sequence Training

1 code implementation • 22 Mar 2020 • Ruotian Luo

In this work, we present a simple yet better variant of Self-Critical Sequence Training.

Ranked #24 on Image Captioning on COCO Captions

Image Captioning

984

Paper
Code

Analysis of diversity-accuracy tradeoff in image captioning

2 code implementations • 27 Feb 2020 • Ruotian Luo, Gregory Shakhnarovich

We investigate the effect of different model architectures, training objectives, hyperparameter settings and decoding procedures on the diversity of automatically generated image captions.

Image Captioning reinforcement-learning +1

Paper
Code

DIODE: A Dense Indoor and Outdoor DEpth Dataset

2 code implementations • 1 Aug 2019 • Igor Vasiljevic, Nick Kolkin, Shanyi Zhang, Ruotian Luo, Haochen Wang, Falcon Z. Dai, Andrea F. Daniele, Mohammadreza Mostajabi, Steven Basart, Matthew R. Walter, Gregory Shakhnarovich

We introduce DIODE, a dataset that contains thousands of diverse high resolution color images with accurate, dense, long-range depth measurements.

Paper
Code

Context-Aware Zero-Shot Recognition

1 code implementation • 19 Apr 2019 • Ruotian Luo, Ning Zhang, Bohyung Han, Linjie Yang

We present a novel problem setting in zero-shot learning, zero-shot object recognition and detection in the context.

Object Recognition Zero-Shot Learning

Paper
Code

Discriminability objective for training descriptive captions

1 code implementation • CVPR 2018 • Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich

One property that remains lacking in image captions generated by contemporary methods is discriminability: being able to tell two images apart given the caption for one of them.

Caption Generation Descriptive +1

110

Paper
Code

Comprehension-guided referring expressions

no code implementations • CVPR 2017 • Ruotian Luo, Gregory Shakhnarovich

Second, we use the comprehension module in a generate-and-rerank pipeline, which chooses from candidate expressions generated by a model according to their performance on the comprehension task.

Referring Expression Referring expression generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.