no code implementations • CVPR 2017 • Jingming Dong, Xiaohan Fei, Stefano Soatto
We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.
no code implementations • 28 Oct 2016 • Jingming Dong, Iuri Frosio, Jan Kautz
The non-stationary nature of image characteristics calls for adaptive processing, based on the local image content.
no code implementations • 13 Jun 2016 • Jingming Dong, Xiaohan Fei, Stefano Soatto
We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.
no code implementations • CVPR 2015 • Jingming Dong, Nikolaos Karianakis, Damek Davis, Joshua Hernandez, Jonathan Balzer, Stefano Soatto
We frame the problem of local representation of imaging data as the computation of minimal sufficient statistics that are invariant to nuisance variability induced by viewpoint and illumination.
no code implementations • CVPR 2016 • Nikolaos Karianakis, Jingming Dong, Stefano Soatto
We conduct an empirical study to test the ability of Convolutional Neural Networks (CNNs) to reduce the effects of nuisance transformations of the input data, such as location, scale and aspect ratio.
no code implementations • CVPR 2015 • Jingming Dong, Stefano Soatto
We introduce a simple modification of local image descriptors, such as SIFT, based on pooling gradient orientations across different domain sizes, in addition to spatial locations.
no code implementations • 20 Dec 2014 • Stefano Soatto, Jingming Dong, Nikolaos Karianakis
We study the structure of representations, defined as approximations of minimal sufficient statistics that are maximal invariants to nuisance factors, for visual data subject to scaling and occlusion of line-of-sight.
no code implementations • 23 Nov 2013 • Jingming Dong, Jonathan Balzer, Damek Davis, Joshua Hernandez, Stefano Soatto
We propose an extension of popular descriptors based on gradient orientation histograms (HOG, computed in a single image) to multiple views.