Search Results for author: Ian Reid

Found 155 papers, 55 papers with code

Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

no code implementations8 Apr 2024 Mahsa Ehsanpour, Ian Reid, Hamid Rezatofighi

The framework uses masked modeling to pre-train the encoder to reconstruct masked human joint trajectories, enabling it to learn generalizable and data efficient representations of motion in human crowded scenes.

Action Understanding Multi-Person Pose forecasting +1

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

no code implementations2 Apr 2024 Duy-Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi

JRDB-PanoTrack includes (1) various data involving indoor and outdoor crowded scenes, as well as comprehensive 2D and 3D synchronized data modalities; (2) high-quality 2D spatial panoptic segmentation and temporal tracking annotations, with additional 3D label projections for further spatial understanding; (3) diverse object classes for closed- and open-world recognition benchmarks, with OSPA-based metrics for evaluation.

Decision Making Panoptic Segmentation +1

PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

no code implementations14 Mar 2024 Jiajun Deng, Sha Zhang, Feras Dayoub, Wanli Ouyang, Yanyong Zhang, Ian Reid

In this work, we present PoIFusion, a simple yet effective multi-modal 3D object detection framework to fuse the information of RGB images and LiDAR point clouds at the point of interest (abbreviated as PoI).

3D Object Detection Object +1

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

no code implementations13 Mar 2024 Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu

We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D Gaussian Splatting (3DGS).

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM

1 code implementation12 Mar 2024 Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang

Human motion generation stands as a significant pursuit in generative computer vision, while achieving long-sequence and efficient motion generation remains challenging.

Symmetry-Breaking Augmentations for Ad Hoc Teamwork

no code implementations15 Feb 2024 Ravi Hammond, Dustin Craggs, Mingyu Guo, Jakob Foerster, Ian Reid

In many collaborative settings, artificial intelligence (AI) agents must be able to adapt to new teammates that use unknown or previously unobserved strategies.

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

no code implementations12 Jul 2023 Krishan Rana, Jesse Haviland, Sourav Garg, Jad Abou-Chakra, Ian Reid, Niko Suenderhauf

To ensure the scalability of our approach, we: (1) exploit the hierarchical nature of 3DSGs to allow LLMs to conduct a 'semantic search' for task-relevant subgraphs from a smaller, collapsed representation of the full graph; (2) reduce the planning horizon for the LLM by integrating a classical path planner and (3) introduce an 'iterative replanning' pipeline that refines the initial plan using feedback from a scene graph simulator, correcting infeasible actions and avoiding planning failures.

Robot Task Planning

Semantic Segmentation on 3D Point Clouds with High Density Variations

no code implementations4 Jul 2023 Ryan Faulkner, Luke Haub, Simon Ratcliffe, Ian Reid, Tat-Jun Chin

LiDAR scanning for surveying applications acquire measurements over wide areas and long distances, which produces large-scale 3D point clouds with significant local density variations.

3D Semantic Segmentation Segmentation

Assessing Domain Gap for Continual Domain Adaptation in Object Detection

1 code implementation21 Feb 2023 Anh-Dzung Doan, Bach Long Nguyen, Surabhi Gupta, Ian Reid, Markus Wagner, Tat-Jun Chin

To ensure reliable object detection in autonomous systems, the detector must be able to adapt to changes in appearance caused by environmental factors such as time of day, weather, and seasons.

Domain Adaptation object-detection +1

Predicting Topological Maps for Visual Navigation in Unexplored Environments

no code implementations23 Nov 2022 Huangying Zhan, Hamid Rezatofighi, Ian Reid

We propose a robotic learning system for autonomous exploration and navigation in unexplored environments.

Visual Navigation

ActiveRMAP: Radiance Field for Active Mapping And Planning

no code implementations23 Nov 2022 Huangying Zhan, Jiyang Zheng, Yi Xu, Ian Reid, Hamid Rezatofighi

We, for the first time, present an RGB-only active vision framework using radiance field representation for active 3D reconstruction and planning in an online manner.

3D Reconstruction

What Images are More Memorable to Machines?

1 code implementation14 Nov 2022 Junlin Han, Huangying Zhan, Jie Hong, Pengfei Fang, Hongdong Li, Lars Petersson, Ian Reid

This paper studies the problem of measuring and predicting how memorable an image is to pattern recognition machines, as a path to explore machine intelligence.

Globally Optimal Event-Based Divergence Estimation for Ventral Landing

1 code implementation27 Sep 2022 Sofia McLeod, Gabriele Meoni, Dario Izzo, Anne Mergy, Daqi Liu, Yasir Latif, Ian Reid, Tat-Jun Chin

This is achieved by estimating divergence (inverse TTC), which is the rate of radial optic flow, from the event stream generated during landing.

The Edge of Disaster: A Battle Between Autonomous Racing and Safety

no code implementations30 Jun 2022 Matthew Howe, James Bockman, Adrian Orenstein, Stefan Podgorski, Sam Bahrami, Ian Reid

Autonomous racing represents a uniquely challenging control environment where agents must act while on the limits of a vehicle's capability in order to set competitive lap times.

Model Predictive Control

CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping

1 code implementation31 May 2022 Junlin Han, Lars Petersson, Hongdong Li, Ian Reid

We present a simple method, CropMix, for the purpose of producing a rich input distribution from the original dataset distribution.

Contrastive Learning

Asynchronous Optimisation for Event-based Visual Odometry

no code implementations2 Mar 2022 Daqi Liu, Alvaro Parra, Yasir Latif, Bo Chen, Tat-Jun Chin, Ian Reid

Event cameras open up new possibilities for robotic perception due to their low latency and high dynamic range.

Event-based vision Visual Odometry

PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

1 code implementation22 Oct 2021 Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

The most competitive noisy label learning methods rely on an unsupervised classification of clean and noisy samples, where samples classified as noisy are re-labelled and "MixMatched" with the clean samples.

Image Classification with Label Noise Learning with noisy labels

Weakly Supervised Training of Monocular 3D Object Detectors Using Wide Baseline Multi-view Traffic Camera Data

1 code implementation21 Oct 2021 Matthew Howe, Ian Reid, Jamie Mackenzie

Our method achieves vehicle 7DoF pose prediction accuracy on our dataset comparable to the top performing monocular 3D object detectors on autonomous vehicle datasets.

Autonomous Vehicles Object +1

ODAM: Object Detection, Association, and Mapping using Posed RGB Video

1 code implementation ICCV 2021 Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard Newcombe

Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics.

3D Object Detection Object +2

JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection

no code implementations CVPR 2022 Mahsa Ehsanpour, Fatemeh Saleh, Silvio Savarese, Ian Reid, Hamid Rezatofighi

However, learning to recognise human actions and their social interactions in an unconstrained real-world environment comprising numerous people, with potentially highly unbalanced and long-tailed distributed action labels from a stream of sensory data captured from a mobile robot platform remains a significant challenge, not least owing to the lack of a reflective large-scale dataset.

Action Detection Action Understanding +1

Unsupervised Scale-consistent Depth Learning from Video

2 code implementations25 May 2021 Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Zhichao Li, Le Zhang, Chunhua Shen, Ming-Ming Cheng, Ian Reid

We propose a monocular depth estimator SC-Depth, which requires only unlabelled videos for training and enables the scale-consistent prediction at inference time.

Monocular Depth Estimation Monocular Visual Odometry +1

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

no code implementations ICCV 2021 Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi

Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems.

Autonomous Driving Human-Object Interaction Detection

Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers

1 code implementation27 Mar 2021 Tianyu Zhu, Markus Hiller, Mahsa Ehsanpour, Rongkai Ma, Tom Drummond, Ian Reid, Hamid Rezatofighi

Tracking a time-varying indefinite number of objects in a video sequence over time remains a challenge despite recent advances in the field.

Multi-Object Tracking Object +1

ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning

1 code implementation21 Mar 2021 Ragav Sachdeva, Filipe R Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

We propose a new training algorithm, ScanMix, that explores semantic clustering and semi-supervised learning (SSL) to allow superior robustness to severe label noise and competitive robustness to non-severe label noise problems, in comparison to the state of the art (SOTA) methods.

Clustering Image Classification

Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging

no code implementations CVPR 2021 Álvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid

Under mild conditions on the noise level of the measurements, rotation averaging satisfies strong duality, which enables global solutions to be obtained via semidefinite programming (SDP) relaxation.

valid

LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment

1 code implementation6 Mar 2021 Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue.

Image Classification

Self-supervised Mean Teacher for Semi-supervised Chest X-ray Classification

1 code implementation5 Mar 2021 Fengbei Liu, Yu Tian, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

In this paper, we propose Self-supervised Mean Teacher for Semi-supervised (S$^2$MTS$^2$) learning that combines self-supervised mean-teacher pre-training with semi-supervised fine-tuning.

Contrastive Learning General Classification +3

DF-VO: What Should Be Learnt for Visual Odometry?

2 code implementations1 Mar 2021 Huangying Zhan, Chamara Saroj Weerasekera, Jia-Wang Bian, Ravi Garg, Ian Reid

More surprisingly, they show that the well-trained networks enable scale-consistent predictions over long videos, while the accuracy is still inferior to traditional methods because of ignoring geometric information.

Monocular Visual Odometry Optical Flow Estimation

Semantics for Robotic Mapping, Perception and Interaction: A Survey

no code implementations2 Jan 2021 Sourav Garg, Niko Sünderhauf, Feras Dayoub, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian Reid, Stephen Gould, Peter Corke, Michael Milford

In robotics and related research fields, the study of understanding is often referred to as semantics, which dictates what does the world "mean" to a robot, and is strongly tied to the question of how to represent that meaning.

Autonomous Driving Navigate

MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos

no code implementations9 Dec 2020 Kejie Li, Hamid Rezatofighi, Ian Reid

Given a new RGB frame, MOLTR firstly applies a monocular 3D detector to localise objects of interest and extract their shape codes that represent the object shapes in a learned embedding space.

Benchmarking Object +1

EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

1 code implementation11 Nov 2020 Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

In this work, we study a new variant of the noisy label problem that combines the open-set and closed-set noisy labels, and introduce a benchmark evaluation to assess the performance of training algorithms under this setup.

HM4: Hidden Markov Model with Memory Management for Visual Place Recognition

no code implementations1 Nov 2020 Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Ian Reid

However, this creates an unboundedly-growing database that poses time and memory scalability challenges for place recognition methods.

Autonomous Driving Management +1

MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking

no code implementations15 Oct 2020 Patrick Dendorfer, Aljoša Ošep, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth, Laura Leal-Taixé

We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data, and create a framework for the standardized evaluation of multiple object tracking methods.

Multiple Object Tracking Multiple People Tracking +3

How Trustworthy are Performance Evaluations for Basic Vision Tasks?

no code implementations8 Aug 2020 Tran Thien Dat Nguyen, Hamid Rezatofighi, Ba-Ngu Vo, Ba-Tuong Vo, Silvio Savarese, Ian Reid

This paper examines performance evaluation criteria for basic vision tasks involving sets of objects namely, object detection, instance-level segmentation and multi-object tracking.

Multi-Object Tracking object-detection +1

Socially and Contextually Aware Human Motion and Pose Forecasting

no code implementations14 Jul 2020 Vida Adeli, Ehsan Adeli, Ian Reid, Juan Carlos Niebles, Hamid Rezatofighi

In this paper, we propose a novel framework to tackle both tasks of human motion (or trajectory) and body skeleton pose forecasting in a unified end-to-end pipeline.

Human Dynamics Robot Navigation

Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos

no code implementations ECCV 2020 Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid, Hamid Rezatofighi

In this paper, we solve the problem of simultaneously grouping people by their social interactions, predicting their individual actions and the social activity of each social group, which we call the social task.

Group Activity Recognition

Auto-Rectify Network for Unsupervised Indoor Depth Estimation

1 code implementation4 Jun 2020 Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Tat-Jun Chin, Chunhua Shen, Ian Reid

However, excellent results have mostly been obtained in street-scene driving scenarios, and such methods often fail in other settings, particularly indoor videos taken by handheld devices.

Monocular Depth Estimation Self-Supervised Learning +1

FroDO: From Detections to 3D Objects

no code implementations11 May 2020 Kejie Li, Martin Rünz, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner.

3D Reconstruction Object +2

MOT20: A benchmark for multi object tracking in crowded scenes

1 code implementation19 Mar 2020 Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking with Transformer +2

3D Gated Recurrent Fusion for Semantic Scene Completion

no code implementations17 Feb 2020 Yu Liu, Jie Li, Qingsen Yan, Xia Yuan, Chunxia Zhao, Ian Reid, Cesar Cadena

This paper tackles the problem of data fusion in the semantic scene completion (SSC) task, which can simultaneously deal with semantic labeling and scene completion.

3D Semantic Scene Completion Scene Understanding

Switchable Precision Neural Networks

no code implementations7 Feb 2020 Luis Guerra, Bohan Zhuang, Ian Reid, Tom Drummond

Instantaneous and on demand accuracy-efficiency trade-off has been recently explored in the context of neural networks slimming.

Quantization

Automatic Pruning for Quantized Neural Networks

no code implementations3 Feb 2020 Luis Guerra, Bohan Zhuang, Ian Reid, Tom Drummond

In particular, for ResNet-18 on ImageNet, we prune 26. 12% of the model size with Binarized Neural Network quantization, achieving a top-1 classification accuracy of 47. 32% in a model of 2. 47 MB and 59. 30% with a 2-bit DoReFa-Net in 4. 36 MB.

Bayesian Optimization Quantization

Learn to Predict Sets Using Feed-Forward Neural Networks

no code implementations30 Jan 2020 Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

In our formulation we define a likelihood for a set distribution represented by a) two discrete distributions defining the set cardinally and permutation variables, and b) a joint distribution over set elements with a fixed cardinality.

Multi-Label Image Classification object-detection +1

SG-VAE: Scene Grammar Variational Autoencoder to generate new indoor scenes

no code implementations ECCV 2020 Pulak Purkait, Christopher Zach, Ian Reid

Our method learns the co-occurrences, and appearance parameters such as shape and pose, for different objects categories through a grammar-based auto-encoder, resulting in a compact and accurate representation for scene layouts.

valid

NeuRoRA: Neural Robust Rotation Averaging

1 code implementation ECCV 2020 Pulak Purkait, Tat-Jun Chin, Ian Reid

Although the idea of replacing robust optimization methods by a graph-based network is demonstrated only for multiple rotation averaging, it could easily be extended to other graph-based geometric problems, for example, pose-graph optimization.

Robot Navigation

Improved Visual Localization via Graph Smoothing

no code implementations7 Nov 2019 Carlos Lassance, Yasir Latif, Ravi Garg, Vincent Gripon, Ian Reid

One solution to this problem is to learn a deep neural network to infer the pose of a query image after learning on a dataset of images with known poses.

Image Retrieval Retrieval +1

Structured Binary Neural Networks for Image Recognition

no code implementations22 Sep 2019 Bohan Zhuang, Chunhua Shen, Mingkui Tan, Peng Chen, Lingqiao Liu, Ian Reid

Experiments on both classification, semantic segmentation and object detection tasks demonstrate the superior performance of the proposed methods over various quantized networks in the literature.

object-detection Object Detection +2

Visual Odometry Revisited: What Should Be Learnt?

2 code implementations21 Sep 2019 Huangying Zhan, Chamara Saroj Weerasekera, Jia-Wang Bian, Ian Reid

In this work we present a monocular visual odometry (VO) algorithm which leverages geometry-based methods and deep learning.

Monocular Visual Odometry

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video

2 code implementations NeurIPS 2019 Jia-Wang Bian, Zhichao Li, Naiyan Wang, Huangying Zhan, Chunhua Shen, Ming-Ming Cheng, Ian Reid

To the best of our knowledge, this is the first work to show that deep networks trained using unlabelled monocular videos can predict globally scale-consistent camera trajectories over a long video sequence.

Depth And Camera Motion Monocular Depth Estimation +1

An Evaluation of Feature Matchers for Fundamental Matrix Estimation

no code implementations26 Aug 2019 Jia-Wang Bian, Yu-Huan Wu, Ji Zhao, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid

According to this, we propose three high-quality matching systems and a Coarse-to-Fine RANSAC estimator.

In defense of OSVOS

no code implementations19 Aug 2019 Yu Liu, Yutong Dai, Anh-Dzung Doan, Lingqiao Liu, Ian Reid

Through adding a common module, video loss, which we formulate with various forms of constraints (including weighted BCE loss, high-dimensional triplet loss, as well as a novel mixed instance-aware video loss), to train the parent network in the step (2), the network is then better prepared for the step (3), i. e. online fine-tuning on the target instance.

Depth Estimation Object +6

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

no code implementations10 Aug 2019 Bohan Zhuang, Jing Liu, Mingkui Tan, Lingqiao Liu, Ian Reid, Chunhua Shen

Furthermore, we propose a second progressive quantization scheme which gradually decreases the bit-width from high-precision to low-precision during training.

Knowledge Distillation Quantization

Scalable Place Recognition Under Appearance Change for Autonomous Driving

no code implementations ICCV 2019 Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid

Our experiments show that, compared to state-of-the-art techniques, our method has much greater potential for large-scale place recognition for autonomous driving.

Autonomous Driving Visual Place Recognition

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation23 Jul 2019 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Ian Reid

In this paper, a non-convex non-smooth optimization framework is proposed to achieve diverse smoothing natures where even contradictive smoothing behaviors can be achieved.

image smoothing

Seeing Behind Things: Extending Semantic Segmentation to Occluded Regions

no code implementations7 Jun 2019 Pulak Purkait, Christopher Zach, Ian Reid

In our experiments we demonstrate that a CNN trained by minimizing the proposed loss is able to predict semantic categories for visible and occluded object parts without requiring to increase the network size (compared to a standard segmentation task).

Segmentation Semantic Segmentation

Bayesian Generative Active Deep Learning

no code implementations26 Apr 2019 Toan Tran, Thanh-Toan Do, Ian Reid, Gustavo Carneiro

Deep learning models have demonstrated outstanding performance in several problems, but their training process tends to require immense amounts of computational and human resources for training and labeling, constraining the types of problems that can be tackled.

Active Learning Data Augmentation

A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning

no code implementations CVPR 2019 Thanh-Toan Do, Toan Tran, Ian Reid, Vijay Kumar, Tuan Hoang, Gustavo Carneiro

Another approach explored in the field relies on an ad-hoc linearization (in terms of N) of the triplet loss that introduces class centroids, which must be optimized using the whole training set for each mini-batch - this means that a naive implementation of this approach has run-time complexity O(N^2).

Metric Learning Retrieval

Template-Based Automatic Search of Compact Semantic Segmentation Architectures

1 code implementation4 Apr 2019 Vladimir Nekrasov, Chunhua Shen, Ian Reid

Automatic search of neural architectures for various vision and natural language tasks is becoming a prominent tool as it allows to discover high-performing structures on any dataset of interest.

General Classification Holdout Set +1

Training Quantized Neural Networks with a Full-precision Auxiliary Module

no code implementations CVPR 2020 Bohan Zhuang, Lingqiao Liu, Mingkui Tan, Chunhua Shen, Ian Reid

In this paper, we seek to tackle a challenge in training low-precision networks: the notorious difficulty in propagating gradient through a low-precision network due to the non-differentiable quantization function.

Image Classification object-detection +2

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

no code implementations23 Mar 2019 Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

We propose V2CNet, a new deep learning framework to automatically translate the demonstration videos to commands that can be directly used in robotic applications.

RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

no code implementations CVPR 2019 Jie Li, Yu Liu, Dong Gong, Qinfeng Shi, Xia Yuan, Chunxia Zhao, Ian Reid

RGB images differentiate from depth images as they carry more details about the color and texture information, which can be utilized as a vital complementary to depth for boosting the performance of 3D semantic scene completion (SSC).

3D Semantic Scene Completion Scene Labeling

Self-supervised Learning for Single View Depth and Surface Normal Estimation

no code implementations1 Mar 2019 Huangying Zhan, Chamara Saroj Weerasekera, Ravi Garg, Ian Reid

In this work we present a self-supervised learning framework to simultaneously train two Convolutional Neural Networks (CNNs) to predict depth and surface normals from a single image.

Depth Prediction Monocular Depth Estimation +2

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

10 code implementations CVPR 2019 Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, Silvio Savarese

By incorporating this generalized $IoU$ ($GIoU$) as a loss into the state-of-the art object detection frameworks, we show a consistent improvement on their performance using both the standard, $IoU$ based, and new, $GIoU$ based, performance measures on popular object detection benchmarks such as PASCAL VOC and MS COCO.

Object object-detection +2

Visual SLAM: Why Bundle Adjust?

no code implementations11 Feb 2019 Álvaro Parra, Tat-Jun Chin, Anders Eriksson, Ian Reid

Bundle adjustment plays a vital role in feature-based monocular SLAM.

Multi-modal Ensemble Classification for Generalized Zero Shot Learning

no code implementations15 Jan 2019 Rafael Felix, Michele Sasdelli, Ian Reid, Gustavo Carneiro

In this paper, we mitigate these issues by proposing a new GZSL method based on multi-modal training and testing processes, where the optimization explicitly promotes a balanced classification accuracy between seen and unseen classes.

Bayesian Inference Classification +2

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

no code implementations12 Jan 2019 Yu Liu, Lingqiao Liu, Hamid Rezatofighi, Thanh-Toan Do, Qinfeng Shi, Ian Reid

As the post-processing step for object detection, non-maximum suppression (GreedyNMS) is widely used in most of the detectors for many years.

object-detection Object Detection

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

no code implementations CVPR 2019 Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically} for mobile devices with limited power capacity and computation resources.

General Classification Image Classification +2

Scalable Deep $k$-Subspace Clustering

no code implementations2 Nov 2018 Tong Zhang, Pan Ji, Mehrtash Harandi, Richard Hartley, Ian Reid

In this paper, we introduce a method that simultaneously learns an embedding space along subspaces within it to minimize a notion of reconstruction error, thus addressing the problem of subspace clustering in an end-to-end learning paradigm.

Clustering

Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells

4 code implementations CVPR 2019 Vladimir Nekrasov, Hao Chen, Chunhua Shen, Ian Reid

While most results in this domain have been achieved on image classification and language modelling problems, here we concentrate on dense per-pixel tasks, in particular, semantic image segmentation using fully convolutional networks.

Depth Prediction Image Classification +8

Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks

1 code implementation16 Oct 2018 Zhibin Liao, Tom Drummond, Ian Reid, Gustavo Carneiro

Furthermore, the proposed measurements also allow us to show that it is possible to optimise the training process with a new dynamic sampling training approach that continuously and automatically change the mini-batch size and learning rate during the training process.

General Classification Image Classification

Diagnostics in Semantic Segmentation

no code implementations27 Sep 2018 Vladimir Nekrasov, Chunhua Shen, Ian Reid

Over the past years, computer vision community has contributed to enormous progress in semantic image segmentation, a per-pixel classification task, crucial for dense scene understanding and rapidly becoming vital in lots of real-world applications, including driverless cars and medical imaging.

Image Segmentation Scene Understanding +2

Pre and Post-hoc Diagnosis and Interpretation of Malignancy from Breast DCE-MRI

no code implementations25 Sep 2018 Gabriel Maicas, Andrew P. Bradley, Jacinto C. Nascimento, Ian Reid, Gustavo Carneiro

Conversely, traditional approaches follow a pre-hoc approach that initially localises suspicious areas that are subsequently classified to establish the breast malignancy -- this approach is trained using strongly annotated data (i. e., it needs a delineation and classification of all lesions in an image).

Real-Time Monocular Object-Model Aware Sparse SLAM

no code implementations24 Sep 2018 Mehdi Hosseinzadeh, Kejie Li, Yasir Latif, Ian Reid

While sparse point-based SLAM methods provide accurate camera localization, the generated maps lack semantic information.

Camera Localization Object +3

Efficient Dense Point Cloud Object Reconstruction using Deformation Vector Fields

no code implementations ECCV 2018 Kejie Li, Trung Pham, Huangying Zhan, Ian Reid

Given a single image at an arbitrary viewpoint, a CNN predicts multiple surfaces, each in a canonical location relative to the object.

3D Object Reconstruction Object

Deep Regression Tracking with Shrinkage Loss

1 code implementation ECCV 2018 Xiankai Lu, Chao Ma, Bingbing Ni, Xiaokang Yang, Ian Reid, Ming-Hsuan Yang

Regression trackers directly learn a mapping from regularly dense samples of target objects to soft labels, which are usually generated by a Gaussian function, to estimate target positions.

regression

Training Compact Neural Networks with Binary Weights and Low Precision Activations

no code implementations8 Aug 2018 Bohan Zhuang, Chunhua Shen, Ian Reid

In this paper, we propose to train a network with binary weights and low-bitwidth activations, designed especially for mobile devices with limited power consumption.

MatchBench: An Evaluation of Feature Matchers

no code implementations7 Aug 2018 Jia-Wang Bian, Ruihan Yang, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid, WenHai Wu

This leads to a critical absence in this field that there is no standard datasets and evaluation metrics to evaluate different feature matchers fairly.

Multi-modal Cycle-consistent Generalized Zero-Shot Learning

1 code implementation ECCV 2018 Rafael Felix, B. G. Vijay Kumar, Ian Reid, Gustavo Carneiro

In generalized zero shot learning (GZSL), the set of classes are split into seen and unseen classes, where training relies on the semantic features of the seen and unseen classes and the visual representations of only the seen classes, while testing uses the visual representations of the seen and unseen classes.

General Classification Generalized Zero-Shot Learning

Model Agnostic Saliency for Weakly Supervised Lesion Detection from Breast DCE-MRI

no code implementations20 Jul 2018 Gabriel Maicas, Gerard Snaauw, Andrew P. Bradley, Ian Reid, Gustavo Carneiro

There is a heated debate on how to interpret the decisions provided by deep learning models (DLM), where the main approaches rely on the visualization of salient regions to interpret the DLM classification process.

General Classification Lesion Detection

Bayesian Semantic Instance Segmentation in Open Set World

no code implementations ECCV 2018 Trung Pham, Vijay Kumar B G, Thanh-Toan Do, Gustavo Carneiro, Ian Reid

In this paper, we present a novel open-set semantic instance segmentation approach capable of segmenting all known and unknown object classes in images, based on the output of an object detector trained on known object classes.

Instance Segmentation Object +2

Bootstrapping the Performance of Webly Supervised Semantic Segmentation

1 code implementation CVPR 2018 Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth.

Segmentation Transfer Learning +2

Training Medical Image Analysis Systems like Radiologists

no code implementations28 May 2018 Gabriel Maicas, Andrew P. Bradley, Jacinto C. Nascimento, Ian Reid, Gustavo Carneiro

This process bears no direct resemblance with radiologist training, which is based on solving a series of tasks of increasing difficulty, where each task involves the use of significantly smaller datasets than those used in machine learning.

BIG-bench Machine Learning Classification +3

Just-in-Time Reconstruction: Inpainting Sparse Maps using Single View Depth Predictors as Priors

no code implementations11 May 2018 Chamara Saroj Weerasekera, Thanuja Dharmasiri, Ravi Garg, Tom Drummond, Ian Reid

Crucially, we obtain the confidence weights that parameterize the CRF model in a data-dependent manner via Convolutional Neural Networks (CNNs) which are trained to model the conditional depth error distributions given each source of input depth map and the associated RGB image.

Depth Estimation Depth Prediction

Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks

no code implementations ICLR 2019 S. Hamid Rezatofighi, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Daniel Cremers, Laura Leal-Taixé, Ian Reid

We demonstrate the validity of this new formulation on two relevant vision problems: object detection, for which our formulation outperforms state-of-the-art detectors such as Faster R-CNN and YOLO, and a complex CAPTCHA test, where we observe that, surprisingly, our set based network acquired the ability of mimicking arithmetics without any rules being coded.

object-detection Object Detection

Object Captioning and Retrieval with Natural Language

1 code implementation16 Mar 2018 Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object.

Object Retrieval

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

1 code implementation CVPR 2018 Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, Ian Reid

Despite learning based methods showing promising results in single view depth estimation and visual odometry, most existing approaches treat the tasks in a supervised manner.

Depth And Camera Motion Depth Prediction +2

Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image

no code implementations28 Feb 2018 Thanh-Toan Do, Ming Cai, Trung Pham, Ian Reid

Detecting objects and their 6D poses from only RGB images is an important task for many robotic applications.

Benchmarking Instance Segmentation +5

Binary Constrained Deep Hashing Network for Image Retrieval without Manual Annotation

no code implementations21 Feb 2018 Thanh-Toan Do, Tuan Hoang, Dang-Khoa Le Tan, Trung Pham, Huu Le, Ngai-Man Cheung, Ian Reid

However, training deep hashing networks for the task is challenging due to the binary constraints on the hash codes, the similarity preserving property, and the requirement for a vast amount of labelled images.

Deep Hashing Image Retrieval +1

Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments

8 code implementations CVPR 2018 Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton Van Den Hengel

This is significant because a robot interpreting a natural-language navigation instruction on the basis of what it sees is carrying out a vision and language process that is similar to Visual Question Answering.

Translation Vision and Language Navigation +2

Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries

no code implementations CVPR 2018 Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

To this end we propose a unified framework, the ParalleL AttentioN (PLAN) network, to discover the object in an image that is being referred to in variable length natural expression descriptions, from short phrases query to long multi-round dialogs.

Object Object Discovery +2

Learning Deeply Supervised Good Features to Match for Dense Monocular Reconstruction

no code implementations16 Nov 2017 Chamara Saroj Weerasekera, Ravi Garg, Yasir Latif, Ian Reid

Visual SLAM (Simultaneous Localization and Mapping) methods typically rely on handcrafted visual features or raw RGB values for establishing correspondences between images.

Depth Estimation Monocular Reconstruction +1

Towards Effective Low-bitwidth Convolutional Neural Networks

2 code implementations CVPR 2018 Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations.

Quantization

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

1 code implementation ICCV 2017 Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

The proposed method still builds one classifier for one interaction (as per type (ii) above), but the classifier built is adaptive to context via weights which are context dependent.

Relationship Detection Visual Relationship Detection

Addressing Challenging Place Recognition Tasks using Generative Adversarial Networks

1 code implementation26 Sep 2017 Yasir Latif, Ravi Garg, Michael Milford, Ian Reid

In the process, meaningful feature spaces are learned for each domain, the distances in which can be used for the task of place recognition.

Robotics

SceneCut: Joint Geometric and Object Segmentation for Indoor Scenes

no code implementations21 Sep 2017 Trung Pham, Thanh-Toan Do, Niko Sünderhauf, Ian Reid

This paper presents SceneCut, a novel approach to jointly discover previously unseen objects and non-object surfaces using a single RGB-D image.

Object Semantic Segmentation

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

2 code implementations21 Sep 2017 Thanh-Toan Do, Anh Nguyen, Ian Reid

We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images.

Affordance Detection Object +2

Visual Question Answering with Memory-Augmented Networks

no code implementations CVPR 2018 Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid

In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.

Question Answering Visual Question Answering

Adaptive Low-Rank Kernel Subspace Clustering

1 code implementation17 Jul 2017 Pan Ji, Ian Reid, Ravi Garg, Hongdong Li, Mathieu Salzmann

In this paper, we present a kernel subspace clustering method that can handle non-linear models.

Clustering Image Clustering +1

Care about you: towards large-scale human-centric visual relationship detection

no code implementations28 May 2017 Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

In addressing this problem we first construct a large-scale human-centric visual relationship detection dataset (HCVRD), which provides many more types of relationship annotation (nearly 10K categories) than the previous released datasets.

Human-Object Interaction Detection Relationship Detection +1

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

no code implementations25 May 2017 Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid

Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.

Segmentation Weakly supervised Semantic Segmentation +1

Smart Mining for Deep Metric Learning

no code implementations ICCV 2017 Ben Harwood, Vijay Kumar B G, Gustavo Carneiro, Ian Reid, Tom Drummond

In this paper, we propose a novel deep metric learning method that combines the triplet model and the global structure of the embedding space.

Metric Learning

A Branch-and-Bound Algorithm for Checkerboard Extraction in Camera-Laser Calibration

no code implementations4 Apr 2017 Alireza Khosravian, Tat-Jun Chin, Ian Reid

We formulate the checkerboard extraction as a combinatorial optimization problem with a clear cut objective function.

Combinatorial Optimization

Towards Context-aware Interaction Recognition

no code implementations18 Mar 2017 Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

Recognizing how objects interact with each other is a crucial task in visual recognition.

Deep Learning Features at Scale for Visual Place Recognition

no code implementations18 Jan 2017 Zetao Chen, Adam Jacobson, Niko Sunderhauf, Ben Upcroft, Lingqiao Liu, Chunhua Shen, Ian Reid, Michael Milford

The success of deep learning techniques in the computer vision domain has triggered a range of initial investigations into their utility for visual place recognition, all using generic features from networks that were trained for other types of recognition tasks.

Visual Place Recognition

From Motion Blur to Motion Flow: a Deep Learning Solution for Removing Heterogeneous Motion Blur

no code implementations CVPR 2017 Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton Van Den Hengel, Qinfeng Shi

The critical observation underpinning our approach is thus that learning the motion flow instead allows the model to focus on the cause of the blur, irrespective of the image content.

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

13 code implementations CVPR 2017 Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation.

3D Absolute Human Pose Estimation Semantic Segmentation +1

Meaningful Maps With Object-Oriented Semantic Mapping

no code implementations26 Sep 2016 Niko Sünderhauf, Trung T. Pham, Yasir Latif, Michael Milford, Ian Reid

For intelligent robots to interact in meaningful ways with their environment, they must understand both the geometric and semantic properties of the scene surrounding them.

Robotics

Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification

no code implementations1 Aug 2016 ZongYuan Ge, Chris McCool, Conrad Sanderson, Peng Wang, Lingqiao Liu, Ian Reid, Peter Corke

Fine-grained classification is a relatively new field that has concentrated on using information from a single image, while ignoring the enormous potential of using video data to improve classification.

Classification General Classification

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

2 code implementations19 Jun 2016 Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, Jose Neira, Ian Reid, John J. Leonard

Simultaneous Localization and Mapping (SLAM)consists in the concurrent construction of a model of the environment (the map), and the estimation of the state of the robot moving within it.

Robotics

Joint Probabilistic Matching Using m-Best Solutions

no code implementations CVPR 2016 Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

Matching between two sets of objects is typically approached by finding the object pairs that collectively maximize the joint matching score.

Person Re-Identification

Efficient Point Process Inference for Large-Scale Object Detection

no code implementations CVPR 2016 Trung T. Pham, Seyed Hamid Rezatofighi, Ian Reid, Tat-Jun Chin

We tackle the problem of large-scale object detection in images, where the number of objects can be arbitrarily large, and can exhibit significant overlap/occlusion.

Human Detection Object +2

Online Multi-Target Tracking Using Recurrent Neural Networks

no code implementations13 Apr 2016 Anton Milan, Seyed Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler

Here, we propose for the first time, an end-to-end learning approach for online multi-target tracking.

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

2 code implementations16 Mar 2016 Ravi Garg, Vijay Kumar BG, Gustavo Carneiro, Ian Reid

In this work we propose a unsupervised framework to learn a deep convolutional neural network for single view depth predic- tion, without requiring a pre-training stage or annotated ground truth depths.

Depth Estimation

Non-linear Dimensionality Regularizer for Solving Inverse Problems

no code implementations16 Mar 2016 Ravi Garg, Anders Eriksson, Ian Reid

Additionally, we evaluate our method on the challenging problem of Non-Rigid Structure from Motion and our approach delivers promising results on CMU mocap dataset despite the presence of significant occlusions and noise.

Exploring Context with Deep Structured models for Semantic Segmentation

no code implementations10 Mar 2016 Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, Ian Reid

We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions.

Image Segmentation Segmentation +1

Fast Training of Triplet-based Deep Binary Embedding Networks

no code implementations CVPR 2016 Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian Reid

To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary code which serve as the label of each training datum.

Image Retrieval Multi-Label Classification +1

MOT16: A Benchmark for Multi-Object Tracking

8 code implementations2 Mar 2016 Anton Milan, Laura Leal-Taixe, Ian Reid, Stefan Roth, Konrad Schindler

Recently, a new benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal of collecting existing and new data and creating a framework for the standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking +2

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions

2 code implementations CVPR 2016 Vijay Kumar B G, Gustavo Carneiro, Ian Reid

Current results from machine learning show that replacing this siamese by a triplet network can improve the classification accuracy in several problems, but this has yet to be demonstrated for local image descriptor learning.

General Classification

Hierarchical Higher-Order Regression Forest Fields: An Application to 3D Indoor Scene Labelling

no code implementations ICCV 2015 Trung T. Pham, Ian Reid, Yasir Latif, Stephen Gould

Specifically, we relax the labelling problem to a regression, and generalize the higher-order associative P n Potts model to a new family of arbitrary higher-order models based on regression forests.

regression Semantic Segmentation

Joint Probabilistic Data Association Revisited

1 code implementation ICCV 2015 Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding the m-best solutions to an integer linear program.

Deeply Learning the Messages in Message Passing Inference

no code implementations NeurIPS 2015 Guosheng Lin, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The network output dimension for message estimation is the same as the number of classes, in contrast to the network output for general CNN potential functions in CRFs, which is exponential in the order of the potentials.

Image Segmentation Semantic Segmentation +1

The k-Support Norm and Convex Envelopes of Cardinality and Rank

no code implementations CVPR 2015 Anders Eriksson, Trung Thanh Pham, Tat-Jun Chin, Ian Reid

Sparsity, or cardinality, as a tool for feature selection is extremely common in a vast number of current computer vision applications.

Computational Efficiency feature selection

Joint Tracking and Segmentation of Multiple Targets

no code implementations CVPR 2015 Anton Milan, Laura Leal-Taixe, Konrad Schindler, Ian Reid

Tracking-by-detection has proven to be the most successful strategy to address the task of tracking multiple targets in unconstrained scenarios.

Video Segmentation Video Semantic Segmentation

MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking

2 code implementations8 Apr 2015 Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, Konrad Schindler

We discuss the challenges of creating such a framework, collecting existing and new data, gathering state-of-the-art methods to be tested on the datasets, and finally creating a unified evaluation system.

3D Reconstruction Multiple Object Tracking +3

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

1 code implementation26 Feb 2015 Fayao Liu, Chunhua Shen, Guosheng Lin, Ian Reid

Therefore, here we present a deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF.

Depth Estimation

Dense 3D Face Correspondence

no code implementations19 Oct 2014 Syed Zulqarnain Gilani, Ajmal Mian, Faisal Shafait, Ian Reid

A deformable model (K3DM) is constructed from the dense corresponded faces and an algorithm is proposed for morphing the K3DM to fit unseen faces.

Face Recognition

Online Unsupervised Feature Learning for Visual Tracking

no code implementations7 Oct 2013 Fayao Liu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

Feature encoding with respect to an over-complete dictionary learned by unsupervised methods, followed by spatial pyramid pooling, and linear classification, has exhibited powerful strength in various vision applications.

Dictionary Learning Visual Tracking

Dense Reconstruction Using 3D Object Shape Priors

no code implementations CVPR 2013 Amaury Dame, Victor A. Prisacariu, Carl Y. Ren, Ian Reid

More specifically, we automatically augment our SLAM system with object specific identity, together with 6D pose and additional shape degrees of freedom for the object(s) of known class in the scene, combining image data and depth information for the pose and shape recovery.

3D Reconstruction Object

Cannot find the paper you are looking for? You can Submit a new open access paper.