Search Results for author: Ian Reid

Found 155 papers, 55 papers with code

Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

no code implementations • 8 Apr 2024 • Mahsa Ehsanpour, Ian Reid, Hamid Rezatofighi

The framework uses masked modeling to pre-train the encoder to reconstruct masked human joint trajectories, enabling it to learn generalizable and data efficient representations of motion in human crowded scenes.

Action Understanding Multi-Person Pose forecasting +1

Paper
Add Code

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

no code implementations • 2 Apr 2024 • Duy-Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi

JRDB-PanoTrack includes (1) various data involving indoor and outdoor crowded scenes, as well as comprehensive 2D and 3D synchronized data modalities; (2) high-quality 2D spatial panoptic segmentation and temporal tracking annotations, with additional 3D label projections for further spatial understanding; (3) diverse object classes for closed- and open-world recognition benchmarks, with OSPA-based metrics for evaluation.

Decision Making Panoptic Segmentation +1

Paper
Add Code

PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

no code implementations • 14 Mar 2024 • Jiajun Deng, Sha Zhang, Feras Dayoub, Wanli Ouyang, Yanyong Zhang, Ian Reid

In this work, we present PoIFusion, a simple yet effective multi-modal 3D object detection framework to fuse the information of RGB images and LiDAR point clouds at the point of interest (abbreviated as PoI).

3D Object Detection Object +1

Paper
Add Code

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

no code implementations • 13 Mar 2024 • Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu

We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D Gaussian Splatting (3DGS).

Paper
Add Code

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM

1 code implementation • 12 Mar 2024 • Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang

Human motion generation stands as a significant pursuit in generative computer vision, while achieving long-sequence and efficient motion generation remains challenging.

Paper
Code

Symmetry-Breaking Augmentations for Ad Hoc Teamwork

no code implementations • 15 Feb 2024 • Ravi Hammond, Dustin Craggs, Mingyu Guo, Jakob Foerster, Ian Reid

In many collaborative settings, artificial intelligence (AI) agents must be able to adapt to new teammates that use unknown or previously unobserved strategies.

Paper
Add Code

Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes

no code implementations • 29 Jan 2024 • Raphael Lafargue, Yassir Bendou, Bastien Pasdeloup, Jean-Philippe Diguet, Ian Reid, Vincent Gripon, Jack Valmadre

Fine-tuning is ineffective for few-shot learning, since the target dataset contains only a handful of examples.

Cross-Domain Few-Shot Few-Shot Image Classification

Paper
Add Code

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

no code implementations • 12 Jul 2023 • Krishan Rana, Jesse Haviland, Sourav Garg, Jad Abou-Chakra, Ian Reid, Niko Suenderhauf

To ensure the scalability of our approach, we: (1) exploit the hierarchical nature of 3DSGs to allow LLMs to conduct a 'semantic search' for task-relevant subgraphs from a smaller, collapsed representation of the full graph; (2) reduce the planning horizon for the LLM by integrating a classical path planner and (3) introduce an 'iterative replanning' pipeline that refines the initial plan using feedback from a scene graph simulator, correcting infeasible actions and avoiding planning failures.

Robot Task Planning

Paper
Add Code

Semantic Segmentation on 3D Point Clouds with High Density Variations

no code implementations • 4 Jul 2023 • Ryan Faulkner, Luke Haub, Simon Ratcliffe, Ian Reid, Tat-Jun Chin

LiDAR scanning for surveying applications acquire measurements over wide areas and long distances, which produces large-scale 3D point clouds with significant local density variations.

3D Semantic Segmentation Segmentation

Paper
Add Code

Assessing Domain Gap for Continual Domain Adaptation in Object Detection

1 code implementation • 21 Feb 2023 • Anh-Dzung Doan, Bach Long Nguyen, Surabhi Gupta, Ian Reid, Markus Wagner, Tat-Jun Chin

To ensure reliable object detection in autonomous systems, the detector must be able to adapt to changes in appearance caused by environmental factors such as time of day, weather, and seasons.

Domain Adaptation object-detection +1

Paper
Code

Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

1 code implementation • ICCV 2023 • Yuyuan Liu, Choubo Ding, Yu Tian, Guansong Pang, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Semantic segmentation models classify pixels into a set of known (``in-distribution'') visual classes.

Ranked #1 on Anomaly Detection on Fishyscapes (using extra training data)

Anomaly Detection Contrastive Learning +4

137

Paper
Code

Predicting Topological Maps for Visual Navigation in Unexplored Environments

no code implementations • 23 Nov 2022 • Huangying Zhan, Hamid Rezatofighi, Ian Reid

We propose a robotic learning system for autonomous exploration and navigation in unexplored environments.

Visual Navigation

Paper
Add Code

ActiveRMAP: Radiance Field for Active Mapping And Planning

no code implementations • 23 Nov 2022 • Huangying Zhan, Jiyang Zheng, Yi Xu, Ian Reid, Hamid Rezatofighi

We, for the first time, present an RGB-only active vision framework using radiance field representation for active 3D reconstruction and planning in an online manner.

3D Reconstruction

Paper
Add Code

What Images are More Memorable to Machines?

1 code implementation • 14 Nov 2022 • Junlin Han, Huangying Zhan, Jie Hong, Pengfei Fang, Hongdong Li, Lars Petersson, Ian Reid

This paper studies the problem of measuring and predicting how memorable an image is to pattern recognition machines, as a path to explore machine intelligence.

Paper
Code

SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes

2 code implementations • 7 Nov 2022 • Libo Sun, Jia-Wang Bian, Huangying Zhan, Wei Yin, Ian Reid, Chunhua Shen

Self-supervised monocular depth estimation has shown impressive results in static scenes.

Indoor Monocular Depth Estimation Monocular Depth Estimation +1

394

Paper
Code

Globally Optimal Event-Based Divergence Estimation for Ventral Landing

1 code implementation • 27 Sep 2022 • Sofia McLeod, Gabriele Meoni, Dario Izzo, Anne Mergy, Daqi Liu, Yasir Latif, Ian Reid, Tat-Jun Chin

This is achieved by estimating divergence (inverse TTC), which is the rate of radial optic flow, from the event stream generated during landing.

Paper
Code

The Edge of Disaster: A Battle Between Autonomous Racing and Safety

no code implementations • 30 Jun 2022 • Matthew Howe, James Bockman, Adrian Orenstein, Stefan Podgorski, Sam Bahrami, Ian Reid

Autonomous racing represents a uniquely challenging control environment where agents must act while on the limits of a vehicle's capability in order to set competitive lap times.

Model Predictive Control

Paper
Add Code

CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping

1 code implementation • 31 May 2022 • Junlin Han, Lars Petersson, Hongdong Li, Ian Reid

We present a simple method, CropMix, for the purpose of producing a rich input distribution from the original dataset distribution.

Contrastive Learning

Paper
Code

Asynchronous Optimisation for Event-based Visual Odometry

no code implementations • 2 Mar 2022 • Daqi Liu, Alvaro Parra, Yasir Latif, Bo Chen, Tat-Jun Chin, Ian Reid

Event cameras open up new possibilities for robotic perception due to their low latency and high dynamic range.

Event-based vision Visual Odometry

Paper
Add Code

You Only Cut Once: Boosting Data Augmentation with a Single Cut

1 code implementation • 28 Jan 2022 • Junlin Han, Pengfei Fang, Weihao Li, Jie Hong, Mohammad Ali Armin, Ian Reid, Lars Petersson, Hongdong Li

We present You Only Cut Once (YOCO) for performing data augmentations.

Data Augmentation

Paper
Code

PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

1 code implementation • 22 Oct 2021 • Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

The most competitive noisy label learning methods rely on an unsupervised classification of clean and noisy samples, where samples classified as noisy are re-labelled and "MixMatched" with the clean samples.

Ranked #1 on Image Classification with Label Noise on CIFAR-100

Image Classification with Label Noise Learning with noisy labels

Paper
Code

Weakly Supervised Training of Monocular 3D Object Detectors Using Wide Baseline Multi-view Traffic Camera Data

1 code implementation • 21 Oct 2021 • Matthew Howe, Ian Reid, Jamie Mackenzie

Our method achieves vehicle 7DoF pose prediction accuracy on our dataset comparable to the top performing monocular 3D object detectors on autonomous vehicle datasets.

Autonomous Vehicles Object +1

Paper
Code

Autonomy and Perception for Space Mining

no code implementations • 27 Sep 2021 • Ragav Sachdeva, Ravi Hammond, James Bockman, Alec Arthur, Brandon Smart, Dustin Craggs, Anh-Dzung Doan, Thomas Rowntree, Elijah Schutz, Adrian Orenstein, Andy Yu, Tat-Jun Chin, Ian Reid

Future Moon bases will likely be constructed using resources mined from the surface of the Moon.

Paper
Add Code

ODAM: Object Detection, Association, and Mapping using Posed RGB Video

1 code implementation • ICCV 2021 • Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard Newcombe

Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics.

3D Object Detection Object +2

Paper
Code

JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection

no code implementations • CVPR 2022 • Mahsa Ehsanpour, Fatemeh Saleh, Silvio Savarese, Ian Reid, Hamid Rezatofighi

However, learning to recognise human actions and their social interactions in an unconstrained real-world environment comprising numerous people, with potentially highly unbalanced and long-tailed distributed action labels from a stream of sensory data captured from a mobile robot platform remains a significant challenge, not least owing to the lack of a reflective large-scale dataset.

Action Detection Action Understanding +1

Paper
Add Code

Unsupervised Scale-consistent Depth Learning from Video

2 code implementations • 25 May 2021 • Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Zhichao Li, Le Zhang, Chunhua Shen, Ming-Ming Cheng, Ian Reid

We propose a monocular depth estimator SC-Depth, which requires only unlabelled videos for training and enables the scale-consistent prediction at inference time.

Ranked #6 on Monocular Depth Estimation on NYU-Depth V2 self-supervised

Monocular Depth Estimation Monocular Visual Odometry +1

713

Paper
Code

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

no code implementations • ICCV 2021 • Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi

Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems.

Autonomous Driving Human-Object Interaction Detection

Paper
Add Code

Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers

1 code implementation • 27 Mar 2021 • Tianyu Zhu, Markus Hiller, Mahsa Ehsanpour, Rongkai Ma, Tom Drummond, Ian Reid, Hamid Rezatofighi

Tracking a time-varying indefinite number of objects in a video sequence over time remains a challenge despite recent advances in the field.

Multi-Object Tracking Object +1

Paper
Code

ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning

1 code implementation • 21 Mar 2021 • Ragav Sachdeva, Filipe R Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

We propose a new training algorithm, ScanMix, that explores semantic clustering and semi-supervised learning (SSL) to allow superior robustness to severe label noise and competitive robustness to non-severe label noise problems, in comparison to the state of the art (SOTA) methods.

Ranked #24 on Image Classification on mini WebVision 1.0

Clustering Image Classification

Paper
Code

Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging

no code implementations • CVPR 2021 • Álvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid

Under mild conditions on the noise level of the measurements, rotation averaging satisfies strong duality, which enables global solutions to be obtained via semidefinite programming (SDP) relaxation.

valid

Paper
Add Code

LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment

1 code implementation • 6 Mar 2021 • Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue.

Ranked #4 on Image Classification on Food-101N

Image Classification

Paper
Code

Self-supervised Mean Teacher for Semi-supervised Chest X-ray Classification

1 code implementation • 5 Mar 2021 • Fengbei Liu, Yu Tian, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

In this paper, we propose Self-supervised Mean Teacher for Semi-supervised (S$^2$MTS$^2$) learning that combines self-supervised mean-teacher pre-training with semi-supervised fine-tuning.

Ranked #2 on Semi-supervised Medical Image Classification on Chest X-Ray14 2% labeled

Contrastive Learning General Classification +3

Paper
Code

DF-VO: What Should Be Learnt for Visual Odometry?

2 code implementations • 1 Mar 2021 • Huangying Zhan, Chamara Saroj Weerasekera, Jia-Wang Bian, Ravi Garg, Ian Reid

More surprisingly, they show that the well-trained networks enable scale-consistent predictions over long videos, while the accuracy is still inferior to traditional methods because of ignoring geometric information.

Monocular Visual Odometry Optical Flow Estimation

539

Paper
Code

Semantics for Robotic Mapping, Perception and Interaction: A Survey

no code implementations • 2 Jan 2021 • Sourav Garg, Niko Sünderhauf, Feras Dayoub, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian Reid, Stephen Gould, Peter Corke, Michael Milford

In robotics and related research fields, the study of understanding is often referred to as semantics, which dictates what does the world "mean" to a robot, and is strongly tied to the question of how to represent that meaning.

Autonomous Driving Navigate

Paper
Add Code

MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos

no code implementations • 9 Dec 2020 • Kejie Li, Hamid Rezatofighi, Ian Reid

Given a new RGB frame, MOLTR firstly applies a monocular 3D detector to localise objects of interest and extract their shape codes that represent the object shapes in a learned embedding space.

Benchmarking Object +1

Paper
Add Code

EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

1 code implementation • 11 Nov 2020 • Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

In this work, we study a new variant of the noisy label problem that combines the open-set and closed-set noisy labels, and introduce a benchmark evaluation to assess the performance of training algorithms under this setup.

Paper
Code

HM4: Hidden Markov Model with Memory Management for Visual Place Recognition

no code implementations • 1 Nov 2020 • Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Ian Reid

However, this creates an unboundedly-growing database that poses time and memory scalability challenges for place recognition methods.

Autonomous Driving Management +1

Paper
Add Code

MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking

no code implementations • 15 Oct 2020 • Patrick Dendorfer, Aljoša Ošep, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth, Laura Leal-Taixé

We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data, and create a framework for the standardized evaluation of multiple object tracking methods.

Multiple Object Tracking Multiple People Tracking +3

Paper
Add Code

How Trustworthy are Performance Evaluations for Basic Vision Tasks?

no code implementations • 8 Aug 2020 • Tran Thien Dat Nguyen, Hamid Rezatofighi, Ba-Ngu Vo, Ba-Tuong Vo, Silvio Savarese, Ian Reid

This paper examines performance evaluation criteria for basic vision tasks involving sets of objects namely, object detection, instance-level segmentation and multi-object tracking.

Multi-Object Tracking object-detection +1

Paper
Add Code

Socially and Contextually Aware Human Motion and Pose Forecasting

no code implementations • 14 Jul 2020 • Vida Adeli, Ehsan Adeli, Ian Reid, Juan Carlos Niebles, Hamid Rezatofighi

In this paper, we propose a novel framework to tackle both tasks of human motion (or trajectory) and body skeleton pose forecasting in a unified end-to-end pipeline.

Human Dynamics Robot Navigation

Paper
Add Code

Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos

no code implementations • ECCV 2020 • Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid, Hamid Rezatofighi

In this paper, we solve the problem of simultaneously grouping people by their social interactions, predicting their individual actions and the social activity of each social group, which we call the social task.

Group Activity Recognition

Paper
Add Code

Auto-Rectify Network for Unsupervised Indoor Depth Estimation

1 code implementation • 4 Jun 2020 • Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Tat-Jun Chin, Chunhua Shen, Ian Reid

However, excellent results have mostly been obtained in street-scene driving scenarios, and such methods often fail in other settings, particularly indoor videos taken by handheld devices.

Ranked #62 on Monocular Depth Estimation on NYU-Depth V2

Monocular Depth Estimation Self-Supervised Learning +1

394

Paper
Code

FroDO: From Detections to 3D Objects

no code implementations • 11 May 2020 • Kejie Li, Martin Rünz, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner.

3D Reconstruction Object +2

Paper
Add Code

MOT20: A benchmark for multi object tracking in crowded scenes

1 code implementation • 19 Mar 2020 • Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking with Transformer +2

12,041

Paper
Code

3D Gated Recurrent Fusion for Semantic Scene Completion

no code implementations • 17 Feb 2020 • Yu Liu, Jie Li, Qingsen Yan, Xia Yuan, Chunxia Zhao, Ian Reid, Cesar Cadena

This paper tackles the problem of data fusion in the semantic scene completion (SSC) task, which can simultaneously deal with semantic labeling and scene completion.

Ranked #14 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Scene Understanding

Paper
Add Code

Hyperspectral Classification Based on 3D Asymmetric Inception Network with Data Fusion Transfer Learning

1 code implementation • 11 Feb 2020 • Haokui Zhang, Yu Liu, Bei Fang, Ying Li, Lingqiao Liu, Ian Reid

Hyperspectral image(HSI) classification has been improved with convolutional neural network(CNN) in very recent years.

General Classification Transfer Learning

Paper
Code

Switchable Precision Neural Networks

no code implementations • 7 Feb 2020 • Luis Guerra, Bohan Zhuang, Ian Reid, Tom Drummond

Instantaneous and on demand accuracy-efficiency trade-off has been recently explored in the context of neural networks slimming.

Quantization

Paper
Add Code

Automatic Pruning for Quantized Neural Networks

no code implementations • 3 Feb 2020 • Luis Guerra, Bohan Zhuang, Ian Reid, Tom Drummond

In particular, for ResNet-18 on ImageNet, we prune 26. 12% of the model size with Binarized Neural Network quantization, achieving a top-1 classification accuracy of 47. 32% in a model of 2. 47 MB and 59. 30% with a 2-bit DoReFa-Net in 4. 36 MB.

Bayesian Optimization Quantization

Paper
Add Code

Learn to Predict Sets Using Feed-Forward Neural Networks

no code implementations • 30 Jan 2020 • Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

In our formulation we define a likelihood for a set distribution represented by a) two discrete distributions defining the set cardinally and permutation variables, and b) a joint distribution over set elements with a fixed cardinality.

Multi-Label Image Classification object-detection +1

Paper
Add Code

Depth Based Semantic Scene Completion with Position Importance Aware Loss

1 code implementation • 29 Jan 2020 • Yu Liu, Jie Li, Xia Yuan, Chunxia Zhao, Roland Siegwart, Ian Reid, Cesar Cadena

We propose PALNet, a novel hybrid network for SSC based on single depth.

3D Semantic Segmentation Position

Paper
Code

SG-VAE: Scene Grammar Variational Autoencoder to generate new indoor scenes

no code implementations • ECCV 2020 • Pulak Purkait, Christopher Zach, Ian Reid

Our method learns the co-occurrences, and appearance parameters such as shape and pose, for different objects categories through a grammar-based auto-encoder, resulting in a compact and accurate representation for scene layouts.

valid

Paper
Add Code

NeuRoRA: Neural Robust Rotation Averaging

1 code implementation • ECCV 2020 • Pulak Purkait, Tat-Jun Chin, Ian Reid

Although the idea of replacing robust optimization methods by a graph-based network is demonstrated only for multiple rotation averaging, it could easily be extended to other graph-based geometric problems, for example, pose-graph optimization.

Robot Navigation

Paper
Code

Improved Visual Localization via Graph Smoothing

no code implementations • 7 Nov 2019 • Carlos Lassance, Yasir Latif, Ravi Garg, Vincent Gripon, Ian Reid

One solution to this problem is to learn a deep neural network to infer the pose of a query image after learning on a dataset of images with known poses.

Image Retrieval Retrieval +1

Paper
Add Code

Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation

no code implementations • 28 Sep 2019 • Yu Liu, Lingqiao Liu, Haokui Zhang, Hamid Rezatofighi, Ian Reid

This paper tackles the problem of video object segmentation.

Meta-Learning Object +4

Paper
Add Code

Structured Binary Neural Networks for Image Recognition

no code implementations • 22 Sep 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Peng Chen, Lingqiao Liu, Ian Reid

Experiments on both classification, semantic segmentation and object detection tasks demonstrate the superior performance of the proposed methods over various quantized networks in the literature.

object-detection Object Detection +2

Paper
Add Code

Visual Odometry Revisited: What Should Be Learnt?

2 code implementations • 21 Sep 2019 • Huangying Zhan, Chamara Saroj Weerasekera, Jia-Wang Bian, Ian Reid

In this work we present a monocular visual odometry (VO) algorithm which leverages geometry-based methods and deep learning.

Monocular Visual Odometry

539

Paper
Code

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video

2 code implementations • NeurIPS 2019 • Jia-Wang Bian, Zhichao Li, Naiyan Wang, Huangying Zhan, Chunhua Shen, Ming-Ming Cheng, Ian Reid

To the best of our knowledge, this is the first work to show that deep networks trained using unlabelled monocular videos can predict globally scale-consistent camera trajectories over a long video sequence.

Ranked #61 on Monocular Depth Estimation on KITTI Eigen split

Depth And Camera Motion Monocular Depth Estimation +1

713

Paper
Code

An Evaluation of Feature Matchers for Fundamental Matrix Estimation

no code implementations • 26 Aug 2019 • Jia-Wang Bian, Yu-Huan Wu, Ji Zhao, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid

According to this, we propose three high-quality matching systems and a Coarse-to-Fine RANSAC estimator.

Paper
Add Code

In defense of OSVOS

no code implementations • 19 Aug 2019 • Yu Liu, Yutong Dai, Anh-Dzung Doan, Lingqiao Liu, Ian Reid

Through adding a common module, video loss, which we formulate with various forms of constraints (including weighted BCE loss, high-dimensional triplet loss, as well as a novel mixed instance-aware video loss), to train the parent network in the step (2), the network is then better prepared for the step (3), i. e. online fine-tuning on the target instance.

Depth Estimation Object +6

Paper
Add Code

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

no code implementations • 10 Aug 2019 • Bohan Zhuang, Jing Liu, Mingkui Tan, Lingqiao Liu, Ian Reid, Chunhua Shen

Furthermore, we propose a second progressive quantization scheme which gradually decreases the bit-width from high-precision to low-precision during training.

Knowledge Distillation Quantization

Paper
Add Code

Scalable Place Recognition Under Appearance Change for Autonomous Driving

no code implementations • ICCV 2019 • Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid

Our experiments show that, compared to state-of-the-art techniques, our method has much greater potential for large-scale place recognition for autonomous driving.

Autonomous Driving Visual Place Recognition

Paper
Add Code

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation • 23 Jul 2019 • Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Ian Reid

In this paper, a non-convex non-smooth optimization framework is proposed to achieve diverse smoothing natures where even contradictive smoothing behaviors can be achieved.

image smoothing

Paper
Code

Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks

no code implementations • NeurIPS 2019 • Vineet Kosaraju, Amir Sadeghian, Roberto Martín-Martín, Ian Reid, S. Hamid Rezatofighi, Silvio Savarese

This problem is compounded by the presence of social interactions between humans and their physical interactions with the scene.

Ranked #17 on Trajectory Prediction on ETH/UCY

Autonomous Vehicles Generative Adversarial Network +2

Paper
Add Code

CVPR19 Tracking and Detection Challenge: How crowded can it get?

no code implementations • 10 Jun 2019 • Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixe

Standardized benchmarks are crucial for the majority of computer vision applications.

Multiple Object Tracking Multiple People Tracking +1

Paper
Add Code

Seeing Behind Things: Extending Semantic Segmentation to Occluded Regions

no code implementations • 7 Jun 2019 • Pulak Purkait, Christopher Zach, Ian Reid

In our experiments we demonstrate that a CNN trained by minimizing the proposed loss is able to predict semantic categories for visible and occluded object parts without requiring to increase the network size (compared to a standard segmentation task).

Segmentation Semantic Segmentation

Paper
Add Code

Bayesian Generative Active Deep Learning

no code implementations • 26 Apr 2019 • Toan Tran, Thanh-Toan Do, Ian Reid, Gustavo Carneiro

Deep learning models have demonstrated outstanding performance in several problems, but their training process tends to require immense amounts of computational and human resources for training and labeling, constraining the types of problems that can be tackled.

Active Learning Data Augmentation

Paper
Add Code

Attention-guided Network for Ghost-free High Dynamic Range Imaging

5 code implementations • CVPR 2019 • Qingsen Yan, Dong Gong, Qinfeng Shi, Anton Van Den Hengel, Chunhua Shen, Ian Reid, Yanning Zhang

Ghosting artifacts caused by moving objects or misalignments is a key challenge in high dynamic range (HDR) imaging for dynamic scenes.

Optical Flow Estimation Vocal Bursts Intensity Prediction

143

Paper
Code

A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning

no code implementations • CVPR 2019 • Thanh-Toan Do, Toan Tran, Ian Reid, Vijay Kumar, Tuan Hoang, Gustavo Carneiro

Another approach explored in the field relies on an ad-hoc linearization (in terms of N) of the triplet loss that introduces class centroids, which must be optimized using the whole training set for each mini-batch - this means that a naive implementation of this approach has run-time complexity O(N^2).

Metric Learning Retrieval

Paper
Add Code

Architecture Search of Dynamic Cells for Semantic Video Segmentation

no code implementations • 4 Apr 2019 • Vladimir Nekrasov, Hao Chen, Chunhua Shen, Ian Reid

In semantic video segmentation the goal is to acquire consistent dense semantic labelling across image frames.

Neural Architecture Search Optical Flow Estimation +3

Paper
Add Code

Template-Based Automatic Search of Compact Semantic Segmentation Architectures

1 code implementation • 4 Apr 2019 • Vladimir Nekrasov, Chunhua Shen, Ian Reid

Automatic search of neural architectures for various vision and natural language tasks is becoming a prominent tool as it allows to discover high-performing structures on any dataset of interest.

Ranked #13 on Semantic Segmentation on CamVid

General Classification Holdout Set +1

149

Paper
Code

Training Quantized Neural Networks with a Full-precision Auxiliary Module

no code implementations • CVPR 2020 • Bohan Zhuang, Lingqiao Liu, Mingkui Tan, Chunhua Shen, Ian Reid

In this paper, we seek to tackle a challenge in training low-precision networks: the notorious difficulty in propagating gradient through a low-precision network due to the non-differentiable quantization function.

Image Classification object-detection +2

Paper
Add Code

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

no code implementations • 23 Mar 2019 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

We propose V2CNet, a new deep learning framework to automatically translate the demonstration videos to commands that can be directly used in robotic applications.

Paper
Add Code

RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

no code implementations • CVPR 2019 • Jie Li, Yu Liu, Dong Gong, Qinfeng Shi, Xia Yuan, Chunxia Zhao, Ian Reid

RGB images differentiate from depth images as they carry more details about the color and texture information, which can be utilized as a vital complementary to depth for boosting the performance of 3D semantic scene completion (SSC).

Ranked #19 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Scene Labeling

Paper
Add Code

Self-supervised Learning for Single View Depth and Surface Normal Estimation

no code implementations • 1 Mar 2019 • Huangying Zhan, Chamara Saroj Weerasekera, Ravi Garg, Ian Reid

In this work we present a self-supervised learning framework to simultaneously train two Convolutional Neural Networks (CNNs) to predict depth and surface normals from a single image.

Ranked #62 on Monocular Depth Estimation on KITTI Eigen split

Depth Prediction Monocular Depth Estimation +2

Paper
Add Code

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

10 code implementations • CVPR 2019 • Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, Silvio Savarese

By incorporating this generalized $IoU$ ($GIoU$) as a loss into the state-of-the art object detection frameworks, we show a consistent improvement on their performance using both the standard, $IoU$ based, and new, $GIoU$ based, performance measures on popular object detection benchmarks such as PASCAL VOC and MS COCO.

Object object-detection +2

12,041

Paper
Code

Visual SLAM: Why Bundle Adjust?

no code implementations • 11 Feb 2019 • Álvaro Parra, Tat-Jun Chin, Anders Eriksson, Ian Reid

Bundle adjustment plays a vital role in feature-based monocular SLAM.

Paper
Add Code

Multi-modal Ensemble Classification for Generalized Zero Shot Learning

no code implementations • 15 Jan 2019 • Rafael Felix, Michele Sasdelli, Ian Reid, Gustavo Carneiro

In this paper, we mitigate these issues by proposing a new GZSL method based on multi-modal training and testing processes, where the optimization explicitly promotes a balanced classification accuracy between seen and unseen classes.

Bayesian Inference Classification +2

Paper
Add Code

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

no code implementations • 12 Jan 2019 • Yu Liu, Lingqiao Liu, Hamid Rezatofighi, Thanh-Toan Do, Qinfeng Shi, Ian Reid

As the post-processing step for object detection, non-maximum suppression (GreedyNMS) is widely used in most of the detectors for many years.

object-detection Object Detection

Paper
Add Code

Single-view Object Shape Reconstruction Using Deep Shape Prior and Silhouette

no code implementations • 29 Nov 2018 • Kejie Li, Ravi Garg, Ming Cai, Ian Reid

3D shape reconstruction from a single image is a highly ill-posed problem.

3D Reconstruction 3D Shape Reconstruction +1

Paper
Add Code

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

no code implementations • CVPR 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically} for mobile devices with limited power capacity and computation resources.

General Classification Image Classification +2

Paper
Add Code

Visual Localization Under Appearance Change: Filtering Approaches

3 code implementations • 20 Nov 2018 • Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Shin-Fang Ch'ng, Thanh-Toan Do, Ian Reid

Our approaches rely on local features with an encoding technique to represent an image as a single vector.

Visual Localization Visual Place Recognition

Paper
Code

Scalable Deep $k$-Subspace Clustering

no code implementations • 2 Nov 2018 • Tong Zhang, Pan Ji, Mehrtash Harandi, Richard Hartley, Ian Reid

In this paper, we introduce a method that simultaneously learns an embedding space along subspaces within it to minimize a notion of reconstruction error, thus addressing the problem of subspace clustering in an end-to-end learning paradigm.

Clustering

Paper
Add Code

Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells

4 code implementations • CVPR 2019 • Vladimir Nekrasov, Hao Chen, Chunhua Shen, Ian Reid

While most results in this domain have been achieved on image classification and language modelling problems, here we concentrate on dense per-pixel tasks, in particular, semantic image segmentation using fully convolutional networks.

Ranked #13 on Semantic Segmentation on PASCAL VOC 2012 val

Depth Prediction Image Classification +8

333

Paper
Code

Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks

1 code implementation • 16 Oct 2018 • Zhibin Liao, Tom Drummond, Ian Reid, Gustavo Carneiro

Furthermore, the proposed measurements also allow us to show that it is possible to optimise the training process with a new dynamic sampling training approach that continuously and automatically change the mini-batch size and learning rate during the training process.

General Classification Image Classification

Paper
Code

Light-Weight RefineNet for Real-Time Semantic Segmentation

2 code implementations • 8 Oct 2018 • Vladimir Nekrasov, Chunhua Shen, Ian Reid

We consider an important task of effective and efficient semantic image segmentation.

Ranked #2 on Real-Time Semantic Segmentation on NYU Depth v2

Image Segmentation Real-Time Semantic Segmentation +1

726

Paper
Code

Diagnostics in Semantic Segmentation

no code implementations • 27 Sep 2018 • Vladimir Nekrasov, Chunhua Shen, Ian Reid

Over the past years, computer vision community has contributed to enormous progress in semantic image segmentation, a per-pixel classification task, crucial for dense scene understanding and rapidly becoming vital in lots of real-world applications, including driverless cars and medical imaging.

Image Segmentation Scene Understanding +2

Paper
Add Code

Pre and Post-hoc Diagnosis and Interpretation of Malignancy from Breast DCE-MRI

no code implementations • 25 Sep 2018 • Gabriel Maicas, Andrew P. Bradley, Jacinto C. Nascimento, Ian Reid, Gustavo Carneiro

Conversely, traditional approaches follow a pre-hoc approach that initially localises suspicious areas that are subsequently classified to establish the breast malignancy -- this approach is trained using strongly annotated data (i. e., it needs a delineation and classification of all lesions in an image).

Paper
Add Code

Real-Time Monocular Object-Model Aware Sparse SLAM

no code implementations • 24 Sep 2018 • Mehdi Hosseinzadeh, Kejie Li, Yasir Latif, Ian Reid

While sparse point-based SLAM methods provide accurate camera localization, the generated maps lack semantic information.

Camera Localization Object +3

Paper
Add Code

Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations

4 code implementations • 13 Sep 2018 • Vladimir Nekrasov, Thanuja Dharmasiri, Andrew Spek, Tom Drummond, Chunhua Shen, Ian Reid

Deployment of deep learning models in robotics as sensory information extractors can be a daunting task to handle, even using generic GPU cards.

Ranked #6 on Real-Time Semantic Segmentation on NYU Depth v2

Knowledge Distillation Monocular Depth Estimation +3

196

Paper
Code

Efficient Dense Point Cloud Object Reconstruction using Deformation Vector Fields

no code implementations • ECCV 2018 • Kejie Li, Trung Pham, Huangying Zhan, Ian Reid

Given a single image at an arbitrary viewpoint, a CNN predicts multiple surfaces, each in a canonical location relative to the object.

3D Object Reconstruction Object

Paper
Add Code

Deep Regression Tracking with Shrinkage Loss

1 code implementation • ECCV 2018 • Xiankai Lu, Chao Ma, Bingbing Ni, Xiaokang Yang, Ian Reid, Ming-Hsuan Yang

Regression trackers directly learn a mapping from regularly dense samples of target objects to soft labels, which are usually generated by a Gaussian function, to estimate target positions.

regression

Paper
Code

Training Compact Neural Networks with Binary Weights and Low Precision Activations

no code implementations • 8 Aug 2018 • Bohan Zhuang, Chunhua Shen, Ian Reid

In this paper, we propose to train a network with binary weights and low-bitwidth activations, designed especially for mobile devices with limited power consumption.

Paper
Add Code

MatchBench: An Evaluation of Feature Matchers

no code implementations • 7 Aug 2018 • Jia-Wang Bian, Ruihan Yang, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid, WenHai Wu

This leads to a critical absence in this field that there is no standard datasets and evaluation metrics to evaluate different feature matchers fairly.

Paper
Add Code

Multi-modal Cycle-consistent Generalized Zero-Shot Learning

1 code implementation • ECCV 2018 • Rafael Felix, B. G. Vijay Kumar, Ian Reid, Gustavo Carneiro

In generalized zero shot learning (GZSL), the set of classes are split into seen and unseen classes, where training relies on the semantic features of the seen and unseen classes and the visual representations of only the seen classes, while testing uses the visual representations of the seen and unseen classes.

Ranked #5 on Generalized Zero-Shot Learning on SUN Attribute

General Classification Generalized Zero-Shot Learning

Paper
Code

Model Agnostic Saliency for Weakly Supervised Lesion Detection from Breast DCE-MRI

no code implementations • 20 Jul 2018 • Gabriel Maicas, Gerard Snaauw, Andrew P. Bradley, Ian Reid, Gustavo Carneiro

There is a heated debate on how to interpret the decisions provided by deep learning models (DLM), where the main approaches rely on the visualization of salient regions to interpret the DLM classification process.

General Classification Lesion Detection

Paper
Add Code

Bayesian Semantic Instance Segmentation in Open Set World

no code implementations • ECCV 2018 • Trung Pham, Vijay Kumar B G, Thanh-Toan Do, Gustavo Carneiro, Ian Reid

In this paper, we present a novel open-set semantic instance segmentation approach capable of segmenting all known and unknown object classes in images, based on the output of an object detector trained on known object classes.

Instance Segmentation Object +2

Paper
Add Code

Bootstrapping the Performance of Webly Supervised Semantic Segmentation

1 code implementation • CVPR 2018 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth.

Segmentation Transfer Learning +2

Paper
Code

Training Medical Image Analysis Systems like Radiologists

no code implementations • 28 May 2018 • Gabriel Maicas, Andrew P. Bradley, Jacinto C. Nascimento, Ian Reid, Gustavo Carneiro

This process bears no direct resemblance with radiologist training, which is based on solving a series of tasks of increasing difficulty, where each task involves the use of significantly smaller datasets than those used in machine learning.

BIG-bench Machine Learning Classification +3

Paper
Add Code

Just-in-Time Reconstruction: Inpainting Sparse Maps using Single View Depth Predictors as Priors

no code implementations • 11 May 2018 • Chamara Saroj Weerasekera, Thanuja Dharmasiri, Ravi Garg, Tom Drummond, Ian Reid

Crucially, we obtain the confidence weights that parameterize the CRF model in a data-dependent manner via Convolutional Neural Networks (CNNs) which are trained to model the conditional depth error distributions given each source of input depth map and the associated RGB image.

Depth Estimation Depth Prediction

Paper
Add Code

Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks

no code implementations • ICLR 2019 • S. Hamid Rezatofighi, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Daniel Cremers, Laura Leal-Taixé, Ian Reid

We demonstrate the validity of this new formulation on two relevant vision problems: object detection, for which our formulation outperforms state-of-the-art detectors such as Faster R-CNN and YOLO, and a complex CAPTCHA test, where we observe that, surprisingly, our set based network acquired the ability of mimicking arithmetics without any rules being coded.

object-detection Object Detection

Paper
Add Code

Structure Aware SLAM using Quadrics and Planes

no code implementations • 24 Apr 2018 • Mehdi Hosseinzadeh, Yasir Latif, Trung Pham, Niko Suenderhauf, Ian Reid

Simultaneous Localization And Mapping (SLAM) is a fundamental problem in mobile robotics.

Camera Localization Object +3

Paper
Add Code

Object Captioning and Retrieval with Natural Language

1 code implementation • 16 Mar 2018 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis

The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object.

Object Retrieval

Paper
Code

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

1 code implementation • CVPR 2018 • Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, Ian Reid

Despite learning based methods showing promising results in single view depth estimation and visual odometry, most existing approaches treat the tasks in a supervised manner.

Depth And Camera Motion Depth Prediction +2

345

Paper
Code

Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image

no code implementations • 28 Feb 2018 • Thanh-Toan Do, Ming Cai, Trung Pham, Ian Reid

Detecting objects and their 6D poses from only RGB images is an important task for many robotic applications.

Benchmarking Instance Segmentation +5

Paper
Add Code

Binary Constrained Deep Hashing Network for Image Retrieval without Manual Annotation

no code implementations • 21 Feb 2018 • Thanh-Toan Do, Tuan Hoang, Dang-Khoa Le Tan, Trung Pham, Huu Le, Ngai-Man Cheung, Ian Reid

However, training deep hashing networks for the task is challenging due to the binary constraints on the hash codes, the similarity preserving property, and the requirement for a vast amount of labelled images.

Deep Hashing Image Retrieval +1

Paper
Add Code

Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning

no code implementations • CVPR 2018 • Qi Wu, Peng Wang, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The Visual Dialogue task requires an agent to engage in a conversation about an image with a human.

Ranked #4 on Visual Dialog on VisDial v0.9 val

Question Answering Visual Dialog +1

Paper
Add Code

Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments

8 code implementations • CVPR 2018 • Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton Van Den Hengel

This is significant because a robot interpreting a natural-language navigation instruction on the basis of what it sees is carrying out a vision and language process that is similar to Visual Question Answering.

Ranked #10 on Visual Navigation on R2R

Translation Vision and Language Navigation +2

454

Paper
Code

Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries

no code implementations • CVPR 2018 • Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

To this end we propose a unified framework, the ParalleL AttentioN (PLAN) network, to discover the object in an image that is being referred to in variable length natural expression descriptions, from short phrases query to long multi-round dialogs.

Object Object Discovery +2

Paper
Add Code

Learning Deeply Supervised Good Features to Match for Dense Monocular Reconstruction

no code implementations • 16 Nov 2017 • Chamara Saroj Weerasekera, Ravi Garg, Yasir Latif, Ian Reid

Visual SLAM (Simultaneous Localization and Mapping) methods typically rely on handcrafted visual features or raw RGB values for establishing correspondences between images.

Depth Estimation Monocular Reconstruction +1

Paper
Add Code

Towards Effective Low-bitwidth Convolutional Neural Networks

2 code implementations • CVPR 2018 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations.

Quantization

Paper
Code

A Bayesian Data Augmentation Approach for Learning Deep Models

1 code implementation • NeurIPS 2017 • Toan Tran, Trung Pham, Gustavo Carneiro, Lyle Palmer, Ian Reid

Data augmentation is an essential part of the training process applied to deep learning models.

Data Augmentation General Classification +1

Paper
Code

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

1 code implementation • ICCV 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

The proposed method still builds one classifier for one interaction (as per type (ii) above), but the classifier built is adaptive to context via weights which are context dependent.

Relationship Detection Visual Relationship Detection

Paper
Code

Addressing Challenging Place Recognition Tasks using Generative Adversarial Networks

1 code implementation • 26 Sep 2017 • Yasir Latif, Ravi Garg, Michael Milford, Ian Reid

In the process, meaningful feature spaces are learned for each domain, the distances in which can be used for the task of place recognition.

Robotics

Paper
Code

SceneCut: Joint Geometric and Object Segmentation for Indoor Scenes

no code implementations • 21 Sep 2017 • Trung Pham, Thanh-Toan Do, Niko Sünderhauf, Ian Reid

This paper presents SceneCut, a novel approach to jointly discover previously unseen objects and non-object surfaces using a single RGB-D image.

Object Semantic Segmentation

Paper
Add Code

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

2 code implementations • 21 Sep 2017 • Thanh-Toan Do, Anh Nguyen, Ian Reid

We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images.

Affordance Detection Object +2

119

Paper
Code

Joint Learning of Set Cardinality and State Distribution

no code implementations • 13 Sep 2017 • S. Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Anthony Dick, Ian Reid

We present a novel approach for learning to predict sets using deep learning.

Multi-Label Image Classification valid

Paper
Add Code

Deep Subspace Clustering Networks

3 code implementations • NeurIPS 2017 • Pan Ji, Tong Zhang, Hongdong Li, Mathieu Salzmann, Ian Reid

We present a novel deep neural network architecture for unsupervised subspace clustering.

Ranked #3 on Image Clustering on Extended Yale-B

Clustering Image Clustering

203

Paper
Code

Visual Question Answering with Memory-Augmented Networks

no code implementations • CVPR 2018 • Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid

In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.

Question Answering Visual Question Answering

Paper
Add Code

"Maximizing rigidity" revisited: a convex programming approach for generic 3D shape reconstruction from multiple perspective views

no code implementations • ICCV 2017 • Pan Ji, Hongdong Li, Yuchao Dai, Ian Reid

Rigid structure-from-motion (RSfM) and non-rigid structure-from-motion (NRSfM) have long been treated in the literature as separate (different) problems.

3D Reconstruction 3D Shape Reconstruction

Paper
Add Code

Adaptive Low-Rank Kernel Subspace Clustering

1 code implementation • 17 Jul 2017 • Pan Ji, Ian Reid, Ravi Garg, Hongdong Li, Mathieu Salzmann

In this paper, we present a kernel subspace clustering method that can handle non-linear models.

Clustering Image Clustering +1

Paper
Code

Care about you: towards large-scale human-centric visual relationship detection

no code implementations • 28 May 2017 • Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

In addressing this problem we first construct a large-scale human-centric visual relationship detection dataset (HCVRD), which provides many more types of relationship annotation (nearly 10K categories) than the previous released datasets.

Human-Object Interaction Detection Relationship Detection +1

Paper
Add Code

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

no code implementations • 25 May 2017 • Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid

Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Tracking the Trackers: An Analysis of the State of the Art in Multiple Object Tracking

no code implementations • 10 Apr 2017 • Laura Leal-Taixé, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth

Standardized benchmarks are crucial for the majority of computer vision applications.

Multiple Object Tracking Multiple People Tracking +1

Paper
Add Code

Smart Mining for Deep Metric Learning

no code implementations • ICCV 2017 • Ben Harwood, Vijay Kumar B G, Gustavo Carneiro, Ian Reid, Tom Drummond

In this paper, we propose a novel deep metric learning method that combines the triplet model and the global structure of the embedding space.

Metric Learning

Paper
Add Code

A Branch-and-Bound Algorithm for Checkerboard Extraction in Camera-Laser Calibration

no code implementations • 4 Apr 2017 • Alireza Khosravian, Tat-Jun Chin, Ian Reid

We formulate the checkerboard extraction as a combinatorial optimization problem with a clear cut objective function.

Combinatorial Optimization

Paper
Add Code

Towards Context-aware Interaction Recognition

no code implementations • 18 Mar 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

Recognizing how objects interact with each other is a crucial task in visual recognition.

Paper
Add Code

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation

no code implementations • 25 Jan 2017 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

Semantic image segmentation is a fundamental task in image understanding.

Image Segmentation Segmentation +1

Paper
Add Code

Deep Learning Features at Scale for Visual Place Recognition

no code implementations • 18 Jan 2017 • Zetao Chen, Adam Jacobson, Niko Sunderhauf, Ben Upcroft, Lingqiao Liu, Chunhua Shen, Ian Reid, Michael Milford

The success of deep learning techniques in the computer vision domain has triggered a range of initial investigations into their utility for visual place recognition, all using generic features from networks that were trained for other types of recognition tasks.

Visual Place Recognition

Paper
Add Code

From Motion Blur to Motion Flow: a Deep Learning Solution for Removing Heterogeneous Motion Blur

no code implementations • CVPR 2017 • Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton Van Den Hengel, Qinfeng Shi

The critical observation underpinning our approach is thus that learning the motion flow instead allows the model to focus on the cause of the blur, irrespective of the image content.

Paper
Add Code

Attend in groups: a weakly-supervised deep learning framework for learning from web data

no code implementations • CVPR 2017 • Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition.

Paper
Add Code

DeepSetNet: Predicting Sets with Deep Neural Networks

no code implementations • ICCV 2017 • S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid

This paper addresses the task of set prediction using deep learning.

Image Classification Object Counting +3

Paper
Add Code

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

13 code implementations • CVPR 2017 • Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation.

Ranked #13 on Semantic Segmentation on Trans10K

3D Absolute Human Pose Estimation Semantic Segmentation +1

585

Paper
Code

Meaningful Maps With Object-Oriented Semantic Mapping

no code implementations • 26 Sep 2016 • Niko Sünderhauf, Trung T. Pham, Yasir Latif, Michael Milford, Ian Reid

For intelligent robots to interact in meaningful ways with their environment, they must understand both the geometric and semantic properties of the scene surrounding them.

Robotics

Paper
Add Code

Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification

no code implementations • 1 Aug 2016 • ZongYuan Ge, Chris McCool, Conrad Sanderson, Peng Wang, Lingqiao Liu, Ian Reid, Peter Corke

Fine-grained classification is a relatively new field that has concentrated on using information from a single image, while ignoring the enormous potential of using video data to improve classification.

Classification General Classification

Paper
Add Code

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

2 code implementations • 19 Jun 2016 • Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, Jose Neira, Ian Reid, John J. Leonard

Simultaneous Localization and Mapping (SLAM)consists in the concurrent construction of a model of the environment (the map), and the estimation of the state of the robot moving within it.

Robotics

Paper
Code

Joint Probabilistic Matching Using m-Best Solutions

no code implementations • CVPR 2016 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

Matching between two sets of objects is typically approached by finding the object pairs that collectively maximize the joint matching score.

Person Re-Identification

Paper
Add Code

Efficient Point Process Inference for Large-Scale Object Detection

no code implementations • CVPR 2016 • Trung T. Pham, Seyed Hamid Rezatofighi, Ian Reid, Tat-Jun Chin

We tackle the problem of large-scale object detection in images, where the number of objects can be arbitrarily large, and can exhibit significant overlap/occlusion.

Human Detection Object +2

Paper
Add Code

Online Multi-Target Tracking Using Recurrent Neural Networks

no code implementations • 13 Apr 2016 • Anton Milan, Seyed Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler

Here, we propose for the first time, an end-to-end learning approach for online multi-target tracking.

Paper
Add Code

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

2 code implementations • 16 Mar 2016 • Ravi Garg, Vijay Kumar BG, Gustavo Carneiro, Ian Reid

In this work we propose a unsupervised framework to learn a deep convolutional neural network for single view depth predic- tion, without requiring a pre-training stage or annotated ground truth depths.

Depth Estimation

236

Paper
Code

Non-linear Dimensionality Regularizer for Solving Inverse Problems

no code implementations • 16 Mar 2016 • Ravi Garg, Anders Eriksson, Ian Reid

Additionally, we evaluate our method on the challenging problem of Non-Rigid Structure from Motion and our approach delivers promising results on CMU mocap dataset despite the presence of significant occlusions and noise.

Paper
Add Code

Exploring Context with Deep Structured models for Semantic Segmentation

no code implementations • 10 Mar 2016 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, Ian Reid

We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions.

Image Segmentation Segmentation +1

Paper
Add Code

Fast Training of Triplet-based Deep Binary Embedding Networks

no code implementations • CVPR 2016 • Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian Reid

To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary code which serve as the label of each training datum.

Image Retrieval Multi-Label Classification +1

Paper
Add Code

MOT16: A Benchmark for Multi-Object Tracking

8 code implementations • 2 Mar 2016 • Anton Milan, Laura Leal-Taixe, Ian Reid, Stefan Roth, Konrad Schindler

Recently, a new benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal of collecting existing and new data and creating a framework for the standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking +2

12,041

Paper
Code

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions

2 code implementations • CVPR 2016 • Vijay Kumar B G, Gustavo Carneiro, Ian Reid

Current results from machine learning show that replacing this siamese by a triplet network can improve the classification accuracy in several problems, but this has yet to be demonstrated for local image descriptor learning.

General Classification

198

Paper
Code

Hierarchical Higher-Order Regression Forest Fields: An Application to 3D Indoor Scene Labelling

no code implementations • ICCV 2015 • Trung T. Pham, Ian Reid, Yasir Latif, Stephen Gould

Specifically, we relax the labelling problem to a regression, and generalize the higher-order associative P n Potts model to a new family of arbitrary higher-order models based on regression forests.

regression Semantic Segmentation

Paper
Add Code

Joint Probabilistic Data Association Revisited

1 code implementation • ICCV 2015 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding the m-best solutions to an integer linear program.

204

Paper
Code

Deeply Learning the Messages in Message Passing Inference

no code implementations • NeurIPS 2015 • Guosheng Lin, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The network output dimension for message estimation is the same as the number of classes, in contrast to the network output for general CNN potential functions in CRFs, which is exponential in the order of the potentials.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

The k-Support Norm and Convex Envelopes of Cardinality and Rank

no code implementations • CVPR 2015 • Anders Eriksson, Trung Thanh Pham, Tat-Jun Chin, Ian Reid

Sparsity, or cardinality, as a tool for feature selection is extremely common in a vast number of current computer vision applications.

Computational Efficiency feature selection

Paper
Add Code

Joint Tracking and Segmentation of Multiple Targets

no code implementations • CVPR 2015 • Anton Milan, Laura Leal-Taixe, Konrad Schindler, Ian Reid

Tracking-by-detection has proven to be the most successful strategy to address the task of tracking multiple targets in unconstrained scenarios.

Video Segmentation Video Semantic Segmentation

Paper
Add Code

MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking

2 code implementations • 8 Apr 2015 • Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, Konrad Schindler

We discuss the challenges of creating such a framework, collecting existing and new data, gathering state-of-the-art methods to be tested on the datasets, and finally creating a unified evaluation system.

3D Reconstruction Multiple Object Tracking +3

Paper
Code

Efficient piecewise training of deep structured models for semantic segmentation

no code implementations • CVPR 2016 • Guosheng Lin, Chunhua Shen, Anton van dan Hengel, Ian Reid

Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs).

Ranked #54 on Semantic Segmentation on PASCAL Context

Image Segmentation Segmentation +1

Paper
Add Code

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

1 code implementation • 26 Feb 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin, Ian Reid

Therefore, here we present a deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF.

Depth Estimation

Paper
Code

Dense 3D Face Correspondence

no code implementations • 19 Oct 2014 • Syed Zulqarnain Gilani, Ajmal Mian, Faisal Shafait, Ian Reid

A deformable model (K3DM) is constructed from the dense corresponded faces and an algorithm is proposed for morphing the K3DM to fit unseen faces.

Face Recognition

Paper
Add Code

Online Unsupervised Feature Learning for Visual Tracking

no code implementations • 7 Oct 2013 • Fayao Liu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

Feature encoding with respect to an over-complete dictionary learned by unsupervised methods, followed by spatial pyramid pooling, and linear classification, has exhibited powerful strength in various vision applications.

Dictionary Learning Visual Tracking

Paper
Add Code

Dense Reconstruction Using 3D Object Shape Priors

no code implementations • CVPR 2013 • Amaury Dame, Victor A. Prisacariu, Carl Y. Ren, Ian Reid

More specifically, we automatically augment our SLAM system with object specific identity, together with 6D pose and additional shape degrees of freedom for the object(s) of known class in the scene, combining image data and depth information for the pose and shape recovery.

3D Reconstruction Object

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.