Search Results for author: Michael Ying Yang

Found 66 papers, 28 papers with code

Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation

no code implementations • 19 Mar 2024 • Yao Wei, Martin Renqiang Min, George Vosselman, Li Erran Li, Michael Ying Yang

Recent progresses have been made in shape generation with powerful generative models, such as diffusion models, which increases the shape fidelity.

3D Shape Generation Language Modelling +2

Paper
Add Code

Robust Shape Fitting for 3D Scene Abstraction

1 code implementation • 15 Mar 2024 • Florian Kluger, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

A RANSAC estimator guided by a neural network fits these primitives to a depth map.

Depth Estimation Scene Parsing

Paper
Code

Convincing Rationales for Visual Question Answering Reasoning

1 code implementation • 6 Feb 2024 • Kun Li, George Vosselman, Michael Ying Yang

Visual Question Answering (VQA) is a challenging task of predicting the answer to a question about the content of an image.

Question Answering Visual Question Answering

Paper
Code

Transformer-based Multimodal Change Detection with Multitask Consistency Constraints

1 code implementation • 13 Oct 2023 • BiYuan Liu, HuaiXin Chen, Kun Li, Michael Ying Yang

We observe that the current change detection methods struggle with the multitask conflicts between semantic and height change detection tasks.

Change Detection Earth Observation

Paper
Code

BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models

no code implementations • 31 Aug 2023 • Yao Wei, George Vosselman, Michael Ying Yang

3D building generation with low data acquisition costs, such as single image-to-3D, becomes increasingly important.

Denoising Image to 3D

Paper
Add Code

Interactive Image Segmentation with Cross-Modality Vision Transformers

1 code implementation • 5 Jul 2023 • Kun Li, George Vosselman, Michael Ying Yang

Interactive image segmentation aims to segment the target from the background with the manual guidance, which takes as input multimodal data such as images, clicks, scribbles, and bounding boxes.

Image Segmentation Interactive Segmentation +2

Paper
Code

Learning Similarity between Scene Graphs and Images with Transformers

no code implementations • 2 Apr 2023 • Yuren Cong, Wentong Liao, Bodo Rosenhahn, Michael Ying Yang

Scene graph generation is conventionally evaluated by (mean) Recall@K, which measures the ratio of correctly predicted triplets that appear in the ground truth.

Contrastive Learning Graph Generation +3

Paper
Add Code

LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints

1 code implementation • 27 Feb 2023 • Mengmeng Liu, Hao Cheng, Lin Chen, Hellward Broszio, Jiangtao Li, Runjiang Zhao, Monika Sester, Michael Ying Yang

Trajectory prediction for autonomous driving must continuously reason the motion stochasticity of road agents and comply with scene constraints.

Autonomous Driving Trajectory Prediction

Paper
Code

Generating Evidential BEV Maps in Continuous Driving Space

1 code implementation • 6 Feb 2023 • Yunshuang Yuan, Hao Cheng, Michael Ying Yang, Monika Sester

Safety is critical for autonomous driving, and one aspect of improving safety is to accurately capture the uncertainties of the perception system, especially knowing the unknown.

Autonomous Driving object-detection +2

Paper
Code

HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images

no code implementations • 23 Jan 2023 • Kun Li, George Vosselman, Michael Ying Yang

Visual question answering (VQA) is an important and challenging multimodal task in computer vision.

Attribute Question Answering +2

Paper
Add Code

Attribute-Centric Compositional Text-to-Image Generation

no code implementations • 4 Jan 2023 • Yuren Cong, Martin Renqiang Min, Li Erran Li, Bodo Rosenhahn, Michael Ying Yang

We further propose an attribute-centric contrastive loss to avoid overfitting to overrepresented attribute compositions.

Attribute Fairness +1

Paper
Add Code

SSGVS: Semantic Scene Graph-to-Video Synthesis

no code implementations • 11 Nov 2022 • Yuren Cong, Jinhui Yi, Bodo Rosenhahn, Michael Ying Yang

A semantic scene graph-to-video synthesis framework (SSGVS), based on the pre-trained VSG encoder, VQ-VAE, and auto-regressive Transformer, is proposed to synthesize a video given an initial scene image and a non-fixed number of semantic scene graphs.

Image Generation

Paper
Add Code

Flow-based GAN for 3D Point Cloud Generation from a Single Image

1 code implementation • 8 Oct 2022 • Yao Wei, George Vosselman, Michael Ying Yang

Generating a 3D point cloud from a single 2D image is of great importance for 3D scene understanding applications.

Point Cloud Generation Scene Understanding

Paper
Code

GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction Model

1 code implementation • 16 Sep 2022 • Hao Cheng, Mengmeng Liu, Lin Chen, Hellward Broszio, Monika Sester, Michael Ying Yang

This paper proposes an attention-based graph model, named GATraj, which achieves a good balance of prediction accuracy and inference speed.

Autonomous Driving Robot Navigation +1

Paper
Code

RelTR: Relation Transformer for Scene Graph Generation

1 code implementation • 27 Jan 2022 • Yuren Cong, Michael Ying Yang, Bodo Rosenhahn

Different objects in the same scene are more or less related to each other, but only a limited number of these relationships are noteworthy.

Graph Generation Object +4

214

Paper
Code

LUAI Challenge 2021 on Learning to Understand Aerial Images

1 code implementation • 30 Aug 2021 • Gui-Song Xia, Jian Ding, Ming Qian, Nan Xue, Jiaming Han, Xiang Bai, Michael Ying Yang, Shengyang Li, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, Qiang Zhou, Chao-hui Yu, Kaixuan Hu, Yingjia Bu, Wenming Tan, Zhe Yang, Wei Li, Shang Liu, Jiaxuan Zhao, Tianzhi Ma, Zi-han Gao, Lingqi Wang, Yi Zuo, Licheng Jiao, Chang Meng, Hao Wang, Jiahao Wang, Yiming Hui, Zhuojun Dong, Jie Zhang, Qianyue Bao, Zixiao Zhang, Fang Liu

This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images.

Object object-detection +4

258

Paper
Code

Disentangled Lifespan Face Synthesis

no code implementations • ICCV 2021 • Sen He, Wentong Liao, Michael Ying Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang

The generated face image given a target age code is expected to be age-sensitive reflected by bio-plausible transformations of shape and texture, while being identity preserving.

Face Generation

Paper
Add Code

Spatial-Temporal Transformer for Dynamic Scene Graph Generation

1 code implementation • ICCV 2021 • Yuren Cong, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn, Michael Ying Yang

Compared to the task of scene graph generation from images, it is more challenging because of the dynamic relationships between objects and the temporal dependencies between frames allowing for a richer semantic interpretation.

Scene Graph Generation Video Understanding +1

173

Paper
Code

Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images

1 code implementation • CVPR 2021 • Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

A RANSAC estimator guided by a neural network fits these primitives to 3D features, such as a depth map.

Paper
Code

Text to Image Generation with Semantic-Spatial Aware GAN

1 code implementation • CVPR 2022 • Kai Hu, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn

Text-to-image synthesis (T2I) aims to generate photo-realistic images which are semantically consistent with the text descriptions.

Sentence Sentence Embedding +2

169

Paper
Code

Context-Aware Layout to Image Generation with Enhanced Object Appearance

1 code implementation • CVPR 2021 • Sen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang

We argue that these are caused by the lack of context-aware object and stuff feature encoding in their generators, and location-sensitive appearance representation in their discriminators.

Ranked #1 on Layout-to-Image Generation on COCO-Stuff 128x128

Layout-to-Image Generation Object

Paper
Code

Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery

1 code implementation • 5 Feb 2021 • Ye Lyu, George Vosselman, Gui-Song Xia, Michael Ying Yang

Semantic segmentation for aerial platforms has been one of the fundamental scene understanding task for the earth observation.

Earth Observation Scene Understanding +2

Paper
Code

Self-supervised monocular depth estimation from oblique UAV videos

1 code implementation • 19 Dec 2020 • Logambal Madhuanand, Francesco Nex, Michael Ying Yang

Monocular video frames are used for training the deep learning model which learns depth and pose information jointly through two different networks, one each for depth and pose.

3D Reconstruction Image Generation +4

Paper
Code

LGENet: Local and Global Encoder Network for Semantic Segmentation of Airborne Laser Scanning Point Clouds

no code implementations • 18 Dec 2020 • Yaping Lin, George Vosselman, Yanpeng Cao, Michael Ying Yang

Interpretation of Airborne Laser Scanning (ALS) point clouds is a critical procedure for producing various geo-information products like 3D city models, digital terrain models and land use maps.

Semantic Segmentation

Paper
Add Code

Boosting Image Super-Resolution Via Fusion of Complementary Information Captured by Multi-Modal Sensors

no code implementations • 7 Dec 2020 • Fan Wang, Jiangxin Yang, Yanlong Cao, Yanpeng Cao, Michael Ying Yang

Image Super-Resolution (SR) provides a promising technique to enhance the image quality of low-resolution optical sensors, facilitating better-performing target detection and autonomous navigation in a wide range of robotics applications.

3D Reconstruction Autonomous Navigation +1

Paper
Add Code

Real-time Semantic Segmentation with Context Aggregation Network

no code implementations • 2 Nov 2020 • Michael Ying Yang, Saumya Kumaar, Ye Lyu, Francesco Nex

With the increasing demand of autonomous systems, pixelwise semantic segmentation for visual scene understanding needs to be not only accurate but also efficient for potential real-time applications.

Real-Time Semantic Segmentation Scene Understanding +1

Paper
Add Code

Exploring Dynamic Context for Multi-path Trajectory Prediction

2 code implementations • 30 Oct 2020 • Hao Cheng, Wentong Liao, Xuejiao Tang, Michael Ying Yang, Monika Sester, Bodo Rosenhahn

In our framework, first, the spatial context between agents is explored by using self-attention architectures.

Trajectory Forecasting

Paper
Code

On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID

1 code implementation • 22 Jun 2020 • Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li

After reviewing existing benchmark datasets in the research community of RS image interpretation, this article discusses the problem of how to efficiently prepare a suitable benchmark dataset for RS image interpretation.

General Classification Image Classification +1

Paper
Code

AMENet: Attentive Maps Encoder Network for Trajectory Prediction

1 code implementation • 15 Jun 2020 • Hao Cheng, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn, Monika Sester

Trajectory prediction is critical for applications of planning safe future movements and remains challenging even for the next few seconds in urban mixed traffic.

Trajectory Prediction

Paper
Code

LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery

no code implementations • 28 May 2020 • Wentong Liao, Xiang Chen, Jingfeng Yang, Stefan Roth, Michael Goesele, Michael Ying Yang, Bodo Rosenhahn

This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation.

object-detection Object Detection +1

Paper
Add Code

Plug & Play Convolutional Regression Tracker for Video Object Detection

2 code implementations • 2 Mar 2020 • Ye Lyu, Michael Ying Yang, George Vosselman, Gui-Song Xia

As the tracker reuses the features from the detector, it is a very light-weighted increment to the detection network.

Object object-detection +2

Paper
Code

MCENET: Multi-Context Encoder Network for Homogeneous Agent Trajectory Prediction in Mixed Traffic

1 code implementation • 14 Feb 2020 • Hao Cheng, Wentong Liao, Michael Ying Yang, Monika Sester, Bodo Rosenhahn

In inference time, we combine the past context and motion information of the target agent with samplings of the latent variables to predict multiple realistic trajectories in the future.

Autonomous Driving Intent Detection +1

Paper
Code

NODIS: Neural Ordinary Differential Scene Understanding

1 code implementation • ECCV 2020 • Cong Yuren, Hanno Ackermann, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn

Detected objects, their labels and the discovered relations can be used to construct a scene graph which provides an abstract semantic interpretation of an image.

Ranked #8 on Scene Graph Generation on Visual Genome

Graph Generation Relationship Detection +2

Paper
Code

CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

3 code implementations • CVPR 2020 • Florian Kluger, Eric Brachmann, Hanno Ackermann, Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

We present a robust estimator for fitting multiple parametric models of the same form to noisy measurements.

Homography Estimation Self-Supervised Learning

Paper
Code

Deep Neural Network for Fast and Accurate Single Image Super-Resolution via Channel-Attention-based Fusion of Orientation-aware Features

no code implementations • 9 Dec 2019 • Du Chen, Zewei He, Yanpeng Cao, Jiangxin Yang, Yanlong Cao, Michael Ying Yang, Siliang Tang, Yueting Zhuang

Firstly, we proposed a novel Orientation-Aware feature extraction and fusion Module (OAM), which contains a mixture of 1D and 2D convolutional kernels (i. e., 5 x 1, 1 x 5, and 3 x 3) for extracting orientation-aware features.

Computational Efficiency Image Super-Resolution

Paper
Add Code

LIP: Learning Instance Propagation for Video Object Segmentation

no code implementations • 30 Sep 2019 • Ye Lyu, George Vosselman, Gui-Song Xia, Michael Ying Yang

In recent years, the task of segmenting foreground objects from background in a video, i. e. video object segmentation (VOS), has received considerable attention.

Data Augmentation Instance Segmentation +5

Paper
Add Code

Temporally Consistent Horizon Lines

1 code implementation • 23 Jul 2019 • Florian Kluger, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

The horizon line is an important geometric feature for many image processing and scene understanding tasks in computer vision.

Ranked #1 on Horizon Line Estimation on KITTI Horizon

3D Reconstruction Autonomous Vehicles +2

Paper
Code

A deep-learning-based approach for fast and robust steel surface defects classification

no code implementations • elsevier journal 2019 • Guizhong Fu, Peize Sun a, Wenbin Zhu, Jiangxin Yang, Yanlong Cao, Michael Ying Yang, Yanpeng Cao

Automatic visual recognition of steel surface defects provides critical functionality to facilitate quality control of steel strip production.

General Classification

Paper
Add Code

Unsupervised Domain Adaptation for Multispectral Pedestrian Detection

no code implementations • 7 Apr 2019 • Dayan Guan, Xing Luo, Yanpeng Cao, Jiangxin Yang, Yanlong Cao, George Vosselman, Michael Ying Yang

In this paper, we propose a novel unsupervised domain adaptation framework for multispectral pedestrian detection, by iteratively generating pseudo annotations and updating the parameters of our designed multispectral pedestrian detector on target domain.

Autonomous Driving Pedestrian Detection +1

Paper
Add Code

Target-Tailored Source-Transformation for Scene Graph Generation

no code implementations • 3 Apr 2019 • Wentong Liao, Cuiling Lan, Wen-Jun Zeng, Michael Ying Yang, Bodo Rosenhahn

We further explore more powerful representations by integrating language prior with the visual context in the transformation for the scene graph generation.

graph construction Graph Generation +6

Paper
Add Code

Robust object extraction from remote sensing data

no code implementations • 3 Apr 2019 • Sophie Crommelinck, Mila Koeva, Michael Ying Yang, George Vosselman

The delineation approach to which the evaluation framework is applied, was previously introduced and is substantially improved in this study.

Object

Paper
Add Code

Box-level Segmentation Supervised Deep Neural Networks for Accurate and Real-time Multispectral Pedestrian Detection

no code implementations • 14 Feb 2019 • Yanpeng Cao, Dayan Guan, Yulun Wu, Jiangxin Yang, Yanlong Cao, Michael Ying Yang

Effective fusion of complementary information captured by multi-modal sensors (visible and infrared cameras) enables robust pedestrian detection under various surveillance situations (e. g. daytime and nighttime).

Autonomous Driving Computational Efficiency +1

Paper
Add Code

Security Event Recognition for Visual Surveillance

no code implementations • 26 Oct 2018 • Michael Ying Yang, Wentong Liao, Chun Yang, Yanpeng Cao, Bodo Rosenhahn

The experimental results show that the proposed approach outperforms the state-of-the-art methods and effective in recognizing complex security events.

Paper
Add Code

UAVid: A Semantic Segmentation Dataset for UAV Imagery

3 code implementations • 24 Oct 2018 • Ye Lyu, George Vosselman, Gui-Song Xia, Alper Yilmaz, Michael Ying Yang

There already exist several semantic segmentation datasets for comparison among semantic segmentation methods in complex urban scenes, such as the Cityscapes and CamVid datasets, where the side views of the objects are captured with a camera mounted on the driving car.

4k Autonomous Driving +5

Paper
Code

Patch-based Evaluation of Dense Image Matching Quality

no code implementations • 25 Jul 2018 • Zhenchao Zhang, Markus Gerke, George Vosselman, Michael Ying Yang

Due to the high cost of laser scanning, we want to explore the potential of using point clouds derived by dense image matching (DIM), as effective alternatives to laser scanning data.

Paper
Add Code

Change Detection between Multimodal Remote Sensing Data Using Siamese CNN

1 code implementation • 25 Jul 2018 • Zhenchao Zhang, George Vosselman, Markus Gerke, Devis Tuia, Michael Ying Yang

Detecting topographic changes in the urban environment has always been an important task for urban planning and monitoring.

Change Detection

Paper
Code

Fusion of Multispectral Data Through Illumination-aware Deep Neural Networks for Pedestrian Detection

no code implementations • 27 Feb 2018 • Dayan Guan, Yanpeng Cao, Jun Liang, Yanlong Cao, Michael Ying Yang

Moreover, we utilized illumination information together with multispectral data to generate more accurate semantic segmentation which are used to boost pedestrian detection accuracy.

Autonomous Driving Multi-Task Learning +2

Paper
Add Code

Video Event Recognition and Anomaly Detection by Combining Gaussian Process and Hierarchical Dirichlet Process Models

no code implementations • 9 Feb 2018 • Michael Ying Yang, Wentong Liao, Yanpeng Cao, Bodo Rosenhahn

In our framework, three levels of video events are connected by Hierarchical Dirichlet Process (HDP) model: low-level visual features, simple atomic activities, and multi-agent interactions.

Anomaly Detection General Classification

Paper
Add Code

Triplet-based Deep Similarity Learning for Person Re-Identification

1 code implementation • 9 Feb 2018 • Wentong Liao, Michael Ying Yang, Ni Zhan, Bodo Rosenhahn

Moreover, we trained the model jointly on six different datasets, which differs from common practice - one model is just trained on one dataset and tested also on the same one.

Person Re-Identification

Paper
Code

Temporally Object-based Video Co-Segmentation

no code implementations • 9 Feb 2018 • Michael Ying Yang, Matthias Reso, Jun Tang, Wentong Liao, Bodo Rosenhahn

Therefore, we formulate a graphical model to select a proposal stream for each object in which the pairwise potentials consist of the appearance dissimilarity between different streams in the same video and also the similarity between the streams in different videos.

Object Segmentation

Paper
Add Code

Unsupervised Deep Domain Adaptation for Pedestrian Detection

no code implementations • 9 Feb 2018 • Lihang Liu, Weiyao Lin, Lisheng Wu, Yong Yu, Michael Ying Yang

This paper addresses the problem of unsupervised domain adaptation on the task of pedestrian detection in crowded scenes.

Pedestrian Detection Unsupervised Domain Adaptation

Paper
Add Code

Slice Sampling Particle Belief Propagation

no code implementations • 9 Feb 2018 • Oliver Mueller, Michael Ying Yang, Bodo Rosenhahn

We propose to avoid dependence on a proposal distribution by introducing a slice sampling based PBP algorithm.

Image Denoising

Paper
Add Code

Vehicle Detection in Aerial Images

no code implementations • 22 Jan 2018 • Michael Ying Yang, Wentong Liao, Xinbo Li, Bodo Rosenhahn

Also, the focal loss function is used to substitute for conventional cross entropy loss function in both of the region proposed network and the final classifier.

object-detection Object Detection

Paper
Add Code

Natural Language Guided Visual Relationship Detection

no code implementations • 16 Nov 2017 • Wentong Liao, Lin Shuai, Bodo Rosenhahn, Michael Ying Yang

Most of the existing works treat this task as a pure visual classification task: each type of relationship or phrase is classified as a relation category based on the extracted visual features.

Relationship Detection Visual Relationship Detection

Paper
Add Code

Object Recognition from very few Training Examples for Enhancing Bicycle Maps

no code implementations • 18 Sep 2017 • Christoph Reinders, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

These algorithms are usually trained on large datasets consisting of thousands or millions of labeled training examples.

Object Recognition Transfer Learning

Paper
Add Code

Towards Automated Cadastral Boundary Delineation from UAV Data

no code implementations • 6 Sep 2017 • Sophie Crommelinck, Michael Ying Yang, Mila Koeva, Markus Gerke, Rohan Bennett, George Vosselman

This study proposes (i) a workflow that automatically extracts candidate cadastral boundaries from UAV orthoimages and (ii) a tool for their semi-automatic processing to delineate final cadastral boundaries.

Contour Detection Superpixels

Paper
Add Code

Deep Learning for Vanishing Point Detection Using an Inverse Gnomonic Projection

2 code implementations • 8 Jul 2017 • Florian Kluger, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

We present a novel approach for vanishing point detection from uncalibrated monocular images.

Ranked #3 on Horizon Line Estimation on York Urban Dataset

Camera Calibration Horizon Line Estimation

Paper
Code

Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

no code implementations • 26 Feb 2017 • Omid Hosseini Jafari, Oliver Groth, Alexander Kirillov, Michael Ying Yang, Carsten Rother

Towards this end we propose a Convolutional Neural Network (CNN) architecture that fuses the state of the state-of-the-art results for depth estimation and semantic labeling.

Depth Estimation Depth Prediction +1

Paper
Add Code

Motion Segmentation via Global and Local Sparse Subspace Optimization

no code implementations • 24 Jan 2017 • Michael Ying Yang, Hanno Ackermann, Weiyao Lin, Sitong Feng, Bodo Rosenhahn

In this paper, we propose a new framework for segmenting feature-based moving objects under affine subspace model.

Clustering Motion Segmentation +1

Paper
Add Code

Can Ground Truth Label Propagation from Video help Semantic Segmentation?

no code implementations • 3 Oct 2016 • Siva Karthik Mustikovela, Michael Ying Yang, Carsten Rother

For state-of-the-art semantic segmentation task, training convolutional neural networks (CNNs) requires dense pixelwise ground truth (GT) labeling, which is expensive and involves extensive human effort.

Semantic Segmentation Video Segmentation +1

Paper
Add Code

Real-Time RGB-D based Template Matching Pedestrian Detection

no code implementations • 3 Oct 2016 • Omid Hosseini jafari, Michael Ying Yang

We show that our method outperforms the state-of-the-art approaches.

Pedestrian Detection Template Matching

Paper
Add Code

On Support Relations and Semantic Scene Graphs

no code implementations • 19 Sep 2016 • Michael Ying Yang, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn

In contrast to previous methods for extracting support relations, the proposed approach generates more accurate results, and does not require a pixel-wise semantic labeling of the scene.

Scene Understanding

Paper
Add Code

Unbiased Sparse Subspace Clustering By Selective Pursuit

no code implementations • 16 Sep 2016 • Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

If these unknown subspaces are well-separated this algorithm is guaranteed to succeed.

Clustering Motion Segmentation +1

Paper
Add Code

Uncertainty-Driven 6D Pose Estimation of Objects and Scenes From a Single RGB Image

no code implementations • CVPR 2016 • Eric Brachmann, Frank Michel, Alexander Krull, Michael Ying Yang, Stefan Gumhold, Carsten Rother

In recent years, the task of estimating the 6D pose of object instances and complete scenes, i. e. camera localization, from a single input image has received considerable attention.

6D Pose Estimation 6D Pose Estimation using RGB +2

Paper
Add Code

Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images

no code implementations • ICCV 2015 • Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother

This is done by describing the posterior density of a particular object pose with a convolutional neural network (CNN) that compares an observed and rendered image.

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Add Code

Automatic 3D Liver Segmentation Using Sparse Representation of Global and Local Image Information via Level Set Formulation

no code implementations • 6 Aug 2015 • Saif Dawood Salman Al-Shaikhli, Michael Ying Yang, Bodo Rosenhahn

A sparse representation of both global (region-based) and local (voxel-wise) image information is embedded in a level set formulation to innovate a new cost function.

Liver Segmentation Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.