no code implementations • 24 Mar 2024 • Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Lixin Cui, Edwin R. Hancock
Therefore, we propose a novel prior semantic guided image fusion method based on the dual-modality strategy, improving the performance of IVF in ITS.
no code implementations • 27 Feb 2024 • George Eskandar, Chongzhe Zhang, Abhishek Kaushik, Karim Guirguis, Mohamed Sayed, Bin Yang
3D Object Detectors (3D-OD) are crucial for understanding the environment in many robotic tasks, especially autonomous driving.
no code implementations • 11 Feb 2024 • Yan Lin, Jilin Hu, Shengnan Guo, Bin Yang, Christian S. Jensen, Youfang Lin, Huaiyu Wan
Experiments involving three representative trajectory-related tasks on two real-world trajectory datasets provide insight into the intended properties performance of GTM and offer evidence that GTM is capable of meeting its objectives.
3 code implementations • 5 Feb 2024 • Ming Jin, Yifan Zhang, Wei Chen, Kexin Zhang, Yuxuan Liang, Bin Yang, Jindong Wang, Shirui Pan, Qingsong Wen
Time series analysis is essential for comprehending the complexities inherent in various real-world systems and applications.
1 code implementation • 4 Feb 2024 • Peng Chen, Yingying Zhang, Yunyao Cheng, Yang Shu, Yihang Wang, Qingsong Wen, Bin Yang, Chenjuan Guo
Multi-scale division divides the time series into different temporal resolutions using patches of various sizes.
1 code implementation • 31 Jan 2024 • Pascal Schlachter, Bin Yang
In real-world applications, there is often a domain shift from training to test data.
no code implementations • 2 Jan 2024 • Mario Döbler, Florian Marencke, Robert A. Marsden, Bin Yang
In real-world scenarios, test data streams are not always independent and identically distributed (i. i. d.).
1 code implementation • 12 Dec 2023 • Jiawei Sun, Bin Yang, Nektarios Koukourakis, Jochen Guck, Juergen W. Czarske
The performance of the proposed cell rotation tomography approach is validated through the three-dimensional reconstruction of cell phantoms and HL60 human cancer cells.
1 code implementation • 11 Dec 2023 • Bin Yang, Patrick Pfreundschuh, Roland Siegwart, Marco Hutter, Peyman Moghadam, Vaishakh Patil
In this paper, we propose TULIP, a new method to reconstruct high-resolution LiDAR point clouds from low-resolution LiDAR input.
1 code implementation • 30 Nov 2023 • Martin Wimpff, Mario Döbler, Bin Yang
Providing a promising pathway to link the human brain with external devices, Brain-Computer Interfaces (BCIs) have seen notable advancements in decoding capabilities, primarily driven by increasingly sophisticated techniques, especially deep learning.
no code implementations • CVPR 2023 • Lunjun Zhang, Anqi Joyce Yang, Yuwen Xiong, Sergio Casas, Bin Yang, Mengye Ren, Raquel Urtasun
In this paper, we study the problem of unsupervised object detection from 3D point clouds in self-driving scenes.
no code implementations • 1 Nov 2023 • Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Edwin R. Hancock
Then, GCNs are performed on the concatenate intra-modal NLss features of infrared and visible images, which can explore the cross-domain NLss of inter-modal to reconstruct the fused image.
Graph Representation Learning Infrared And Visible Image Fusion
2 code implementations • 17 Oct 2023 • Martin Wimpff, Leonardo Gizzi, Jan Zerfowski, Bin Yang
The objective of this study is to investigate the application of various channel attention mechanisms within the domain of brain-computer interface (BCI) for motor imagery decoding.
no code implementations • 6 Oct 2023 • Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An
One challenge in large-scale online recommendation systems is the constant and complicated changes in users' behavior patterns, such as interaction rates and retention tendencies.
no code implementations • 19 Jul 2023 • Sean Bin Yang, Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen
Next, we propose a relational reasoning framework to enable faster training of more robust sparse path encoders.
no code implementations • 6 Jul 2023 • Yan Lin, Huaiyu Wan, Jilin Hu, Shengnan Guo, Bin Yang, Youfang Lin, Christian S. Jensen
Given an origin (O), a destination (D), and a departure time (T), an Origin-Destination (OD) travel time oracle~(ODT-Oracle) returns an estimate of the time it takes to travel from O to D when departing at T. ODT-Oracles serve important purposes in map-based services.
no code implementations • 29 Jun 2023 • Rinor Cakaj, Jens Mehnert, Bin Yang
However, we show experimentally that, despite the approximate additive penalty of BN, feature maps in deep neural networks (DNNs) tend to explode at the beginning of the network and that feature maps of DNNs contain large values during the whole training.
no code implementations • 29 Jun 2023 • Rinor Cakaj, Jens Mehnert, Bin Yang
Large weights in deep neural networks are a sign of a more complex network that is overfitted to the training data.
no code implementations • 23 Jun 2023 • George Eskandar, Shuai Zhang, Mohamed Abdelsamad, Mark Youssef, Diandian Guo, Bin Yang
Data efficiency, or the ability to generalize from a few labeled data, remains a major challenge in deep learning.
no code implementations • 8 Jun 2023 • Haomin Yu, Yanru Song, Jilin Hu, Chenjuan Guo, Bin Yang
To overcome these challenges, we propose the crystal-specific pre-training framework for learning crystal representations with self-supervision.
1 code implementation • 1 Jun 2023 • Robert A. Marsden, Mario Döbler, Bin Yang
To tackle the problem of universal TTA, we identify and highlight several challenges a self-training based method has to deal with: 1) model bias and the occurrence of trivial solutions when performing entropy minimization on varying sequence lengths with and without multiple domain shifts, 2) loss of generalization which exacerbates the adaptation to multiple domain shifts and the occurrence of catastrophic forgetting, and 3) performance degradation due to shifts in class prior.
1 code implementation • 16 May 2023 • George Eskandar, Diandian Guo, Karim Guirguis, Bin Yang
Second, in contrast to previous works which employ one discriminator that overfits the target domain semantic distribution, we employ a discriminator for the whole image and multiscale discriminators on the image patches.
1 code implementation • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022 • George Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang
Semantic Image Synthesis (SIS) is a subclass of image-to-image translation where a semantic layout is used to generate a photorealistic image.
Ranked #11 on Image-to-Image Translation on ADE20K Labels-to-Photos
Multimodal Unsupervised Image-To-Image Translation Translation +1
no code implementations • 16 May 2023 • George Eskandar, Youssef Farag, Tarun Yenamandra, Daniel Cremers, Karim Guirguis, Bin Yang
Moreover, we employ an unsupervised latent exploration algorithm in the $\mathcal{S}$-space of the generator and show that it is more efficient than the conventional $\mathcal{W}^{+}$-space in controlling the image content.
no code implementations • CVPR 2023 • Karim Guirguis, Johannes Meier, George Eskandar, Matthias Kayser, Bin Yang, Juergen Beyerer
Our contribution is three-fold: (1) we design a standalone lightweight generator with (2) class-wise heads (3) to generate and replay diverse instance-level base features to the RoI head while finetuning on the novel data.
Data-free Knowledge Distillation Few-Shot Object Detection +2
1 code implementation • 24 Feb 2023 • David Campos, Miao Zhang, Bin Yang, Tung Kieu, Chenjuan Guo, Christian S. Jensen
First, we propose adaptive ensemble distillation that assigns adaptive weights to different base models such that their varying classification capabilities contribute purposefully to the training of the lightweight model.
no code implementations • 3 Feb 2023 • Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai
In this paper, we choose reinforcement learning methods to optimize the retention as they are designed to maximize the long-term performance.
no code implementations • 11 Jan 2023 • Zhihua Liu, Bin Yang, Yan Shen, Xuejun Ni, Huiyu Zhou
In this paper, we propose a long-short diffeomorphic motion network, which is a multi-task framework with a learnable deformation prior to search for the plausible deformation of landmark.
1 code implementation • ICCV 2023 • Bin Yang, Jun Chen, Mang Ye
The grand unified representation lies in two aspects: 1) GUR adopts a bottom-up domain learning strategy with a cross-memory association embedding module to explore the information of hierarchical domains, i. e., intra-camera, inter-camera, and inter-modality domains, learning a unified and robust representation against hierarchical discrepancy.
no code implementations • 20 Dec 2022 • Yunyao Cheng, Chenjuan Guo, KaiXuan Chen, Kai Zhao, Bin Yang, Jiandong Xie, Christian S. Jensen, Feiteng Huang, Kai Zheng
To capture the temporal and multivariate correlations among subsequences, we design a pattern discovery model, that constructs correlations via diverse pattern functions.
1 code implementation • CVPR 2023 • Yansong Tang, Jinpeng Liu, Aoyang Liu, Bin Yang, Wenxun Dai, Yongming Rao, Jiwen Lu, Jie zhou, Xiu Li
With the continuously thriving popularity around the world, fitness activity analytic has become an emerging research topic in computer vision.
no code implementations • 8 Dec 2022 • Xinle Wu, Dalin Zhang, Miao Zhang, Chenjuan Guo, Shuai Zhao, Yi Zhang, Huai Wang, Bin Yang
We then propose a resource-aware search strategy to explore the search space to find the best PINN model under different resource constraints.
no code implementations • 29 Nov 2022 • Xinle Wu, Dalin Zhang, Miao Zhang, Chenjuan Guo, Bin Yang, Christian S. Jensen
To overcome these limitations, we propose SEARCH, a joint, scalable framework, to automatically devise effective CTS forecasting models.
1 code implementation • CVPR 2023 • Mario Döbler, Robert A. Marsden, Bin Yang
We demonstrate the effectiveness of our proposed method 'robust mean teacher' (RMT) on the continual and gradual corruption benchmarks CIFAR10C, CIFAR100C, and Imagenet-C. We further consider ImageNet-R and propose a new continual DomainNet-126 benchmark.
1 code implementation • 22 Nov 2022 • Mohamed Amine ben Salem, Karim Said Barsim, Bin Yang
In the causal direction, such variations are expected to have no impact on the effect generation mechanism.
1 code implementation • 16 Nov 2022 • Marc Fischer, Alexander Bartler, Bin Yang
As such, fine-tuning a model to a downstream task in a parameter-efficient but effective way, e. g. for a new set of classes in the case of semantic segmentation, is of increasing importance.
no code implementations • 21 Oct 2022 • Phillip Czech, Markus Braun, Ulrich Kreßel, Bin Yang
This paper presents a novel approach to pedestrian trajectory prediction for on-board camera systems, which utilizes behavioral features of pedestrians that can be inferred from visual observations.
1 code implementation • ACM MM 2022 • Bin Yang, Mang Ye, Jun Chen, Zesen Wu
Visible infrared person re-identification (VI-ReID) aims at searching out the corresponding infrared (visible) images from a gallery set captured by other spectrum cameras.
no code implementations • 11 Oct 2022 • Karim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang, Juergen Beyerer
We make the observation that the large gap in performance between two-stage and one-stage FSODs are mainly due to their weak discriminability, which is explained by a small post-fusion receptive field and a small number of foreground samples in the loss function.
Ranked #13 on Few-Shot Object Detection on MS-COCO (10-shot)
no code implementations • 30 Sep 2022 • Yiwen Liao, Raphaël Latty, Bin Yang
Post-silicon validation is one of the most critical processes in modern semiconductor manufacturing.
no code implementations • 25 Sep 2022 • Yiwen Liao, Jochen Rivoir, Raphaël Latty, Bin Yang
However, most existing feature selection approaches, especially deep-learning-based, often focus on the features with great importance scores only but neglect those with less importance scores during training as well as the order of important candidate features.
2 code implementations • 22 Sep 2022 • Sherif Abdulatif, Ruizhe Cao, Bin Yang
Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain applications, such as automatic speech recognition (ASR) and speech separation, as they can capture both local and global dependencies.
Ranked #1 on Audio Super-Resolution on VCTK Multi-Speaker
no code implementations • 10 Sep 2022 • Yan Zhao, Liwei Deng, Xuanhao Chen, Chenjuan Guo, Bin Yang, Tung Kieu, Feiteng Huang, Torben Bach Pedersen, Kai Zheng, Christian S. Jensen
The continued digitization of societal processes translates into a proliferation of time series data that cover applications such as fraud detection, intrusion detection, and energy management, where anomaly detection is often essential to enable reliability and safety.
no code implementations • 22 Aug 2022 • Dalin Zhang, KaiXuan Chen, Yan Zhao, Bin Yang, Lina Yao, Christian S. Jensen
A key challenge is that while the application of deep models often incurs substantial memory and computational costs, edge devices typically offer only very limited storage and computational capabilities that may vary substantially across devices.
1 code implementation • 16 Aug 2022 • Robert A. Marsden, Mario Döbler, Bin Yang
In this work, we address two problems that exist when applying self-training in the setting of test-time adaptation.
no code implementations • 12 Aug 2022 • Robert A. Marsden, Felix Wiewel, Mario Döbler, Yang Yang, Bin Yang
In this work, we focus on UDA and additionally address the case of adapting not only to a single domain, but to a sequence of target domains.
no code implementations • 1 Jul 2022 • Yiwen Liao, Tianjie Ge, Raphaël Latty, Bin Yang
Intelligent test requires efficient and effective analysis of high-dimensional data in a large scale.
no code implementations • 1 Jul 2022 • Yiwen Liao, Bin Yang, Raphaël Latty, Jochen Rivoir
In this sense, an more efficient tuning requires identifying the most critical tuning knobs and process parameters in terms of a given figure-of-merit for a Device Under Test (DUT).
1 code implementation • 23 Jun 2022 • Shufang Xie, Rui Yan, Peng Han, Yingce Xia, Lijun Wu, Chenjuan Guo, Bin Yang, Tao Qin
We observe that the same intermediate molecules are visited many times in the searching process, and they are usually independently treated in previous tree-based methods (e. g., AND-OR tree search, Monte Carlo tree search).
Ranked #2 on Multi-step retrosynthesis on USPTO-190
no code implementations • 22 Jun 2022 • Bin Yang, Thomas Carette, Masanobu Jimbo, Shinya Maruyama
Federated Learning (FL) allows a number of agents to participate in training a global machine learning model without disclosing locally stored data.
no code implementations • 6 Jun 2022 • Bin Yang, Mengxi Wu, Winfried Teizer
We apply the deep learning neural network architecture to the two-level system in quantum optics to solve the time-dependent Schrodinger equation.
no code implementations • 18 May 2022 • Wei Li, Bin Yang, Junsheng Qiao
In this paper, the depiction of $(O, G)$-granular variable precision fuzzy rough sets ($(O, G)$-GVPFRSs for short) is first given based on overlap and grouping functions.
1 code implementation • 18 May 2022 • Alexander Bartler, Florian Bender, Felix Wiewel, Bin Yang
Nowadays, deep neural networks outperform humans in many tasks.
no code implementations • 13 May 2022 • Wei Li, Bin Yang, Junsheng Qiao
In this paper, we mainly construct three types of $L$-fuzzy $\beta$-covering-based rough set models and study the axiom sets, matrix representations and interdependency of these three pairs of $L$-fuzzy $\beta$-covering-based rough approximation operators.
no code implementations • 13 May 2022 • Gongao Qi, Bin Yang, Wei Li
In order to further generalize the FRS theory to more complicated data environments, we firstly propose four types of fuzzy neighborhood operators based on fuzzy covering by overlap functions and their implicators in this paper.
no code implementations • 28 Apr 2022 • Razvan-Gabriel Cirstea, Chenjuan Guo, Bin Yang, Tung Kieu, Xuanyi Dong, Shirui Pan
(i) Linear complexity: we introduce a novel patch attention with linear complexity.
no code implementations • 11 Apr 2022 • Karim Guirguis, George Eskandar, Matthias Kayser, Bin Yang, Juergen Beyerer
First, we leverage a meta-training paradigm, where we learn the domain shift on the base classes, then transfer the domain knowledge to the novel classes.
no code implementations • 7 Apr 2022 • Tung Kieu, Bin Yang, Chenjuan Guo, Christian S. Jensen, Yan Zhao, Feiteng Huang, Kai Zheng
This is an extended version of "Robust and Explainable Autoencoders for Unsupervised Time Series Outlier Detection", to appear in IEEE ICDE 2022.
1 code implementation • 30 Mar 2022 • Sean Bin Yang, Chenjuan Guo, Jilin Hu, Bin Yang, Jian Tang, Christian S. Jensen
In this setting, it is essential to learn generic temporal path representations(TPRs) that consider spatial and temporal correlations simultaneously and that can be used in different applications, i. e., downstream tasks.
1 code implementation • 29 Mar 2022 • Razvan-Gabriel Cirstea, Bin Yang, Chenjuan Guo, Tung Kieu, Shirui Pan
Such spatio-temporal agnostic models employ a shared parameter space irrespective of the time series locations and the time periods and they assume that the temporal patterns are similar across locations and do not evolve across time, which may not always hold, thus leading to sub-optimal results.
1 code implementation • 28 Mar 2022 • Ruizhe Cao, Sherif Abdulatif, Bin Yang
The estimation of magnitude and complex spectrogram is decoupled in the decoder stage and then jointly incorporated to reconstruct the enhanced speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 7 Mar 2022 • George Eskandar, Robert A. Marsden, Pavithran Pandiyan, Mario Döbler, Karim Guirguis, Bin Yang
Integrating different representations from complementary sensing modalities is crucial for robust scene interpretation in autonomous driving.
1 code implementation • 17 Feb 2022 • Ming Jin, Yu Zheng, Yuan-Fang Li, Siheng Chen, Bin Yang, Shirui Pan
Multivariate time series forecasting has long received significant attention in real-world applications, such as energy consumption and traffic prediction.
no code implementations • 8 Feb 2022 • George Eskandar, Sanjeev Sudarsan, Karim Guirguis, Janaranjani Palaniswamy, Bharath Somashekar, Bin Yang
Lidar sensors are costly yet critical for understanding the 3D environment in autonomous driving.
no code implementations • 9 Jan 2022 • Bin Yang, Shuang Li, Jinglang Feng, Massimiliano Vasile
The intelligent initial guess generator is a deep neural network that is trained to correct the initial velocity vector coming from the solution of the unperturbed Lambert problem.
no code implementations • 21 Dec 2021 • Xinle Wu, Dalin Zhang, Chenjuan Guo, Chaoyang He, Bin Yang, Christian S. Jensen
Specifically, we design both a micro and a macro search space to model possible architectures of ST-blocks and the connections among heterogeneous ST-blocks, and we provide a search strategy that is able to jointly explore the search spaces to identify optimal forecasting models.
no code implementations • CVPR 2022 • Miao Zhang, Jilin Hu, Steven Su, Shirui Pan, Xiaojun Chang, Bin Yang, Gholamreza Haffari
Differentiable Architecture Search (DARTS) has received massive attention in recent years, mainly because it significantly reduces the computational cost through weight sharing and continuous relaxation.
no code implementations • 22 Nov 2021 • David Campos, Tung Kieu, Chenjuan Guo, Feiteng Huang, Kai Zheng, Bin Yang, Christian S. Jensen
To improve accuracy, the ensemble employs multiple basic outlier detection models built on convolutional sequence-to-sequence autoencoders that can capture temporal dependencies in time series.
no code implementations • 29 Sep 2021 • Ming Jin, Yuan-Fang Li, Yu Zheng, Bin Yang, Shirui Pan
Spatiotemporal representation learning on multivariate time series has received tremendous attention in forecasting traffic and energy data.
1 code implementation • 29 Sep 2021 • George Eskandar, Mohamed Abdelsamad, Karim Armanious, Bin Yang
Semantic Image Synthesis (SIS) is a subclass of image-to-image translation where a photorealistic image is synthesized from a segmentation mask.
no code implementations • 27 Sep 2021 • Kanil Patel, William Beluch, Kilian Rambach, Michael Pfeiffer, Bin Yang
The focus of this article is to learn deep radar spectra classifiers which offer robust real-time uncertainty estimates using label smoothing during training.
no code implementations • 6 Sep 2021 • Nico Reick, Felix Wiewel, Alexander Bartler, Bin Yang
One major drawback of state-of-the-art artificial intelligence is its lack of explainability.
1 code implementation • 22 Jul 2021 • YuFei Wang, Yiqing Shen, Meng Yuan, Jing Xu, Bin Yang, Chi Liu, Wenjia Cai, Weijing Cheng, Wei Wang
The large-scale OCTA dataset is available at https://doi. org/10. 5281/zenodo. 5111975, https://doi. org/10. 5281/zenodo. 5111972.
1 code implementation • 19 Jul 2021 • Thomas Küstner, Jiazhen Pan, Haikun Qi, Gastao Cruz, Christopher Gilliam, Thierry Blu, Bin Yang, Sergios Gatidis, René Botnar, Claudia Prieto
Physiological motion, such as cardiac and respiratory motion, during Magnetic Resonance (MR) image acquisition can cause image artifacts.
1 code implementation • 17 Jun 2021 • Sean Bin Yang, Chenjuan Guo, Jilin Hu, Jian Tang, Bin Yang
In the global view, PIM distinguishes the representations of the input paths from those of the negative paths.
no code implementations • 1 Jun 2021 • Kanil Patel, William Beluch, Kilian Rambach, Adriana-Eliza Cozma, Michael Pfeiffer, Bin Yang
Deep learning (DL) has recently attracted increasing interest to improve object type classification for automotive radar. In addition to high accuracy, it is crucial for decision making in autonomous vehicles to evaluate the reliability of the predictions; however, decisions of DL networks are non-transparent.
no code implementations • 5 May 2021 • Robert A. Marsden, Alexander Bartler, Mario Döbler, Bin Yang
To avoid the costly annotation of training data for unseen domains, unsupervised domain adaptation (UDA) attempts to provide efficient knowledge transfer from a labeled source domain to an unlabeled target domain.
Ranked #20 on Synthetic-to-Real Translation on SYNTHIA-to-Cityscapes
no code implementations • 29 Apr 2021 • Bin Liu, Yaxu Wang, Guangzu Zhao, Bin Yang, Ruirui Wang, Dexiang Huang, Bin Xiang
Therefore, this paper proposes an intelligent decision method for the main control parameters of the TBM based on the multi-objective optimization of excavation efficiency and cost.
1 code implementation • 30 Mar 2021 • Alexander Bartler, Andre Bühler, Felix Wiewel, Mario Döbler, Bin Yang
By minimizing the self-supervised loss, we learn task-specific model parameters for different tasks.
no code implementations • 19 Mar 2021 • Razvan-Gabriel Cirstea, Chenjuan Guo, Bin Yang
For example, speed sensors are deployed in different locations in a road network, where the speed of a specific location across time is captured by the corresponding sensor as a time series, resulting in multiple speed time series from different locations, which are often correlated.
no code implementations • 15 Mar 2021 • Karim Armanious, Sherif Abdulatif, Wenbin Shi, Tobias Hepp, Sergios Gatidis, Bin Yang
We apply the proposed methodology on a brain MRI dataset containing healthy individuals as well as Alzheimer's patients.
no code implementations • 8 Mar 2021 • Yiwen Liao, Alexander Bartler, Bin Yang
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
no code implementations • 26 Feb 2021 • Chenxi Zhou, Bin Yang, Wenliang Fan, Wei Li
(3) The detection of neural disease was demonstrated to be benefit from thermodynamic model, implying the immense potential of thermodynamics in auxiliary diagnosis.
no code implementations • 19 Feb 2021 • George Eskandar, Alexander Braun, Martin Meinke, Karim Armanious, Bin Yang
Our algorithm is able to address the limitations of previous video prediction frameworks when dealing with sparse data by spatially inpainting the depth maps in the upcoming frames.
1 code implementation • 19 Feb 2021 • Felix Wiewel, Bin Yang
While many recently proposed methods for continual learning use some training examples for rehearsal, their performance strongly depends on the number of stored examples.
1 code implementation • CVPR 2019 • Wenyuan Zeng, Wenjie Luo, Simon Suo, Abbas Sadat, Bin Yang, Sergio Casas, Raquel Urtasun
In this paper, we propose a neural motion planner (NMP) for learning to drive autonomously in complex urban scenarios that include traffic-light handling, yielding, and interactions with multiple road-users.
no code implementations • 17 Jan 2021 • Bin Yang, Min Bai, Ming Liang, Wenyuan Zeng, Raquel Urtasun
The key idea is to decompose the 4D object label into two parts: the object size in 3D that's fixed through time for rigid objects, and the motion path describing the evolution of the object's pose through time.
1 code implementation • 17 Jan 2021 • Yan Wang, Bin Yang, Rui Hu, Ming Liang, Raquel Urtasun
In this paper we propose a model that unifies these two tasks and performs them in the same metric space.
no code implementations • 16 Jan 2021 • Abbas Sadat, Sean Segal, Sergio Casas, James Tu, Bin Yang, Raquel Urtasun, Ersin Yumer
Our experiments on a wide range of tasks and models show that the proposed curation pipeline is able to select datasets that lead to better generalization and higher performance.
no code implementations • ICCV 2019 • Yun Chen, Bin Yang, Ming Liang, Raquel Urtasun
In this paper, we tackle the problem of depth completion from RGBD data.
no code implementations • CVPR 2018 • Wenjie Luo, Bin Yang, Raquel Urtasun
In this paper we propose a novel deep neural network that is able to jointly reason about 3D detection, tracking and motion forecasting given data captured by a 3D sensor.
no code implementations • CVPR 2019 • Ming Liang, Bin Yang, Yun Chen, Rui Hu, Raquel Urtasun
In this paper we propose to exploit multiple related tasks for accurate multi-sensor 3D object detection.
Ranked #13 on 3D Object Detection on KITTI Cars Easy
no code implementations • 21 Dec 2020 • Bin Yang, Ming Liang, Raquel Urtasun
In this paper we show that High-Definition (HD) maps provide strong priors that can boost the performance and robustness of modern 3D object detectors.
no code implementations • ECCV 2018 • Ming Liang, Bin Yang, Shenlong Wang, Raquel Urtasun
In this paper, we propose a novel 3D object detector that can exploit both LIDAR as well as cameras to perform very accurate localization.
no code implementations • 16 Nov 2020 • Ze Yang, Siva Manivasagam, Ming Liang, Bin Yang, Wei-Chiu Ma, Raquel Urtasun
We then incorporate the reconstructed pedestrian assets bank in a realistic LiDAR simulation system by performing motion retargeting, and show that the simulated LiDAR data can be used to significantly reduce the amount of annotated real-world data required for visual perception tasks.
no code implementations • 2 Nov 2020 • Bob Wei, Mengye Ren, Wenyuan Zeng, Ming Liang, Bin Yang, Raquel Urtasun
In this paper, we propose an end-to-end self-driving network featuring a sparse attention module that learns to automatically attend to important regions of the input.
no code implementations • 26 Oct 2020 • Yiwen Liao, Raphaël Latty, Bin Yang
Feature selection is generally used as one of the most important preprocessing techniques in machine learning, as it helps to reduce the dimensionality of data and assists researchers and practitioners in understanding data.
no code implementations • 20 Oct 2020 • Sherif Abdulatif, Karim Armanious, Jayasankar T. Sajeev, Karim Guirguis, Bin Yang
Recent years have seen a surge in the number of available frameworks for speech enhancement (SE) and recognition.
1 code implementation • 22 Sep 2020 • Karim Armanious, Sherif Abdulatif, Wenbin Shi, Shashank Salian, Thomas Küstner, Daniel Weiskopf, Tobias Hepp, Sergios Gatidis, Bin Yang
Thus, a whole-body assessment of the BA does not reflect the deviations of aging behavior between organs.
3 code implementations • ECCV 2020 • Tsun-Hsuan Wang, Sivabalan Manivasagam, Ming Liang, Bin Yang, Wenyuan Zeng, James Tu, Raquel Urtasun
In this paper, we explore the use of vehicle-to-vehicle (V2V) communication to improve the perception and motion forecasting performance of self-driving vehicles.
Ranked #1 on 3D Object Detection on OPV2V
no code implementations • 13 Aug 2020 • Lingyun Luke Li, Bin Yang, Ming Liang, Wenyuan Zeng, Mengye Ren, Sean Segal, Raquel Urtasun
We show that our approach can outperform the state-of-the-art on both datasets.
no code implementations • ECCV 2020 • Kelvin Wong, Qiang Zhang, Ming Liang, Bin Yang, Renjie Liao, Abbas Sadat, Raquel Urtasun
We present a novel method for testing the safety of self-driving vehicles in simulation.
no code implementations • ECCV 2020 • Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun
In this paper, we propose the Deep Structured self-Driving Network (DSDNet), which performs object detection, motion prediction, and motion planning with a single neural network.
1 code implementation • 5 Aug 2020 • Thomas Küstner, Tobias Hepp, Marc Fischer, Martin Schwartz, Andreas Fritsche, Hans-Ulrich Häring, Konstantin Nikolaou, Fabian Bamberg, Bin Yang, Fritz Schick, Sergios Gatidis, Jürgen Machann
Methods: Quantification and localization of different adipose tissue compartments from whole-body MR images is of high interest to examine metabolic conditions.
1 code implementation • 1 Aug 2020 • Jing Shi, Zhiheng Li, Haitian Zheng, Yihang Xu, Tianyou Xiao, Weitao Tan, Xiaoning Guo, Sizhe Li, Bin Yang, Zhexin Xu, Ruitao Lin, Zhongkai Shangguan, Yue Zhao, Jingwen Wang, Rohan Sharma, Surya Iyer, Ajinkya Deshmukh, Raunak Mahalik, Srishti Singh, Jayant G Rohra, Yi-Peng Zhang, Tongyu Yang, Xuan Wen, Ethan Fahnestock, Bryce Ikeda, Ian Lawson, Alan Finkelstein, Kehao Guo, Richard Magnotti, Andrew Sexton, Jeet Ketan Thaker, Yiyang Su, Chenliang Xu
This technical report summarizes submissions and compiles from Actor-Action video classification challenge held as a final project in CSC 249/449 Machine Vision course (Spring 2020) at University of Rochester
1 code implementation • 29 Jul 2020 • Thomas Buhl Andersen, Rógvi Eliasen, Mikkel Jarlund, Bin Yang
We contribute to the advancement of this field by making accessible a benchmark dataset collected using a commercially available sensor setup from 20 persons covering 18 unique gestures, in the hope of allowing further comparison of results as well as easier entry into this field of research.
no code implementations • ECCV 2020 • Bin Yang, Runsheng Guo, Ming Liang, Sergio Casas, Raquel Urtasun
We tackle the problem of exploiting Radar for perception in the context of self-driving as Radar provides complementary information to other sensors such as LiDAR or cameras in the form of Doppler velocity.
1 code implementation • ECCV 2020 • Ming Liang, Bin Yang, Rui Hu, Yun Chen, Renjie Liao, Song Feng, Raquel Urtasun
We propose a motion forecasting model that exploits a novel structured map representation as well as actor-map interactions.
1 code implementation • ICLR 2021 • Kanil Patel, William Beluch, Bin Yang, Michael Pfeiffer, Dan Zhang
The goal of this paper is to resolve the identified issues of HB in order to provide calibrated confidence estimates using only a small holdout calibration dataset for bin optimization while preserving multi-class ranking accuracy.
no code implementations • CVPR 2020 • Sivabalan Manivasagam, Shenlong Wang, Kelvin Wong, Wenyuan Zeng, Mikita Sazanovich, Shuhan Tan, Bin Yang, Wei-Chiu Ma, Raquel Urtasun
We first utilize ray casting over the 3D scene and then use a deep neural network to produce deviations from the physics-based simulation, producing realistic LiDAR point clouds.
no code implementations • CVPR 2020 • Ming Liang, Bin Yang, Wenyuan Zeng, Yun Chen, Rui Hu, Sergio Casas, Raquel Urtasun
We tackle the problem of joint perception and motion forecasting in the context of self-driving vehicles.
no code implementations • CVPR 2020 • James Tu, Mengye Ren, Siva Manivasagam, Ming Liang, Bin Yang, Richard Du, Frank Cheng, Raquel Urtasun
Modern autonomous driving systems rely heavily on deep learning models to process point cloud sensory data; meanwhile, deep models have been shown to be susceptible to adversarial attacks with visually imperceptible perturbations.
2 code implementations • 3 Mar 2020 • Karim Guirguis, Christoph Schorn, Andre Guntoro, Sherif Abdulatif, Bin Yang
The understanding of the surrounding environment plays a critical role in autonomous robotic systems, such as self-driving cars.
no code implementations • 26 Feb 2020 • Jilin Hu, Jianbing Shen, Bin Yang, Ling Shao
Graph convolutional neural networks~(GCNs) have recently demonstrated promising results on graph-based semi-supervised classification, but little work has been done to explore their theoretical properties.
no code implementations • 16 Dec 2019 • Kanil Patel, William Beluch, Dan Zhang, Michael Pfeiffer, Bin Yang
Uncertainty estimates help to identify ambiguous, novel, or anomalous inputs, but the reliable quantification of uncertainty has proven to be challenging for modern deep networks.
no code implementations • 21 Oct 2019 • Karim Armanious, Vijeth Kumar, Sherif Abdulatif, Tobias Hepp, Sergios Gatidis, Bin Yang
Local deformations in medical modalities are common phenomena due to a multitude of factors such as metallic implants or limited field of views in magnetic resonance imaging (MRI).
no code implementations • 21 Oct 2019 • Sherif Abdulatif, Karim Armanious, Karim Guirguis, Jayasankar T. Sajeev, Bin Yang
Automatic speech recognition (ASR) systems are of vital importance nowadays in commonplace tasks such as speech-to-text processing and language translation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
no code implementations • 14 Oct 2019 • Karim Armanious, Sherif Abdulatif, Anish Rao Bhaktharaguttu, Thomas Küstner, Tobias Hepp, Sergios Gatidis, Bin Yang
Individuals age differently depending on a multitude of different factors such as lifestyle, medical history and genetics.
no code implementations • 12 Oct 2019 • Karim Armanious, Aastha Tanwar, Sherif Abdulatif, Thomas Küstner, Sergios Gatidis, Bin Yang
Motion is one of the main sources for artifacts in magnetic resonance (MR) images.
1 code implementation • 9 Aug 2019 • Björn Barz, Kai Schröter, Moritz Münch, Bin Yang, Andrea Unger, Doris Dransch, Joachim Denzler
The analysis of natural disasters such as floods in a timely manner often suffers from limited data due to a coarse distribution of sensors or sensor failures.
no code implementations • 9 Jul 2019 • Sean Bin Yang, Bin Yang
The objective function is designed to consider errors on both ranking scores and spatial properties, making the framework a multi-task learning framework.
no code implementations • 6 Jun 2019 • Felix Wiewel, Bin Yang
Artificial neural networks (ANNs) suffer from catastrophic forgetting when trained on a sequence of tasks.
no code implementations • 8 May 2019 • Bin Yang, Lin Yang, Xiaochun Li, Wenhan Zhang, Hua Zhou, Yequn Zhang, Yongxiong Ren, Yinbo Shi
Image retrieval utilizes image descriptors to retrieve the most similar images to a given query image.
no code implementations • ICLR 2019 • Alexander Bartler, Felix Wiewel, Bin Yang, Lukas Mauch
In this paper, we propose an easy method to train VAEs with binary or categorically valued latent representations.
no code implementations • 12 Mar 2019 • Patrick Schlachter, Yiwen Liao, Bin Yang
This enables to model the abnormal classes by atypical normal samples.
no code implementations • 8 Mar 2019 • Karim Armanious, Chenming Jiang, Sherif Abdulatif, Thomas Küstner, Sergios Gatidis, Bin Yang
The proposed framework utilizes new non-adversarial cycle losses which direct the framework to minimize the textural and perceptual discrepancies in the translated images.
no code implementations • 4 Mar 2019 • Karim Armanious, Sherif Abdulatif, Fady Aziz, Urs Schneider, Bin Yang
Radar is of vital importance in many fields, such as autonomous driving, safety and surveillance applications.
2 code implementations • CVPR 2018 • Bin Yang, Wenjie Luo, Raquel Urtasun
Existing approaches are, however, expensive in computation due to high dimensionality of point clouds.
Ranked #8 on Birds Eye View Object Detection on KITTI Cars Hard
no code implementations • 4 Feb 2019 • Patrick Schlachter, Yiwen Liao, Bin Yang
In one-class classification, only samples of one normal class are available for training.
no code implementations • 10 Jan 2019 • Patrick Schlachter, Bin Yang
Active learning methods play an important role to reduce the efforts of manual labeling in the field of machine learning.
no code implementations • 20 Dec 2018 • Patrick Schlachter, Yiwen Liao, Bin Yang
This paper proposes a novel generic one-class feature learning method based on intra-class splitting.
no code implementations • 16 Dec 2018 • Xiaogang Cheng, Bin Yang, Kaige Tan, Erik Isaksson, Liren Li, Anders Hedman, Thomas Olofsson, Hai-Bo Li
Due to the challenges of intra- and inter-individual differences and skin subtleness variations, there is no satisfactory solution for thermal comfort measurements until now.
no code implementations • 17 Nov 2018 • Sherif Abdulatif, Fady Aziz, Karim Armanious, Bernhard Kleiner, Bin Yang, Urs Schneider
In our proposed experimental setup, a treadmill is used to collect $\boldsymbol{\mu}$-D signatures of 22 subjects with different genders and body characteristics.
no code implementations • 13 Nov 2018 • Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen, Lu Chen
Origin-destination (OD) matrices are often used in urban planning, where a city is partitioned into regions and an element (i, j) in an OD matrix records the cost (e. g., travel time, fuel consumption, or travel speed) from region i to region j.
no code implementations • 12 Nov 2018 • Sherif Abdulatif, Karim Armanious, Fady Aziz, Urs Schneider, Bin Yang
Two sets of experiments were collected on 22 subjects walking on a treadmill at an intermediate velocity using a \unit[25]{GHz} CW radar.
no code implementations • 23 Oct 2018 • Lukas Mauch, Bin Yang
Deep neural networks (DNN) are powerful models for many pattern recognition tasks, yet their high computational complexity and memory requirement limit them to applications on high-performance computing platforms.
no code implementations • 15 Oct 2018 • Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang
Numerous factors could lead to partial deteriorations of medical images.
no code implementations • 17 Sep 2018 • Karim Armanious, Sergios Gatidis, Konstantin Nikolaou, Bin Yang, Thomas Küstner
Motion artifacts are a primary source of magnetic resonance (MR) image quality deterioration with strong repercussions on diagnostic performance.
no code implementations • 29 Aug 2018 • Razvan-Gabriel Cirstea, Darius-Valer Micu, Gabriel-Marcel Muresan, Chenjuan Guo, Bin Yang
To enable accurate forecasting on such correlated time series, this paper proposes two models that combine convolutional neural networks (CNNs) and recurrent neural networks (RNNs).
no code implementations • 25 Jun 2018 • Thomas Küstner, Sergios Gatidis, Annika Liebgott, Martin Schwartz, Lukas Mauch, Petros Martirosian, Holger Schmidt, Nina F. Schwenzer, Konstantin Nikolaou, Fabian Bamberg, Bin Yang, Fritz Schick
Therefore, the assessment or the ensurance of sufficient image quality in an automated manner is of high interest.
no code implementations • 17 Jun 2018 • Karim Armanious, Chenming Jiang, Marc Fischer, Thomas Küstner, Konstantin Nikolaou, Sergios Gatidis, Bin Yang
Image-to-image translation is considered a new frontier in the field of medical image analysis, with numerous potential applications.
9 code implementations • ICML 2018 • Mengye Ren, Wenyuan Zeng, Bin Yang, Raquel Urtasun
Deep neural networks have been shown to be very powerful modeling tools for many supervised learning tasks involving complex input patterns.
5 code implementations • 6 Mar 2018 • Changsong Yu, Karim Said Barsim, Qiuqiang Kong, Bin Yang
The objective of audio classification is to predict the presence or absence of audio events in an audio clip.
no code implementations • 22 Feb 2018 • Chenjuan Guo, Bin Yang, Jilin Hu, Christian S. Jensen
In the second step, we exploit the above graph-like structure to achieve a comprehensive trajectory-based routing solution.
no code implementations • 20 Feb 2018 • Karim Said Barsim, Lukas Mauch, Bin Yang
The problem of identifying end-use electrical appliances from their individual consumption profiles, known as the appliance identification problem, is a primary stage in both Non-Intrusive Load Monitoring (NILM) and automated plug-wise metering.
no code implementations • 5 Feb 2018 • Karim Said Barsim, Bin Yang
Recently, and with the growing development of big energy datasets, data-driven learning techniques began to represent a potential solution to the energy disaggregation problem outperforming engineered and hand-crafted models.
no code implementations • 2 Feb 2018 • Karim Said Barsim, Lirong Yang, Bin Yang
In this paper, we propose a multi-generator extension to the adversarial training framework, in which the objective of each generator is to represent a unique component of a target mixture distribution.
2 code implementations • CVPR 2018 • Mengye Ren, Andrei Pokrovsky, Bin Yang, Raquel Urtasun
Conventional deep convolutional neural networks (CNNs) apply convolution operators uniformly in space across all feature maps for hundreds of layers - this incurs a high computational cost for real-time applications.
no code implementations • ICCV 2017 • Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun
In this paper we introduce the TorontoCity benchmark, which covers the full greater Toronto area (GTA) with 712. 5 $km^2$ of land, 8439 $km$ of road and around 400, 000 buildings.
1 code implementation • 8 Oct 2016 • Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang
The effectiveness of GBD-Net is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO.
1 code implementation • CVPR 2016 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li
They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories.
1 code implementation • 9 Apr 2016 • Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, Wanli Ouyang
Temporal and contextual information of videos are not fully investigated and utilized.
1 code implementation • ICCV 2015 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li
With the combination of CNN features and boosting forest, CCF benefits from the richer capacity in feature representation compared with channel features, as well as lower cost in computation and storage compared with end-to-end CNN methods.
no code implementations • 15 Jul 2014 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li
Face detection has drawn much attention in recent decades since the seminal work by Viola and Jones.
Ranked #37 on Face Detection on WIDER Face (Medium)
no code implementations • 5 Nov 2013 • Bin Yang, William Zhu
Second, we study the connectivity for matroids by means of relation-based rough sets and some conditions under which a general matroid is connected are presented.
no code implementations • 2 Nov 2013 • Bin Yang, Hong Zhao, William Zhu
First, we investigate some properties of the definable sets with respect to a covering.
no code implementations • 2 Aug 2013 • Bin Yang, Manohar Kaul, Christian S. Jensen
This paper formulates and addresses the problem of annotating all edges in a road network with travel cost based weights from a set of trips in the network that cover only a small fraction of the edges, each with an associated ground-truth travel cost.