1 code implementation • 28 Nov 2023 • Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen
To our knowledge, this is the first work that deploys an effective combination scheme of regression (denoising) and classification (ASR) objectives to derive a general pre-processor applicable to various unseen ASR systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 28 Nov 2023 • Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao
The performance of speaker verification (SV) models may drop dramatically in noisy environments.
2 code implementations • 18 Oct 2023 • Zong-Wei Hong, Yu-Chen Lin, Hsuan-Tung Liu, Yi-Ren Yeh, Chu-Song Chen
Although face anti-spoofing (FAS) methods have achieved remarkable performance on specific domains or attack types, few studies have focused on the simultaneous presence of domain changes and unknown attacks, which is closer to real application scenarios.
1 code implementation • ICCV 2023 • Yu-Hsing Hsieh, Guan-Sheng Chen, Shun-Xian Cai, Ting-Yun Wei, Huei-Fang Yang, Chu-Song Chen
To our knowledge, this is the first work on weakly-supervised continual learning for instance segmentation of images.
1 code implementation • 4 Aug 2022 • Yao-Chih Lee, Kuan-Wei Tseng, Guan-Sheng Chen, Chu-Song Chen
It can improve the robustness of learning-based methods with flow-guided keyframes and well-established depth prior.
no code implementations • 18 Jun 2022 • Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao
NASTAR uses a feedback mechanism to simulate adaptive training data via a noise extractor and a retrieval model.
1 code implementation • CVPR 2022 • Timmy S. T. Wan, Jun-Cheng Chen, Tzer-Yi Wu, Chu-Song Chen
In visual search, the gallery set could be incrementally growing and added to the database in practice.
no code implementations • ICCV 2021 • Wen-Cheng Chen, Min-Chun Hu, Chu-Song Chen
The STR mechanism treats the spatial transformation as the message passing process, and the relation between the view poses and the routing weights is modeled by an end-to-end trainable neural network.
1 code implementation • 22 Jun 2021 • Hau Chu, Jia-Hong Lee, Yao-Chih Lee, Ching-Hsien Hsu, Jia-Da Li, Chu-Song Chen
This paper introduces an approach for multi-human 3D pose estimation and tracking based on calibrated multi-view.
Ranked #6 on 3D Multi-Person Pose Estimation on Campus
no code implementations • CVPR 2021 • Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung
We take advantage of the recent self-supervised framework on jointly learning depth and camera ego-motion estimation on raw videos.
1 code implementation • 22 May 2021 • Chih-Ting Liu, Jun-Cheng Chen, Chu-Song Chen, Shao-Yi Chien
Besides, we discover the errors not only for the identity labels of tracklets but also for the evaluation protocol for the test data of MARS.
1 code implementation • 7 Dec 2020 • Cheng-Hao Tu, Cheng-En Wu, Chu-Song Chen
Although CondConv is effective for the performance enhancement of a deep model, it is currently applied to individual tasks only.
Ranked #1 on Continual Learning on Flowers (Fine-grained 6 Tasks)
1 code implementation • 15 Sep 2020 • Ashesh, Chu-Song Chen, Hsuan-Tien Lin
Technically, the gaze information can be inferred from two different magnification levels: face orientation and eye orientation.
no code implementations • 15 Oct 2019 • Cheng-En Wu, Yi-Ming Chan, Chien-Hung Chen, Wen-Cheng Chen, Chu-Song Chen
It is hard to detect on-road objects under various lighting conditions.
1 code implementation • NeurIPS 2019 • Steven C. Y. Hung, Cheng-Hao Tu, Cheng-En Wu, Chien-Hung Chen, Yi-Ming Chan, Chu-Song Chen
First, it can avoid forgetting (i. e., learn new tasks while remembering all previous tasks).
2 code implementations • Proceedings of the 2019 on International Conference on Multimedia Retrieval 2019 • Steven C. Y. Hung, Jia-Hong Lee, Timmy S. T. Wan, Chein-Hung Chen, Yi-Ming Chan, Chu-Song Chen
Simultaneously running multiple modules is a key requirement for a smart multimedia system for facial applications including face recognition, facial expression understanding, and gender identification.
Ranked #1 on Gender Prediction on FotW Gender (using extra training data)
no code implementations • 25 Nov 2018 • Shih-Yao Lin, Yen-Yu Lin, Chu-Song Chen, Yi-Ping Hung
This paper aims at recognizing partially observed human actions in videos.
1 code implementation • 26 Oct 2018 • Hsin-Rung Chou, Jia-Hong Lee, Yi-Ming Chan, Chu-Song Chen
Many face recognition systems boost the performance using deep learning models, but only a few researches go into the mechanisms for dealing with online registration.
Ranked #1 on Face Recognition on LFW (Online Open Set) (using extra training data)
1 code implementation • 6 Jun 2018 • Jia-Hong Lee, Yi-Ming Chan, Ting-Yen Chen, Chu-Song Chen
Automatic age and gender classification based on unconstrained images has become essential techniques on mobile devices.
Ranked #9 on Age And Gender Classification on Adience Gender
1 code implementation • 14 May 2018 • Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen
We propose a novel method to merge convolutional neural-nets for the inference stage.
no code implementations • 11 May 2018 • Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, Chu-Song Chen
In this paper, we present an object detection method that tackles the stingray detection problem based on aerial images.
no code implementations • ICCV 2017 • Kuang-Yu Chang, Kung-Hung Lu, Chu-Song Chen
Although aesthetic quality assessment has generated a great deal of interest in the last decade, most studies focus on providing a quality rating of good or bad for an image.
no code implementations • CVPR 2016 • Kevin Lin, Jiwen Lu, Chu-Song Chen, Jie zhou
In this paper, we propose a new unsupervised deep learning approach called DeepBit to learn compact binary descriptor for efficient visual object matching.
1 code implementation • 1 Jul 2015 • Huei-Fang Yang, Kevin Lin, Chu-Song Chen
SSDH is simple and can be realized by a slight enhancement of an existing deep architecture for classification; yet it is effective and outperforms other hashing approaches on several benchmarks and large datasets.
no code implementations • 19 Jun 2015 • Kuan-Wen Chen, Chun-Hsin Wang, Xiao Wei, Qiao Liang, Ming-Hsuan Yang, Chu-Song Chen, Yi-Ping Hung
Augmented reality (AR) displays become more and more popular recently, because of its high intuitiveness for humans and high-quality head-mounted display have rapidly developed.
no code implementations • 9 Dec 2014 • Yao-Hsiang Yang, Lu-Hung Chen, Chieh-Chih Wang, Chu-Song Chen
We propose a Bayesian framework of Gaussian process in order to extend Fisher's discriminant to classify functional data such as spectra and images.