Search Results for author: Chu-Song Chen

Found 26 papers, 15 papers with code

D4AM: A General Denoising Framework for Downstream Acoustic Models

1 code implementation • 28 Nov 2023 • Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen

To our knowledge, this is the first work that deploys an effective combination scheme of regression (denoising) and classification (ASR) objectives to derive a general pre-processor applicable to various unseen ASR systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models

no code implementations • 28 Nov 2023 • Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao

The performance of speaker verification (SV) models may drop dramatically in noisy environments.

Denoising Speaker Verification +1

Paper
Add Code

Domain-Generalized Face Anti-Spoofing with Unknown Attacks

2 code implementations • 18 Oct 2023 • Zong-Wei Hong, Yu-Chen Lin, Hsuan-Tung Liu, Yi-Ren Yeh, Chu-Song Chen

Although face anti-spoofing (FAS) methods have achieved remarkable performance on specific domains or attack types, few studies have focused on the simultaneous presence of domain changes and unknown attacks, which is closer to real application scenarios.

Domain Generalization Face Anti-Spoofing

204

Paper
Code

Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision

1 code implementation • ICCV 2023 • Yu-Hsing Hsieh, Guan-Sheng Chen, Shun-Xian Cai, Ting-Yun Wei, Huei-Fang Yang, Chu-Song Chen

To our knowledge, this is the first work on weakly-supervised continual learning for instance segmentation of images.

Continual Learning Incremental Learning +5

Paper
Code

Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training

1 code implementation • 4 Aug 2022 • Yao-Chih Lee, Kuan-Wei Tseng, Guan-Sheng Chen, Chu-Song Chen

It can improve the robustness of learning-based methods with flow-guided keyframes and well-established depth prior.

Optical Flow Estimation Pose Estimation

Paper
Code

NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

no code implementations • 18 Jun 2022 • Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao

NASTAR uses a feedback mechanism to simulate adaptive training data via a noise extractor and a retrieval model.

Retrieval Speech Enhancement

Paper
Add Code

Continual Learning for Visual Search with Backward Consistent Feature Embedding

1 code implementation • CVPR 2022 • Timmy S. T. Wan, Jun-Cheng Chen, Tzer-Yi Wu, Chu-Song Chen

In visual search, the gallery set could be incrementally growing and added to the database in practice.

Continual Learning

Paper
Code

STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing

no code implementations • ICCV 2021 • Wen-Cheng Chen, Min-Chun Hu, Chu-Song Chen

The STR mechanism treats the spatial transformation as the message passing process, and the relation between the view poses and the routing weights is modeled by an end-to-end trainable neural network.

Paper
Add Code

Part-Aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking

1 code implementation • 22 Jun 2021 • Hau Chu, Jia-Hong Lee, Yao-Chih Lee, Ching-Hsien Hsu, Jia-Da Li, Chu-Song Chen

This paper introduces an approach for multi-human 3D pose estimation and tracking based on calibrated multi-view.

Ranked #6 on 3D Multi-Person Pose Estimation on Campus

3D Human Pose Estimation 3D Human Pose Tracking +3

Paper
Code

3D Video Stabilization With Depth Estimation by CNN-Based Optimization

no code implementations • CVPR 2021 • Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung

We take advantage of the recent self-supervised framework on jointly learning depth and camera ego-motion estimation on raw videos.

3D Reconstruction Depth And Camera Motion +2

Paper
Add Code

Video-based Person Re-identification without Bells and Whistles

1 code implementation • 22 May 2021 • Chih-Ting Liu, Jun-Cheng Chen, Chu-Song Chen, Shao-Yi Chien

Besides, we discover the errors not only for the identity labels of tracklets but also for the evaluation protocol for the test data of MARS.

Video-Based Person Re-Identification

Paper
Code

EXTENDING CONDITIONAL CONVOLUTION STRUCTURES FOR ENHANCING MULTITASKING CONTINUAL LEARNING

1 code implementation • 7 Dec 2020 • Cheng-Hao Tu, Cheng-En Wu, Chu-Song Chen

Although CondConv is effective for the performance enhancement of a deep model, it is currently applied to individual tasks only.

Ranked #1 on Continual Learning on Flowers (Fine-grained 6 Tasks)

Computational Efficiency Continual Learning +1

Paper
Code

360-Degree Gaze Estimation in the Wild Using Multiple Zoom Scales

1 code implementation • 15 Sep 2020 • Ashesh, Chu-Song Chen, Hsuan-Tien Lin

Technically, the gaze information can be inferred from two different magnification levels: face orientation and eye orientation.

Gaze Estimation

Paper
Code

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

no code implementations • 15 Oct 2019 • Cheng-En Wu, Yi-Ming Chan, Chien-Hung Chen, Wen-Cheng Chen, Chu-Song Chen

It is hard to detect on-road objects under various lighting conditions.

Paper
Add Code

Compacting, Picking and Growing for Unforgetting Continual Learning

1 code implementation • NeurIPS 2019 • Steven C. Y. Hung, Cheng-Hao Tu, Cheng-En Wu, Chien-Hung Chen, Yi-Ming Chan, Chu-Song Chen

First, it can avoid forgetting (i. e., learn new tasks while remembering all previous tasks).

Ranked #1 on Continual Learning on Stanford Cars (Fine-grained 6 Tasks)

Age And Gender Classification Continual Learning +4

116

Paper
Code

Increasingly Packing Multiple Facial-Informatics Modules in A Unified Deep-Learning Model via Lifelong Learning

2 code implementations • Proceedings of the 2019 on International Conference on Multimedia Retrieval 2019 • Steven C. Y. Hung, Jia-Hong Lee, Timmy S. T. Wan, Chein-Hung Chen, Yi-Ming Chan, Chu-Song Chen

Simultaneously running multiple modules is a key requirement for a smart multimedia system for facial applications including face recognition, facial expression understanding, and gender identification.

Ranked #1 on Gender Prediction on FotW Gender (using extra training data)

Age And Gender Classification Continual Learning +4

116

Paper
Code

Learning Conditional Random Fields with Augmented Observations for Partially Observed Action Recognition

no code implementations • 25 Nov 2018 • Shih-Yao Lin, Yen-Yu Lin, Chu-Song Chen, Yi-Ping Hung

This paper aims at recognizing partially observed human actions in videos.

Action Recognition Temporal Action Localization

Paper
Add Code

Data-specific Adaptive Threshold for Face Recognition and Authentication

1 code implementation • 26 Oct 2018 • Hsin-Rung Chou, Jia-Hong Lee, Yi-Ming Chan, Chu-Song Chen

Many face recognition systems boost the performance using deep learning models, but only a few researches go into the mechanisms for dealing with online registration.

Ranked #1 on Face Recognition on LFW (Online Open Set) (using extra training data)

Face Recognition

Paper
Code

Joint Estimation of Age and Gender from Unconstrained Face Images using Lightweight Multi-task CNN for Mobile Applications

1 code implementation • 6 Jun 2018 • Jia-Hong Lee, Yi-Ming Chan, Ting-Yen Chen, Chu-Song Chen

Automatic age and gender classification based on unconstrained images has become essential techniques on mobile devices.

Ranked #9 on Age And Gender Classification on Adience Gender

Age And Gender Classification Classification +2

Paper
Code

Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

1 code implementation • 14 May 2018 • Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen

We propose a novel method to merge convolutional neural-nets for the inference stage.

Paper
Code

Stingray Detection of Aerial Images Using Augmented Training Images Generated by A Conditional Generative Model

no code implementations • 11 May 2018 • Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, Chu-Song Chen

In this paper, we present an object detection method that tackles the stingray detection problem based on aerial images.

Data Augmentation Image Classification +3

Paper
Add Code

Aesthetic Critiques Generation for Photos

no code implementations • ICCV 2017 • Kuang-Yu Chang, Kung-Hung Lu, Chu-Song Chen

Although aesthetic quality assessment has generated a great deal of interest in the last decade, most studies focus on providing a quality rating of good or bad for an image.

Image Captioning

Paper
Add Code

Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks

no code implementations • CVPR 2016 • Kevin Lin, Jiwen Lu, Chu-Song Chen, Jie zhou

In this paper, we propose a new unsupervised deep learning approach called DeepBit to learn compact binary descriptor for efficient visual object matching.

Image Retrieval Object +3

Paper
Add Code

Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks

1 code implementation • 1 Jul 2015 • Huei-Fang Yang, Kevin Lin, Chu-Song Chen

SSDH is simple and can be realized by a slight enhancement of an existing deep architecture for classification; yet it is effective and outperforms other hashing approaches on several benchmarks and large datasets.

Attribute Classification +3

205

Paper
Code

To Know Where We Are: Vision-Based Positioning in Outdoor Environments

no code implementations • 19 Jun 2015 • Kuan-Wen Chen, Chun-Hsin Wang, Xiao Wei, Qiao Liang, Ming-Hsuan Yang, Chu-Song Chen, Yi-Ping Hung

Augmented reality (AR) displays become more and more popular recently, because of its high intuitiveness for humans and high-quality head-mounted display have rapidly developed.

Image Registration Model Compression

Paper
Add Code

Bayesian Fisher's Discriminant for Functional Data

no code implementations • 9 Dec 2014 • Yao-Hsiang Yang, Lu-Hung Chen, Chieh-Chih Wang, Chu-Song Chen

We propose a Bayesian framework of Gaussian process in order to extend Fisher's discriminant to classify functional data such as spectra and images.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.