Search Results for author: Ruili Wang

Found 16 papers, 7 papers with code

PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition

no code implementations • 13 Dec 2023 • Chengxi Lei, Satwinder Singh, Feng Hou, Xiaoyun Jia, Ruili Wang

Most of the current speech data augmentation methods operate on either the raw waveform or the amplitude spectrum of speech.

Automatic Speech Recognition Data Augmentation +2

Paper
Add Code

Survey on deep learning in multimodal medical imaging for cancer detection

no code implementations • 4 Dec 2023 • Yan Tian, Zhaocheng Xu, Yujun Ma, Weiping Ding, Ruili Wang, Zhihong Gao, Guohua Cheng, Linyang He, Xuran Zhao

Finally, we discuss the current scope of work and provide directions for the future development of multimodal cancer detection.

object-detection Object Detection

Paper
Add Code

Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization

1 code implementation • 13 Sep 2023 • Zhenguang Liu, Xinyang Yu, Ruili Wang, Shuai Ye, Zhe Ma, Jianfeng Dong, Sifeng He, Feng Qian, Xiaobo Zhang, Roger Zimmermann, Lei Yang

We theoretically analyzed the mutual information between the label and the disentangled features, arriving at a loss that maximizes the extraction of task-relevant information from the original feature.

Disentanglement

Paper
Code

Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition

1 code implementation • 23 Aug 2023 • Yujun Ma, Benjia Zhou, Ruili Wang, Pichao Wang

RGB-D action and gesture recognition remain an interesting topic in human-centered scene understanding, primarily due to the multiple granularities and large variation in human motion.

Gesture Recognition Scene Understanding

Paper
Code

A Novel Self-training Approach for Low-resource Speech Recognition

no code implementations • 10 Aug 2023 • Satwinder Singh, Feng Hou, Ruili Wang

In this paper, we propose a self-training approach for automatic speech recognition (ASR) for low-resource settings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

How to Design Translation Prompts for ChatGPT: An Empirical Study

no code implementations • 5 Apr 2023 • Yuan Gao, Ruili Wang, Feng Hou

Machine translation relies heavily on the abilities of language understanding and generation.

Machine Translation Natural Language Understanding +2

Paper
Add Code

Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection

1 code implementation • CVPR 2023 • Yi Wang, Ruili Wang, Xin Fan, Tianzhu Wang, Xiangjian He

A multi-level hybrid loss is firstly designed to guide the network to learn pixel-level, region-level, and object-level features.

Decoder object-detection +2

Paper
Code

Improved Meta Learning for Low Resource Speech Recognition

no code implementations • 11 May 2022 • Satwinder Singh, Ruili Wang, Feng Hou

We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach.

Meta-Learning speech-recognition +1

Paper
Add Code

3D Human Motion Prediction: A Survey

no code implementations • 3 Mar 2022 • Kedi Lyu, Haipeng Chen, Zhenguang Liu, Beiqi Zhang, Ruili Wang

3D human motion prediction, predicting future poses from a given sequence, is an issue of great significance and challenge in computer vision and machine intelligence, which can help machines in understanding human behaviors.

Human motion prediction motion prediction

Paper
Add Code

Improving Entity Linking through Semantic Reinforced Entity Embeddings

1 code implementation • ACL 2020 • Feng Hou, Ruili Wang, Jun He, Yi Zhou

We propose a simple yet effective method, FGS2EE, to inject fine-grained semantic information into entity embeddings to reduce the distinctiveness and facilitate the learning of contextual commonality.

Entity Embeddings Entity Linking +1

Paper
Code

TIPCB: A Simple but Effective Part-based Convolutional Baseline for Text-based Person Search

1 code implementation • 25 May 2021 • Yuhao Chen, Guoqing Zhang, Yujiang Lu, zhenxing Wang, yuhui Zheng, Ruili Wang

Text-based person search is a sub-task in the field of image retrieval, which aims to retrieve target person images according to a given textual description.

Ranked #11 on Text based Person Retrieval on CUHK-PEDES

Image Retrieval Person Search +3

Paper
Code

DEEPF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals

no code implementations • 11 Feb 2021 • Satwinder Singh, Ruili Wang, Yuanhang Qiu

We propose a novel pitch estimation technique called DeepF0, which leverages the available annotated data to directly learns from the raw audio in a data-driven manner.

Information Retrieval Music Information Retrieval +1

Paper
Add Code

Image Synthesis with Adversarial Networks: a Comprehensive Survey and Case Studies

1 code implementation • 26 Dec 2020 • Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Huiyu Zhou, Ruili Wang, M. Emre Celebi, Jie Yang

However, there is a lack of comprehensive review in this field, especially lack of a collection of GANs loss-variant, evaluation metrics, remedies for diverse image generation, and stable training.

Image-to-Image Translation Translation

123

Paper
Code

Road Segmentation for Remote Sensing Images using Adversarial Spatial Pyramid Networks

1 code implementation • 10 Aug 2020 • Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Ruili Wang, Jie Yang

We also propose a feature pyramid network that improves the performance of the proposed model by extracting effective features from all the layers of the network for describing different scales objects.

Domain Adaptation Image Generation +1

Paper
Code

A novel Deep Structure U-Net for Sea-Land Segmentation in Remote Sensing Images

no code implementations • 17 Mar 2020 • Pourya Shamsolmoali, Masoumeh Zareapoor, Ruili Wang, Huiyu Zhou, Jie Yang

This paper presents a novel deep neural network structure for pixel-wise sea-land segmentation, a Residual Dense U-Net (RDU-Net), in complex and high-density remote sensing images.

Segmentation

Paper
Add Code

KDSL: a Knowledge-Driven Supervised Learning Framework for Word Sense Disambiguation

no code implementations • 28 Aug 2018 • Shi Yin, Yi Zhou, Chenguang Li, Shangfei Wang, Jianmin Ji, Xiaoping Chen, Ruili Wang

We propose KDSL, a new word sense disambiguation (WSD) framework that utilizes knowledge to automatically generate sense-labeled data for supervised learning.

Word Sense Disambiguation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.