Search Results for author: Bilei Zhu

Found 11 papers, 5 papers with code

ByteComposer: a Human-like Melody Composition Method based on Language Model Agent

no code implementations • 24 Feb 2024 • Xia Liang, Xingjian Du, Jiaju Lin, Pei Zou, Yuan Wan, Bilei Zhu

Large Language Models (LLM) have shown encouraging progress in multimodal understanding and generation tasks.

Language Modelling Music Generation

Paper
Add Code

Joint Music and Language Attention Models for Zero-shot Music Tagging

no code implementations • 16 Oct 2023 • Xingjian Du, Zhesong Yu, Jiaju Lin, Bilei Zhu, Qiuqiang Kong

However, previous music tagging research primarily focuses on close-set music tagging tasks which can not be generalized to new tags.

Audio Tagging Decoder +1

Paper
Add Code

ByteCover3: Accurate Cover Song Identification on Short Queries

no code implementations • 21 Mar 2023 • Xingjian Du, Zijie Wang, Xia Liang, Huidong Liang, Bilei Zhu, Zejun Ma

Deep learning based methods have become a paradigm for cover song identification (CSI) in recent years, where the ByteCover systems have achieved state-of-the-art results on all the mainstream datasets of CSI.

Cover song identification Retrieval

Paper
Add Code

Graph Contrastive Learning with Implicit Augmentations

1 code implementation • 7 Nov 2022 • Huidong Liang, Xingjian Du, Bilei Zhu, Zejun Ma, Ke Chen, Junbin Gao

Existing graph contrastive learning methods rely on augmentation techniques based on random perturbations (e. g., randomly adding or dropping edges and nodes).

Contrastive Learning Graph Classification +1

3

Paper
Code

BYTECOVER2: TOWARDS DIMENSIONALITY REDUCTION OF LATENT EMBEDDING FOR EFFICIENT COVER SONG IDENTIFICATION

no code implementations • ICASSP 2022 • Xingjian Du, Ke Chen, Zijie Wang, Bilei Zhu, Zejun Ma

Convolutional neural network (CNN)-based methods have dominated the recent research of cover song identification (CSI).

Ranked #1 on Cover song identification on SHS100K-TEST

Cover song identification Dimensionality Reduction

Paper
Add Code

HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection

1 code implementation • 2 Feb 2022 • Ke Chen, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov

To combat these problems, we introduce HTS-AT: an audio transformer with a hierarchical structure to reduce the model size and training time.

Ranked #4 on Sound Event Detection on DESED

Audio Classification Event Detection +3

317

Paper
Code

Zero-shot Audio Source Separation through Query-based Learningfrom Weakly-labeled Data

no code implementations • AAAI 2021 • Ke Chen, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov

Our approach uses a single model for source separation of multiple sound types, and relies solely on weakly-labeled data for training.

Audio Source Separation Event Detection +2

Paper
Add Code

Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data

1 code implementation • 15 Dec 2021 • Ke Chen, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov

Our approach uses a single model for source separation of multiple sound types, and relies solely on weakly-labeled data for training.

Ranked #1 on Audio Source Separation on AudioSet

Audio Source Separation Audio Tagging +3

168

Paper
Code

ByteCover: Cover Song Identification via Multi-Loss Training

1 code implementation • 27 Oct 2020 • Xingjian Du, Zhesong Yu, Bilei Zhu, Xiaoou Chen, Zejun Ma

We present in this paper ByteCover, which is a new feature learning method for cover song identification (CSI).

Ranked #2 on Cover song identification on Da-TACOS

Cover song identification

21

Paper
Code

Rule-embedded network for audio-visual voice activity detection in live musical video streams

1 code implementation • 27 Oct 2020 • Yuanbo Hou, Yi Deng, Bilei Zhu, Zejun Ma, Dick Botteldooren

Detecting anchor's voice in live musical streams is an important preprocessing for music and speech signal processing.

Sound Multimedia Audio and Speech Processing

11

Paper
Code

Contrastive Unsupervised Learning for Audio Fingerprinting

no code implementations • 26 Oct 2020 • Zhesong Yu, Xingjian Du, Bilei Zhu, Zejun Ma

The rise of video-sharing platforms has attracted more and more people to shoot videos and upload them to the Internet.

Contrastive Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.