Search Results for author: Long Ma

Found 41 papers, 14 papers with code

Fast Peer Adaptation with Context-aware Exploration

no code implementations4 Feb 2024 Long Ma, Yuanfei Wang, Fangwei Zhong, Song-Chun Zhu, Yizhou Wang

To do so, it is crucial for the agent to efficiently probe and identify the peer's strategy, as this is the prerequisite for carrying out the best response in adaptation.

DeCoF: Generated Video Detection via Frame Consistency

no code implementations3 Feb 2024 Long Ma, Jiajia Zhang, Hongping Deng, Ningyu Zhang, Yong Liao, Haiyang Yu

The escalating quality of video generated by advanced video generation methods leads to new security challenges in society, which makes generated video detection an urgent research priority.

Video Generation

From Text to Pixels: A Context-Aware Semantic Synergy Solution for Infrared and Visible Image Fusion

no code implementations31 Dec 2023 Xingyuan Li, Yang Zou, JinYuan Liu, Zhiying Jiang, Long Ma, Xin Fan, Risheng Liu

With the rapid progression of deep learning technologies, multi-modality image fusion has become increasingly prevalent in object detection tasks.

Bilevel Optimization Infrared And Visible Image Fusion +2

Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction

no code implementations2 Sep 2023 Gehui Li, JinYuan Liu, Long Ma, Zhiying Jiang, Xin Fan, Risheng Liu

To overcome these limitations, we propose a Macro-Micro-Hierarchical transformer, which consists of a macro attention to capture long-range dependencies, a micro attention to extract local features, and a hierarchical structure for coarse-to-fine correction.

Face Recognition Semantic Segmentation

PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation

3 code implementations8 Aug 2023 Zhu Liu, JinYuan Liu, Benzhuang Zhang, Long Ma, Xin Fan, Risheng Liu

We first conduct systematic analyses about the components of image fusion, investigating the correlation with segmentation robustness under adversarial perturbations.

Infrared And Visible Image Fusion Segmentation +2

Bilevel Generative Learning for Low-Light Vision

1 code implementation7 Aug 2023 Yingchi Liu, Zhu Liu, Long Ma, JinYuan Liu, Xin Fan, Zhongxuan Luo, Risheng Liu

In this study, we propose a generic low-light vision solution by introducing a generative block to convert data from the RAW to the RGB domain.

Bilevel Optimization

Bilevel Fast Scene Adaptation for Low-Light Image Enhancement

1 code implementation2 Jun 2023 Long Ma, Dian Jin, Nan An, JinYuan Liu, Xin Fan, Risheng Liu

A bilevel learning framework is constructed to endow the scene-irrelevant generality of the encoder towards diverse scenes (i. e., freezing the encoder in the adaptation and testing phases).

Denoising Hyperparameter Optimization +1

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

no code implementations21 May 2023 Shubo Lv, Xiong Wang, Sining Sun, Long Ma, Lei Xie

Real-world complex acoustic environments especially the ones with a low signal-to-noise ratio (SNR) will bring tremendous challenges to a keyword spotting (KWS) system.

Denoising Multi-Task Learning +4

Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond

2 code implementations11 May 2023 Zhu Liu, JinYuan Liu, Guanyao Wu, Long Ma, Xin Fan, Risheng Liu

Recently, multi-modality scene perception tasks, e. g., image fusion and scene understanding, have attracted widespread attention for intelligent vision systems.

Scene Understanding

PAI at SemEval-2023 Task 2: A Universal System for Named Entity Recognition with External Entity Information

1 code implementation10 May 2023 Long Ma, Kai Lu, Tianbo Che, Hailong Huang, Weiguo Gao, Xuan Li

The MultiCoNER II task aims to detect complex, ambiguous, and fine-grained named entities in low-context situations and noisy scenarios like the presence of spelling mistakes and typos for multiple languages.

named-entity-recognition Named Entity Recognition +3

Reporting delays: a widely neglected impact factor in COVID-19 forecasts

no code implementations24 Apr 2023 Long Ma, Piet Van Mieghem, Maksim Kitsak

Motivated by the desire to enhance epidemic forecasts, we develop a statistical framework to detect, uncover, and remove reporting delays in the infectious, recovered, and deceased epidemic time series.

Common Sense Reasoning

DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model

no code implementations16 Mar 2023 Yanzhe Fu, Yueteng Kang, Songjun Cao, Long Ma

In this work, we propose a two-stage knowledge distillation method to solve these two problems: the first step is to make the big and non-streaming teacher model smaller, and the second step is to make it streaming.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer

no code implementations17 Jan 2023 Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, Lei Xie

In this paper, we propose an efficient approach to obtain a high quality contextual list for a unified streaming/non-streaming based E2E model.

Practical Exposure Correction: Great Truths Are Always Simple

no code implementations29 Dec 2022 Long Ma, Tianjiao Ma, Xinwei Xue, Xin Fan, Zhongxuan Luo, Risheng Liu

Improving the visual quality of the given degraded observation by correcting exposure level is a fundamental task in the computer vision community.

Semantic-aware Texture-Structure Feature Collaboration for Underwater Image Enhancement

1 code implementation19 Nov 2022 Di Wang, Long Ma, Risheng Liu, Xin Fan

To address the above limitations, we develop an efficient and compact enhancement network in collaboration with a high-level semantic-aware pretrained model, aiming to exploit its hierarchical feature representation as an auxiliary for the low-level underwater image enhancement.

Image Enhancement object-detection +2

Adjacent Slice Feature Guided 2.5D Network for Pulmonary Nodule Segmentation

no code implementations19 Nov 2022 Xinwei Xue, Gaoyu Wang, Long Ma, Qi Jia, Yi Wang

In this paper, we design an adjacent slice feature fusion model to introduce information from adjacent slices.

Segmentation

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

no code implementations3 Jul 2022 Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Then, during the training of the conversational ASR system, the extractor will be frozen to extract the textual representation of preceding speech, while such representation is used as context fed to the ASR decoder through attention mechanism.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Toward Fast, Flexible, and Robust Low-Light Image Enhancement

1 code implementation CVPR 2022 Long Ma, Tengyu Ma, Risheng Liu, Xin Fan, Zhongxuan Luo

Existing low-light image enhancement techniques are mostly not only difficult to deal with both visual quality and computational efficiency but also commonly invalid in unknown complex scenarios.

Computational Efficiency Face Detection +2

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

1 code implementation22 Feb 2022 Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2. 0 models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement

1 code implementation9 Dec 2021 Long Ma, Risheng Liu, Jiaao Zhang, Xin Fan, Zhongxuan Luo

Further, by sharing an encoder for these two components, we obtain a more lightweight version (SLiteCSDNet for short).

Low-Light Image Enhancement

Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision

1 code implementation9 Dec 2021 Risheng Liu, Long Ma, Tengyu Ma, Xin Fan, Zhongxuan Luo

To partially address above issues, we establish Retinex-inspired Unrolling with Architecture Search (RUAS), a general learning framework, which not only can address low-light enhancement task, but also has the flexibility to handle other more challenging downstream vision applications.

Rolling Shutter Correction

Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

no code implementations15 Sep 2021 Keqi Deng, Songjun Cao, Long Ma

For the former task, a standard deviation constraint loss (SDC-loss) based end-to-end (E2E) architecture is proposed to identify accents under the same language.

Accented Speech Recognition Automatic Speech Recognition +3

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

no code implementations7 Jul 2021 Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

Secondly, a group of geo-specific language models (Geo-LMs) are integrated into our speech recognition system to improve recognition accuracy of long tail and homophone POI.

speech-recognition Speech Recognition

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

no code implementations1 May 2020 Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

Experiments on AISHELL-1 data show that the proposed model, along with the training strategies, improve the character error rate (CER) of MoChA from 8. 96% to 7. 68% on test set.

speech-recognition Speech Recognition

Underexposed Image Correction via Hybrid Priors Navigated Deep Propagation

no code implementations17 Jul 2019 Risheng Liu, Long Ma, Yuxi Zhang, Xin Fan, Zhongxuan Luo

Plenty of experimental results of underexposed image correction demonstrate that our proposed method performs favorably against the state-of-the-art methods on both subjective and objective assessments.

Face Detection Single Image Haze Removal

Task-Oriented Convex Bilevel Optimization with Latent Feasibility

no code implementations6 Jul 2019 Risheng Liu, Long Ma, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang

This paper firstly proposes a convex bilevel optimization paradigm to formulate and optimize popular learning and vision problems in real-world scenarios.

Bilevel Optimization

A Bridging Framework for Model Optimization and Deep Propagation

no code implementations NeurIPS 2018 Risheng Liu, Shichao Cheng, Xiaokun Liu, Long Ma, Xin Fan, Zhongxuan Luo

Different from these existing network based iterations, which often lack theoretical investigations, we provide strict convergence analysis for PODM in the challenging nonconvex and nonsmooth scenarios.

Model Optimization

Task Embedded Coordinate Update: A Realizable Framework for Multivariate Non-convex Optimization

no code implementations5 Nov 2018 Yiyang Wang, Risheng Liu, Long Ma, Xiaoliang Song

Integrating both numerical algorithms and advanced techniques together, TECU is proposed in a unified framework for solving a class of non-convex problems.

Learning Converged Propagations with Deep Prior Ensemble for Image Enhancement

1 code implementation9 Oct 2018 Risheng Liu, Long Ma, Yiyang Wang, Lei Zhang

Enhancing visual qualities of images plays very important roles in various vision and learning applications.

Image Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.