Search Results for author: Long Ma

Found 41 papers, 14 papers with code

PAI at SemEval-2022 Task 11: Name Entity Recognition with Contextualized Entity Representations and Robust Loss Functions

1 code implementation • SemEval (NAACL) 2022 • Long Ma, Xiaorong Jian, Xuan Li

This paper describes our system used in the SemEval-2022 Task 11 Multilingual Complex Named Entity Recognition, achieving 3rd for track 1 on the leaderboard.

Binary Classification named-entity-recognition +2

Paper
Code

Seeing Text in the Dark: Algorithm and Benchmark

no code implementations • 13 Apr 2024 • Chengpei Xu, Hao Fu, Long Ma, Wenjing Jia, Chengqi Zhang, Feng Xia, Xiaoyu Ai, Binghao Li, Wenjie Zhang

Localizing text in low-light environments is challenging due to visual degradations.

Low-Light Image Enhancement

Paper
Add Code

Fast Peer Adaptation with Context-aware Exploration

no code implementations • 4 Feb 2024 • Long Ma, Yuanfei Wang, Fangwei Zhong, Song-Chun Zhu, Yizhou Wang

To do so, it is crucial for the agent to efficiently probe and identify the peer's strategy, as this is the prerequisite for carrying out the best response in adaptation.

Paper
Add Code

DeCoF: Generated Video Detection via Frame Consistency

no code implementations • 3 Feb 2024 • Long Ma, Jiajia Zhang, Hongping Deng, Ningyu Zhang, Yong Liao, Haiyang Yu

The escalating quality of video generated by advanced video generation methods leads to new security challenges in society, which makes generated video detection an urgent research priority.

Video Generation

Paper
Add Code

From Text to Pixels: A Context-Aware Semantic Synergy Solution for Infrared and Visible Image Fusion

no code implementations • 31 Dec 2023 • Xingyuan Li, Yang Zou, JinYuan Liu, Zhiying Jiang, Long Ma, Xin Fan, Risheng Liu

With the rapid progression of deep learning technologies, multi-modality image fusion has become increasingly prevalent in object detection tasks.

Bilevel Optimization Infrared And Visible Image Fusion +2

Paper
Add Code

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing

no code implementations • 18 Nov 2023 • Haoran Li, Long Ma, Yong Liao, Lechao Cheng, Yanbin Hao, Pengyuan Zhou

First, we segment the objects and the background in a multi-object image.

Attribute Object +1

Paper
Add Code

Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation

no code implementations • 7 Sep 2023 • Xiaohan Cui, Long Ma, Tengyu Ma, JinYuan Liu, Xin Fan, Risheng Liu

In this work, we try to arouse the potential of enhancer + detector.

object-detection Object Detection

Paper
Add Code

Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction

no code implementations • 2 Sep 2023 • Gehui Li, JinYuan Liu, Long Ma, Zhiying Jiang, Xin Fan, Risheng Liu

To overcome these limitations, we propose a Macro-Micro-Hierarchical transformer, which consists of a macro attention to capture long-range dependencies, a micro attention to extract local features, and a hierarchical structure for coarse-to-fine correction.

Face Recognition Semantic Segmentation

Paper
Add Code

Improving Misaligned Multi-modality Image Fusion with One-stage Progressive Dense Registration

no code implementations • 22 Aug 2023 • Di Wang, JinYuan Liu, Long Ma, Risheng Liu, Xin Fan

Both stages directly estimate the respective target deformation fields.

Paper
Add Code

PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation

3 code implementations • 8 Aug 2023 • Zhu Liu, JinYuan Liu, Benzhuang Zhang, Long Ma, Xin Fan, Risheng Liu

We first conduct systematic analyses about the components of image fusion, investigating the correlation with segmentation robustness under adversarial perturbations.

Ranked #16 on Thermal Image Segmentation on MFN Dataset

Infrared And Visible Image Fusion Segmentation +2

Paper
Code

Bilevel Generative Learning for Low-Light Vision

1 code implementation • 7 Aug 2023 • Yingchi Liu, Zhu Liu, Long Ma, JinYuan Liu, Xin Fan, Zhongxuan Luo, Risheng Liu

In this study, we propose a generic low-light vision solution by introducing a generative block to convert data from the RAW to the RGB domain.

Bilevel Optimization

Paper
Code

Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation

2 code implementations • ICCV 2023 • JinYuan Liu, Zhu Liu, Guanyao Wu, Long Ma, Risheng Liu, Wei Zhong, Zhongxuan Luo, Xin Fan

Multi-modality image fusion and segmentation play a vital role in autonomous driving and robotic operation.

Ranked #3 on Semantic Segmentation on FMB Dataset

Autonomous Driving Segmentation +2

Paper
Code

Bilevel Fast Scene Adaptation for Low-Light Image Enhancement

1 code implementation • 2 Jun 2023 • Long Ma, Dian Jin, Nan An, JinYuan Liu, Xin Fan, Risheng Liu

A bilevel learning framework is constructed to endow the scene-irrelevant generality of the encoder towards diverse scenes (i. e., freezing the encoder in the adaptation and testing phases).

Denoising Hyperparameter Optimization +1

Paper
Code

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

no code implementations • 21 May 2023 • Shubo Lv, Xiong Wang, Sining Sun, Long Ma, Lei Xie

Real-world complex acoustic environments especially the ones with a low signal-to-noise ratio (SNR) will bring tremendous challenges to a keyword spotting (KWS) system.

Denoising Multi-Task Learning +4

Paper
Add Code

MonoTDP: Twin Depth Perception for Monocular 3D Object Detection in Adverse Scenes

no code implementations • 18 May 2023 • Xingyuan Li, JinYuan Liu, Yixin Lei, Long Ma, Xin Fan, Risheng Liu

3D object detection plays a crucial role in numerous intelligent vision systems.

Monocular 3D Object Detection Object +1

Paper
Add Code

Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond

2 code implementations • 11 May 2023 • Zhu Liu, JinYuan Liu, Guanyao Wu, Long Ma, Xin Fan, Risheng Liu

Recently, multi-modality scene perception tasks, e. g., image fusion and scene understanding, have attracted widespread attention for intelligent vision systems.

Scene Understanding

Paper
Code

PAI at SemEval-2023 Task 2: A Universal System for Named Entity Recognition with External Entity Information

1 code implementation • 10 May 2023 • Long Ma, Kai Lu, Tianbo Che, Hailong Huang, Weiguo Gao, Xuan Li

The MultiCoNER II task aims to detect complex, ambiguous, and fine-grained named entities in low-context situations and noisy scenarios like the presence of spelling mistakes and typos for multiple languages.

named-entity-recognition Named Entity Recognition +3

Paper
Code

Reporting delays: a widely neglected impact factor in COVID-19 forecasts

no code implementations • 24 Apr 2023 • Long Ma, Piet Van Mieghem, Maksim Kitsak

Motivated by the desire to enhance epidemic forecasts, we develop a statistical framework to detect, uncover, and remove reporting delays in the infectious, recovered, and deceased epidemic time series.

Common Sense Reasoning

Paper
Add Code

DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model

no code implementations • 16 Mar 2023 • Yanzhe Fu, Yueteng Kang, Songjun Cao, Long Ma

In this work, we propose a two-stage knowledge distillation method to solve these two problems: the first step is to make the big and non-streaming teacher model smaller, and the second step is to make it streaming.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer

no code implementations • 17 Jan 2023 • Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, Lei Xie

In this paper, we propose an efficient approach to obtain a high quality contextual list for a unified streaming/non-streaming based E2E model.

Paper
Add Code

Practical Exposure Correction: Great Truths Are Always Simple

no code implementations • 29 Dec 2022 • Long Ma, Tianjiao Ma, Xinwei Xue, Xin Fan, Zhongxuan Luo, Risheng Liu

Improving the visual quality of the given degraded observation by correcting exposure level is a fundamental task in the computer vision community.

Paper
Add Code

Semantic-aware Texture-Structure Feature Collaboration for Underwater Image Enhancement

1 code implementation • 19 Nov 2022 • Di Wang, Long Ma, Risheng Liu, Xin Fan

To address the above limitations, we develop an efficient and compact enhancement network in collaboration with a high-level semantic-aware pretrained model, aiming to exploit its hierarchical feature representation as an auxiliary for the low-level underwater image enhancement.

Image Enhancement object-detection +2

Paper
Code

Adjacent Slice Feature Guided 2.5D Network for Pulmonary Nodule Segmentation

no code implementations • 19 Nov 2022 • Xinwei Xue, Gaoyu Wang, Long Ma, Qi Jia, Yi Wang

In this paper, we design an adjacent slice feature fusion model to introduce information from adjacent slices.

Segmentation

Paper
Add Code

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

no code implementations • 3 Jul 2022 • Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Then, during the training of the conversational ASR system, the extractor will be frozen to extract the textual representation of preceding speech, while such representation is used as context fed to the ASR decoder through attention mechanism.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Toward Fast, Flexible, and Robust Low-Light Image Enhancement

1 code implementation • CVPR 2022 • Long Ma, Tengyu Ma, Risheng Liu, Xin Fan, Zhongxuan Luo

Existing low-light image enhancement techniques are mostly not only difficult to deal with both visual quality and computational efficiency but also commonly invalid in unknown complex scenarios.

Computational Efficiency Face Detection +2

397

Paper
Code

A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

no code implementations • 9 Mar 2022 • Yike Zhang, Xiaobing Feng, Yi Liu, Songjun Cao, Long Ma

Automatic speech recognition (ASR) systems used on smart phones or vehicles are usually required to process speech queries from very different domains.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

1 code implementation • 22 Feb 2022 • Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2. 0 models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Conversational Speech Recognition By Learning Conversation-level Characteristics

no code implementations • 16 Feb 2022 • Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Conversational automatic speech recognition (ASR) is a task to recognize conversational speech including multiple speakers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

no code implementations • 14 Dec 2021 • Keqi Deng, Songjun Cao, Yike Zhang, Long Ma

In our framework, the encoder is initialized with a pretrained AM (wav2vec2. 0).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement

1 code implementation • 9 Dec 2021 • Long Ma, Risheng Liu, Jiaao Zhang, Xin Fan, Zhongxuan Luo

Further, by sharing an encoder for these two components, we obtain a more lightweight version (SLiteCSDNet for short).

Low-Light Image Enhancement

Paper
Code

Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision

1 code implementation • 9 Dec 2021 • Risheng Liu, Long Ma, Tengyu Ma, Xin Fan, Zhongxuan Luo

To partially address above issues, we establish Retinex-inspired Unrolling with Architecture Search (RUAS), a general learning framework, which not only can address low-light enhancement task, but also has the flexibility to handle other more challenging downstream vision applications.

Rolling Shutter Correction

Paper
Code

Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning

no code implementations • 15 Sep 2021 • Songjun Cao, Yueteng Kang, Yanzhe Fu, Xiaoshuo Xu, Sining Sun, Yike Zhang, Long Ma

Under such a framework, the neural network is usually pre-trained with massive unlabeled data and then fine-tuned with limited labeled data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

no code implementations • 15 Sep 2021 • Keqi Deng, Songjun Cao, Long Ma

For the former task, a standard deviation constraint loss (SDC-loss) based end-to-end (E2E) architecture is proposed to identify accents under the same language.

Accented Speech Recognition Automatic Speech Recognition +3

Paper
Add Code

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

no code implementations • 7 Jul 2021 • Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

Secondly, a group of geo-specific language models (Geo-LMs) are integrated into our speech recognition system to improve recognition accuracy of long tail and homophone POI.

speech-recognition Speech Recognition

Paper
Add Code

Retinex-inspired Unrolling with Cooperative Prior Architecture Search for Low-light Image Enhancement

1 code implementation • CVPR 2021 • Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo

Low-light image enhancement plays very important roles in low-level vision field.

Low-Light Image Enhancement Rolling Shutter Correction

Paper
Code

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

no code implementations • 1 May 2020 • Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

Experiments on AISHELL-1 data show that the proposed model, along with the training strategies, improve the character error rate (CER) of MoChA from 8. 96% to 7. 68% on test set.

speech-recognition Speech Recognition

Paper
Add Code

Underexposed Image Correction via Hybrid Priors Navigated Deep Propagation

no code implementations • 17 Jul 2019 • Risheng Liu, Long Ma, Yuxi Zhang, Xin Fan, Zhongxuan Luo

Plenty of experimental results of underexposed image correction demonstrate that our proposed method performs favorably against the state-of-the-art methods on both subjective and objective assessments.

Face Detection Single Image Haze Removal

Paper
Add Code

Task-Oriented Convex Bilevel Optimization with Latent Feasibility

no code implementations • 6 Jul 2019 • Risheng Liu, Long Ma, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang

This paper firstly proposes a convex bilevel optimization paradigm to formulate and optimize popular learning and vision problems in real-world scenarios.

Bilevel Optimization

Paper
Add Code

A Bridging Framework for Model Optimization and Deep Propagation

no code implementations • NeurIPS 2018 • Risheng Liu, Shichao Cheng, Xiaokun Liu, Long Ma, Xin Fan, Zhongxuan Luo

Different from these existing network based iterations, which often lack theoretical investigations, we provide strict convergence analysis for PODM in the challenging nonconvex and nonsmooth scenarios.

Model Optimization

Paper
Add Code

Task Embedded Coordinate Update: A Realizable Framework for Multivariate Non-convex Optimization

no code implementations • 5 Nov 2018 • Yiyang Wang, Risheng Liu, Long Ma, Xiaoliang Song

Integrating both numerical algorithms and advanced techniques together, TECU is proposed in a unified framework for solving a class of non-convex problems.

Paper
Add Code

Learning Converged Propagations with Deep Prior Ensemble for Image Enhancement

1 code implementation • 9 Oct 2018 • Risheng Liu, Long Ma, Yiyang Wang, Lei Zhang

Enhancing visual qualities of images plays very important roles in various vision and learning applications.

Image Enhancement

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.