Search Results for author: Yinjie Lei

Found 33 papers, 13 papers with code

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

no code implementations24 Oct 2023 Yinjie Lei, Zixuan Wang, Feng Chen, Guoqing Wang, Peng Wang, Yang Yang

Multi-modal 3D scene understanding has gained considerable attention due to its wide applications in many areas, such as autonomous driving and human-computer interaction.

Autonomous Driving Scene Understanding

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

1 code implementation6 Sep 2023 Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen

Moreover, we argue that object localization and description generation require different levels of scene understanding, which could be challenging for a shared set of queries to capture.

3D dense captioning Caption Generation +4

Unsupervised Domain Adaptation via Domain-Adaptive Diffusion

no code implementations26 Aug 2023 Duo Peng, Qiuhong Ke, Yinjie Lei, Jun Liu

Unsupervised Domain Adaptation (UDA) is quite challenging due to the large distribution discrepancy between the source domain and the target domain.

Unsupervised Domain Adaptation

Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches

1 code implementation ICCV 2023 Xin Lin, Chao Ren, Xiao Liu, Jie Huang, Yinjie Lei

Although unsupervised approaches based on generative adversarial networks offer a promising solution for denoising without paired datasets, they are difficult in surpassing the performance limitations of conventional GAN-based unsupervised frameworks without significantly modifying existing structures or increasing the computational complexity of denoisers.

Image Denoising

End-to-End 3D Dense Captioning with Vote2Cap-DETR

1 code implementation CVPR 2023 Sijin Chen, Hongyuan Zhu, Xin Chen, Yinjie Lei, Tao Chen, Gang Yu

Compared with prior arts, our framework has several appealing advantages: 1) Without resorting to numerous hand-crafted components, our method is based on a full transformer encoder-decoder architecture with a learnable vote query driven object decoder, and a caption decoder that produces the dense captions in a set-prediction manner.

3D dense captioning Dense Captioning +1

Zero-Shot Point Cloud Segmentation by Semantic-Visual Aware Synthesis

1 code implementation ICCV 2023 Yuwei Yang, Munawar Hayat, Zhao Jin, Hongyuan Zhu, Yinjie Lei

Given only the class-level semantic information for unseen objects, we strive to enhance the correspondence, alignment and consistency between the visual and semantic spaces, to synthesise diverse, generic and transferable visual features.

Point Cloud Segmentation Segmentation +2

Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation

1 code implementation CVPR 2023 Yuwei Yang, Munawar Hayat, Zhao Jin, Chao Ren, Yinjie Lei

Despite the significant recent progress made on 3D point cloud semantic segmentation, the current methods require training data for all classes at once, and are not suitable for real-life scenarios where new categories are being continuously discovered.

Class-Incremental Semantic Segmentation

Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training

1 code implementation CVPR 2023 Zhao Jin, Munawar Hayat, Yuwei Yang, Yulan Guo, Yinjie Lei

The current approaches for 3D visual reasoning are task-specific, and lack pre-training methods to learn generic representations that can transfer across various tasks.

3D dense captioning Dense Captioning +3

Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction

no code implementations20 Aug 2022 Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen, Yongqi Tong

Existing works usually formulate the span detection as a 1D token tagging problem, and model the sentiment recognition with a 2D tagging matrix of token pairs.

Aspect Sentiment Triplet Extraction Inductive Bias

Semantic-Aware Domain Generalized Segmentation

1 code implementation CVPR 2022 Duo Peng, Yinjie Lei, Munawar Hayat, Yulan Guo, Wen Li

In this paper, we address domain generalized semantic segmentation, where a segmentation model is trained to be domain-invariant without using any target domain data.

Domain Generalization Segmentation +1

Box2Seg: Learning Semantics of 3D Point Clouds with Box-Level Supervision

no code implementations9 Jan 2022 Yan Liu, Qingyong Hu, Yinjie Lei, Kai Xu, Jonathan Li, Yulan Guo

In this paper, we introduce a neural architecture, termed Box2Seg, to learn point-level semantics of 3D point clouds with bounding box-level supervision.

Semantic Segmentation

Overcome Anterograde Forgetting with Cycled Memory Networks

no code implementations4 Dec 2021 Jian Peng, Dingqi Ye, Bo Tang, Yinjie Lei, Yu Liu, Haifeng Li

This work proposes a general framework named Cycled Memory Networks (CMN) to address the anterograde forgetting in neural networks for lifelong learning.

Transfer Learning

Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling

no code implementations3 Dec 2021 Lianjie Jia, Chenyang Yu, Xiehao Ye, Tianyu Yan, Yinjie Lei, Pingping Zhang

To generate high-quality pseudo-labels and mitigate the impact of clustering errors, we propose a novel clustering relationship modeling framework for unsupervised person Re-ID.

Clustering Contrastive Learning +1

Reviewing continual learning from the perspective of human-level intelligence

no code implementations23 Nov 2021 Yifan Chang, Wenbo Li, Jian Peng, Bo Tang, Yu Kang, Yinjie Lei, Yuanmiao Gui, Qing Zhu, Yu Liu, Haifeng Li

Different from previous reviews that mainly focus on the catastrophic forgetting phenomenon in CL, this paper surveys CL from a more macroscopic perspective based on the Stability Versus Plasticity mechanism.

Continual Learning

Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation

no code implementations5 Aug 2021 Duo Peng, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Jun Liu

In this work, we propose two simple yet effective texture randomization mechanisms, Global Texture Randomization (GTR) and Local Texture Randomization (LTR), for Domain Generalization based SRSS.

Domain Generalization Segmentation +1

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation15 Jul 2021 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Michael Ng

The effectiveness and superior performance of our approach are validated through comprehensive experiments in a range of applications.

image smoothing

Hierarchical Paired Channel Fusion Network for Street Scene Change Detection

no code implementations19 Oct 2020 Yinjie Lei, Duo Peng, Pingping Zhang, Qiuhong Ke, Haifeng Li

Based on the MPFL strategy, our framework achieves a novel approach to adapt to the scale and location diversities of the scene change regions.

Change Detection Scene Change Detection

Contextualize Knowledge Bases with Transformer for End-to-end Task-Oriented Dialogue Systems

no code implementations EMNLP 2021 Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen

Incorporating knowledge bases (KB) into end-to-end task-oriented dialogue systems is challenging, since it requires to properly represent the entity of KB, which is associated with its KB context and dialogue context.

Response Generation Task-Oriented Dialogue Systems

Semantic Context Encoding for Accurate 3D Point Cloud Segmentation

no code implementations IEEE 2020 Hao liu, Y ulan Guo, Y anni Ma, Yinjie Lei, and Gongjian Wen

In this paper, we propose a simple yet effective Point Context Encoding (PointCE) module to capture semantic contexts of a point cloud and adaptively highlight intermediate feature maps.

Image Segmentation Point Cloud Segmentation +2

Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks

no code implementations ECCV 2020 Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei

Most existing crowd counting systems rely on the availability of the object location annotation which can be expensive to obtain.

Crowd Counting

Global context reasoning for semantic segmentation of 3d point clouds

no code implementations IEEE 2020 Y anni Ma, Y ulan Guo, Hao liu, Yinjie Lei

In this paper, we propose a Point Global Context Reasoning (PointGCR) module to capture global contextual information along the channel dimension.

Segmentation Semantic Segmentation

Towards Using Count-level Weak Supervision for Crowd Counting

no code implementations29 Feb 2020 Yinjie Lei, Yan Liu, Pingping Zhang, Lingqiao Liu

Most existing crowd counting methods require object location-level annotation, i. e., placing a dot at the center of an object.

Crowd Counting

Overcoming Long-term Catastrophic Forgetting through Adversarial Neural Pruning and Synaptic Consolidation

1 code implementation19 Dec 2019 Jian Peng, Bo Tang, Hao Jiang, Zhuo Li, Yinjie Lei, Tao Lin, Haifeng Li

It is due to two facts: first, as the model learns more tasks, the intersection of the low-error parameter subspace satisfying for these tasks becomes smaller or even does not exist; second, when the model learns a new task, the cumulative error keeps increasing as the model tries to protect the parameter configuration of previous tasks from interference.

Image Classification

Improving Distant Supervised Relation Extraction by Dynamic Neural Network

no code implementations15 Nov 2019 Yanjie Gou, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Xi Peng

To account for this style shift, the model should adjust its parameters in accordance with entity types.

Relation Relation Extraction

Deep Multiphase Level Set for Scene Parsing

no code implementations8 Oct 2019 Pingping Zhang, Wei Liu, Yinjie Lei, Hongyu Wang, Huchuan Lu

The proposed method consists of three modules, i. e., recurrent FCNs, adaptive multiphase level set, and deeply supervised learning.

Image Segmentation Scene Parsing +1

Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion

no code implementations ICCV 2019 Pingping Zhang, Wei Liu, Yinjie Lei, Huchuan Lu, Xiaoyun Yang

To address these issues, in this work we propose a novel deep learning framework, named Cascaded Context Pyramid Network (CCPNet), to jointly infer the occupancy and semantic labels of a volumetric 3D scene from a single depth image.

Ranked #5 on 3D Semantic Scene Completion on NYUv2 (using extra training data)

3D Semantic Scene Completion

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation23 Jul 2019 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Ian Reid

In this paper, a non-convex non-smooth optimization framework is proposed to achieve diverse smoothing natures where even contradictive smoothing behaviors can be achieved.

image smoothing

Mask-aware networks for crowd counting

no code implementations18 Dec 2018 Shengqin Jiang, Xiaobo Lu, Yinjie Lei, Lingqiao Liu

Our rationale is that the mask prediction could be better modeled as a binary segmentation problem and the difficulty of estimating the density could be reduced if the mask is known.

Crowd Counting Object

Non-rigid Object Tracking via Deep Multi-scale Spatial-temporal Discriminative Saliency Maps

no code implementations22 Feb 2018 Pingping Zhang, Wei Liu, Dong Wang, Yinjie Lei, Hongyu Wang, Chunhua Shen, Huchuan Lu

Extensive experiments demonstrate that the proposed algorithm achieves competitive performance in both saliency detection and visual tracking, especially outperforming other related trackers on the non-rigid object tracking datasets.

Object Object Tracking +2

Cannot find the paper you are looking for? You can Submit a new open access paper.