Search Results for author: Xiaoyan Sun

Found 29 papers, 8 papers with code

Scene Adaptive Sparse Transformer for Event-based Object Detection

1 code implementation2 Apr 2024 Yansong Peng, Hebei Li, Yueyi Zhang, Xiaoyan Sun, Feng Wu

However, they display inadequate sparsity and adaptability when applied to event-based object detection, since these approaches cannot balance the fine granularity of token-level sparsification and the efficiency of window-based Transformers, leading to reduced performance and efficiency.

Object object-detection +1

Event-assisted Low-Light Video Object Segmentation

no code implementations2 Apr 2024 Hebei Li, Jin Wang, Jiahui Yuan, Yue Li, Wenming Weng, Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

In the realm of video object segmentation (VOS), the challenge of operating under low-light conditions persists, resulting in notably degraded image quality and compromised accuracy when comparing query and memory frames for similarity computation.

Object Semantic Segmentation +2

Graph Relation Distillation for Efficient Biomedical Instance Segmentation

2 code implementations12 Jan 2024 Xiaoyu Liu, Yueyi Zhang, Zhiwei Xiong, Wei Huang, Bo Hu, Xiaoyan Sun, Feng Wu

IGD constructs a graph representing instance features and relations, transferring these two types of knowledge by enforcing instance graph consistency.

Instance Segmentation Knowledge Distillation +2

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

no code implementations28 Nov 2023 Yuqing Wen, Yucheng Zhao, Yingfei Liu, Fan Jia, Yanhui Wang, Chong Luo, Chi Zhang, Tiancai Wang, Xiaoyan Sun, Xiangyu Zhang

This work notably propels the field of autonomous driving by effectively augmenting the training dataset used for advanced BEV perception techniques.

Autonomous Driving Video Generation

Deep Multi-Threshold Spiking-UNet for Image Processing

1 code implementation20 Jul 2023 Hebei Li, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

Furthermore, we adopt a flow-based training method to fine-tune the converted models, reducing time steps while preserving performance.

Denoising Image Segmentation +1

Image Captioning with Multi-Context Synthetic Data

no code implementations29 May 2023 Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

This potential can be harnessed to create synthetic image-text pairs for training captioning models.

Image Captioning Language Modelling +2

Semantics-Preserving Sketch Embedding for Face Generation

no code implementations23 Nov 2022 Binxin Yang, Xuejin Chen, Chaoqun Wang, Chi Zhang, Zihan Chen, Xiaoyan Sun

With a semantic feature matching loss for effective semantic supervision, our sketch embedding precisely conveys the semantics in the input sketches to the synthesized images.

Face Generation Image-to-Image Translation

Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation

no code implementations CVPR 2022 Guanting Dong, Yueyi Zhang, HanLin Li, Xiaoyan Sun, Zhiwei Xiong

Previous LiDAR scene flow estimation methods, especially recurrent neural networks, usually suffer from structure distortion in challenging cases, such as sparse reflection and motion occlusions.

Autonomous Driving Scene Flow Estimation

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

no code implementations NeurIPS 2021 Chaoqun Wang, Shaobo Min, Xuejin Chen, Xiaoyan Sun, Houqiang Li

This enables DPPN to produce visual representations with accurate attribute localization ability, which benefits the semantic-visual alignment and representation transferability.

Attribute Generalized Zero-Shot Learning

Adaptive Domain-Specific Normalization for Generalizable Person Re-Identification

no code implementations7 May 2021 Jiawei Liu, Zhipeng Huang, Kecheng Zheng, Dong Liu, Xiaoyan Sun, Zheng-Jun Zha

It describes unseen target domain as a combination of the known source ones, and explicitly learns domain-specific representation with target distribution to improve the model's generalization by a meta-learning pipeline.

Generalizable Person Re-identification Meta-Learning

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

no code implementations5 Apr 2021 Chaoqun Wang, Xuejin Chen, Shaobo Min, Xiaoyan Sun, Houqiang Li

First, DCEN leverages task labels to cluster representations of the same semantic category by cross-modal contrastive learning and exploring semantic-visual complementarity.

Contrastive Learning Generalized Zero-Shot Learning

VAE^2: Preventing Posterior Collapse of Variational Video Predictions in the Wild

no code implementations28 Jan 2021 Yizhou Zhou, Chong Luo, Xiaoyan Sun, Zheng-Jun Zha, Wenjun Zeng

We believe that VAE$^2$ is also applicable to other stochastic sequence prediction problems where training data are lack of stochasticity.

Video Prediction

Generating Comprehensive Data with Protocol Fuzzing for Applying Deep Learning to Detect Network Attacks

no code implementations23 Dec 2020 Qingtian Zou, Anoop Singhal, Xiaoyan Sun, Peng Liu

Network attacks have become a major security concern for organizations worldwide and have also drawn attention in the academics.

Cryptography and Security

Spatiotemporal Fusion in 3D CNNs: A Probabilistic View

no code implementations CVPR 2020 Yizhou Zhou, Xiaoyan Sun, Chong Luo, Zheng-Jun Zha, Wen-Jun Zeng

Based on the probability space, we further generate new fusion strategies which achieve the state-of-the-art performance on four well-known action recognition datasets.

Action Recognition In Videos Temporal Action Localization

Posterior-Guided Neural Architecture Search

1 code implementation23 Jun 2019 Yizhou Zhou, Xiaoyan Sun, Chong Luo, Zheng-Jun Zha, Wen-Jun Zeng

Accordingly, a hybrid network representation is presented which enables us to leverage the Variational Dropout so that the approximation of the posterior distribution becomes fully gradient-based and highly efficient.

Image Classification Neural Architecture Search

Quality-Gated Convolutional LSTM for Enhancing Compressed Video

1 code implementation11 Mar 2019 Ren Yang, Xiaoyan Sun, Mai Xu, Wen-Jun Zeng

The past decade has witnessed great success in applying deep learning to enhance the quality of compressed video.

Temporal-Spatial Mapping for Action Recognition

no code implementations11 Sep 2018 Xiaolin Song, Cuiling Lan, Wen-Jun Zeng, Junliang Xing, Jingyu Yang, Xiaoyan Sun

We propose a video level 2D feature representation by transforming the convolutional features of all frames to a 2D feature map, referred to as VideoMap.

Action Recognition Image Classification +3

MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition

no code implementations CVPR 2018 Yizhou Zhou, Xiaoyan Sun, Zheng-Jun Zha, Wen-Jun Zeng

Recent attempts use 3D convolutional neural networks (CNNs) to explore spatio-temporal information for human action recognition.

Action Recognition Temporal Action Localization

Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph

no code implementations13 Jun 2016 Jiaying Liu, Wenhan Yang, Xiaoyan Sun, Wen-Jun Zeng

With the rapid development of social network and multimedia technology, customized image and video stylization has been widely used for various social-media applications.

Style Transfer Superpixels

TenSR: Multi-Dimensional Tensor Sparse Representation

no code implementations CVPR 2016 Na Qi, Yunhui Shi, Xiaoyan Sun, Bao-Cai Yin

In this paper, we propose a new sparse model TenSR based on tensor for MD data representation along with the corresponding MD sparse coding and MD dictionary learning algorithms.

Dictionary Learning

MARLow: A Joint Multiplanar Autoregressive and Low-Rank Approach for Image Completion

no code implementations3 May 2016 Mading Li, Jiaying Liu, Zhiwei Xiong, Xiaoyan Sun, Zongming Guo

In this paper, we propose a novel multiplanar autoregressive (AR) model to exploit the correlation in cross-dimensional planes of a similar patch group collected in an image, which has long been neglected by previous AR models.

Separable Kernel for Image Deblurring

no code implementations CVPR 2014 Lu Fang, Haifeng Liu, Feng Wu, Xiaoyan Sun, Houqiang Li

In this paper, we deal with the image deblurring problem in a completely new perspective by proposing separable kernel to represent the inherent properties of the camera and scene system.

Deblurring Image Deblurring

CID: Combined Image Denoising in Spatial and Frequency Domains Using Web Images

no code implementations CVPR 2014 Huanjing Yue, Xiaoyan Sun, Jingyu Yang, Feng Wu

Second, to handle heavy noise, we further propose using the denoising image to improve image registration of the retrieved Web images, 3D cube building, and the estimation of filtering parameters in the frequency domain.

Image Denoising Image Registration

Cannot find the paper you are looking for? You can Submit a new open access paper.