Search Results for author: Xiaoyan Sun

Found 29 papers, 8 papers with code

Scene Adaptive Sparse Transformer for Event-based Object Detection

1 code implementation • 2 Apr 2024 • Yansong Peng, Hebei Li, Yueyi Zhang, Xiaoyan Sun, Feng Wu

However, they display inadequate sparsity and adaptability when applied to event-based object detection, since these approaches cannot balance the fine granularity of token-level sparsification and the efficiency of window-based Transformers, leading to reduced performance and efficiency.

Object object-detection +1

Paper
Code

Event-assisted Low-Light Video Object Segmentation

no code implementations • 2 Apr 2024 • Hebei Li, Jin Wang, Jiahui Yuan, Yue Li, Wenming Weng, Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

In the realm of video object segmentation (VOS), the challenge of operating under low-light conditions persists, resulting in notably degraded image quality and compromised accuracy when comparing query and memory frames for similarity computation.

Object Semantic Segmentation +2

Paper
Add Code

Graph Relation Distillation for Efficient Biomedical Instance Segmentation

2 code implementations • 12 Jan 2024 • Xiaoyu Liu, Yueyi Zhang, Zhiwei Xiong, Wei Huang, Bo Hu, Xiaoyan Sun, Feng Wu

IGD constructs a graph representing instance features and relations, transferring these two types of knowledge by enforcing instance graph consistency.

Instance Segmentation Knowledge Distillation +2

Paper
Code

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

no code implementations • 30 Nov 2023 • Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Chuanxin Tang, Xiaoyan Sun, Chong Luo, Baining Guo

We present MicroCinema, a straightforward yet effective framework for high-quality and coherent text-to-video generation.

Text-to-Image Generation Text-to-Video Generation +1

Paper
Add Code

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

no code implementations • 28 Nov 2023 • Yuqing Wen, Yucheng Zhao, Yingfei Liu, Fan Jia, Yanhui Wang, Chong Luo, Chi Zhang, Tiancai Wang, Xiaoyan Sun, Xiangyu Zhang

This work notably propels the field of autonomous driving by effectively augmenting the training dataset used for advanced BEV perception techniques.

Autonomous Driving Video Generation

Paper
Add Code

GET: Group Event Transformer for Event-Based Vision

1 code implementation • ICCV 2023 • Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun, Feng Wu

Event cameras are a type of novel neuromorphic sen-sor that has been gaining increasing attention.

Event-based vision object-detection +1

Paper
Code

EGVD: Event-Guided Video Deraining

1 code implementation • 29 Sep 2023 • Yueyi Zhang, Jin Wang, Wenming Weng, Xiaoyan Sun, Zhiwei Xiong

In this paper, we approach video deraining by employing an event camera.

Motion Detection Rain Removal

Paper
Code

Deep Multi-Threshold Spiking-UNet for Image Processing

1 code implementation • 20 Jul 2023 • Hebei Li, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

Furthermore, we adopt a flow-based training method to fine-tune the converted models, reducing time steps while preserving performance.

Denoising Image Segmentation +1

Paper
Code

Image Captioning with Multi-Context Synthetic Data

no code implementations • 29 May 2023 • Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

This potential can be harnessed to create synthetic image-text pairs for training captioning models.

Image Captioning Language Modelling +2

Paper
Add Code

Paint by Example: Exemplar-based Image Editing with Diffusion Models

2 code implementations • CVPR 2023 • Binxin Yang, Shuyang Gu, Bo Zhang, Ting Zhang, Xuejin Chen, Xiaoyan Sun, Dong Chen, Fang Wen

Language-guided image editing has achieved great success recently.

Image Generation Image Manipulation

958

Paper
Code

Semantics-Preserving Sketch Embedding for Face Generation

no code implementations • 23 Nov 2022 • Binxin Yang, Xuejin Chen, Chaoqun Wang, Chi Zhang, Zihan Chen, Xiaoyan Sun

With a semantic feature matching loss for effective semantic supervision, our sketch embedding precisely conveys the semantics in the input sketches to the synthesized images.

Face Generation Image-to-Image Translation

Paper
Add Code

Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation

no code implementations • CVPR 2022 • Guanting Dong, Yueyi Zhang, HanLin Li, Xiaoyan Sun, Zhiwei Xiong

Previous LiDAR scene flow estimation methods, especially recurrent neural networks, usually suffer from structure distortion in challenging cases, such as sparse reflection and motion occlusions.

Autonomous Driving Scene Flow Estimation

Paper
Add Code

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

no code implementations • NeurIPS 2021 • Chaoqun Wang, Shaobo Min, Xuejin Chen, Xiaoyan Sun, Houqiang Li

This enables DPPN to produce visual representations with accurate attribute localization ability, which benefits the semantic-visual alignment and representation transferability.

Attribute Generalized Zero-Shot Learning

Paper
Add Code

Adaptive Domain-Specific Normalization for Generalizable Person Re-Identification

no code implementations • 7 May 2021 • Jiawei Liu, Zhipeng Huang, Kecheng Zheng, Dong Liu, Xiaoyan Sun, Zheng-Jun Zha

It describes unseen target domain as a combination of the known source ones, and explicitly learns domain-specific representation with target distribution to improve the model's generalization by a meta-learning pipeline.

Generalizable Person Re-identification Meta-Learning

Paper
Add Code

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

no code implementations • 5 Apr 2021 • Chaoqun Wang, Xuejin Chen, Shaobo Min, Xiaoyan Sun, Houqiang Li

First, DCEN leverages task labels to cluster representations of the same semantic category by cross-modal contrastive learning and exploring semantic-visual complementarity.

Contrastive Learning Generalized Zero-Shot Learning

Paper
Add Code

VAE^2: Preventing Posterior Collapse of Variational Video Predictions in the Wild

no code implementations • 28 Jan 2021 • Yizhou Zhou, Chong Luo, Xiaoyan Sun, Zheng-Jun Zha, Wenjun Zeng

We believe that VAE$^2$ is also applicable to other stochastic sequence prediction problems where training data are lack of stochasticity.

Video Prediction

Paper
Add Code

Generating Comprehensive Data with Protocol Fuzzing for Applying Deep Learning to Detect Network Attacks

no code implementations • 23 Dec 2020 • Qingtian Zou, Anoop Singhal, Xiaoyan Sun, Peng Liu

Network attacks have become a major security concern for organizations worldwide and have also drawn attention in the academics.

Cryptography and Security

Paper
Add Code

Spatiotemporal Fusion in 3D CNNs: A Probabilistic View

no code implementations • CVPR 2020 • Yizhou Zhou, Xiaoyan Sun, Chong Luo, Zheng-Jun Zha, Wen-Jun Zeng

Based on the probability space, we further generate new fusion strategies which achieve the state-of-the-art performance on four well-known action recognition datasets.

Action Recognition In Videos Temporal Action Localization

Paper
Add Code

Tracking by Instance Detection: A Meta-Learning Approach

no code implementations • CVPR 2020 • Guangting Wang, Chong Luo, Xiaoyan Sun, Zhiwei Xiong, Wen-Jun Zeng

We propose a principled three-step approach to build a high-performance tracker.

Domain Adaptation Meta-Learning +2

Paper
Add Code

Posterior-Guided Neural Architecture Search

1 code implementation • 23 Jun 2019 • Yizhou Zhou, Xiaoyan Sun, Chong Luo, Zheng-Jun Zha, Wen-Jun Zeng

Accordingly, a hybrid network representation is presented which enables us to leverage the Variational Dropout so that the approximation of the posterior distribution becomes fully gradient-based and highly efficient.

Image Classification Neural Architecture Search

Paper
Code

Communication-Efficient Federated Deep Learning with Asynchronous Model Update and Temporally Weighted Aggregation

no code implementations • 18 Mar 2019 • Yang Chen, Xiaoyan Sun, Yaochu Jin

The proposed algorithm is empirically on two datasets with different deep neural networks.

Federated Learning

Paper
Add Code

Quality-Gated Convolutional LSTM for Enhancing Compressed Video

1 code implementation • 11 Mar 2019 • Ren Yang, Xiaoyan Sun, Mai Xu, Wen-Jun Zeng

The past decade has witnessed great success in applying deep learning to enhance the quality of compressed video.

Paper
Code

Temporal-Spatial Mapping for Action Recognition

no code implementations • 11 Sep 2018 • Xiaolin Song, Cuiling Lan, Wen-Jun Zeng, Junliang Xing, Jingyu Yang, Xiaoyan Sun

We propose a video level 2D feature representation by transforming the convolutional features of all frames to a 2D feature map, referred to as VideoMap.

Ranked #51 on Action Recognition on UCF101

Action Recognition Image Classification +3

Paper
Add Code

MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition

no code implementations • CVPR 2018 • Yizhou Zhou, Xiaoyan Sun, Zheng-Jun Zha, Wen-Jun Zeng

Recent attempts use 3D convolutional neural networks (CNNs) to explore spatio-temporal information for human action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph

no code implementations • 13 Jun 2016 • Jiaying Liu, Wenhan Yang, Xiaoyan Sun, Wen-Jun Zeng

With the rapid development of social network and multimedia technology, customized image and video stylization has been widely used for various social-media applications.

Style Transfer Superpixels

Paper
Add Code

TenSR: Multi-Dimensional Tensor Sparse Representation

no code implementations • CVPR 2016 • Na Qi, Yunhui Shi, Xiaoyan Sun, Bao-Cai Yin

In this paper, we propose a new sparse model TenSR based on tensor for MD data representation along with the corresponding MD sparse coding and MD dictionary learning algorithms.

Dictionary Learning

Paper
Add Code

MARLow: A Joint Multiplanar Autoregressive and Low-Rank Approach for Image Completion

no code implementations • 3 May 2016 • Mading Li, Jiaying Liu, Zhiwei Xiong, Xiaoyan Sun, Zongming Guo

In this paper, we propose a novel multiplanar autoregressive (AR) model to exploit the correlation in cross-dimensional planes of a similar patch group collected in an image, which has long been neglected by previous AR models.

Paper
Add Code

Separable Kernel for Image Deblurring

no code implementations • CVPR 2014 • Lu Fang, Haifeng Liu, Feng Wu, Xiaoyan Sun, Houqiang Li

In this paper, we deal with the image deblurring problem in a completely new perspective by proposing separable kernel to represent the inherent properties of the camera and scene system.

Deblurring Image Deblurring

Paper
Add Code

CID: Combined Image Denoising in Spatial and Frequency Domains Using Web Images

no code implementations • CVPR 2014 • Huanjing Yue, Xiaoyan Sun, Jingyu Yang, Feng Wu

Second, to handle heavy noise, we further propose using the denoising image to improve image registration of the retrieved Web images, 3D cube building, and the estimation of filtering parameters in the frequency domain.

Image Denoising Image Registration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.