1 code implementation • 28 Mar 2024 • Chenyang Liu, Keyan Chen, Haotian Zhang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
The Change-Agent integrates a multi-level change interpretation (MCI) model as the eyes and a large language model (LLM) as the brain.
no code implementations • 17 Jan 2024 • Zili Liu, Hao Chen, Wenyuan Li, Keyan Chen, Zipeng Qi, Chenyang Liu, Zhengxia Zou, Zhenwei Shi
This paper is the first to consider the impact of label noise on the detection of clouds and snow in remote sensing images.
no code implementations • 23 Dec 2023 • Chenyang Liu, Keyan Chen, Zipeng Qi, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
The existing methods for Remote Sensing Image Change Captioning (RSICC) perform well in simple scenes but exhibit poorer performance in complex scenes.
1 code implementation • 30 Nov 2023 • Zipeng Qi, Guoxi Huang, Zebin Huang, Qin Guo, Jinwen Chen, Junyu Han, Jian Wang, Gang Zhang, Lufei Liu, Errui Ding, Jingdong Wang
The LRDiff framework constructs an image-rendering process with multiple layers, each of which applies the vision guidance to instructively estimate the denoising direction for a single object.
no code implementations • 14 Sep 2023 • Zipeng Qi, xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang
Generating realistic talking faces is a complex and widely discussed task with numerous applications.
no code implementations • 15 Mar 2023 • Zipeng Qi, Hao Chen, Chenyang Liu, Zhenwei Shi, Zhengxia Zou
In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene based on multi-view images.
1 code implementation • 1 Mar 2023 • Chenyang Liu, Jiajun Yang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
To sufficiently utilize the extracted multi-scale features for captioning, we propose a scale-aware reinforcement (SR) module and combine it with the Transformer decoding layer to progressively utilize the features from different PDP layers.
5 code implementations • 27 Feb 2021 • Hao Chen, Zipeng Qi, Zhenwei Shi
To achieve this, we express the bitemporal image into a few tokens, and use a transformer encoder to model contexts in the compact token-based space-time.
Building change detection for remote sensing images Change Detection