no code implementations • 25 Apr 2024 • Ao Xiang, Jingyu Zhang, Qin Yang, Liyang Wang, Yu Cheng
With the development and widespread application of digital image processing technology, image splicing has become a common method of image manipulation, raising numerous security and legal issues.
no code implementations • 19 Apr 2024 • Danqing Ma, Meng Wang, Ao Xiang, Zongqing Qi, Qin Yang
This study proposes a multi-modal fusion framework Multitrans based on the Transformer architecture and self-attention mechanism.
no code implementations • 10 Apr 2024 • Jingyu Zhang, Ao Xiang, Yu Cheng, Qin Yang, Liyang Wang
With the rapid advancement of artificial intelligence technology, AI-enabled image recognition has emerged as a potent tool for addressing challenges in traditional environmental monitoring.
no code implementations • 13 Mar 2024 • Ao Xiang, Zongqing Qi, Han Wang, Qin Yang, Danqing Ma
This paper introduces a new multi-modal model based on the Transformer architecture and tensor product fusion strategy, combining BERT's text vectors and ViT's image vectors to classify students' psychological conditions, with an accuracy of 93. 65%.
no code implementations • 13 Mar 2024 • Zongqing Qi, Danqing Ma, Jingyu Xu, Ao Xiang, Hedi Qu
In recent years, there have been frequent incidents of foreign objects intruding into railway and Airport runways.
no code implementations • 22 Jan 2018 • Boonchoo Thapana, Ao Xiang, He Qing
Furthermore, we adopt an efficient union-find algorithm to maintain the clustering information in order to reduce redundancies in the merging.