1 code implementation • 14 May 2024 • Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu, Zheng Fang, Weiyan Wang, Jinbao Xue, Yangyu Tao, Jianchen Zhu, Kai Liu, Sihuan Lin, Yifu Sun, Yun Li, Dongdong Wang, Mingtao Chen, Zhichao Hu, Xiao Xiao, Yan Chen, Yuhong Liu, Wei Liu, Di Wang, Yong Yang, Jie Jiang, Qinglin Lu
For fine-grained language understanding, we train a Multimodal Large Language Model to refine the captions of the images.
1 code implementation • 5 Mar 2024 • Jiwen Zhang, Jihao Wu, Yihua Teng, Minghui Liao, Nuo Xu, Xiao Xiao, Zhongyu Wei, Duyu Tang
To address this, this work presents Chain-of-Action-Thought (dubbed CoAT), which takes the description of the previous actions, the current screen, and more importantly the action thinking of what actions should be performed and the outcomes led by the chosen action.
no code implementations • 11 Nov 2023 • Xiaochen Wang, Xiao Xiao, Ruhan Zhang, Xuan Zhang, Taesik Na, Tejaswi Tenneti, Haixun Wang, Fenglong Ma
Efficient and accurate product relevance assessment is critical for user experiences and business success.
1 code implementation • 8 Jul 2023 • Rao Fu, Cheng Wen, Qian Li, Xiao Xiao, Pierre Alliez
This paper proposes BPNet, a novel end-to-end deep learning framework to learn B\'ezier primitive segmentation on 3D point clouds.
no code implementations • 20 Feb 2023 • Jing Xu, Shuo Wang, Na Ying, Xiao Xiao, Jiang Zhang, Yun Cheng, Zhiling Jin, Gangfeng Zhang
Previous GCNs-based methods usually require providing spatial correlation graph structure of observation sites in advance.
no code implementations • 12 Sep 2022 • Yuqing Xie, Taesik Na, Xiao Xiao, Saurav Manchanda, Young Rao, Zhihong Xu, Guanghua Shu, Esther Vasiete, Tejaswi Tenneti, Haixun Wang
To train the model efficiently on noisy data, we propose a self-adversarial learning method and a cascade training method.
1 code implementation • 23 Apr 2022 • Wei Shao, Zhiling Jin, Shuo Wang, Yufan Kang, Xiao Xiao, Hamid Menouar, Zhaofeng Zhang, Junshan Zhang, Flora Salim
To address these issues, we construct new graph models to represent the contextual information of each node and the long-term spatio-temporal data dependency structure.
no code implementations • 12 Jul 2021 • Sumudu Herath, Xiao Xiao, Fehmi Cirak
The trained GPR model encodes the nonlinearities and anisotropies present in the microscale and serves as a material model for the membrane response of the macroscale shell.
1 code implementation • 17 May 2021 • Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer
Unlike the previous OCR agnostic preprocessing techniques, the proposed approach approximates the gradient of a particular OCR engine to train a preprocessor module.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 5 Jun 2020 • Fan Yang, Xiao Xiao
Blur detection is the separation of blurred and clear regions of an image, which is an important and challenging task in computer vision.