no code implementations • 9 Feb 2024 • Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, QiXing Huang, Li Erran Li
By combining natural language understanding, generation capabilities, and breadth of knowledge of large language models with image perception, recent large vision language models (LVLMs) have shown unprecedented visual reasoning capabilities.
no code implementations • 5 Jun 2023 • Siming Yan, Chen Song, Youkang Kong, QiXing Huang
Different from the popular practice of predicting 2D features first and then obtaining 3D features through dimensionality lifting, our approach directly uses a 3D network for feature extraction.
no code implementations • 14 Apr 2023 • Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang
Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.
1 code implementation • ICCV 2023 • Siming Yan, Zhenpei Yang, Haoxiang Li, Chen Song, Li Guan, Hao Kang, Gang Hua, QiXing Huang
The most popular and accessible 3D representation, i. e., point clouds, involves discrete samples of the underlying continuous 3D surface.
Ranked #5 on 3D Point Cloud Linear Classification on ModelNet40 (using extra training data)
3D Point Cloud Classification 3D Point Cloud Linear Classification +3
1 code implementation • ICCV 2021 • Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, QiXing Huang
This task is challenging because 3D scenes exhibit diverse patterns, ranging from continuous ones, such as object sizes and the relative poses between pairs of shapes, to discrete patterns, such as occurrence and co-occurrence of objects with symmetrical relationships.
1 code implementation • ICCV 2021 • Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, QiXing Huang
This paper introduces HPNet, a novel deep-learning approach for segmenting a 3D shape represented as a point cloud into primitive patches.
1 code implementation • CVPR 2020 • Zhenpei Yang, Siming Yan, Qi-Xing Huang
In this paper, we introduce a novel RGB-D based relative pose estimation approach that is suitable for small-overlapping or non-overlapping scans and can output multiple relative poses.
no code implementations • 22 Dec 2019 • Siming Yan, Xuyang Fang, Bowen Xiao, Harold Rockwell, Yimeng Zhang, Tai Sing Lee
The abundant recurrent horizontal and feedback connections in the primate visual cortex are thought to play an important role in bringing global and semantic contextual information to early visual areas during perceptual inference, helping to resolve local ambiguity and fill in missing details.
no code implementations • 20 Feb 2018 • Siming Yan, Feng Shi, Yu-Hua Chen, Damini Dey, Sang-Eun Lee, Hyuk-Jae Chang, Debiao Li, Yibin Xie
Coronary calcium causes beam hardening and blooming artifacts on cardiac computed tomography angiography (CTA) images, which lead to overestimation of lumen stenosis and reduction of diagnostic specificity.
no code implementations • IJCNLP 2017 • Liunian Li, Xiaojun Wan, Jin-Ge Yao, Siming Yan
In this work we study the challenging task of automatically constructing essays for Chinese college entrance examination where the topic is specified in advance.