Search Results for author: Siming Yan

Found 10 papers, 4 papers with code

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

no code implementations9 Feb 2024 Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, QiXing Huang, Li Erran Li

By combining natural language understanding, generation capabilities, and breadth of knowledge of large language models with image perception, recent large vision language models (LVLMs) have shown unprecedented visual reasoning capabilities.

Hallucination Natural Language Understanding +2

Multi-View Representation is What You Need for Point-Cloud Pre-Training

no code implementations5 Jun 2023 Siming Yan, Chen Song, Youkang Kong, QiXing Huang

Different from the popular practice of predicting 2D features first and then obtaining 3D features through dimensionality lifting, our approach directly uses a 3D network for feature extraction.

3D Object Detection 3D Shape Classification +4

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining

no code implementations14 Apr 2023 Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

1 code implementation ICCV 2021 Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, QiXing Huang

This task is challenging because 3D scenes exhibit diverse patterns, ranging from continuous ones, such as object sizes and the relative poses between pairs of shapes, to discrete patterns, such as occurrence and co-occurrence of objects with symmetrical relationships.

Attribute

HPNet: Deep Primitive Segmentation Using Hybrid Representations

1 code implementation ICCV 2021 Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, QiXing Huang

This paper introduces HPNet, a novel deep-learning approach for segmenting a 3D shape represented as a point cloud into primitive patches.

Clustering Segmentation

Extreme Relative Pose Network under Hybrid Representations

1 code implementation CVPR 2020 Zhenpei Yang, Siming Yan, Qi-Xing Huang

In this paper, we introduce a novel RGB-D based relative pose estimation approach that is suitable for small-overlapping or non-overlapping scans and can output multiple relative poses.

Pose Estimation Translation

Recurrent Feedback Improves Feedforward Representations in Deep Neural Networks

no code implementations22 Dec 2019 Siming Yan, Xuyang Fang, Bowen Xiao, Harold Rockwell, Yimeng Zhang, Tai Sing Lee

The abundant recurrent horizontal and feedback connections in the primate visual cortex are thought to play an important role in bringing global and semantic contextual information to early visual areas during perceptual inference, helping to resolve local ambiguity and fill in missing details.

Calcium Removal From Cardiac CT Images Using Deep Convolutional Neural Network

no code implementations20 Feb 2018 Siming Yan, Feng Shi, Yu-Hua Chen, Damini Dey, Sang-Eun Lee, Hyuk-Jae Chang, Debiao Li, Yibin Xie

Coronary calcium causes beam hardening and blooming artifacts on cardiac computed tomography angiography (CTA) images, which lead to overestimation of lumen stenosis and reduction of diagnostic specificity.

BIG-bench Machine Learning Specificity

Leveraging Diverse Lexical Chains to Construct Essays for Chinese College Entrance Examination

no code implementations IJCNLP 2017 Liunian Li, Xiaojun Wan, Jin-Ge Yao, Siming Yan

In this work we study the challenging task of automatically constructing essays for Chinese college entrance examination where the topic is specified in advance.

Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.