Search Results for author: Yibo Liu

Found 13 papers, 7 papers with code

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

2 code implementations • 27 Nov 2023 • Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning.

Complex Query Answering Logical Reasoning +1

7,177

Paper
Code

Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge

no code implementations • 28 Sep 2023 • Zheyuan Yang, Yibo Liu, Guile Wu, Tongtong Cao, Yuan Ren, Yang Liu, Bingbing Liu

To resolve this problem, we study learning effective NeRFs and SDFs representations with 3D Generative Adversarial Networks (GANs) for 3D object generation.

Decoder Object

Paper
Add Code

MV-DeepSDF: Implicit Modeling with Multi-Sweep Point Clouds for 3D Vehicle Reconstruction in Autonomous Driving

no code implementations • ICCV 2023 • Yibo Liu, Kelly Zhu, Guile Wu, Yuan Ren, Bingbing Liu, Yang Liu, Jinjun Shan

This set-level latent code is an expression of the optimal 3D shape in the implicit space, and can be subsequently decoded to a continuous SDF of the vehicle.

3D Reconstruction Autonomous Driving

Paper
Add Code

Evolutionary Multitasking with Solution Space Cutting for Point Cloud Registration

no code implementations • 12 Dec 2022 • Wu Yue, Peiran Gong, Maoguo Gong, Hangqi Ding, Zedong Tang, Yibo Liu, Wenping Ma, Qiguang Miao

However, most evolving registration methods cannot tackle the local optimum well and they have rarely investigated the success ratio, which implies the probability of not falling into local optima and is closely related to the practicality of the algorithm.

Point Cloud Registration Transfer Learning

Paper
Add Code

Occlusion-Resistant LiDAR Fiducial Marker Detection

1 code implementation • 2 Sep 2022 • Yibo Liu, Jinjun Shan, Hunter Schofield

The LiDAR fiducial marker, akin to the well-known AprilTag used in camera applications, serves as a convenient resource to impart artificial features to the LiDAR sensor, facilitating robotics applications.

Paper
Code

Endowing Language Models with Multimodal Knowledge Graph Representations

1 code implementation • 27 Jun 2022 • Ningyuan Huang, Yash R. Deshpande, Yibo Liu, Houda Alberts, Kyunghyun Cho, Clara Vania, Iacer Calixto

We use the recently released VisualSem KG as our external knowledge repository, which covers a subset of Wikipedia and WordNet entities, and compare a mix of tuple-based and graph-based algorithms to learn entity and relation representations that are grounded on the KG multimodal information.

Multilingual Named Entity Recognition named-entity-recognition +2

Paper
Code

Multi-view Point Cloud Registration based on Evolutionary Multitasking with Bi-Channel Knowledge Sharing Mechanism

no code implementations • 6 May 2022 • Yue Wu, Yibo Liu, Maoguo Gong, Peiran Gong, Hao Li, Zedong Tang, Qiguang Miao, Wenping Ma

The modeling of multi-view point cloud registration as multi-task optimization are twofold.

3D Reconstruction Point Cloud Registration

Paper
Add Code

Intensity Image-based LiDAR Fiducial Marker System

1 code implementation • 3 Mar 2022 • Yibo Liu, Hunter Schofield, Jinjun Shan

In this paper, an Intensity Image-based LiDAR Fiducial Marker (IILFM) system is developed.

Paper
Code

Autonomous Vision-based UAV Landing with Collision Avoidance using Deep Learning

no code implementations • 17 Sep 2021 • Tianpei Liao, Amal Haridevan, Yibo Liu, Jinjun Shan

There is a risk of collision when multiple UAVs land simultaneously without communication on the same platform.

Collision Avoidance

Paper
Add Code

Application of Ghost-DeblurGAN to Fiducial Marker Detection

1 code implementation • 8 Sep 2021 • Yibo Liu, Amaldev Haridevan, Hunter Schofield, Jinjun Shan

Feature extraction or localization based on the fiducial marker could fail due to motion blur in real-world robotic applications.

Ranked #43 on Deblurring on GoPro

Deblurring Generative Adversarial Network +1

Paper
Code

Table2Charts: Recommending Charts by Learning Shared Table Representations

1 code implementation • 24 Aug 2020 • Mengyu Zhou, Qingtao Li, Xinyi He, Yuejiang Li, Yibo Liu, Wei Ji, Shi Han, Yining Chen, Daxin Jiang, Dongmei Zhang

It is common for people to create different types of charts to explore a multi-dimensional dataset (table).

Q-Learning Recommendation Systems

Paper
Code

VisualSem: A High-quality Knowledge Graph for Vision and Language

1 code implementation • EMNLP (MRL) 2021 • Houda Alberts, Teresa Huang, Yash Deshpande, Yibo Liu, Kyunghyun Cho, Clara Vania, Iacer Calixto

We also release a neural multi-modal retrieval model that can use images or sentences as inputs and retrieves entities in the KG.

Data Augmentation Natural Language Understanding +2

Paper
Code

2D-CTC for Scene Text Recognition

no code implementations • 23 Jul 2019 • Zhaoyi Wan, Fengming Xie, Yibo Liu, Xiang Bai, Cong Yao

Scene text recognition has been an important, active research topic in computer vision for years.

Decoder Scene Text Recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.