Search Results for author: Min Shi

Found 27 papers, 16 papers with code

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

1 code implementation • 2 May 2024 • Shihao Wang, Zhiding Yu, Xiaohui Jiang, Shiyi Lan, Min Shi, Nadine Chang, Jan Kautz, Ying Li, Jose M. Alvarez

We further propose OmniDrive-nuScenes, a new visual question-answering dataset challenging the true 3D situational awareness of a model with comprehensive visual question-answering (VQA) tasks, including scene description, traffic regulation, 3D grounding, counterfactual reasoning, decision making and planning.

Autonomous Driving counterfactual +4

Paper
Code

Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

1 code implementation • 26 Apr 2024 • Tianqi Liu, Xinyi Ye, Min Shi, Zihao Huang, Zhiyu Pan, Zhan Peng, Zhiguo Cao

We incorporate the above ACA, SVA, and CAF into a coarse-to-fine framework, termed Geometry-aware Reconstruction and Fusion-refined Rendering (GeFu).

Paper
Code

FairCLIP: Harnessing Fairness in Vision-Language Learning

1 code implementation • 29 Mar 2024 • Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang

Fairness is a critical concern in deep learning, especially in healthcare, where these models influence diagnoses and treatment decisions.

Fairness

Paper
Code

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

1 code implementation • 4 Mar 2024 • Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Shuai Yang, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

Specifically, it is powered by a text-conditioned tri-plane latent diffusion model, which quickly generates coarse 3D samples for fast prototyping.

3D Generation Text to 3D +1

558

Paper
Code

FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

1 code implementation • 3 Nov 2023 • Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang

Existing medical fairness datasets are all for classification tasks, and no fairness datasets are available for medical segmentation, while medical segmentation is an equally important clinical task as classifications, which can provide detailed spatial information on organ abnormalities ready to be assessed by clinicians.

Fairness Image Segmentation +3

Paper
Code

FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

no code implementations • 3 Oct 2023 • Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

To address this research gap, we conduct the first comprehensive study on the fairness of 3D medical imaging models across multiple protected attributes.

Fairness

Paper
Add Code

When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo

1 code implementation • ICCV 2023 • Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao

This constraint reduces the 2D search space into the epipolar line in stereo matching.

Ranked #3 on 3D Reconstruction on DTU

3D Reconstruction Descriptive +2

Paper
Code

EANet: Expert Attention Network for Online Trajectory Prediction

no code implementations • 11 Sep 2023 • Pengfei Yao, Tianlu Mao, Min Shi, Jingkai Sun, Zhaoqi Wang

We introduce expert attention, which adjusts the weights of different depths of network layers, avoiding the model updated slowly due to gradient problem and enabling fast learning of new scenario's knowledge to restore prediction accuracy.

Autonomous Driving Trajectory Prediction

Paper
Add Code

Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

no code implementations • ICCV 2023 • Yan Luo, Min Shi, Yu Tian, Tobias Elze, Mengyu Wang

This is the largest glaucoma detection dataset with 3D OCT imaging data and the first glaucoma progression forecasting dataset that is publicly available.

Fairness

Paper
Add Code

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

1 code implementation • 3 Aug 2023 • Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao

We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world.

Question Answering Retrieval +1

380

Paper
Code

Neural Video Depth Stabilizer

3 code implementations • ICCV 2023 • Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

Video depth estimation aims to infer temporally consistent depth.

Ranked #16 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Monocular Depth Estimation

763

Paper
Code

Harvard Glaucoma Fairness: A Retinal Nerve Disease Dataset for Fairness Learning and Fair Identity Normalization

1 code implementation • 15 Jun 2023 • Yan Luo, Yu Tian, Min Shi, Louis R. Pasquale, Lucy Q. Shen, Nazlee Zebardast, Tobias Elze, Mengyu Wang

To address this gap, we introduce Harvard Glaucoma Fairness (Harvard-GF), a retinal nerve disease dataset with both 2D and 3D imaging data and balanced racial groups for glaucoma detection.

Fairness Feature Importance

Paper
Code

Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation

1 code implementation • CVPR 2023 • Min Shi, Zihao Huang, Xianzheng Ma, Xiaowei Hu, Zhiguo Cao

To calibrate the inaccurate matching results, we introduce a two-stage framework, where matched keypoints from the first stage are viewed as similarity-aware position proposals.

Ranked #3 on 2D Pose Estimation on MP-100

Category-Agnostic Pose Estimation Decoder +1

Paper
Code

Demystify Transformers & Convolutions in Modern Image Deep Networks

1 code implementation • 10 Nov 2022 • Xiaowei Hu, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie zhou, Xiaogang Wang, Yu Qiao, Jifeng Dai

Our experiments on various tasks and an analysis of inductive bias show a significant performance boost due to advanced network-level and block-level designs, but performance differences persist among different STMs.

Image Deep Networks Spatial Token Mixer

Paper
Code

Artifact-Tolerant Clustering-Guided Contrastive Embedding Learning for Ophthalmic Images

1 code implementation • 2 Sep 2022 • Min Shi, Anagha Lokhande, Mojtaba S. Fazli, Vishal Sharma, Yu Tian, Yan Luo, Louis R. Pasquale, Tobias Elze, Michael V. Boland, Nazlee Zebardast, David S. Friedman, Lucy Q. Shen, Mengyu Wang

Ophthalmic images and derivatives such as the retinal nerve fiber layer (RNFL) thickness map are crucial for detecting and monitoring ophthalmic diseases (e. g., glaucoma).

Clustering Contrastive Learning +1

Paper
Code

Design What You Desire: Icon Generation from Orthogonal Application and Theme Labels

1 code implementation • 31 Jul 2022 • Yinpeng Chen, Zhiyu Pan, Min Shi, Hao Lu, Zhiguo Cao, Weicai Zhong

Generative adversarial networks (GANs) have been trained to be professional artists able to create stunning artworks such as face generation and image style transfer.

Disentanglement Face Generation +1

Paper
Code

3D Instances as 1D Kernels

1 code implementation • 15 Jul 2022 • Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong

The idea of instance kernel is inspired by recent success of dynamic convolutions in 2D/3D instance segmentation.

Ranked #2 on 3D Instance Segmentation on S3DIS (mCov metric)

3D Instance Segmentation Semantic Segmentation

Paper
Code

Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting

1 code implementation • CVPR 2022 • Min Shi, Hao Lu, Chen Feng, Chengxin Liu, Zhiguo Cao

In this work, we propose a similarity-aware CAC framework that jointly learns representation and similarity metric.

Ranked #4 on Object Counting on CARPK

Object Counting

Paper
Code

ST-PCNN: Spatio-Temporal Physics-Coupled Neural Networks for Dynamics Forecasting

no code implementations • 12 Aug 2021 • Yu Huang, James Li, Min Shi, Hanqi Zhuang, Xingquan Zhu, Laurent Chérubin, James VanZwieten, Yufei Tang

A spatio-temporal physics-coupled neural network (ST-PCNN) model is proposed to achieve three goals: (1) learning the underlying physics parameters, (2) transition of local information between spatio-temporal regions, and (3) forecasting future values for the dynamical system.

Paper
Add Code

Physics-Coupled Spatio-Temporal Active Learning for Dynamical Systems

no code implementations • 11 Aug 2021 • Yu Huang, Yufei Tang, Xingquan Zhu, Min Shi, Ali Muhamed Ali, Hanqi Zhuang, Laurent Cherubin

To tackle these challenges, we advocate a spatio-temporal physics-coupled neural networks (ST-PCNN) model to learn the underlying physics of the dynamical system and further couple the learned physics to assist the learning of the recurring dynamics.

Active Learning Spatio-Temporal Forecasting

Paper
Add Code

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Grigory Malivenko, David Plowman, Samarth Shukla, Radu Timofte, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Yiran Wang, Xingyi Li, Min Shi, Ke Xian, Zhiguo Cao, Jin-Hua Du, Pei-Lin Wu, Chao Ge, Jiaoyang Yao, Fangwen Tu, Bo Li, Jung Eun Yoo, Kwanggyoon Seo, Jialei Xu, Zhenyu Li, Xianming Liu, Junjun Jiang, Wei-Chi Chen, Shayan Joya, Huanhuan Fan, Zhaobing Kang, Ang Li, Tianpeng Feng, Yang Liu, Chuannan Sheng, Jian Yin, Fausto T. Benavide

While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference.

Depth Estimation

Paper
Add Code

Deep Attributed Network Representation Learning via Attribute Enhanced Neighborhood

no code implementations • 12 Apr 2021 • Cong Li, Min Shi, Bo Qu, Xiang Li

In this paper, we propose a deep attributed network representation learning via attribute enhanced neighborhood (DANRL-ANE) model to improve the robustness and effectiveness of node representations.

Attribute Decoder +3

Paper
Add Code

Evolutionary Architecture Search for Graph Neural Networks

1 code implementation • 21 Sep 2020 • Min Shi, David A. Wilson, Xingquan Zhu, Yu Huang, Yuan Zhuang, Jianxun Liu, Yufei Tang

In particular, Neural Architecture Search (NAS) has seen significant attention throughout the AutoML research community, and has pushed forward the state-of-the-art in a number of neural models to address grid-like data such as texts and images.

Neural Architecture Search Representation Learning

Paper
Code

Deep Line Art Video Colorization with a Few References

no code implementations • 24 Mar 2020 • Min Shi, Jia-Qi Zhang, Shu-Yu Chen, Lin Gao, Yu-Kun Lai, Fang-Lue Zhang

The color transform network takes the target line art images as well as the line art and color images of one or more reference images as input, and generates corresponding target color images.

Colorization

Paper
Add Code

Feature-Attention Graph Convolutional Networks for Noise Resilient Learning

no code implementations • 26 Dec 2019 • Min Shi, Yufei Tang, Xingquan Zhu, Jianxun Liu

By using spectral-based graph convolution aggregation process, each node is allowed to concentrate more on the most determining neighborhood features aligned with the corresponding learning task.

Feature Importance

Paper
Add Code

Multi-Label Graph Convolutional Network Representation Learning

no code implementations • 26 Dec 2019 • Min Shi, Yufei Tang, Xingquan Zhu, Jianxun Liu

The multi-label network nodes not only have multiple labels for each node, such labels are often highly correlated making existing methods ineffective or fail to handle such correlation for node representation learning.

Ranked #30 on Multi-Label Classification on MS-COCO

Multi-Label Classification Node Classification +1

Paper
Add Code

DOT: Gene-set analysis by combining decorrelated association statistics

no code implementations • 5 Jun 2019 • Olga A Vsevolozhskaya, Min Shi, Fengjiao Hu, Dmitri V Zaykin

Historically, the majority of statistical association methods have been designed assuming availability of SNP-level information.

Genomics Applications

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.