Search Results for author: Jianing Qiu

Found 20 papers, 9 papers with code

One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models

1 code implementation • 4 Mar 2024 • Lin Li, Haoyan Guan, Jianing Qiu, Michael Spratling

This work studies the adversarial robustness of VLMs from the novel perspective of the text prompt instead of the extensively studied model weights (frozen in this work).

Adversarial Attack Adversarial Robustness

Paper
Code

A Survey of Reasoning with Foundation Models

1 code implementation • 17 Dec 2023 • Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-world settings such as negotiation, medical diagnosis, and criminal investigation.

Medical Diagnosis

345

Paper
Code

Dietary Assessment with Multimodal ChatGPT: A Systematic Analysis

no code implementations • 14 Dec 2023 • Frank P. -W. Lo, Jianing Qiu, Zeyu Wang, Junhong Chen, Bo Xiao, Wu Yuan, Stamatia Giannarou, Gary Frost, Benny Lo

Although artificial intelligence (AI)-based solutions have been devised to automate the dietary assessment process, these prior AI methodologies encounter challenges in their ability to generalize across a diverse range of food types, dietary behaviors, and cultural contexts.

Image Captioning Scene Understanding

Paper
Add Code

Aria-NeRF: Multimodal Egocentric View Synthesis

no code implementations • 11 Nov 2023 • Jiankai Sun, Jianing Qiu, Chuanyang Zheng, John Tucker, Javier Yu, Mac Schwager

The construction of a NeRF-like model from an egocentric image sequence plays a pivotal role in understanding human behavior and holds diverse applications within the realms of VR/AR.

Paper
Add Code

VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

no code implementations • 8 Oct 2023 • Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv, Hui Miao, Li Guo, Shujun Zhang, Cheng Pei, Xiaojuan Fan, Jianqin Lei, Ting Wei, Junguo Duan, Chun Liu, Xiaobo Xia, Siqi Xiong, Junhong Li, Benny Lo, Yih Chung Tham, Tien Yin Wong, Ningli Wang, Wu Yuan

To be commensurate with this capacity, in addition to the real data used for pre-training, we also generated and leveraged synthetic ophthalmic imaging data.

Disease Prediction Representation Learning

Paper
Add Code

CauDR: A Causality-inspired Domain Generalization Framework for Fundus-based Diabetic Retinopathy Grading

no code implementations • 27 Sep 2023 • Hao Wei, Peilun Shi, Juzheng Miao, Minqing Zhang, Guitao Bai, Jianing Qiu, Furui Liu, Wu Yuan

Building on this, a causality-inspired diabetic retinopathy grading framework named CauDR was developed to eliminate spurious correlations and achieve more generalizable DR diagnostics.

Diabetic Retinopathy Grading Domain Generalization

Paper
Add Code

AROID: Improving Adversarial Robustness through Online Instance-wise Data Augmentation

no code implementations • 12 Jun 2023 • Lin Li, Jianing Qiu, Michael Spratling

This allows our method to efficiently explore a large search space for a more effective DA policy and evolve the policy as training progresses.

Adversarial Robustness Data Augmentation

Paper
Add Code

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation

1 code implementation • 25 Apr 2023 • Peilun Shi, Jianing Qiu, Sai Mu Dalike Abaxi, Hao Wei, Frank P. -W. Lo, Wu Yuan

In this paper, we examine the recent Segment Anything Model (SAM) on medical images, and report both quantitative and qualitative zero-shot segmentation results on nine medical image segmentation benchmarks, covering various imaging modalities, such as optical coherence tomography (OCT), magnetic resonance imaging (MRI), and computed tomography (CT), as well as different applications including dermatology, ophthalmology, and radiology.

Computed Tomography (CT) Image Segmentation +4

Paper
Code

Large AI Models in Health Informatics: Applications, Challenges, and the Future

1 code implementation • 21 Mar 2023 • Jianing Qiu, Lin Li, Jiankai Sun, Jiachuan Peng, Peilun Shi, Ruiyang Zhang, Yinzhao Dong, Kyle Lam, Frank P. -W. Lo, Bo Xiao, Wu Yuan, Ningli Wang, Dong Xu, Benny Lo

Large AI models, or foundation models, are models recently emerging with massive scales both parameter-wise and data-wise, the magnitudes of which can reach beyond billions.

Decision Making Drug Discovery +1

331

Paper
Code

EVEN: An Event-Based Framework for Monocular Depth Estimation at Adverse Night Conditions

no code implementations • 8 Feb 2023 • Peilun Shi, Jiachuan Peng, Jianing Qiu, Xinwei Ju, Frank Po Wen Lo, Benny Lo

Comprehensive experiments have been conducted, and the impact of different adverse weather combinations on the performance of framework has also been investigated.

Autonomous Driving Monocular Depth Estimation

Paper
Add Code

MenuAI: Restaurant Food Recommendation System via a Transformer-based Deep Learning Model

no code implementations • 15 Oct 2022 • Xinwei Ju, Frank Po Wen Lo, Jianing Qiu, Peilun Shi, Jiachuan Peng, Benny Lo

The promising results, with accuracy ranging from 77. 2% to 99. 5%, have demonstrated the great potential of LTR model in addressing food recommendation problems.

Food recommendation Learning-To-Rank +2

Paper
Add Code

Clustering Egocentric Images in Passive Dietary Monitoring with Self-Supervised Learning

no code implementations • 25 Aug 2022 • Jiachuan Peng, Peilun Shi, Jianing Qiu, Xinwei Ju, Frank P. -W. Lo, Xiao Gu, Wenyan Jia, Tom Baranowski, Matilda Steiner-Asiedu, Alex K. Anderson, Megan A McCrory, Edward Sazonov, Mingui Sun, Gary Frost, Benny Lo

By clustering images into separate events, annotators and dietitians can examine and analyze the data more efficiently and facilitate the subsequent dietary assessment processes.

Clustering Self-Supervised Learning

Paper
Add Code

Tackling Long-Tailed Category Distribution Under Domain Shifts

1 code implementation • 20 Jul 2022 • Xiao Gu, Yao Guo, Zeju Li, Jianing Qiu, Qi Dou, Yuxuan Liu, Benny Lo, Guang-Zhong Yang

Two new datasets were proposed for this problem, named AWA2-LTS and ImageNet-LTS.

Domain Generalization Meta-Learning +2

Paper
Code

Mining Discriminative Food Regions for Accurate Food Recognition

1 code implementation • 8 Jul 2022 • Jianing Qiu, Frank P. -W. Lo, Yingnan Sun, Siyao Wang, Benny Lo

Taking inspiration from Adversarial Erasing, a strategy that progressively discovers discriminative object regions for weakly supervised semantic segmentation, we propose a novel network architecture in which a primary network maintains the base accuracy of classifying an input image, an auxiliary network adversarially mines discriminative food regions, and a region network classifies the resulting mined regions.

Food Recognition Weakly supervised Semantic Segmentation +1

Paper
Code

Egocentric Human Trajectory Forecasting with a Wearable Camera and Multi-Modal Fusion

1 code implementation • 1 Nov 2021 • Jianing Qiu, Lipeng Chen, Xiao Gu, Frank P. -W. Lo, Ya-Yen Tsai, Jiankai Sun, Jiaqi Liu, Benny Lo

To this end, a novel egocentric human trajectory forecasting dataset was constructed, containing real trajectories of people navigating in crowded spaces wearing a camera, as well as extracted rich contextual data.

Trajectory Forecasting

Paper
Code

Occlusion-Invariant Rotation-Equivariant Semi-Supervised Depth Based Cross-View Gait Pose Estimation

2 code implementations • 3 Sep 2021 • Xiao Gu, Jianxin Yang, Hanxiao Zhang, Jianing Qiu, Frank Po Wen Lo, Yao Guo, Guang-Zhong Yang, Benny Lo

It can generalize well on the real-world data from all the other unseen views.

Pose Estimation

Paper
Code

TransAction: ICL-SJTU Submission to EPIC-Kitchens Action Anticipation Challenge 2021

1 code implementation • 28 Jul 2021 • Xiao Gu, Jianing Qiu, Yao Guo, Benny Lo, Guang-Zhong Yang

In this report, the technical details of our submission to the EPIC-Kitchens Action Anticipation Challenge 2021 are given.

Action Anticipation

Paper
Code

Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring

no code implementations • 1 Jul 2021 • Jianing Qiu, Frank P. -W. Lo, Xiao Gu, Modou L. Jobarteh, Wenyan Jia, Tom Baranowski, Matilda Steiner-Asiedu, Alex K. Anderson, Megan A McCrory, Edward Sazonov, Mingui Sun, Gary Frost, Benny Lo

In this paper, we propose a privacy-preserved secure solution (i. e., egocentric image captioning) for dietary assessment with passive monitoring, which unifies food recognition, volume estimation, and scene understanding.

Food Recognition Image Captioning +1

Paper
Add Code

An Intelligent Passive Food Intake Assessment System with Egocentric Cameras

no code implementations • 7 May 2021 • Frank Po Wen Lo, Modou L Jobarteh, Yingnan Sun, Jianing Qiu, Shuo Jiang, Gary Frost, Benny Lo

Malnutrition is a major public health concern in low-and-middle-income countries (LMICs).

Semantic Segmentation

Paper
Add Code

Indoor Future Person Localization from an Egocentric Wearable Camera

no code implementations • 6 Mar 2021 • Jianing Qiu, Frank P. -W. Lo, Xiao Gu, Yingnan Sun, Shuo Jiang, Benny Lo

Accurate prediction of future person location and movement trajectory from an egocentric wearable camera can benefit a wide range of applications, such as assisting visually impaired people in navigation, and the development of mobility assistance for people with disability.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.