Search Results for author: Jinlin Wu

Found 16 papers, 7 papers with code

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons

no code implementations • 14 May 2024 • Zhen Chen, Xingjian Luo, Jinlin Wu, Danny T. M. Chan, Zhen Lei, Jinqiao Wang, Sebastien Ourselin, Hongbin Liu

In this work, by leveraging advanced multimodal large language models (MLLMs), we propose a Versatile Surgery Assistant (VS-Assistant) that can accurately understand the surgeon's intention and complete a series of surgical understanding tasks, e. g., surgical scene analysis, surgical instrument detection, and segmentation on demand.

Paper
Add Code

A Survey on Personalized Content Synthesis with Diffusion Models

no code implementations • 9 May 2024 • Xulu Zhang, Xiao-Yong Wei, WengYu Zhang, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li

This paper offers a comprehensive survey of PCS, with a particular focus on the diffusion models.

Face Generation Text-to-Image Generation

Paper
Add Code

Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning

no code implementations • 1 May 2024 • Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Lei, Zhen Chen, Hongbin Liu

Ultrasound robots are increasingly used in medical diagnostics and early disease screening.

Language Modelling Large Language Model +2

Paper
Add Code

Transitive Vision-Language Prompt Learning for Domain Generalization

no code implementations • 29 Apr 2024 • Liyuan Wang, Yan Jin, Zhen Chen, Jinlin Wu, Mengke Li, Yang Lu, Hanzi Wang

The vision-language pre-training has enabled deep models to make a huge step forward in generalizing across unseen domains.

Domain Generalization

Paper
Add Code

Generative Active Learning for Image Synthesis Personalization

1 code implementation • 22 Mar 2024 • Xulu Zhang, WengYu Zhang, Xiao-Yong Wei, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li

The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept.

Active Learning Image Generation

Paper
Code

BronchoTrack: Airway Lumen Tracking for Branch-Level Bronchoscopic Localization

no code implementations • 20 Feb 2024 • Qingyao Tian, Huai Liao, Xinyan Huang, Bingyu Yang, Jinlin Wu, Jian Chen, Lujie Li, Hongbin Liu

Localizing the bronchoscope in real time is essential for ensuring intervention quality.

Multi-Object Tracking

Paper
Add Code

Compositional Inversion for Stable Diffusion Models

1 code implementation • 13 Dec 2023 • Xulu Zhang, Xiao-Yong Wei, Jinlin Wu, Tianyi Zhang, Zhaoxiang Zhang, Zhen Lei, Qing Li

It stems from the fact that during inversion, the irrelevant semantics in the user images are also encoded, forcing the inverted concepts to occupy locations far from the core distribution in the embedding space.

Paper
Code

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives

no code implementations • 7 Dec 2023 • Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

Learning scene graphs from natural language descriptions has proven to be a cheap and promising scheme for Scene Graph Generation (SGG).

Graph Generation Scene Graph Generation +1

Paper
Add Code

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

no code implementations • 18 Nov 2023 • Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

For the more challenging settings of relation-involved open vocabulary SGG, the proposed approach integrates relation-aware pre-training utilizing image-caption data and retains visual-concept alignment through knowledge distillation.

Concept Alignment Graph Generation +6

Paper
Add Code

FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data

1 code implementation • 17 Nov 2023 • Pietro Melzi, Ruben Tolosana, Ruben Vera-Rodriguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Ivan DeAndres-Tame, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Weisong Zhao, Xiangyu Zhu, Zheyu Yan, Xiao-Yu Zhang, Jinlin Wu, Zhen Lei, Suvidha Tripathi, Mahak Kothari, Md Haider Zama, Debayan Deb, Bernardo Biesseck, Pedro Vidal, Roger Granada, Guilherme Fickel, Gustavo Führ, David Menotti, Alexander Unnervik, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Parsa Rahimi, Sébastien Marcel, Ioannis Sarridis, Christos Koutlis, Georgia Baltsou, Symeon Papadopoulos, Christos Diou, Nicolò Di Domenico, Guido Borghi, Lorenzo Pellegrini, Enrique Mas-Candela, Ángela Sánchez-Pérez, Andrea Atzori, Fadi Boutros, Naser Damer, Gianni Fenu, Mirko Marras

Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail.

Face Recognition

Paper
Code

PWISeg: Point-based Weakly-supervised Instance Segmentation for Surgical Instruments

1 code implementation • 16 Nov 2023 • Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu

To address this issue, we propose a novel yet effective weakly-supervised surgical instrument instance segmentation approach, named Point-based Weakly-supervised Instance Segmentation (PWISeg).

Instance Segmentation Segmentation +4

Paper
Code

SurgPLAN: Surgical Phase Localization Network for Phase Recognition

no code implementations • 16 Nov 2023 • Xingjian Luo, You Pang, Zhen Chen, Jinlin Wu, Zongmin Zhang, Zhen Lei, Hongbin Liu

To address these two challenges, we propose a Surgical Phase LocAlization Network, named SurgPLAN, to facilitate a more accurate and stable surgical phase recognition with the principle of temporal detection.

Surgical phase recognition

Paper
Add Code

WS-YOLO: Weakly Supervised Yolo Network for Surgical Tool Localization in Endoscopic Videos

1 code implementation • 23 Sep 2023 • Rongfeng Wei, Jinlin Wu, You Pang, Zhen Chen

Being able to automatically detect and track surgical instruments in endoscopic video recordings would allow for many useful applications that could transform different aspects of surgery.

Paper
Code

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

no code implementations • ICCV 2023 • Benzhi Wang, Yang Yang, Jinlin Wu, Guo-Jun Qi, Zhen Lei

On the other hand, the similarity of cross-scale images is often smaller than that of images with the same scale for a person, which will increase the difficulty of matching.

Person Search