Search Results for author: Jin Gao

Found 27 papers, 14 papers with code

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

1 code implementation18 Apr 2024 Jin Gao, Shubo Lin, Shaoru Wang, Yutong Kou, Zeming Li, Liang Li, Congxuan Zhang, Xiaoqin Zhang, Yizheng Wang, Weiming Hu

In this paper, we question if the extremely simple ViTs' fine-tuning performance with a small-scale architecture can also benefit from this pre-training paradigm, which is considerably less studied yet in contrast to the well-established lightweight architecture design methodology with sophisticated components introduced.

Contrastive Learning Image Classification +2

BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues

no code implementations11 Mar 2024 Fudong Ge, Yiwei Zhang, Shuhan Shen, Yue Wang, Weiming Hu, Jin Gao

To tackle the above issues, we design a new BEV-enhanced VPR framework, nemely BEV2PR, which can generate a composite descriptor with both visual cues and spatial awareness solely based on a single camera.

Visual Place Recognition

Multi-Generative Agent Collective Decision-Making in Urban Planning: A Case Study for Kendall Square Renovation

no code implementations17 Feb 2024 Jin Gao, Hanyong Xu, Luc Dao

In this study, we develop a multiple-generative agent system to simulate community decision-making for the redevelopment of Kendall Square's Volpe building.

Decision Making

Data-Centric Foundation Models in Computational Healthcare: A Survey

1 code implementation4 Jan 2024 Yunkun Zhang, Jin Gao, Zheling Tan, Lingfeng Zhou, Kexin Ding, Mu Zhou, Shaoting Zhang, Dequan Wang

The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare.

Ethics

Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking

1 code implementation18 Dec 2023 Shihao Feng, Pengpeng Liang, Jin Gao, Erkang Cheng

Instead of performing correlation of the two branches at just one point in the network, in this paper, we present a multi-correlation Siamese Transformer network that has multiple stages and carries out feature correlation at the end of each stage based on sparse pillars.

3D Single Object Tracking Autonomous Driving +2

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

no code implementations6 Nov 2023 Yanqin Jiang, Li Zhang, Jin Gao, Weimin Hu, Yao Yao

This is achieved by leveraging the object-level 3D-aware image diffusion model as the primary supervision signal for training Dynamic Neural Radiance Fields (DyNeRF).

3D Generation Camera Calibration +3

AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban Planning via Consensus-based Multi-Agent Reinforcement Learning

no code implementations25 Oct 2023 Kejiang Qian, Lingjun Mao, Xin Liang, Yimin Ding, Jin Gao, Xinran Wei, Ziyi Guo, Jiajie Li

By integrating Multi-Agent Reinforcement Learning, our framework ensures that participatory urban planning decisions are more dynamic and adaptive to evolving community needs and provides a robust platform for automating complex real-world urban planning processes.

Decision Making Multi-agent Reinforcement Learning +1

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking

1 code implementation NeurIPS 2023 Yutong Kou, Jin Gao, Bing Li, Gang Wang, Weiming Hu, Yizheng Wang, Liang Li

To this end, we non-uniformly resize the cropped image to have a smaller input size while the resolution of the area where the target is more likely to appear is higher and vice versa.

Visual Tracking

Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques

no code implementations14 Feb 2023 Qi Zhang, Zijian Yang, Yilun Huang, Ze Chen, Zijian Cai, Kangxu Wang, Jiewen Zheng, Jiarong He, Jin Gao

In this paper, we present our solution to the Multilingual Information Retrieval Across a Continuum of Languages (MIRACL) challenge of WSDM CUP 2023\footnote{https://project-miracl. github. io/}.

Data Augmentation Information Retrieval +1

Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption

no code implementations CVPR 2023 Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang

We update the target data instead, and project all test inputs toward the source domain with a generative diffusion model.

Test-time Adaptation

Multi-Frames Temporal Abnormal Clues Learning Method for Face Anti-Spoofing

no code implementations8 Aug 2022 Heng Cong, Rongyu Zhang, Jiarong He, Jin Gao

Face anti-spoofing researches are widely used in face recognition and has received more attention from industry and academics.

Face Anti-Spoofing Face Recognition

A Semantic Alignment System for Multilingual Query-Product Retrieval

no code implementations5 Aug 2022 Qi Zhang, Zijian Yang, Yilun Huang, Ze Chen, Zijian Cai, Kangxu Wang, Jiewen Zheng, Jiarong He, Jin Gao

Our models are all trained with cross-entropy loss to classify the query-product pairs into ESCI 4 categories at first, and then we use weighted sum with the 4-class probabilities to get the score for ranking.

Data Augmentation Retrieval +1

DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning

no code implementations13 Jul 2022 Shaoru Wang, Zeming Li, Jin Gao, Liang Li, Weiming Hu

However, when facing various resource budgets in real-world applications, it costs a huge computation burden to pretrain multiple networks of various sizes one by one.

Knowledge Distillation Self-Supervised Learning

Back to the Source: Diffusion-Driven Test-Time Adaptation

1 code implementation7 Jul 2022 Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang

We instead update the target data, by projecting all test inputs toward the source domain with a generative diffusion model.

Test-time Adaptation

Narrowing the Gap: Improved Detector Training with Noisy Location Annotations

1 code implementation12 Jun 2022 Shaoru Wang, Jin Gao, Bing Li, Weiming Hu

Experiments for both synthesized and real-world scenarios consistently demonstrate the effectiveness of our approach, e. g., our method increases the degraded performance of the FCOS detector from 33. 6% AP to 35. 6% AP on COCO.

object-detection Object Detection

A Closer Look at Self-Supervised Lightweight Vision Transformers

2 code implementations28 May 2022 Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu

We also point out some defects of such pre-training, e. g., failing to benefit from large-scale pre-training data and showing inferior performance on data-insufficient downstream tasks.

Contrastive Learning Image Classification +1

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

2 code implementations CVPR 2020 Jin Gao, Yan Lu, Xiaojuan Qi, Yutong Kou, Bing Li, Liang Li, Shan Yu, Weiming Hu

In this paper, we propose a simple yet effective recursive least-squares estimator-aided online learning approach for few-shot online adaptation without requiring offline training.

Continual Learning One-Shot Learning +1

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking

no code implementations6 May 2021 Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu

In this paper, we show the existence of universal perturbations that can enable the targeted attack, e. g., forcing a tracker to follow the ground-truth trajectory with specified offsets, to be video-agnostic and free from inference in a network.

Visual Tracking

Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

1 code implementation ICCV 2021 Chenxi Wang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu

To quickly detect graspness in practice, we develop a neural network named graspness model to approximate the searching process.

Robotic Grasping

Visual Tracking via Spatially Aligned Correlation Filters Network

no code implementations ECCV 2018 Mengdan Zhang, Qiang Wang, Junliang Xing, Jin Gao, Peixi Peng, Weiming Hu, Steve Maybank

Correlation filters based trackers rely on a periodic assumption of the search sample to efficiently distinguish the target from the background.

Visual Tracking

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking

2 code implementations CVPR 2018 Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank

The RASNet model reformulates the correlation filter within a Siamese tracking framework, and introduces different kinds of the attention mechanisms to adapt the model without updating the model online.

Object Tracking Representation Learning +1

DCFNet: Discriminant Correlation Filters Network for Visual Tracking

5 code implementations13 Apr 2017 Qiang Wang, Jin Gao, Junliang Xing, Mengdan Zhang, Weiming Hu

In this work, we present an end-to-end lightweight network architecture, namely DCFNet, to learn the convolutional features and perform the correlation tracking process simultaneously.

Object Tracking Visual Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.