Search Results for author: Jin Gao

Found 27 papers, 14 papers with code

OPDAI at SemEval-2022 Task 11: A hybrid approach for Chinese NER using outside Wikipedia knowledge

no code implementations • SemEval (NAACL) 2022 • Ze Chen, Kangxu Wang, Jiewen Zheng, Zijian Cai, Jiarong He, Jin Gao

This article describes the OPDAI submission to SemEval-2022 Task 11 on Chinese complex NER.

Paper
Add Code

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

1 code implementation • 18 Apr 2024 • Jin Gao, Shubo Lin, Shaoru Wang, Yutong Kou, Zeming Li, Liang Li, Congxuan Zhang, Xiaoqin Zhang, Yizheng Wang, Weiming Hu

In this paper, we question if the extremely simple ViTs' fine-tuning performance with a small-scale architecture can also benefit from this pre-training paradigm, which is considerably less studied yet in contrast to the well-established lightweight architecture design methodology with sophisticated components introduced.

Contrastive Learning Image Classification +2

Paper
Code

BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues

no code implementations • 11 Mar 2024 • Fudong Ge, Yiwei Zhang, Shuhan Shen, Yue Wang, Weiming Hu, Jin Gao

To tackle the above issues, we design a new BEV-enhanced VPR framework, nemely BEV2PR, which can generate a composite descriptor with both visual cues and spatial awareness solely based on a single camera.

Visual Place Recognition

Paper
Add Code

Multi-Generative Agent Collective Decision-Making in Urban Planning: A Case Study for Kendall Square Renovation

no code implementations • 17 Feb 2024 • Jin Gao, Hanyong Xu, Luc Dao

In this study, we develop a multiple-generative agent system to simulate community decision-making for the redevelopment of Kendall Square's Volpe building.

Decision Making

Paper
Add Code

Data-Centric Foundation Models in Computational Healthcare: A Survey

1 code implementation • 4 Jan 2024 • Yunkun Zhang, Jin Gao, Zheling Tan, Lingfeng Zhou, Kexin Ding, Mu Zhou, Shaoting Zhang, Dequan Wang

The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare.

Ethics

Paper
Code

Multi-Correlation Siamese Transformer Network with Dense Connection for 3D Single Object Tracking

1 code implementation • 18 Dec 2023 • Shihao Feng, Pengpeng Liang, Jin Gao, Erkang Cheng

Instead of performing correlation of the two branches at just one point in the network, in this paper, we present a multi-correlation Siamese Transformer network that has multiple stages and carries out feature correlation at the end of each stage based on sparse pillars.

3D Single Object Tracking Autonomous Driving +2

Paper
Code

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

no code implementations • 6 Nov 2023 • Yanqin Jiang, Li Zhang, Jin Gao, Weimin Hu, Yao Yao

This is achieved by leveraging the object-level 3D-aware image diffusion model as the primary supervision signal for training Dynamic Neural Radiance Fields (DyNeRF).

3D Generation Camera Calibration +3

Paper
Add Code

AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban Planning via Consensus-based Multi-Agent Reinforcement Learning

no code implementations • 25 Oct 2023 • Kejiang Qian, Lingjun Mao, Xin Liang, Yimin Ding, Jin Gao, Xinran Wei, Ziyi Guo, Jiajie Li

By integrating Multi-Agent Reinforcement Learning, our framework ensures that participatory urban planning decisions are more dynamic and adaptive to evolving community needs and provides a robust platform for automating complex real-world urban planning processes.

Decision Making Multi-agent Reinforcement Learning +1

Paper
Add Code

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking

1 code implementation • NeurIPS 2023 • Yutong Kou, Jin Gao, Bing Li, Gang Wang, Weiming Hu, Yizheng Wang, Liang Li

To this end, we non-uniformly resize the cropped image to have a smaller input size while the resolution of the area where the target is more likely to appear is higher and vice versa.

Visual Tracking

Paper
Code

Text-guided Foundation Model Adaptation for Pathological Image Classification

2 code implementations • 27 Jul 2023 • Yunkun Zhang, Jin Gao, Mu Zhou, Xiaosong Wang, Yu Qiao, Shaoting Zhang, Dequan Wang

In this paper, we propose to Connect Image and Text Embeddings (CITE) to enhance pathological image classification.

Classification Image Classification +1

Paper
Code

Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques

no code implementations • 14 Feb 2023 • Qi Zhang, Zijian Yang, Yilun Huang, Ze Chen, Zijian Cai, Kangxu Wang, Jiewen Zheng, Jiarong He, Jin Gao

In this paper, we present our solution to the Multilingual Information Retrieval Across a Continuum of Languages (MIRACL) challenge of WSDM CUP 2023\footnote{https://project-miracl. github. io/}.

Data Augmentation Information Retrieval +1

Paper
Add Code

Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption

no code implementations • CVPR 2023 • Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang

We update the target data instead, and project all test inputs toward the source domain with a generative diffusion model.

Test-time Adaptation

Paper
Add Code

Multi-Frames Temporal Abnormal Clues Learning Method for Face Anti-Spoofing

no code implementations • 8 Aug 2022 • Heng Cong, Rongyu Zhang, Jiarong He, Jin Gao

Face anti-spoofing researches are widely used in face recognition and has received more attention from industry and academics.

Face Anti-Spoofing Face Recognition

Paper
Add Code

Image Quality Assessment with Gradient Siamese Network

no code implementations • 8 Aug 2022 • Heng Cong, Lingzhi Fu, Rongyu Zhang, Yusheng Zhang, Hao Wang, Jiarong He, Jin Gao

In this work, we introduce Gradient Siamese Network (GSN) for image quality assessment.

Image Quality Assessment

Paper
Add Code

A Semantic Alignment System for Multilingual Query-Product Retrieval

no code implementations • 5 Aug 2022 • Qi Zhang, Zijian Yang, Yilun Huang, Ze Chen, Zijian Cai, Kangxu Wang, Jiewen Zheng, Jiarong He, Jin Gao

Our models are all trained with cross-entropy loss to classify the query-product pairs into ESCI 4 categories at first, and then we use weighted sum with the 4-class probabilities to get the score for ranking.

Data Augmentation Retrieval +1

Paper
Add Code

DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning

no code implementations • 13 Jul 2022 • Shaoru Wang, Zeming Li, Jin Gao, Liang Li, Weiming Hu

However, when facing various resource budgets in real-world applications, it costs a huge computation burden to pretrain multiple networks of various sizes one by one.

Knowledge Distillation Self-Supervised Learning

Paper
Add Code

Back to the Source: Diffusion-Driven Test-Time Adaptation

1 code implementation • 7 Jul 2022 • Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang

We instead update the target data, by projecting all test inputs toward the source domain with a generative diffusion model.

Test-time Adaptation

Paper
Code

PolarFormer: Multi-camera 3D Object Detection with Polar Transformer

1 code implementation • 30 Jun 2022 • Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang

3D object detection in autonomous driving aims to reason "what" and "where" the objects of interest present in a 3D world.

Ranked #2 on Robust Camera Only 3D Object Detection on nuScenes-C

3D Object Detection Autonomous Driving +5

153

Paper
Code

Narrowing the Gap: Improved Detector Training with Noisy Location Annotations

1 code implementation • 12 Jun 2022 • Shaoru Wang, Jin Gao, Bing Li, Weiming Hu

Experiments for both synthesized and real-world scenarios consistently demonstrate the effectiveness of our approach, e. g., our method increases the degraded performance of the FCOS detector from 33. 6% AP to 35. 6% AP on COCO.

object-detection Object Detection

Paper
Code

A Closer Look at Self-Supervised Lightweight Vision Transformers

2 code implementations • 28 May 2022 • Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu

We also point out some defects of such pre-training, e. g., failing to benefit from large-scale pre-training data and showing inferior performance on data-insufficient downstream tasks.

Contrastive Learning Image Classification +1

Paper
Code

Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation

1 code implementation • CVPR 2022 • Zongyang Ma, Guan Luo, Jin Gao, Liang Li, Yuxin Chen, Shaoru Wang, Congxuan Zhang, Weiming Hu

Open-vocabulary object detection aims to detect novel object categories beyond the training set.

Ranked #26 on Open Vocabulary Object Detection on MSCOCO

Knowledge Distillation Language Modelling +3

Paper
Code

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

2 code implementations • CVPR 2020 • Jin Gao, Yan Lu, Xiaojuan Qi, Yutong Kou, Bing Li, Liang Li, Shan Yu, Weiming Hu

In this paper, we propose a simple yet effective recursive least-squares estimator-aided online learning approach for few-shot online adaptation without requiring offline training.

Continual Learning One-Shot Learning +1

Paper
Code

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking

no code implementations • 6 May 2021 • Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu

In this paper, we show the existence of universal perturbations that can enable the targeted attack, e. g., forcing a tracker to follow the ground-truth trajectory with specified offsets, to be video-agnostic and free from inference in a network.

Visual Tracking

Paper
Add Code

Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

1 code implementation • ICCV 2021 • Chenxi Wang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu

To quickly detect graspness in practice, we develop a neural network named graspness model to approximate the searching process.

Ranked #3 on Robotic Grasping on GraspNet-1Billion

Robotic Grasping

101

Paper
Code

Visual Tracking via Spatially Aligned Correlation Filters Network

no code implementations • ECCV 2018 • Mengdan Zhang, Qiang Wang, Junliang Xing, Jin Gao, Peixi Peng, Weiming Hu, Steve Maybank

Correlation filters based trackers rely on a periodic assumption of the search sample to efficiently distinguish the target from the background.

Visual Tracking

Paper
Add Code

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking

2 code implementations • CVPR 2018 • Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank

The RASNet model reformulates the correlation filter within a Siamese tracking framework, and introduces different kinds of the attention mechanisms to adapt the model without updating the model online.

Ranked #3 on Visual Object Tracking on OTB-2013

Object Tracking Representation Learning +1

Paper
Code

DCFNet: Discriminant Correlation Filters Network for Visual Tracking

5 code implementations • 13 Apr 2017 • Qiang Wang, Jin Gao, Junliang Xing, Mengdan Zhang, Weiming Hu

In this work, we present an end-to-end lightweight network architecture, namely DCFNet, to learn the convolutional features and perform the correlation tracking process simultaneously.

Object Tracking Visual Tracking

214

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.