Search Results for author: Haojie Li

Found 30 papers, 5 papers with code

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

no code implementations18 Dec 2023 Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Moreover, to facilitate disentangled representation learning, we introduce four well-designed constraints: an auxiliary style classifier, an auxiliary inverse classifier, a content contrastive loss, and a pair of latent cycle losses, which can effectively contribute to the construction of the identity-related style space and semantic-related content space.

Disentanglement

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection

no code implementations ICCV 2023 Xinzhu Ma, Yongtao Wang, Yinmin Zhang, Zhiyi Xia, Yuan Meng, Zhihui Wang, Haojie Li, Wanli Ouyang

In this work, we build a modular-designed codebase, formulate strong training recipes, design an error diagnosis toolbox, and discuss current methods for image-based 3D object detection.

3D Object Detection Object +1

Visual Tuning

no code implementations10 May 2023 Bruce X. B. Yu, Jianlong Chang, Haixin Wang, Lingbo Liu, Shijie Wang, Zhiyu Wang, Junfan Lin, Lingxi Xie, Haojie Li, Zhouchen Lin, Qi Tian, Chang Wen Chen

With the surprising development of pre-trained visual foundation models, visual tuning jumped out of the standard modus operandi that fine-tunes the whole pre-trained model or just the fully connected layer.

Hyperuniform disordered parametric loudspeaker array

no code implementations3 Jan 2023 Kun Tang, Yuqi Wang, Shaobo Wang, Da Gao, Haojie Li, Xindong Liang, Patrick Sebbah, Yibin Li, Jin Zhang, Junhui Shi

A steerable parametric loudspeaker array is known for its directivity and narrow beam width.

Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator

no code implementations CVPR 2023 Shijie Wang, Jianlong Chang, Haojie Li, Zhihui Wang, Wanli Ouyang, Qi Tian

PLEor could leverage pre-trained CLIP model to infer the discrepancies encompassing both pre-defined and unknown subcategories, called category-specific discrepancies, and transfer them to the backbone network trained in the close-set scenarios.

Knowledge Distillation Retrieval +1

Anti-Delay Kalman Filter Fusion Algorithm for Vehicle-borne Sensor Network with Finite-Time Convergence

no code implementations20 Sep 2022 Hang Yu, Keren Dai, Haojie Li, Yao Zou, Xiang Ma, Shaojie Ma, He Zhang

Intelligent vehicles in autonomous driving and obstacle avoidance, the precise relative state of vehicles put forward a higher demand.

Autonomous Driving

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers

no code implementations31 Aug 2022 Zengyuan Guo, Yuechen Yu, Pengyuan Lv, Chengquan Zhang, Haojie Li, Zhihui Wang, Kun Yao, Jingtuo Liu, Jingdong Wang

The Vertex-based Merging Module is capable of aggregating local contextual information between adjacent basic grids, providing the ability to merge basic girds that belong to the same spanning cell accurately.

Table Recognition

Semantic decomposition Network with Contrastive and Structural Constraints for Dental Plaque Segmentation

no code implementations12 Aug 2022 Jian Shi, Baoli Sun, Xinchen Ye, Zhihui Wang, Xiaolong Luo, Jin Liu, Heli Gao, Haojie Li

Therefore, we propose a semantic decomposition network (SDNet) that introduces two single-task branches to separately address the segmentation of teeth and dental plaque and designs additional constraints to learn category-specific features for each branch, thus facilitating the semantic decomposition and improving the performance of dental plaque segmentation.

Segmentation

Fine-grained Retrieval Prompt Tuning

no code implementations29 Jul 2022 Shijie Wang, Jianlong Chang, Zhihui Wang, Haojie Li, Wanli Ouyang, Qi Tian

In this paper, we develop Fine-grained Retrieval Prompt Tuning (FRPT), which steers a frozen pre-trained model to perform the fine-grained retrieval task from the perspectives of sample prompting and feature adaptation.

Retrieval

Cascading Residual Graph Convolutional Network for Multi-Behavior Recommendation

1 code implementation26 May 2022 Mingshi Yan, Zhiyong Cheng, Chen Gao, Jing Sun, Fan Liu, Fuming Sun, Haojie Li

In particular, we design a cascading residual graph convolutional network structure, which enables our model to learn user preferences by continuously refining user embeddings across different types of behaviors.

Multi-Task Learning

An Underwater Image Semantic Segmentation Method Focusing on Boundaries and a Real Underwater Scene Semantic Segmentation Dataset

2 code implementations26 Aug 2021 Zhiwei Ma, Haojie Li, Zhihui Wang, Dan Yu, Tianyi Wang, Yingshuang Gu, Xin Fan, Zhongxuan Luo

Based on this dataset, we propose a semi-supervised underwater semantic segmentation network focusing on the boundaries(US-Net: Underwater Segmentation Network).

Boundary Detection Instance Segmentation +7

A Dataset And Benchmark Of Underwater Object Detection For Robot Picking

no code implementations10 Jun 2021 Chongwei Liu, Haojie Li, Shuchang Wang, Ming Zhu, Dong Wang, Xin Fan, Zhihui Wang

Towards these challenges we introduce a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark, based on the collection and re-annotation of all relevant datasets.

object-detection Object Detection

Delving into Localization Errors for Monocular 3D Object Detection

1 code implementation CVPR 2021 Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang

Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving, while accurate 3D object detection from this kind of data is very challenging.

3D Object Detection From Monocular Images Autonomous Driving +3

Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution

no code implementations CVPR 2021 Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu

First, we design a cross-task distillation scheme that encourages DSR and DE networks to learn from each other in a teacher-student role-exchanging fashion.

Depth Estimation Super-Resolution +1

A Unified Joint Maximum Mean Discrepancy for Domain Adaptation

no code implementations25 Jan 2021 Wei Wang, Baopu Li, Shuhui Yang, Jing Sun, Zhengming Ding, Junyang Chen, Xiao Dong, Zhihui Wang, Haojie Li

From the revealed unified JMMD, we illustrate that JMMD degrades the feature-label dependence (discriminability) that benefits to classification, and it is sensitive to the label distribution shift when the label kernel is the weighted class conditional one.

Domain Adaptation

Direct Depth Learning Network for Stereo Matching

no code implementations10 Dec 2020 Hong Zhang, Haojie Li, Shenglun Chen, Tiantian Yan, Zhihui Wang, Guo Lu, Wanli Ouyang

To make the Adaptive-Grained Depth Refinement stage robust to the coarse depth and adaptive to the depth range of the points, the Granularity Uncertainty is introduced to Adaptive-Grained Depth Refinement stage.

Autonomous Driving Depth Estimation +1

Full Matching on Low Resolution for Disparity Estimation

no code implementations10 Dec 2020 Hong Zhang, Shenglun Chen, Zhihui Wang, Haojie Li, Wanli Ouyang

To this end, we first propose to decompose the full matching task into multiple stages of the cost aggregation module.

Disparity Estimation

SIRI: Spatial Relation Induced Network For Spatial Description Resolution

no code implementations NeurIPS 2020 Peiyao Wang, Weixin Luo, Yanyu Xu, Haojie Li, Shugong Xu, Jianyu Yang, Shenghua Gao

Spatial Description Resolution, as a language-guided localization task, is proposed for target location in a panoramic street view, given corresponding language descriptions.

Relation

Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

no code implementations12 Oct 2020 Shijie Wang, Zhihui Wang, Haojie Li, Wanli Ouyang

Existing deep learning based weakly supervised fine-grained image recognition (WFGIR) methods usually pick out the discriminative regions from the high-level feature (HLF) maps directly.

Attribute Fine-Grained Image Recognition

Rethink Maximum Mean Discrepancy for Domain Adaptation

no code implementations1 Jul 2020 Wei Wang, Haojie Li, Zhengming Ding, Zhihui Wang

On the other hand, we design two different strategies to boost the feature discriminability: 1) we directly impose a trade-off parameter on the implicit intra-class distance in MMD to regulate its change; 2) we impose the similar weights revealed in MMD on inter-class distance and maximize it, then a balanced factor could be introduced to quantitatively leverage the relative importance between the feature transferability and its discriminability.

Domain Adaptation

Sparsely-Labeled Source Assisted Domain Adaptation

no code implementations8 May 2020 Wei Wang, Zhihui Wang, Yuankai Xiang, Jing Sun, Haojie Li, Fuming Sun, Zhengming Ding

However, there are usually a large number of unlabeled data but only a few labeled data in the source domain, and how to transfer knowledge from this sparsely-labeled source domain to the target domain is still a challenge, which greatly limits their application in the wild.

Clustering Domain Adaptation

A New Dataset, Poisson GAN and AquaNet for Underwater Object Grabbing

no code implementations3 Mar 2020 Chongwei Liu, Zhihui Wang, Shijie Wang, Tao Tang, Yulong Tao, Caifei Yang, Haojie Li, Xing Liu, Xin Fan

We also propose a novel Poisson-blending Generative Adversarial Network (Poisson GAN) and an efficient object detection network (AquaNet) to address two common issues within related datasets: the class-imbalance problem and the problem of mass small object, respectively.

4k Generative Adversarial Network +2

Importance Filtered Cross-Domain Adaptation

no code implementations24 Dec 2019 Wei Wang, Haojie Li, Zhihui Wang, Jing Sun, Zhengming Ding, Fuming Sun

Firstly, an importance filtered mechanism is devised to generate filtered soft labels to mitigate negative transfer desirably.

Domain Adaptation Object Recognition

Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving

no code implementations27 Mar 2019 Xinzhu Ma, Zhihui Wang, Haojie Li, Peng-Bo Zhang, Xin Fan, Wanli Ouyang

To this end, we first leverage a stand-alone module to transform the input data from 2D image plane to 3D point clouds space for a better input representation, then we perform the 3D detection using PointNet backbone net to obtain objects 3D locations, dimensions and orientations.

3D Reconstruction Autonomous Driving +2

User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks

2 code implementations9 Aug 2018 Yuanzheng Ci, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo

Scribble colors based line art colorization is a challenging computer vision problem since neither greyscale values nor semantic information is presented in line arts, and the lack of authentic illustration-line art training pairs also increases difficulty of model generalization.

Benchmarking Line Art Colorization

A Single Shot Text Detector with Scale-adaptive Anchors

no code implementations5 Jul 2018 Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo

Currently, most top-performing text detection networks tend to employ fixed-size anchor boxes to guide the search for text instances.

Computational Efficiency Text Detection

Sequential Dual Deep Learning with Shape and Texture Features for Sketch Recognition

no code implementations9 Aug 2017 Qi Jia, Meiyu Yu, Xin Fan, Haojie Li

We develop dual deep networks with memorable gated recurrent units (GRUs), and sequentially feed these two types of features into the dual networks, respectively.

Sketch Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.