Search Results for author: Dong Wang

Found 270 papers, 100 papers with code

A Copy-Augmented Generative Model for Open-Domain Question Answering

no code implementations • ACL 2022 • Shuang Liu, Dong Wang, Xiaoguang Li, Minghui Huang, Meizhen Ding

Open-domain question answering is a challenging task with a wide variety of practical applications.

Natural Questions Open-Domain Question Answering +1

Paper
Add Code

Other Tokens Matter: Exploring Global and Local Features of Vision Transformers for Object Re-Identification

no code implementations • 23 Apr 2024 • Yingquan Wang, Pingping Zhang, Dong Wang, Huchuan Lu

In this work, we first explore the influence of global and local features of ViT and then further propose a novel Global-Local Transformer (GLTrans) for high-performance object Re-ID.

Object

Paper
Add Code

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding

5 code implementations • 11 Apr 2024 • Yiwen Tang, Jiaming Liu, Dong Wang, Zhigang Wang, Shanghang Zhang, Bin Zhao, Xuelong Li

The adapter incorporates prior spatial knowledge from the source modality to guide the local feature aggregation of 3D tokens, compelling the semantic adaption of any-modality transformers.

Paper
Code

Open-Vocabulary Federated Learning with Multimodal Prototyping

1 code implementation • 1 Apr 2024 • Huimin Zeng, Zhenrui Yue, Dong Wang

A new user could come up with queries that involve data from unseen classes, and such open-vocabulary queries would directly defect such FL systems.

Federated Learning

Paper
Code

Exploring Dynamic Transformer for Efficient Object Tracking

no code implementations • 26 Mar 2024 • Jiawen Zhu, Xin Chen, Haiwen Diao, Shuai Li, Jun-Yan He, Chenyang Li, Bin Luo, Dong Wang, Huchuan Lu

For instance, DyTrack obtains 64. 9% AUC on LaSOT with a speed of 256 fps.

Object Visual Object Tracking

Paper
Add Code

HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation

no code implementations • 25 Mar 2024 • Linglin Jing, Yiming Ding, Yunpeng Gao, Zhigang Wang, Xu Yan, Dong Wang, Gerald Schaefer, Hui Fang, Bin Zhao, Xuelong Li

In this paper, we propose a novel hybrid pseudo-labeling framework for unsupervised event-based semantic segmentation, HPL-ESS, to alleviate the influence of noisy pseudo labels.

Image Reconstruction Segmentation +2

Paper
Add Code

Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation

no code implementations • 22 Mar 2024 • Zhenrui Yue, Huimin Zeng, Yimeng Lu, Lanyu Shang, Yang Zhang, Dong Wang

The proliferation of online misinformation has posed significant threats to public interest.

Misinformation Response Generation +2

Paper
Add Code

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

1 code implementation • 18 Mar 2024 • Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Dong Wang, Huchuan Lu, You He

Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset.

Continual Learning Incremental Learning +1

Paper
Code

Federated Recommendation via Hybrid Retrieval Augmented Generation

1 code implementation • 7 Mar 2024 • Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang

To this end, we propose GPT-FedRec, a federated recommendation framework leveraging ChatGPT and a novel hybrid Retrieval Augmented Generation (RAG) mechanism.

Hallucination Privacy Preserving +2

Paper
Code

Reinforcement Learning with Elastic Time Steps

no code implementations • 22 Feb 2024 • Dong Wang, Giovanni Beltrame

Given the discrete nature of RL algorithms, they are oblivious to the effects of the choice of control rate: finding the correct control rate can be difficult and mistakes often result in excessive use of computing resources or even lack of convergence.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Adversarial Data Augmentation for Robust Speaker Verification

no code implementations • 5 Feb 2024 • Zhenyu Zhou, Junhui Chen, Namin Wang, Lantian Li, Dong Wang

This adversarial learning empowers the network to generate speaker embeddings that can deceive the augmentation classifier, making the learned speaker embeddings more robust in the face of augmentation variations.

Data Augmentation Speaker Verification

Paper
Add Code

Off-Policy Primal-Dual Safe Reinforcement Learning

2 code implementations • 26 Jan 2024 • Zifan Wu, Bo Tang, Qian Lin, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang

Results on benchmark tasks show that our method not only achieves an asymptotic performance comparable to state-of-the-art on-policy methods while using much fewer samples, but also significantly reduces constraint violation during training.

reinforcement-learning Safe Reinforcement Learning

842

Paper
Code

Deployable Reinforcement Learning with Variable Control Rate

1 code implementation • 17 Jan 2024 • Dong Wang, Giovanni Beltrame

Unfortunately, the system should be controlled at the highest, worst-case frequency to ensure stability, which can demand significant computational and energy resources and hinder the deployability of the controller on onboard hardware.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

no code implementations • 29 Dec 2023 • Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day.

Data Augmentation

Paper
Add Code

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

no code implementations • 27 Dec 2023 • Jiahong Zhou, Shunhui Mao, Guoliang Yang, Bo Tang, Qianlong Xie, Lebin Lin, Xingxing Wang, Dong Wang

The existing studies focus on dynamically allocating CRs in queue truncation scenarios (i. e., allocating the size of candidates), and formulate the CR allocation problem as an optimization problem with constraints.

Model Selection Recommendation Systems +1

Paper
Add Code

X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer

no code implementations • 12 Dec 2023 • Linglin Jing, Ying Xue, Xu Yan, Chaoda Zheng, Dong Wang, Ruimao Zhang, Zhigang Wang, Hui Fang, Bin Zhao, Zhen Li

The field of 4D point cloud understanding is rapidly developing with the goal of analyzing dynamic 3D point cloud sequences.

Action Recognition Action Segmentation +6

Paper
Add Code

Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning

no code implementations • 12 Dec 2023 • Wei Geng, Baidi Xiao, Rongpeng Li, Ning Wei, Dong Wang, Zhifeng Zhao

In this paper, we propose a novel decomposition-based multi-agent distributional RL method by approximating the globally shared noisy reward by a Gaussian mixture model (GMM) and decomposing it into the combination of individual distributional local rewards, with which each agent can be updated locally through distributional RL.

Distributional Reinforcement Learning Multi-agent Reinforcement Learning +2

Paper
Add Code

Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning

no code implementations • 12 Dec 2023 • Jiawei Sun, Bin Zhao, Dong Wang, Zhigang Wang, Jie Zhang, Nektarios Koukourakis, Juergen W. Czarske, Xuelong Li

Quantitative phase imaging (QPI) through multi-core fibers (MCFs) has been an emerging in vivo label-free endoscopic imaging modality with minimal invasiveness.

Retrieval

Paper
Add Code

Intelligent Virtual Assistants with LLM-based Process Automation

no code implementations • 4 Dec 2023 • Yanchu Guan, Dong Wang, Zhixuan Chu, Shiyu Wang, Feiyue Ni, Ruihua Song, Longfei Li, Jinjie Gu, Chenyi Zhuang

This paper proposes a novel LLM-based virtual assistant that can automatically perform multi-step operations within mobile apps based on high-level user requests.

Language Modelling Large Language Model

Paper
Add Code

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting

no code implementations • 20 Nov 2023 • Chi Yan, Delin Qu, Dan Xu, Bin Zhao, Zhigang Wang, Dong Wang, Xuelong Li

This strategy is essential to extend 3D Gaussian representation to reconstruct the whole scene rather than synthesize a static object in existing methods.

Pose Tracking Simultaneous Localization and Mapping

Paper
Add Code

Implicit Event-RGBD Neural SLAM

no code implementations • 18 Nov 2023 • Delin Qu, Chi Yan, Dong Wang, Jie Yin, Dan Xu, Bin Zhao, Xuelong Li

To address these challenges, we propose EN-SLAM, the first event-RGBD implicit neural SLAM framework, which effectively leverages the high rate and high dynamic range advantages of event data for tracking and mapping.

Paper
Add Code

Analysis and Applications of Deep Learning with Finite Samples in Full Life-Cycle Intelligence of Nuclear Power Generation

no code implementations • 7 Nov 2023 • Chenwei Tang, Wenqiang Zhou, Dong Wang, Caiyang Yu, Zhenan He, Jizhe Zhou, Shudong Huang, Yi Gao, Jianming Chen, Wentao Feng, Jiancheng Lv

The advent of Industry 4. 0 has precipitated the incorporation of Artificial Intelligence (AI) methods within industrial contexts, aiming to realize intelligent manufacturing, operation as well as maintenance, also known as industrial intelligence.

Few-Shot Learning Open Set Learning +1

Paper
Add Code

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs

2 code implementations • 6 Nov 2023 • Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li

Generalizable articulated object manipulation is essential for home-assistant robots.

Imitation Learning In-Context Learning +1

Paper
Code

LlamaRec: Two-Stage Recommendation using Large Language Models for Ranking

1 code implementation • 25 Oct 2023 • Zhenrui Yue, Sara Rabhi, Gabriel de Souza Pereira Moreira, Dong Wang, Even Oldridge

Recently, large language models (LLMs) have exhibited significant progress in language understanding and generation.

Movie Recommendation

Paper
Code

A Glance is Enough: Extract Target Sentence By Looking at A keyword

no code implementations • 9 Oct 2023 • Ying Shi, Dong Wang, Lantian Li, Jiqing Han

This paper investigates the possibility of extracting a target sentence from multi-talker speech using only a keyword as input.

Sentence

Paper
Add Code

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models

5 code implementations • 4 Oct 2023 • Yiwen Tang, Ray Zhang, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li

To this end, we introduce Point-PEFT, a novel framework for adapting point cloud pre-trained models with minimal learnable parameters.

Paper
Code

Linear Recurrent Units for Sequential Recommendation

1 code implementation • 3 Oct 2023 • Zhenrui Yue, Yueqi Wang, Zhankui He, Huimin Zeng, Julian McAuley, Dong Wang

State-of-the-art sequential recommendation relies heavily on self-attention-based recommender models.

Language Modelling Sequential Recommendation

Paper
Code

Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text

no code implementations • 20 Sep 2023 • Xuyang Chen, Dong Wang, Konrad Schindler, Mingwei Sun, Yongliang Wang, Nicolo Savioli, Liqiu Meng

Recently, Transformer-based text detection techniques have sought to predict polygons by encoding the coordinates of individual boundary vertices using distinct query features.

regression Text Detection

Paper
Add Code

Leveraging the Power of Data Augmentation for Transformer-based Tracking

no code implementations • 15 Sep 2023 • Jie Zhao, Johan Edstedt, Michael Felsberg, Dong Wang, Huchuan Lu

Due to long-distance correlation and powerful pretrained models, transformer-based methods have initiated a breakthrough in visual object tracking performance.

Data Augmentation Visual Object Tracking

Paper
Add Code

BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction

no code implementations • 17 Aug 2023 • Dong Wang, Kavé Salamatian, Yunqing Xia, Weiwei Deng, Qi Zhiang

Although deep pre-trained language models have shown promising benefit in a large set of industrial scenarios, including Click-Through-Rate (CTR) prediction, how to integrate pre-trained language models that handle only textual signals into a prediction pipeline with non-textual features is challenging.

Click-Through Rate Prediction Dimensionality Reduction +1

Paper
Add Code

Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking

no code implementations • ICCV 2023 • Ben Kang, Xin Chen, Dong Wang, Houwen Peng, Huchuan Lu

The Bridge Module incorporates the high-level information of deep features into the shallow large-resolution features.

Position Visual Tracking

Paper
Add Code

Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking

2 code implementations • 1 Aug 2023 • Mingzhan Yang, Guangxin Han, Bin Yan, Wenhua Zhang, Jinqing Qi, Huchuan Lu, Dong Wang

Also, our method shows strong generalization for diverse trackers and scenarios in a plug-and-play and training-free manner.

Ranked #9 on Multi-Object Tracking on DanceTrack

Multi-Object Tracking Multiple Object Tracking +1

6,094

Paper
Code

Tracking Anything in High Quality

1 code implementation • 26 Jul 2023 • Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li

To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.

Ranked #5 on Semi-Supervised Video Object Segmentation on YouTube-VOS 2019 (using extra training data)

Object Semantic Segmentation +3

733

Paper
Code

Topology-Preserving Automatic Labeling of Coronary Arteries via Anatomy-aware Connection Classifier

1 code implementation • 22 Jul 2023 • Zhixing Zhang, Ziwei Zhao, Dong Wang, Shishuang Zhao, Yuhang Liu, Jia Liu, LiWei Wang

Automatic labeling of coronary arteries is an essential task in the practical diagnosis process of cardiovascular diseases.

Anatomy

Paper
Code

Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model

no code implementations • 17 Jul 2023 • Rongke Liu, Dong Wang, Yizhi Ren, Zhen Wang, Kaitian Guo, Qianqian Qin, Xiaolei Liu

Therefore, the attack models in existing MIAs are difficult to effectively train with the knowledge of the target model, resulting in sub-optimal attacks.

Paper
Add Code

Cross-Element Combinatorial Selection for Multi-Element Creative in Display Advertising

no code implementations • 4 Jul 2023 • Wei zhang, Ping Zhang, Jian Dong, Yongkang Wang, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang

The effectiveness of ad creatives is greatly influenced by their visual appearance.

Paper
Add Code

A Collaborative Transfer Learning Framework for Cross-domain Recommendation

no code implementations • 26 Jun 2023 • Wei zhang, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang

The disadvantage of the former is that the data from other domains is not utilized by a single domain model, while the latter leverage all the data from different domains, but the fine-tuned model of transfer learning may trap the model in a local optimum of the source domain, making it difficult to fit the target domain.

Click-Through Rate Prediction Recommendation Systems +1

Paper
Add Code

HSR-Diff:Hyperspectral Image Super-Resolution via Conditional Diffusion Models

no code implementations • 21 Jun 2023 • Chanyue Wu, Dong Wang, Hanyu Mao, Ying Li

Despite the proven significance of hyperspectral images (HSIs) in performing various computer vision tasks, its potential is adversely affected by the low-resolution (LR) property in the spatial domain, resulting from multiple physical factors.

Denoising Image Super-Resolution

Paper
Add Code

Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature Centers

no code implementations • 12 Jun 2023 • AnLan Sun, Zhao Zhang, Meng Lei, Yuting Dai, Dong Wang, LiWei Wang

The coherence loss uses the feature centers generated by the static images to guide the frame attention in the video model.

Video Classification

Paper
Add Code

Graph Based Long-Term And Short-Term Interest Model for Click-Through Rate Prediction

no code implementations • 5 Jun 2023 • Huinan Sun, Guangliang Yu, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang

It consists of a multi-interest graph structure for capturing long-term user behavior, a multi-scenario heterogeneous sequence model for modeling short-term information, then an adaptive fusion mechanism to fused information from long-term and short-term behaviors.

Click-Through Rate Prediction

Paper
Add Code

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

1 code implementation • 1 Jun 2023 • Qian Lin, Bo Tang, Zifan Wu, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang

Aiming at promoting the safe real-world deployment of Reinforcement Learning (RL), research on safe RL has made significant progress in recent years.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Mining Negative Temporal Contexts For False Positive Suppression In Real-Time Ultrasound Lesion Detection

1 code implementation • 29 May 2023 • Haojun Yu, Youcheng Li, Quanlin Wu, Ziwei Zhao, Dengbo Chen, Dong Wang, LiWei Wang

To address this issue, we propose to extract contexts from previous frames, including NTC, with the guidance of inverse optical flow.

Lesion Detection object-detection +2

141

Paper
Code

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

1 code implementation • NeurIPS 2023 • Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li

Specifically, we propose Multi-Task Diffusion Model (\textsc{MTDiff}), a diffusion-based method that incorporates Transformer backbones and prompt learning for generative planning and data synthesis in multi-task offline settings.

Reinforcement Learning (RL)

Paper
Code

Spot keywords from very noisy and mixed speech

no code implementations • 28 May 2023 • Ying Shi, Dong Wang, Lantian Li, Jiqing Han, Shi Yin

We propose a novel Mix Training (MT) strategy that encourages the model to discover low-energy keywords from noisy and mixed speech.

Data Augmentation Keyword Spotting

Paper
Add Code

Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning

1 code implementation • 27 May 2023 • Zhenrui Yue, Huimin Zeng, Mengfei Lan, Heng Ji, Dong Wang

With emerging online topics as a source for numerous new events, detecting unseen / rare event types presents an elusive challenge for existing event detection methods, where only limited data access is provided for training.

Event Detection Meta-Learning

Paper
Code

Ordered and Binary Speaker Embedding

no code implementations • 25 May 2023 • Jiaying Wang, Xianglong Wang, Namin Wang, Lantian Li, Dong Wang

Modern speaker recognition systems represent utterances by embedding vectors.

Clustering Retrieval +2

Paper
Add Code

CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition

no code implementations • 25 May 2023 • Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang

A comprehensive study was conducted to compare CN-Celeb-AV with two popular public AVPR benchmark datasets, and the results demonstrated that CN-Celeb-AV is more in line with real-world scenarios and can be regarded as a new benchmark dataset for AVPR research.

Person Recognition

Paper
Add Code

Neural Image Re-Exposure

1 code implementation • 23 May 2023 • Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Huchuan Lu

In this work, we aim to re-expose the captured photo in post-processing to provide a more flexible way of addressing those issues within a unified framework.

Ranked #4 on Deblurring on GoPro (using extra training data)

Deblurring Joint Deblur and Frame Interpolation +5

Paper
Code

Towards Benchmarking and Assessing Visual Naturalness of Physical World Adversarial Attacks

1 code implementation • CVPR 2023 • Simin Li, Shuing Zhang, Gujun Chen, Dong Wang, Pu Feng, Jiakai Wang, Aishan Liu, Xin Yi, Xianglong Liu

First, to benchmark attack naturalness, we contribute the first Physical Attack Naturalness (PAN) dataset with human rating and gaze.

Adversarial Attack Autonomous Driving +1

Paper
Code

MetaAdapt: Domain Adaptive Few-Shot Misinformation Detection via Meta Learning

1 code implementation • 22 May 2023 • Zhenrui Yue, Huimin Zeng, Yang Zhang, Lanyu Shang, Dong Wang

As such, MetaAdapt can learn how to adapt the misinformation detection model and exploit the source data for improved performance in the target domain.

Meta-Learning Misinformation +1

Paper
Code

Subspace-Configurable Networks

1 code implementation • 22 May 2023 • Olga Saukh, Dong Wang, Xiaoxi He, Lothar Thiele

The obtained subspace is low-dimensional and has a surprisingly simple structure even for complex, non-invertible transformations of the input, leading to an exceptionally high efficiency of subspace-configurable networks (SCNs) when limited storage and computing resources are at stake.

Audio Signal Processing Data Augmentation

Paper
Code

Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

1 code implementation • CVPR 2023 • Xin Chen, Ben Kang, Jiawen Zhu, Dong Wang, Houwen Peng, Huchuan Lu

In this paper, we introduce a new sequence-to-sequence learning framework for RGB-based and multi-modal object tracking.

Ranked #1 on Rgb-T Tracking on LasHeR

Object Rgb-T Tracking +1

Paper
Code

Label-free timing analysis of SiPM-based modularized detectors with physics-constrained deep learning

no code implementations • 24 Apr 2023 • Pengcheng Ai, Le Xiao, Zhi Deng, Yi Wang, Xiangming Sun, Guangming Huang, Dong Wang, Yulei Li, Xinchi Ran

We mathematically demonstrate the existence of the optimal function desired by the method, and give a systematic algorithm for training and calibration of the model.

Paper
Add Code

MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed

no code implementations • 17 Apr 2023 • Xiaowen Shi, Ze Wang, Yuanying Cai, Xiaoxu Wu, Fan Yang, Guogang Liao, Yongkang Wang, Xingxing Wang, Dong Wang

There are two types of data employed to train reinforcement learning (RL) model for position allocation, named strategy data and random data.

Imitation Learning Position +2

Paper
Add Code

Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results

1 code implementation • 12 Apr 2023 • Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng

Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.

Face Anti-Spoofing Face Recognition

21,255

Paper
Code

One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field

no code implementations • CVPR 2023 • Weichuang Li, Longhao Zhang, Dong Wang, Bin Zhao, Zhigang Wang, Mulin Chen, Bang Zhang, Zhongjian Wang, Liefeng Bo, Xuelong Li

Talking head generation aims to generate faces that maintain the identity information of the source image and imitate the motion of the driving image.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement

1 code implementation • ICCV 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao

The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its application to diverse downstream vision tasks.

Computational Efficiency Few-Shot Learning

120

Paper
Code

Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction

1 code implementation • ICCV 2023 • Delin Qu, Yizhen Lao, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li

This paper addresses the problem of rolling shutter correction in complex nonlinear and dynamic scenes with extreme occlusion.

Rolling Shutter Correction

Paper
Code

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance

7 code implementations • 29 Mar 2023 • Zoey Guo, Yiwen Tang, Ray Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li

In this paper, we propose ViewRefer, a multi-view framework for 3D visual grounding exploring how to grasp the view knowledge from both text and 3D modalities.

Visual Grounding

Paper
Code

Propagate And Calibrate: Real-time Passive Non-line-of-sight Tracking

no code implementations • CVPR 2023 • Yihao Wang, Zhigang Wang, Bin Zhao, Dong Wang, Mulin Chen, Xuelong Li

In contrast, we propose a purely passive method to track a person walking in an invisible room by only observing a relay wall, which is more in line with real application scenarios, e. g., security.

Paper
Add Code

Visual Prompt Multi-Modal Tracking

1 code implementation • CVPR 2023 • Jiawen Zhu, Simiao Lai, Xin Chen, Dong Wang, Huchuan Lu

To inherit the powerful representations of the foundation model, a natural modus operandi for multi-modal tracking is full fine-tuning on the RGB-based parameters.

Ranked #10 on Rgb-T Tracking on LasHeR

Object Tracking Rgb-T Tracking

227

Paper
Code

Fully Self-Supervised Depth Estimation from Defocus Clue

1 code implementation • CVPR 2023 • Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang, Xuelong Li

We show that our framework circumvents the needs for the depth and AIF image ground-truth, and receives superior predictions, thus closing the gap between the theoretical success of DFD works and their applications in the real world.

Depth Estimation

Paper
Code

Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation

1 code implementation • 17 Mar 2023 • Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu

To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.

Object object-detection +1

Paper
Code

Universal Instance Perception as Object Discovery and Retrieval

1 code implementation • CVPR 2023 • Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu

All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks.

Ranked #1 on Referring Expression Segmentation on RefCoCo val (using extra training data)

Described Object Detection Generalized Referring Expression Comprehension +15

1,438

Paper
Code

PIER: Permutation-Level Interest-Based End-to-End Re-ranking Framework in E-commerce

1 code implementation • 6 Feb 2023 • Xiaowen Shi, Fan Yang, Ze Wang, Xiaoxu Wu, Muzhi Guan, Guogang Liao, Yongkang Wang, Xingxing Wang, Dong Wang

Then we design a novel omnidirectional attention mechanism in OCPM to capture the context information in the permutation.

Re-Ranking

Paper
Code

A Deep Behavior Path Matching Network for Click-Through Rate Prediction

no code implementations • 1 Feb 2023 • Jian Dong, Yisong Yu, Yapeng Zhang, Yimin Lv, Shuli Wang, Beihong Jin, Yongkang Wang, Xingxing Wang, Dong Wang

User behaviors on an e-commerce app not only contain different kinds of feedback on items but also sometimes imply the cognitive clue of the user's decision-making.

Click-Through Rate Prediction Contrastive Learning +1

Paper
Add Code

Decision-Making Context Interaction Network for Click-Through Rate Prediction

no code implementations • 29 Jan 2023 • Xiang Li, Shuwei Chen, Jian Dong, Jin Zhang, Yongkang Wang, Xingxing Wang, Dong Wang

Click-through rate (CTR) prediction is crucial in recommendation and online advertising systems.

Click-Through Rate Prediction Decision Making

Paper
Add Code

Representation Learning for Visual Object Tracking by Masked Appearance Transfer

1 code implementation • CVPR 2023 • Haojie Zhao, Dong Wang, Huchuan Lu

However, for the template, we make the decoder reconstruct the target appearance within the search region.

Representation Learning Visual Object Tracking

Paper
Code

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding

no code implementations • ICCV 2023 • Zoey Guo, Yiwen Tang, Ray Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li

In this paper, we propose ViewRefer, a multi-view framework for 3D visual grounding exploring how to grasp the view knowledge from both text and 3D modalities.

Visual Grounding

Paper
Add Code

HSR-Diff: Hyperspectral Image Super-Resolution via Conditional Diffusion Models

no code implementations • ICCV 2023 • Chanyue Wu, Dong Wang, Yunpeng Bai, Hanyu Mao, Ying Li, Qiang Shen

Denoising Hyperspectral Image Super-Resolution +1

Paper
Add Code

Direct Heterogeneous Causal Learning for Resource Allocation Problems in Marketing

no code implementations • 28 Nov 2022 • Hao Zhou, Shaoming Li, Guibin Jiang, Jiaqi Zheng, Dong Wang

Our key intuition is that we introduce the decision factor to establish a bridge between ML and OR such that the solution can be directly obtained in OR by only performing the sorting or comparison operations on the decision factor.

Decision Making Marketing

Paper
Add Code

Wasserstein Archetypal Analysis

no code implementations • 25 Oct 2022 • Katy Craig, Braxton Osting, Dong Wang, Yiming Xu

We prove a consistency result for the regularized problem, ensuring that if the data are iid samples from a probability measure, then as the number of samples is increased, a subsequence of the archetype points converges to the archetype points for the limiting data distribution, almost surely.

Paper
Add Code

QA Domain Adaptation using Hidden Space Augmentation and Self-Supervised Contrastive Adaptation

1 code implementation • 19 Oct 2022 • Zhenrui Yue, Huimin Zeng, Bernhard Kratzwald, Stefan Feuerriegel, Dong Wang

Unlike existing approaches, we generate pseudo labels and propose to train the model via a novel attention-based contrastive adaptation method.

Contrastive Learning Data Augmentation +2

Paper
Code

Unsupervised Domain Adaptation for COVID-19 Information Service with Contrastive Adversarial Domain Mixup

no code implementations • 6 Oct 2022 • Huimin Zeng, Zhenrui Yue, Ziyi Kou, Lanyu Shang, Yang Zhang, Dong Wang

Moreover, we leverage the power of domain adversarial examples to establish an intermediate domain mixup, where the latent representations of the input text from both domains could be mixed during the training process.

Contrastive Learning Misinformation +1

Paper
Add Code

On Attacking Out-Domain Uncertainty Estimation in Deep Neural Networks

no code implementations • 3 Oct 2022 • Huimin Zeng, Zhenrui Yue, Yang Zhang, Ziyi Kou, Lanyu Shang, Dong Wang

In many applications with real-world consequences, it is crucial to develop reliable uncertainty estimation for the predictions made by the AI decision systems.

Adversarial Attack

Paper
Add Code

Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression

no code implementations • 26 Sep 2022 • Tingyu Fan, Linyao Gao, Yiling Xu, Dong Wang, Zhu Li

Besides, we propose a residual coding framework for the compression of the latent variable, which explores the spatial correlation of each layer by progressive downsampling, and model the corresponding residual with a fully-factorized entropy model.

Paper
Add Code

Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection

no code implementations • 13 Sep 2022 • Ziwei Zhao, Dong Wang, Yihong Chen, Ziteng Wang, LiWei Wang

In mammogram mass detection, modeling pairwise lesion correspondence explicitly is particularly important.

Lesion Detection

Paper
Add Code

PointScatter: Point Set Representation for Tubular Structure Extraction

1 code implementation • 13 Sep 2022 • Dong Wang, Zhao Zhang, Ziwei Zhao, Yuhang Liu, Yihong Chen, LiWei Wang

Inspired by this, we propose PointScatter, an alternative to the segmentation models for the tubular structure extraction task.

Segmentation

Paper
Code

Domain Adaptation for Question Answering via Question Classification

1 code implementation • COLING 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

In this work, we investigate the potential benefits of question classification for QA domain adaptation.

Domain Adaptation General Classification +1

Paper
Code

TFN: An Interpretable Neural Network with Time-Frequency Transform Embedded for Intelligent Fault Diagnosis

1 code implementation • 5 Sep 2022 • Qian Chen, Xingjian Dong, Guowei Tu, Dong Wang, Baoxuan Zhao, Zhike Peng

However, the CNN is a typical black-box model, and the mechanism of CNN's decision-making are not clear, which limits its application in high-reliability-required fault diagnosis scenarios.

Decision Making

Paper
Code

Contrastive Domain Adaptation for Early Misinformation Detection: A Case Study on COVID-19

2 code implementations • 20 Aug 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

However, early misinformation often demonstrates both conditional and label shifts against existing misinformation data (e. g., class imbalance in COVID-19 datasets), rendering such methods less effective for detecting early misinformation.

Domain Adaptation Misinformation

Paper
Code

Task Aligned Meta-learning based Augmented Graph for Cold-Start Recommendation

no code implementations • 11 Aug 2022 • Yuxiang Shi, Yue Ding, Bo Chen, YuYang Huang, Ruiming Tang, Dong Wang

In this paper, we propose a Task aligned Meta-learning based Augmented Graph (TMAG) to address cold-start recommendation.

Meta-Learning Recommendation Systems

Paper
Add Code

Defending Substitution-Based Profile Pollution Attacks on Sequential Recommenders

1 code implementation • 19 Jul 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

Additionally, we design an adversarial training method tailored for sequential recommender systems.

Adversarial Attack Adversarial Defense +1

Paper
Code

Towards Grand Unification of Object Tracking

1 code implementation • 14 Jul 2022 • Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu

We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters.

Ranked #2 on Multi-Object Tracking and Segmentation on BDD100K val

Multi-Object Tracking Multi-Object Tracking and Segmentation +3

942

Paper
Code

SRRT: Search Region Regulation Tracking

no code implementations • 10 Jul 2022 • Jiawen Zhu, Xin Chen, Pengyu Zhang, Xinying Wang, Dong Wang, Wenda Zhao, Huchuan Lu

Trackers tend to lose the target object due to the limited search region or be interfered with by distractors due to the excessive search region.

Paper
Add Code

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

3 code implementations • 28 May 2022 • Renrui Zhang, Ziyu Guo, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li, Peng Gao

By fine-tuning on downstream tasks, Point-M2AE achieves 86. 43% accuracy on ScanObjectNN, +3. 36% to the second-best, and largely benefits the few-shot classification, part segmentation and 3D object detection with the hierarchical pre-training scheme.

Ranked #4 on 3D Point Cloud Linear Classification on ModelNet40 (using extra training data)

3D Object Detection 3D Point Cloud Linear Classification +5

198

Paper
Code

Vision-based Anti-UAV Detection and Tracking

1 code implementation • 22 May 2022 • Jie Zhao, Jingshu Zhang, Dongdong Li, Dong Wang

It contains a detection dataset with a total of 10, 000 images and a tracking dataset with 20 videos that include short-term and long-term sequences.

Paper
Code

NMA: Neural Multi-slot Auctions with Externalities for Online Advertising

no code implementations • 20 May 2022 • Guogang Liao, Xuejian Li, Ze Wang, Fan Yang, Muzhi Guan, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang

Although VCG-based multi-slot auctions (e. g., VCG, WVCG) make it theoretically possible to model global externalities (e. g., the order and positions of ads and so on), they lack an efficient balance of both revenue and social welfare.

Paper
Add Code

D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction

1 code implementation • 2 May 2022 • Tingyu Fan, Linyao Gao, Yiling Xu, Zhu Li, Dong Wang

This paper proposes a novel 3D sparse convolution-based Deep Dynamic Point Cloud Compression (D-DPCC) network to compensate and compress the DPC geometry with 3D motion estimation and motion compensation in the feature space.

Motion Compensation Motion Estimation +2

Paper
Code

Evolving Programmable Computational Metamaterials

1 code implementation • 19 Apr 2022 • Atoosa Parsa, Dong Wang, Corey S. O'Hern, Mark D. Shattuck, Rebecca Kramer-Bottiglio, Josh Bongard

Granular metamaterials are a promising choice for the realization of mechanical computing devices.

Paper
Code

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

1 code implementation • CVPR 2022 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiang Ruan

With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information.

Ranked #2 on Rgb-T Tracking on GTOT

Attribute Object Tracking +1

Paper
Code

Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks

no code implementations • 2 Apr 2022 • Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang

With the recent prevalence of reinforcement learning (RL), there have been tremendous interests in utilizing RL for ads allocation in recommendation platforms (e. g., e-commerce and news feed sites).

Contrastive Learning Reinforcement Learning (RL)

Paper
Add Code

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation

no code implementations • 2 Apr 2022 • Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang

Ads allocation, which involves allocating ads and organic items to limited slots in feed with the purpose of maximizing platform revenue, has become a research hotspot.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation

no code implementations • 1 Apr 2022 • Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang

A mixed list of ads and organic items is usually displayed in feed and how to allocate the limited slots to maximize the overall revenue is a key problem.

Click-Through Rate Prediction reinforcement-learning +1

Paper
Add Code

Efficient Localness Transformer for Smart Sensor-Based Energy Disaggregation

no code implementations • 29 Mar 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

Modern smart sensor-based energy management systems leverage non-intrusive load monitoring (NILM) to predict and optimize appliance load distribution in real-time.

energy management Inductive Bias +2

Paper
Add Code

Balanced Multimodal Learning via On-the-fly Gradient Modulation

1 code implementation • CVPR 2022 • Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu

Multimodal learning helps to comprehensively understand the world, by integrating different senses.

198

Paper
Code

Efficient Visual Tracking via Hierarchical Cross-Attention Transformer

1 code implementation • 25 Mar 2022 • Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu

Most state-of-the-art trackers are satisfied with the real-time speed on powerful GPUs.

Visual Tracking

Paper
Code

High-Performance Transformer Tracking

1 code implementation • 25 Mar 2022 • Xin Chen, Bin Yan, Jiawen Zhu, Huchuan Lu, Xiang Ruan, Dong Wang

First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head.

Vocal Bursts Intensity Prediction

Paper
Code

Neural Topic Modeling with Deep Mutual Information Estimation

no code implementations • 12 Mar 2022 • Kang Xu, Xiaoqiu Lu, Yuan-Fang Li, Tongtong Wu, Guilin Qi, Ning Ye, Dong Wang, Zheng Zhou

NTM-DMIE is a neural network method for topic learning which maximizes the mutual information between the input documents and their latent topic representation.

Mutual Information Estimation Text Clustering +1

Paper
Add Code

Rényi State Entropy for Exploration Acceleration in Reinforcement Learning

1 code implementation • 8 Mar 2022 • Mingqi Yuan, Man-on Pun, Dong Wang

One of the most critical challenges in deep reinforcement learning is to maintain the long-term exploration capability of the agent.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Show, Deconfound and Tell: Image Captioning With Causal Inference

1 code implementation • CVPR 2022 • Bing Liu, Dong Wang, Xu Yang, Yong Zhou, Rui Yao, Zhiwen Shao, Jiaqi Zhao

In the encoding stage, the IOD is able to disentangle the region-based visual features by deconfounding the visual confounder.

Causal Inference Image Captioning

Paper
Code

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation

1 code implementation • 21 Oct 2021 • Yepeng Liu, Zaiwang Gu, Shenghua Gao, Dong Wang, Yusheng Zeng, Jun Cheng

Very often, the pose is estimated after the face detection.

Face Detection Face Recognition +1

133

Paper
Code

Real Additive Margin Softmax for Speaker Verification

1 code implementation • 18 Oct 2021 • Lantian Li, Ruiqian Nai, Dong Wang

The additive margin softmax (AM-Softmax) loss has delivered remarkable performance in speaker verification.

Speaker Verification

Paper
Code

CycleFlow: Purify Information Factors by Cycle Loss

no code implementations • 18 Oct 2021 • Haoran Sun, Chen Chen, Lantian Li, Dong Wang

SpeechFlow is a powerful factorization model based on information bottleneck (IB), and its effectiveness has been reported by several studies.

Voice Conversion

Paper
Add Code

Generating High-Fidelity Privacy-Conscious Synthetic Patient Data for Causal Effect Estimation with Multiple Treatments

no code implementations • 29 Sep 2021 • Jingpu Shi, Dong Wang, Gino Tesei, Beau Norgeot

Validation of these models, however, has been a challenge because the ground truth is unknown: only one treatment-outcome pair for each person can be observed.

Causal Inference

Paper
Add Code

Speech-MLP: a simple MLP architecture for speech processing

no code implementations • 29 Sep 2021 • Chao Xing, Dong Wang, LiRong Dai, Qun Liu, Anderson Avila

Overparameterized transformer-based architectures have shown remarkable performance in recent years, achieving state-of-the-art results in speech processing tasks such as speech recognition, speech synthesis, keyword spotting, and speech enhancement et al.

Keyword Spotting Speech Enhancement +3

Paper
Add Code

Extracting Attentive Social Temporal Excitation for Sequential Recommendation

no code implementations • 28 Sep 2021 • Yunzhe Li, Yue Ding, Bo Chen, Xin Xin, Yule Wang, Yuxiang Shi, Ruiming Tang, Dong Wang

In this paper, we propose a novel time-aware sequential recommendation framework called Social Temporal Excitation Networks (STEN), which introduces temporal point processes to model the fine-grained impact of friends' behaviors on the user s dynamic interests in an event-level direct paradigm.

Collaborative Filtering Graph Embedding +2

Paper
Add Code

ICPE: An Item Cluster-Wise Pareto-Efficient Framework for Recommendation Debiasing

no code implementations • 27 Sep 2021 • Yule Wang, Xin Xin, Yue Ding, Yunzhe Li, Dong Wang

In detail, we define our item cluster-wise optimization target as the recommender model should balance all item clusters that differ in popularity, thus we set the model learning on each item cluster as a unique optimization objective.

counterfactual Counterfactual Inference +2

Paper
Add Code

DemiNet: Dependency-Aware Multi-Interest Network with Self-Supervised Graph Learning for Click-Through Rate Prediction

no code implementations • 26 Sep 2021 • Yule Wang, Qiang Luo, Yue Ding, Yunzhe Li, Dong Wang, Hongbo Deng

In this paper, we propose a novel model named DemiNet (short for DEpendency-Aware Multi-Interest Network) to address the above two issues.

Click-Through Rate Prediction Denoising +2

Paper
Add Code

Cross DQN: Cross Deep Q Network for Ads Allocation in Feed

1 code implementation • 9 Sep 2021 • Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang

Our model results in higher revenue and better user experience than state-of-the-art baselines in offline experiments.

Paper
Code

TPRM: A Topic-based Personalized Ranking Model for Web Search

no code implementations • 13 Aug 2021 • Minghui Huang, Wei Peng, Dong Wang

Ranking models have achieved promising results, but it remains challenging to design personalized ranking systems to leverage user profiles and semantic representations between queries and documents.

Document Ranking

Paper
Add Code

GQE-PRF: Generative Query Expansion with Pseudo-Relevance Feedback

no code implementations • 13 Aug 2021 • Minghui Huang, Dong Wang, Shuang Liu, Meizhen Ding

To leverage the strength of text generation for information retrieval, in this article, we propose a novel approach which effectively integrates text generation models into PRF-based query expansion.

Information Retrieval Retrieval +1

Paper
Add Code

Probabilistic methods for approximate archetypal analysis

no code implementations • 12 Aug 2021 • Ruijian Han, Braxton Osting, Dong Wang, Yiming Xu

Archetypal analysis is an unsupervised learning method for exploratory data analysis.

Paper
Add Code

Video Annotation for Visual Tracking via Selection and Refinement

1 code implementation • ICCV 2021 • Kenan Dai, Jie Zhao, Lijun Wang, Dong Wang, Jianhua Li, Huchuan Lu, Xuesheng Qian, Xiaoyun Yang

Deep learning based visual trackers entail offline pre-training on large volumes of video datasets with accurate bounding box annotations that are labor-expensive to achieve.

Visual Tracking

Paper
Code

OLR 2021 Challenge: Datasets, Rules and Baselines

no code implementations • 23 Jul 2021 • Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang

In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning

no code implementations • 19 Jul 2021 • Mingqi Yuan, Mon-on Pun, Dong Wang, Yi Chen, Haojun Li

Furthermore, we leverage a variational auto-encoder (VAE) model to capture the life-long novelty of states, which is combined with the global JFI score to form multimodal intrinsic rewards.

Efficient Exploration Fairness +2

Paper
Add Code

Gradient Importance Learning for Incomplete Observations

1 code implementation • ICLR 2022 • Qitong Gao, Dong Wang, Joshua D. Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic

Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification.

Imputation Reinforcement Learning (RL) +2

Paper
Code

Oriental Language Recognition (OLR) 2020: Summary and Analysis

no code implementations • 5 Jul 2021 • Jing Li, Binling Wang, Yiming Zhi, Zheng Li, Lin Li, Qingyang Hong, Dong Wang

The fifth Oriental Language Recognition (OLR) Challenge focuses on language recognition in a variety of complex environments to promote its development.

Dialect Identification valid

Paper
Add Code

Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization

1 code implementation • 2 Jul 2021 • Qing Guo, Junya Chen, Dong Wang, Yuewei Yang, Xinwei Deng, Lawrence Carin, Fan Li, Jing Huang, Chenyang Tao

Successful applications of InfoNCE and its variants have popularized the use of contrastive variational mutual information (MI) estimators in machine learning.

Mutual Information Estimation

Paper
Code

CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding

1 code implementation • ACL 2021 • Dong Wang, Ning Ding, Piji Li, Hai-Tao Zheng

Recent works aimed to improve the robustness of pre-trained models mainly focus on adversarial training from perturbed examples with similar semantics, neglecting the utilization of different or even opposite semantics.

Contrastive Learning Natural Language Understanding +3

Paper
Code

AOMD: An Analogy-aware Approach to Offensive Meme Detection on Social Media

no code implementations • 21 Jun 2021 • Lanyu Shang, Yang Zhang, Yuheng Zha, Yingxi Chen, Christina Youn, Dong Wang

To address the above challenges, we develop a deep learning based Analogy-aware Offensive Meme Detection (AOMD) framework to learn the implicit analogy from the multi-modal contents of the meme and effectively detect offensive analogy memes.

Paper
Add Code

Voting for the right answer: Adversarial defense for speaker verification

1 code implementation • 15 Jun 2021 • Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang, Hung-Yi Lee

Automatic speaker verification (ASV) is a well developed technology for biometric identification, and has been ubiquitous implemented in security-critic applications, such as banking and access control.

Adversarial Defense Speaker Verification

Paper
Code

A Dataset And Benchmark Of Underwater Object Detection For Robot Picking

no code implementations • 10 Jun 2021 • Chongwei Liu, Haojie Li, Shuchang Wang, Ming Zhu, Dong Wang, Xin Fan, Zhihui Wang

Towards these challenges we introduce a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark, based on the collection and re-annotation of all relevant datasets.

object-detection Object Detection

Paper
Add Code

A Streaming End-to-End Framework For Spoken Language Understanding

no code implementations • 20 May 2021 • Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen

In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.

Intent Detection Keyword Spotting +3

Paper
Add Code

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search

1 code implementation • CVPR 2021 • Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu

Object tracking has achieved significant progress over the past few years.

Neural Architecture Search Object +1

381

Paper
Code

Learning Spatio-Temporal Transformer for Visual Tracking

1 code implementation • ICCV 2021 • Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu

In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component.

Ranked #18 on Visual Object Tracking on TrackingNet

Visual Object Tracking Visual Tracking

610

Paper
Code

Transformer Tracking

1 code implementation • CVPR 2021 • Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu

The correlation operation is a simple fusion manner to consider the similarity between the template and the search region.

Ranked #5 on Visual Tracking on TNL2K

Visual Object Tracking Visual Tracking

554

Paper
Code

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

1 code implementation • 19 Mar 2021 • Zhe Xie, Chengxuan Liu, Yichi Zhang, Hongtao Lu, Dong Wang, Yue Ding

To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation.

Collaborative Filtering Sequential Recommendation

Paper
Code

FAST-Dynamic-Vision: Detection and Tracking Dynamic Objects with Event and Depth Sensing

1 code implementation • 10 Mar 2021 • Botao He, Haojia Li, Siyuan Wu, Dong Wang, Zhiwei Zhang, Qianli Dong, Chao Xu, Fei Gao

The bottleneck of solving this problem is the accurate perception of rapid dynamic objects.

Motion Compensation Robust Object Detection +1 Robotics

126

Paper
Code

Pyramid Spatial-Temporal Aggregation for Video-Based Person Re-Identification

1 code implementation • ICCV 2021 • Yingquan Wang, Pingping Zhang, Shang Gao, Xia Geng, Hu Lu, Dong Wang

Video-based person re-identification aims to associate the video clips of the same person across multiple non-overlapping cameras.

Ranked #1 on Person Re-Identification on DukeMTMC-VideoReID

Video-Based Person Re-Identification

Paper
Code

Towards Robust and Efficient Contrastive Textual Representation Learning

no code implementations • 1 Jan 2021 • Liqun Chen, Yizhe Zhang, Dianqi Li, Chenyang Tao, Dong Wang, Lawrence Carin

There has been growing interest in representation learning for text data, based on theoretical arguments and empirical evidence.

Contrastive Learning Representation Learning

Paper
Add Code

eTREE: Learning Tree-structured Embeddings

1 code implementation • 20 Dec 2020 • Faisal M. Almutairi, Yunlong Wang, Dong Wang, Emily Zhao, Nicholas D. Sidiropoulos

In many applications, the categories of items exhibit a hierarchical tree structure.

Recommendation Systems

Paper
Code

Wasserstein Contrastive Representation Distillation

no code implementations • CVPR 2021 • Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin

The primary goal of knowledge distillation (KD) is to encapsulate the information of a model learned from a teacher network into a student network, with the latter being more compact than the former.

Ranked #10 on Knowledge Distillation on CIFAR-100

Contrastive Learning Knowledge Distillation +2

Paper
Add Code

Temporal Relational Modeling with Self-Supervision for Action Segmentation

1 code implementation • 14 Dec 2020 • Dong Wang, Di Hu, Xingjian Li, Dejing Dou

The main reason is that large number of nodes (i. e., video frames) makes GCNs hard to capture and model temporal relations in videos.

Ranked #23 on Action Segmentation on Breakfast

Action Recognition Action Segmentation +1

Paper
Code

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation

1 code implementation • CVPR 2021 • Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang

Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation.

Ranked #15 on Semi-Supervised Video Object Segmentation on VOT2020

Semi-Supervised Video Object Segmentation Visual Object Tracking

188

Paper
Code

Multi-modal Visual Tracking: Review and Experimental Comparison

2 code implementations • 8 Dec 2020 • Pengyu Zhang, Dong Wang, Huchuan Lu

Visual object tracking, as a fundamental task in computer vision, has drawn much attention in recent years.

Rgb-T Tracking Visual Object Tracking

Paper
Code

Proactive Pseudo-Intervention: Causally Informed Contrastive Learning For Interpretable Vision Models

no code implementations • 6 Dec 2020 • Dong Wang, Yuewei Yang, Chenyang Tao, Zhe Gan, Liqun Chen, Fanjie Kong, Ricardo Henao, Lawrence Carin

Deep neural networks excel at comprehending complex visual signals, delivering on par or even superior performance to that of human experts.

Contrastive Learning Data Integration +1

Paper
Add Code

Integrating User History into Heterogeneous Graph for Dialogue Act Recognition

no code implementations • COLING 2020 • Dong Wang, Ziran Li, Haitao Zheng, Ying Shen

Dialogue Act Recognition (DAR) is a challenging problem in Natural Language Understanding, which aims to attach Dialogue Act (DA) labels to each utterance in a conversation.

Dialogue Act Classification

Paper
Add Code

Summarize before Aggregate: A Global-to-local Heterogeneous Graph Inference Network for Conversational Emotion Recognition

no code implementations • COLING 2020 • Dongming Sheng, Dong Wang, Ying Shen, Haitao Zheng, Haozhuang Liu

Local dependencies, which captures short-term emotional effects between neighbouring utterances, are further injected via an Aggregation Graph to distinguish the subtle differences between utterances containing emotional phrases.

Ranked #29 on Emotion Recognition in Conversation on IEMOCAP

Emotion Recognition in Conversation

Paper
Add Code

Can We Trust Deep Speech Prior?

no code implementations • 4 Nov 2020 • Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han

Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture.

Speech Enhancement

Paper
Add Code

Deep generative LDA

1 code implementation • 30 Oct 2020 • Yunqi Cai, Dong Wang

Limited by its linear form and the underlying Gaussian assumption, however, LDA is not applicable in situations where the data distribution is complex.

Dimensionality Reduction Speaker Recognition

Paper
Code

Deep Speaker Vector Normalization with Maximum Gaussianality Training

1 code implementation • 30 Oct 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel

In this paper, we argue that this problem is largely attributed to the maximum-likelihood (ML) training criterion of the DNF model, which aims to maximize the likelihood of the observations but not necessarily improve the Gaussianality of the latent codes.

Speaker Recognition

Paper
Code

Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification

no code implementations • 27 Oct 2020 • Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang

Domain mismatch often occurs in real applications and causes serious performance reduction on speaker verification systems.

Speaker Verification

Paper
Add Code

Deep generative factorization for speech signal

no code implementations • 27 Oct 2020 • Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang

Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks.

Paper
Add Code

Consistency of archetypal analysis

no code implementations • 16 Oct 2020 • Braxton Osting, Dong Wang, Yiming Xu, Dominique Zosso

Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data.

Paper
Add Code

Remarks on Optimal Scores for Speaker Recognition

no code implementations • 10 Oct 2020 • Dong Wang

In this article, we first establish the theory of optimal scores for speaker recognition.

Speaker Identification Speaker Recognition +1

Paper
Add Code

WANA: Symbolic Execution of Wasm Bytecode for Cross-Platform Smart Contract Vulnerability Detection

1 code implementation • 30 Jul 2020 • Dong Wang, Bo Jiang, W. K. Chan

Furthermore, WANA proposes a set of test oracles to detect the vulnerabilities in EOSIO and Ethereum smart contracts based on WebAssembly bytecode analysis.

Software Engineering D.2.5

Paper
Code

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation

1 code implementation • 4 Jul 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang

In recent years, the multiple-stage strategy has become a popular trend for visual tracking.

Visual Tracking

188

Paper
Code

Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking

no code implementations • 4 Jul 2020 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang

In this study, we propose a novel RGB-T tracking framework by jointly modeling both appearance and motion cues.

Ranked #4 on Rgb-T Tracking on GTOT

Rgb-T Tracking

Paper
Add Code

A Characteristic Function-based Algorithm for Geodesic Active Contours

no code implementations • 1 Jul 2020 • Jun Ma, Dong Wang, Xiao-Ping Wang, Xiaoping Yang

Active contour models have been widely used in image segmentation, and the level set method (LSM) is the most popular approach for solving the models, via implicitly representing the contour by a level set function.

Image Segmentation Lesion Segmentation +2

Paper
Add Code

AP20-OLR Challenge: Three Tasks and Their Baselines

no code implementations • 4 Jun 2020 • Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Li-Ming Song, Cheng Yang

Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks.

Dialect Identification

Paper
Add Code

DASC: Towards A Road Damage-Aware Social-Media-Driven Car Sensing Framework for Disaster Response Applications

no code implementations • 4 Jun 2020 • Md Tahmid Rashid, Daniel, Zhang, Dong Wang

iii) How to efficiently guide the cars to the event locations with little prior knowledge of the road damage caused by the disaster, while also handling the dynamics of the physical world and social media?

Disaster Response

Paper
Add Code

Improve bone age assessment by learning from anatomical local regions

no code implementations • 27 May 2020 • Dong Wang, Kexin Zhang, Jia Ding, Li-Wei Wang

In the clinical practice, Tanner and Whitehouse (TW2) method is a widely-used method for radiologists to perform BAA.

Paper
Add Code

An efficient iterative method for reconstructing surface from point clouds

no code implementations • 25 May 2020 • Dong Wang

In this paper, we develop an efficient iterative method on a variational model for the surface reconstruction from point clouds.

Surface Reconstruction

Paper
Add Code

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning

1 code implementation • 25 May 2020 • Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, Thomas Fang Zheng

Domain generalization remains a critical problem for speaker recognition, even with the state-of-the-art architectures based on deep neural nets.

Audio and Speech Processing

Paper
Code

Conversational Word Embedding for Retrieval-Based Dialog System

1 code implementation • ACL 2020 • Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, Guoping Hu

Human conversations contain many types of information, e. g., knowledge, common sense, and language habits.

Common Sense Reasoning Machine Translation +4

Paper
Code

Stabilizing Training of Generative Adversarial Nets via Langevin Stein Variational Gradient Descent

no code implementations • 22 Apr 2020 • Dong Wang, Xiaoqian Qin, Fengyi Song, Li Cheng

Generative adversarial networks (GANs), famous for the capability of learning complex underlying data distribution, are however known to be tricky in the training process, which would probably result in mode collapse or performance deterioration.

Variational Inference

Paper
Add Code

CovidSens: A Vision on Reliable Social Sensing for COVID-19

no code implementations • 9 Apr 2020 • Md Tahmid Rashid, Dong Wang

In this vision paper, we discuss the roles of CovidSens and identify potential challenges in developing reliable social sensing based risk alert systems.

Misinformation

Paper
Add Code

Deep Normalization for Speaker Vectors

1 code implementation • 7 Apr 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel

Deep speaker embedding has demonstrated state-of-the-art performance in speaker recognition tasks.

Speaker Recognition

Paper
Code

High-Performance Long-Term Tracking with Meta-Updater

2 code implementations • CVPR 2020 • Kenan Dai, Yunhua Zhang, Dong Wang, Jianhua Li, Huchuan Lu, Xiaoyun Yang

Most top-ranked long-term trackers adopt the offline-trained Siamese architectures, thus, they cannot benefit from great progress of short-term trackers with online update.

Ranked #10 on Visual Object Tracking on LaSOT-ext

Visual Object Tracking Visual Tracking +1

257

Paper
Code

Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises

1 code implementation • CVPR 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang

An effective and efficient perturbation generator is trained with a carefully designed adversarial loss, which can simultaneously cool hot regions where the target exists on the heatmaps and force the predicted bounding box to shrink, making the tracked target invisible to trackers.

Adversarial Attack

Paper
Code

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

no code implementations • CVPR 2020 • Dong Wang, Yuan Zhang, Kexin Zhang, Li-Wei Wang

Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine.

Lung Nodule Detection medical image detection

Paper
Add Code

Group Activity Recognition by Using Effective Multiple Modality Relation Representation With Temporal-Spatial Attention

no code implementations • IEEE Access 2020 • Dezhong Xu, HENG FU, Lifang Wu, Meng Jian, Dong Wang, AND XU LIU

Then, we propose two types of inference models, opt-GRU and relation-GRU, which are used to encode the object relationship and motion representation effectively, and form the discriminative frame-level feature representation.

Ranked #4 on Group Activity Recognition on Volleyball

Autonomous Vehicles Group Activity Recognition +3

Paper
Add Code

Graph Representation Learning for Merchant Incentive Optimization in Mobile Payment Marketing

no code implementations • 27 Feb 2020 • Ziqi Liu, Dong Wang, Qianyu Yu, Zhiqiang Zhang, Yue Shen, Jian Ma, Wenliang Zhong, Jinjie Gu, Jun Zhou, Shuang Yang, Yuan Qi

In this paper, we present a graph representation learning method atop of transaction networks for merchant incentive optimization in mobile payment marketing.

Graph Representation Learning Marketing

Paper
Add Code

Deep Variational Luenberger-type Observer for Stochastic Video Prediction

no code implementations • 12 Feb 2020 • Dong Wang, Feng Zhou, Zheng Yan, Guang Yao, Zongxuan Liu, Wennan Ma, Cewu Lu

Our model builds upon an variational encoder which transforms the input video into a latent feature space and a Luenberger-type observer which captures the dynamic evolution of the latent features.

Representation Learning Video Prediction +1

Paper
Add Code

Curriculum Audiovisual Learning

no code implementations • 26 Jan 2020 • Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou

Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data.

Clustering

Paper
Add Code

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard

no code implementations • 20 Dec 2019 • Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar

21 teams submit results for Task 1, 23 teams submit results for Task 2, 24 teams submit results for Task 3, and 13 teams submit results for Task 4.

Line Detection Task 2

Paper
Add Code

On Fenchel Mini-Max Learning

1 code implementation • NeurIPS 2019 • Chenyang Tao, Liqun Chen, Shuyang Dai, Junya Chen, Ke Bai, Dong Wang, Jianfeng Feng, Wenlian Lu, Georgiy Bobashev, Lawrence Carin

Inference, estimation, sampling and likelihood evaluation are four primary goals of probabilistic modeling.

Density Estimation

Paper
Code

OpenArray v1.0: a simple operator library for the decoupling of ocean modeling and parallel computing

1 code implementation • Geoscientific Model Development 2019 • Xiaomeng Huang, Xing Huang, Dong Wang, Qi Wu, Yi Li, Shixun Zhang, YuWen Chen, Mingqing Wang, Yuan Gao, Qiang Tang, Yue Chen, Zheng Fang, Zhenya Song, Guangwen Yang

In this work, we design a simple computing library to bridge the gap and decouple the work of ocean modeling from parallel computing.

334

Paper
Code

Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

no code implementations • 10 Nov 2019 • Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shijing Si, Dinghan Shen, Dong Wang, Lawrence Carin

Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks.

Machine Translation Natural Language Understanding +2

Paper
Add Code

ROI Pooled Correlation Filters for Visual Tracking

1 code implementation • CVPR 2019 • Yuxuan Sun, Chong Sun, Dong Wang, You He, Huchuan Lu

The ROI (region-of-interest) based pooling method performs pooling operations on the cropped ROI regions for various samples and has shown great success in the object detection methods.

object-detection Object Detection +1

Paper
Code

CN-CELEB: a challenging Chinese speaker recognition dataset

2 code implementations • 31 Oct 2019 • Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang

These datasets tend to deliver over optimistic performance and do not meet the request of research on speaker recognition in unconstrained conditions.

Speaker Recognition

Paper
Code

On Investigation of Unsupervised Speech Factorization Based on Normalization Flow

no code implementations • 29 Oct 2019 • Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang

Speech signals are complex composites of various information, including phonetic content, speaker traits, channel effect, etc.

Paper
Add Code

VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results

1 code implementation • International Conference on Computer Vision Workshops 2019 • Dawei Du, Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Lin, QinGhua Hu, Tao Peng, Jiayu Zheng, Xinyao Wang, Yue Zhang, Liefeng Bo, Hailin Shi, Rui Zhu, Aashish Kumar, Aijin Li, Almaz Zinollayev, Anuar Askergaliyev, Arne Schumann, Binjie Mao, Byeongwon Lee, Chang Liu, Changrui Chen, Chunhong Pan, Chunlei Huo, Da Yu, Dechun Cong, Dening Zeng, Dheeraj Reddy Pailla, Di Li, Dong Wang, Donghyeon Cho, Dongyu Zhang, Furui Bai, George Jose, Guangyu Gao, Guizhong Liu, Haitao Xiong, Hao Qi, Haoran Wang, Heqian Qiu, Hongliang Li, Huchuan Lu, Ildoo Kim, Jaekyum Kim, Jane Shen, Jihoon Lee, Jing Ge, Jingjing Xu, Jingkai Zhou, Jonas Meier, Jun Won Choi, Junhao Hu, Junyi Zhang, Junying Huang, Kaiqi Huang, Keyang Wang, Lars Sommer, Lei Jin, Lei Zhang

Results of 33 object detection algorithms are presented.

Object object-detection +1

12,059

Paper
Code

GradNet: Gradient-Guided Network for Visual Object Tracking

2 code implementations • ICCV 2019 • Peixia Li, Bo-Yu Chen, Wanli Ouyang, Dong Wang, Xiaoyun Yang, Huchuan Lu

In this work, we propose a novel gradient-guided network to exploit the discriminative information in gradients and update the template in the siamese network through feed-forward and backward operations.

Ranked #3 on Visual Object Tracking on OTB-2015 (Precision metric)

Object Template Matching +2

Paper
Code

An Online Reinforcement Learning Approach to Quality-Cost-Aware Task Allocation for Multi-Attribute Social Sensing

no code implementations • 11 Sep 2019 • Yang Zhang, Daniel Zhang, Nathan Vance, Dong Wang

Social sensing has emerged as a new sensing paradigm where humans (or devices on their behalf) collectively report measurements about the physical world.

Attribute

Paper
Add Code

'Skimming-Perusal' Tracking: A Framework for Real-Time and Robust Long-term Tracking

1 code implementation • ICCV 2019 • Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang

In this work, we present a novel robust and real-time long-term tracking framework based on the proposed skimming and perusal modules.

124

Paper
Code

VAE-based Domain Adaptation for Speaker Verification

no code implementations • 27 Aug 2019 • Xueyi Wang, Lantian Li, Dong Wang

By enforcing the neural model to discriminate the speakers in the training set, deep speaker embedding (called `x-vectors`) can be derived from the hidden layers.

Domain Adaptation Speaker Verification

Paper
Add Code

signADAM: Learning Confidences for Deep Neural Networks

1 code implementation • 21 Jul 2019 • Dong Wang, Yicheng Liu, Wenwo Tang, Fanhua Shang, Hongying Liu, Qigong Sun, Licheng Jiao

In this paper, we propose a new first-order gradient-based algorithm to train deep neural networks.

Paper
Code

Towards Reliable Online Clickbait Video Detection: A Content-Agnostic Approach

no code implementations • 17 Jul 2019 • Lanyu Shang, Daniel Zhang, Michael Wang, Shuyue Lai, Dong Wang

Current clickbait detection solutions that mainly focus on analyzing the text of the title, the image of the thumbnail, or the content of the video are shown to be suboptimal in detecting the online clickbait videos.

Clickbait Detection

Paper
Add Code

AP19-OLR Challenge: Three Tasks and Their Baselines

no code implementations • 16 Jul 2019 • Zhiyuan Tang, Dong Wang, Li-Ming Song

The participants can refer to these online-published recipes to deploy LID systems for convenience.

Paper
Add Code

C^3 Framework: An Open-source PyTorch Code for Crowd Counting

3 code implementations • 5 Jul 2019 • Junyu. Gao, Wei. Lin, Bin Zhao, Dong Wang, Chenyu Gao, Jun Wen

This technical report attempts to provide efficient and solid kits addressed on the field of crowd counting, which is denoted as Crowd Counting Code Framework (C$^3$F).

Crowd Counting

687

Paper
Code

LMVP: Video Predictor with Leaked Motion Information

no code implementations • 24 Jun 2019 • Dong Wang, Yitong Li, Wei Cao, Liqun Chen, Qi Wei, Lawrence Carin

We propose a Leaked Motion Video Predictor (LMVP) to predict future frames by capturing the spatial and temporal dependencies from given inputs.

Paper
Add Code

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

no code implementations • 18 Jun 2019 • Dong Wang, Lei Zhou, Xiao Bai, Jun Zhou

Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods.

Paper
Add Code

A Preliminary Study on Data Augmentation of Deep Learning for Image Classification

no code implementations • 9 Jun 2019 • Benlin Hu, Cheng Lei, Dong Wang, Shu Zhang, Zhenyu Chen

Deep learning models have a large number of freeparameters that need to be calculated by effective trainingof the models on a great deal of training data to improvetheir generalization performance.

Data Augmentation General Classification +1

Paper
Add Code

Exploiting Persona Information for Diverse Generation of Conversational Responses

1 code implementation • 29 May 2019 • Haoyu Song, Wei-Nan Zhang, Yiming Cui, Dong Wang, Ting Liu

Giving conversational context with persona information to a chatbot, how to exploit the information to generate diverse and sustainable conversations is still a non-trivial task.

Chatbot

Paper
Code

Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems

no code implementations • 28 May 2019 • Tianle Cai, Ruiqi Gao, Jikai Hou, Siyu Chen, Dong Wang, Di He, Zhihua Zhang, Li-Wei Wang

First-order methods such as stochastic gradient descent (SGD) are currently the standard algorithm for training deep neural networks.

regression Second-order methods

Paper
Add Code

Globally Soft Filter Pruning For Efficient Convolutional Neural Networks

no code implementations • ICLR 2019 • Ke Xu, Xiao-Yun Wang, Qun Jia, Jianjing An, Dong Wang

Therefore, accumulating the saliency of the filter over the entire data set can provide more accurate guidance for pruning.

Paper
Add Code

Early Action Prediction with Generative Adversarial Networks

no code implementations • 30 Apr 2019 • Dong Wang, Yuan Yuan, Qi. Wang

Action Prediction is aimed to determine what action is occurring in a video as early as possible, which is crucial to many online applications, such as predicting a traffic accident before it happens and detecting malicious actions in the monitoring system.

Early Action Prediction Generative Adversarial Network

Paper
Add Code

Anomaly Detection in Traffic Scenes via Spatial-aware Motion Reconstruction

no code implementations • 30 Apr 2019 • Yuan Yuan, Dong Wang, Qi. Wang

3) Results of motion orientation and magnitude are adaptively weighted and fused by a Bayesian model, which makes the proposed method more robust and handle more kinds of abnormal events.

Anomaly Detection Autonomous Vehicles

Paper
Add Code

Cross-Modal Message Passing for Two-stream Fusion

no code implementations • 30 Apr 2019 • Dong Wang, Yuan Yuan, Qi. Wang

The classification object ensures that each modal network predicts the true action category while the competing objective encourages each modal network to outperform the other one.

Action Recognition General Classification +3

Paper
Add Code

Memory-Augmented Temporal Dynamic Learning for Action Recognition

no code implementations • 30 Apr 2019 • Yuan Yuan, Dong Wang, Qi. Wang

Human actions captured in video sequences contain two crucial factors for action recognition, i. e., visual appearance and motion dynamics.

Action Recognition Temporal Action Localization

Paper
Add Code

The iterative convolution-thresholding method (ICTM) for image segmentation

no code implementations • 24 Apr 2019 • Dong Wang, Xiao-Ping Wang

In this paper, we propose a novel iterative convolution-thresholding method (ICTM) that is applicable to a range of variational models for image segmentation.

Image Segmentation Segmentation +1

Paper
Add Code

Listen to the Image

no code implementations • CVPR 2019 • Di Hu, Dong Wang, Xuelong. Li, Feiping Nie, Qi. Wang

different encoding schemes indicate that using machine model to accelerate optimization evaluation and reduce experimental cost is feasible to some extent, which could dramatically promote the upgrading of encoding scheme then help the blind to improve their visual perception ability.

Translation

Paper
Add Code

VAE-based regularization for deep speaker embedding

no code implementations • 7 Apr 2019 • Yang Zhang, Lantian Li, Dong Wang

Deep speaker embedding has achieved state-of-the-art performance in speaker recognition.

Speaker Recognition

Paper
Add Code

Neural IR Meets Graph Embedding: A Ranking Model for Product Search

no code implementations • 24 Jan 2019 • Yuan Zhang, Dong Wang, Yan Zhang

Recently, neural models for information retrieval are becoming increasingly popular.

Graph Embedding Information Retrieval +2

Paper
Add Code

CONet: A Cognitive Ocean Network

no code implementations • 9 Jan 2019 • Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar

The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.

Paper
Add Code

Phonetic-attention scoring for deep speaker features in speaker verification

no code implementations • 8 Nov 2018 • Lantian Li, Zhiyuan Tang, Ying Shi, Dong Wang

This score reflects the similarity of the two frames in phonetic content, and is used to weigh the contribution of this frame pair in the utterance-based scoring.

Machine Translation Speaker Verification +1

Paper
Add Code

Gaussian-Constrained training for speaker verification

no code implementations • 8 Nov 2018 • Lantian Li, Zhiyuan Tang, Ying Shi, Dong Wang

This paper proposes a Gaussian-constrained training approach that (1) discards the parametric classifier, and (2) enforces the distribution of the derived speaker vectors to be Gaussian.

Speaker Verification

Paper
Add Code

Learning regression and verification networks for long-term visual tracking

3 code implementations • 12 Sep 2018 • Yunhua Zhang, Dong Wang, Lijun Wang, Jinqing Qi, Huchuan Lu

Compared with short-term tracking, the long-term tracking task requires determining the tracked object is present or absent, and then estimating the accurate bounding box if present or conducting image-wide re-detection if absent.

General Classification Object +3

203

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.