Search Results for author: Yang Long

Found 25 papers, 12 papers with code

Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm without Real Data Exposure

no code implementations • 14 Mar 2024 • Fan Wan, Xingyu Miao, Haoran Duan, Jingjing Deng, Rui Gao, Yang Long

With increasing concerns over data privacy and model copyrights, especially in the context of collaborations between AI service providers and data owners, an innovative SG-ZSL paradigm is proposed in this work.

Zero-Shot Learning

Paper
Add Code

ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields

1 code implementation • 2 Feb 2024 • Xingyu Miao, Yang Bai, Haoran Duan, Fan Wan, Yawen Huang, Yang Long, Yefeng Zheng

Most of the existing works on arbitrary 3D NeRF style transfer required retraining on each single style condition.

Style Transfer

Paper
Code

CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video

no code implementations • 10 Jan 2024 • Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Yang Long, Yefeng Zheng

The goal of our work is to generate high-quality novel views from monocular videos of complex and dynamic scenes.

Paper
Add Code

Dual Feature Augmentation Network for Generalized Zero-shot Learning

1 code implementation • 25 Sep 2023 • Lei Xiang, Yuan Zhou, Haoran Duan, Yang Long

To address these issues, we propose a novel Dual Feature Augmentation Network (DFAN), which comprises two feature augmentation modules, one for visual features and the other for semantic features.

Attribute Generalized Zero-Shot Learning

Paper
Code

DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume

1 code implementation • 14 Aug 2023 • Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Xinxing Xu, Yang Long, Yefeng Zheng

Nevertheless, the dynamic cost volume inevitably generates extra occlusions and noise, thus we alleviate this by designing a fusion module that makes static and dynamic cost volumes compensate for each other.

Ranked #3 on Unsupervised Monocular Depth Estimation on Cityscapes

Monocular Depth Estimation Optical Flow Estimation +1

Paper
Code

Graph-based Representation for Image based on Granular-ball

1 code implementation • 4 Mar 2023 • Xia Shuyin, Dai Dawei, Yang Long, Zhany Li, Lan Danf, Zhu hao, Wang Guoy

Current image processing methods usually operate on the finest-granularity unit; that is, the pixel, which leads to challenges in terms of efficiency, robustness, and understandability in deep learning models.

Attribute Image Classification

Paper
Code

On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation Learning

1 code implementation • 18 Dec 2022 • Chenghao Xiao, Yang Long, Noura Al Moubayed

In this paper, we aim to help guide future designs of sentence representation learning methods by taking a closer look at contrastive SRL through the lens of isotropy, contextualization and learning dynamics.

Contrastive Learning Representation Learning +2

Paper
Code

Evolutionary Generalized Zero-Shot Learning

no code implementations • 23 Nov 2022 • Dubing Chen, Haofeng Zhang, Yuming Shen, Yang Long, Ling Shao

In this work, we propose a novel Evolutionary Generalized Zero-Shot Learning setting, which (i) avoids the domain shift problem in inductive GZSL, and (ii) is more in line with the needs of real-world deployments than transductive GZSL.

Generalized Zero-Shot Learning

Paper
Add Code

Knowing the Past to Predict the Future: Reinforcement Virtual Learning

no code implementations • 2 Nov 2022 • Peng Zhang, Yawen Huang, Bingzhang Hu, Shizheng Wang, Haoran Duan, Noura Al Moubayed, Yefeng Zheng, Yang Long

Reinforcement Learning (RL)-based control system has received considerable attention in recent decades.

Reinforcement Learning (RL)

Paper
Add Code

Action Quality Assessment with Temporal Parsing Transformer

1 code implementation • 19 Jul 2022 • Yang Bai, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang

Action Quality Assessment(AQA) is important for action understanding and resolving the task poses unique challenges due to subtle visual differences.

Action Quality Assessment Action Understanding +1

Paper
Code

Absolute Zero-Shot Learning

no code implementations • 23 Feb 2022 • Rui Gao, Fan Wan, Daniel Organisciak, Jiyao Pu, Junyan Wang, Haoran Duan, Peng Zhang, Xingsong Hou, Yang Long

Considering the increasing concerns about data copyright and privacy issues, we present a novel Absolute Zero-Shot Learning (AZSL) paradigm, i. e., training a classifier with zero real data.

Transfer Learning Zero-Shot Learning

Paper
Add Code

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

no code implementations • 6 Jan 2022 • Yang Long, Gui-Song Xia, Liangpei Zhang, Gong Cheng, Deren Li

Finally, we perform ASP by unifying the tile-level scene classification and object-based image analysis to achieve pixel-wise semantic labeling.

Aerial Scene Classification Benchmarking +4

Paper
Add Code

Semi-Supervised Crowd Counting from Unlabeled Data

no code implementations • 31 Aug 2021 • Haoran Duan, Fan Wan, Rui Sun, Zeyu Wang, Varun Ojha, Yu Guan, Hubert P. H. Shum, Bingzhang Hu, Yang Long

Our method achieved competitive performance in semi-supervised learning approaches on these crowd counting datasets.

Crowd Counting

Paper
Add Code

Discriminative Latent Semantic Graph for Video Captioning

1 code implementation • 8 Aug 2021 • Yang Bai, Junyan Wang, Yang Long, Bingzhang Hu, Yang song, Maurice Pagnucco, Yu Guan

Video captioning aims to automatically generate natural language sentences that can describe the visual contents of a given video.

Object Sentence +2

Paper
Code

EfficientTDNN: Efficient Architecture Search for Speaker Recognition

1 code implementation • 25 Mar 2021 • Rui Wang, Zhihua Wei, Haoran Duan, Shouling Ji, Yang Long, Zhen Hong

Compared with hand-designed approaches, neural architecture search (NAS) appears as a practical technique in automating the manual architecture design process and has attracted increasing interest in spoken language processing tasks such as speaker recognition.

Data Augmentation Network Pruning +2

Paper
Code

Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models

no code implementations • 8 Mar 2021 • Sam Bond-Taylor, Adam Leach, Yang Long, Chris G. Willcocks

Deep generative models are a class of techniques that train deep neural networks to model the distribution of training samples.

Paper
Add Code

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

no code implementations • 19 Aug 2020 • Junyan Wang, Yang Bai, Yang Long, Bingzhang Hu, Zhenhua Chai, Yu Guan, Xiaolin Wei

Video summarization aims to select representative frames to retain high-level information, which is usually solved by predicting the segment-wise importance score via a softmax function.

Meta-Learning Video Summarization

Paper
Add Code

On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID

1 code implementation • 22 Jun 2020 • Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li

After reviewing existing benchmark datasets in the research community of RS image interpretation, this article discusses the problem of how to efficiently prepare a suitable benchmark dataset for RS image interpretation.

General Classification Image Classification +1

Paper
Code

Order Matters: Shuffling Sequence Generation for Video Prediction

1 code implementation • 20 Jul 2019 • Junyan Wang, Bingzhang Hu, Yang Long, Yu Guan

Predicting future frames in natural video sequences is a new challenge that is receiving increasing attention in the computer vision community.

Video Generation Video Prediction

Paper
Code

Towards Reliable, Automated General Movement Assessment for Perinatal Stroke Screening in Infants Using Wearable Accelerometers

no code implementations • 21 Feb 2019 • Yan Gao, Yang Long, Yu Guan, Anna Basu, Jessica Baggaley, Thomas Ploetz

We demonstrate the effectiveness of our approach in a study with 34 newborns (21 typically developing infants and 13 PS infants with abnormal movements).

Paper
Add Code

Learning RoI Transformer for Detecting Oriented Objects in Aerial Images

1 code implementation • 1 Dec 2018 • Jian Ding, Nan Xue, Yang Long, Gui-Song Xia, Qikai Lu

Especially when detecting densely packed objects in aerial images, methods relying on horizontal proposals for common object detection often introduce mismatches between the Region of Interests (RoIs) and objects.

Ranked #48 on Object Detection In Aerial Images on DOTA (using extra training data)

General Classification Object +4

191

Paper
Code

Robust Cross-View Gait Recognition with Evidence: A Discriminant Gait GAN (DiGGAN) Approach

1 code implementation • 26 Nov 2018 • BingZhang Hu, Yu Guan, Yan Gao, Yang Long, Nicholas Lane, Thomas Ploetz

Gait as a biometric trait has attracted much attention in many security and privacy applications such as identity recognition and authentication, during the last few decades.

Gait Identification Gait Recognition +1

Paper
Code

ActionXPose: A Novel 2D Multi-view Pose-based Algorithm for Real-time Human Action Recognition

no code implementations • 29 Oct 2018 • Federico Angelini, Zeyu Fu, Yang Long, Ling Shao, Syed Mohsen Naqvi

We present ActionXPose, a novel 2D pose-based algorithm for posture-level Human Action Recognition (HAR).

Action Recognition Temporal Action Localization

Paper
Add Code

Towards Universal Representation for Unseen Action Recognition

no code implementations • CVPR 2018 • Yi Zhu, Yang Long, Yu Guan, Shawn Newsam, Ling Shao

Unseen Action Recognition (UAR) aims to recognise novel action categories without training examples.

Ranked #14 on Action Recognition on ActivityNet

Action Recognition Multiple Instance Learning +2

Paper
Add Code

From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

no code implementations • CVPR 2017 • Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han

Using the proposed Unseen Visual Data Synthesis (UVDS) algorithm, semantic attributes are effectively utilised as an intermediate clue to synthesise unseen visual features at the training stage.

General Classification Object Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.