Search Results for author: Fan Zhang

Found 235 papers, 63 papers with code

软件标识符的自然语言规范性研究(Research on the Natural Language Normalness of Software Identifiers)

no code implementations CCL 2021 Dongzhen Wen, Fan Zhang, Xiao Zhang, Liang Yang, Yuan Lin, Bo Xu, Hongfei Lin

“软件源代码的理解则是软件协同开发与维护的核心, 而源代码中占半数以上的标识符的理解则在软件理解中起到重要作用, 传统软件工程主要研究通过命名规范限制标识符的命名过程以构造更易理解和交流的标识符。本文则在梳理分析常见编程语言命名规范的基础上, 提出一种全新的标识符可理解性评价标准。具体而言, 本文首先总结梳理了常见主流编程语言中的命名规范并类比自然语言语素概念本文提出基于软件语素的标识符构成过程, 即标识符的构成可被视为软件语素的生成、排列和连接过程。在此基础上, 本文提出一种结合自然语料库的软件标识符规范性评价方法, 用来衡量软件标识符是否易于理解。最后, 本文通过源代码理解数据集和乇乩乴乨乵乢平台中开源项目对规范性指标进行了验证性实验, 结果表明本文提出的规范性分数能够很好衡量软件项目的可理解性。”

Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits

no code implementations ICML 2020 Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

We first present a policy evaluation procedure in the ambiguous environment and also give a heuristic algorithm to solve the distributionally robust policy learning problems efficiently.

Multi-Armed Bandits

Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline

1 code implementation22 Mar 2024 Shuhao Li, Yue Cui, Jingyi Xu, Libin Li, Lingkai Meng, Weidong Yang, Fan Zhang, Xiaofang Zhou

Traffic prediction has long been a focal and pivotal area in research, witnessing both significant strides from city-level to road-level predictions in recent years.

Autonomous Driving Traffic Prediction

Gradient-Aware Logit Adjustment Loss for Long-tailed Classifier

1 code implementation14 Mar 2024 Fan Zhang, Wei Qin, Weijieying Ren, Lei Wang, Zetong Chen, Richang Hong

Additionally, We find that most of the solutions to long-tailed problems are still biased towards head classes in the end, and we propose a simple and post hoc prediction re-balancing strategy to further mitigate the basis toward head class.

LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

no code implementations9 Mar 2024 QiHao Zhao, Yalun Dai, Hao Li, Wei Hu, Fan Zhang, Jun Liu

Long-tail recognition is challenging because it requires the model to learn good representations from tail categories and address imbalances across all categories.

Immersive Video Compression using Implicit Neural Representations

1 code implementation2 Feb 2024 Ho Man Kwan, Fan Zhang, Andrew Gower, David Bull

In this paper we, for the first time, extend their application to immersive (multi-view) videos, by proposing MV-HiNeRV, a new INR-based immersive video codec.

Video Compression

3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework

no code implementations14 Jan 2024 Fan Zhang, Shuyi Mao, Qing Li, Xiaojiang Peng

Comparative evaluations with popular point-based methods on HPoint103 and the public dataset DHP19 demonstrate the dramatic outperformance of our D-CPT.

Pose Estimation Virtual Try-on

MIMIC: Mask Image Pre-training with Mix Contrastive Fine-tuning for Facial Expression Recognition

no code implementations14 Jan 2024 Fan Zhang, Xiaobao Guo, Xiaojiang Peng, Alex Kot

In addition, when compared with the domain disparity existing between face datasets and FER datasets, the divergence between general datasets and FER datasets is more pronounced.

Contrastive Learning Face Recognition +3

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

2 code implementations10 Jan 2024 Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu

Next, we discuss several key challenges to achieve intelligent, efficient and secure Personal LLM Agents, followed by a comprehensive survey of representative solutions to address these challenges.

Compressing Deep Image Super-resolution Models

no code implementations31 Dec 2023 YuXuan Jiang, Jakub Nawala, Fan Zhang, David Bull

Deep learning techniques have been applied in the context of image super-resolution (SR), achieving remarkable advances in terms of reconstruction performance.

Image Super-Resolution Knowledge Distillation

Emage: Non-Autoregressive Text-to-Image Generation

no code implementations22 Dec 2023 Zhangyin Feng, Runyi Hu, Liangxin Liu, Fan Zhang, Duyu Tang, Yong Dai, Xiaocheng Feng, Jiwei Li, Bing Qin, Shuming Shi

Compared with autoregressive baselines that needs to run one thousand times, our model only runs 16 times to generate images of competitive quality with an order of magnitude lower inference latency.

Denoising Text-to-Image Generation

GreenScan: Towards large-scale monitoring the health of urban trees using mobile sensing

no code implementations22 Dec 2023 Akshit Gupta, Simone Mora, Fan Zhang, Martine Rutten, R. Venkatesha Prasad, Carlo Ratti

Healthy urban greenery is a fundamental asset to mitigate climate change phenomenons such as extreme heat and air pollution.

Generative Multimodal Models are In-Context Learners

1 code implementation20 Dec 2023 Quan Sun, Yufeng Cui, Xiaosong Zhang, Fan Zhang, Qiying Yu, Zhengxiong Luo, Yueze Wang, Yongming Rao, Jingjing Liu, Tiejun Huang, Xinlong Wang

The human ability to easily solve multimodal tasks in context (i. e., with only a few demonstrations or simple instructions), is what current multimodal systems have largely struggled to imitate.

In-Context Learning Question Answering +2

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

1 code implementation19 Dec 2023 Fan Zhang, ShaoDi You, Yu Li, Ying Fu

Nonetheless, the performance of these methods is often constrained by the domain gap and looser constraints.

Monocular Depth Estimation Style Transfer +1

Full-reference Video Quality Assessment for User Generated Content Transcoding

no code implementations19 Dec 2023 Zihao Qi, Chen Feng, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull

In this work, we observe that existing full-/no-reference quality metrics fail to accurately predict the perceptual quality difference between transcoded UGC content and the corresponding unpristine references.

Video Quality Assessment Visual Question Answering (VQA)

Device Scheduling for Relay-assisted Over-the-Air Aggregation in Federated Learning

no code implementations15 Dec 2023 Fan Zhang, Jining Chen, Kunlun Wang, Wen Chen

we formulate a joint device scheduling, and power allocation problem to maximize the number of scheduled devices.

Federated Learning Scheduling

BVI-Artefact: An Artefact Detection Benchmark Dataset for Streamed Videos

no code implementations14 Dec 2023 Chen Feng, Duolikun Danier, Fan Zhang, Alex Mackin, Andy Collins, David Bull

Professionally generated content (PGC) streamed online can contain visual artefacts that degrade the quality of user experience.

RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment

no code implementations14 Dec 2023 Chen Feng, Duolikun Danier, Haoran Wang, Fan Zhang, Benoit Vallade, Alex Mackin, David Bull

Deep learning-based video quality assessment (deep VQA) has demonstrated significant potential in surpassing conventional metrics, with promising improvements in terms of correlation with human perception.

Knowledge Distillation Model Compression +2

A Simple Framework to Enhance the Adversarial Robustness of Deep Learning-based Intrusion Detection System

no code implementations6 Dec 2023 Xinwei Yuan, Shu Han, Wei Huang, Hongliang Ye, Xianglong Kong, Fan Zhang

In this paper, we propose a novel IDS architecture that can enhance the robustness of IDS against adversarial attacks by combining conventional machine learning (ML) models and Deep Learning models.

Adversarial Attack Adversarial Robustness +1

Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation

no code implementations5 Dec 2023 Tianhao Peng, Ge Gao, Heming Sun, Fan Zhang, David Bull

In recent years, end-to-end learnt video codecs have demonstrated their potential to compete with conventional coding algorithms in term of compression efficiency.

Video Compression

How does spatial structure affect psychological restoration? A method based on Graph Neural Networks and Street View Imagery

1 code implementation29 Nov 2023 Haoran Ma, Yan Zhang, Pengyuan Liu, Fan Zhang, Pengyu Zhu

In this work, a spatial-dependent graph neural networks (GNNs) approach is proposed to reveal the relation between spatial structure and restoration quality on an urban scale.

VBench: Comprehensive Benchmark Suite for Video Generative Models

1 code implementation29 Nov 2023 Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, LiMin Wang, Dahua Lin, Yu Qiao, Ziwei Liu

We will open-source VBench, including all prompts, evaluation methods, generated videos, and human preference annotations, and also include more video generation models in VBench to drive forward the field of video generation.

Image Generation Video Generation

A Novel Deep Clustering Framework for Fine-Scale Parcellation of Amygdala Using dMRI Tractography

no code implementations25 Nov 2023 Haolin He, Ce Zhu, Le Zhang, Yipeng Liu, Xiao Xu, Yuqian Chen, Leo Zekelman, Jarrett Rushmore, Yogesh Rathi, Nikos Makris, Lauren J. O'Donnell, Fan Zhang

The amygdala plays a vital role in emotional processing and exhibits structural diversity that necessitates fine-scale parcellation for a comprehensive understanding of its anatomico-functional correlations.

Clustering Deep Clustering +1

Cross-Domain Dual-Functional OFDM Waveform Design for Accurate Sensing/Positioning

no code implementations8 Nov 2023 Fan Zhang, Tianqi Mao, Ruiqi Liu, Zhu Han, Sheng Chen, Zhaocheng Wang

For the communication-centric design, to maximize the achievable data rate, a fraction of REs are optimally allocated for communications according to prior knowledge of the communication channel.

CapsFusion: Rethinking Image-Text Data at Scale

1 code implementation31 Oct 2023 Qiying Yu, Quan Sun, Xiaosong Zhang, Yufeng Cui, Fan Zhang, Yue Cao, Xinlong Wang, Jingjing Liu

To provide higher-quality and more scalable multimodal pretraining data, we propose CapsFusion, an advanced framework that leverages large language models to consolidate and refine information from both web-based image-text pairs and synthetic captions.

World Knowledge

Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

1 code implementation29 Oct 2023 Zhen Qian, Min Chen, Zhuo Sun, Fan Zhang, Qingsong Xu, Jinzhao Guo, Zhiwei Xie, Zhixin Zhang

Understanding urban dynamics and promoting sustainable development requires comprehensive insights about buildings.

Planning with Logical Graph-based Language Model for Instruction Generation

no code implementations26 Aug 2023 Fan Zhang, Kebing Jin, Hankz Hankui Zhuo

Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from free-form texts.

Language Modelling Text Generation +1

Ultrafast and Ultralight Network-Based Intelligent System for Real-time Diagnosis of Ear Diseases in Any Devices

no code implementations21 Aug 2023 Yubiao Yue, Xinyu Zeng, Xiaoqiang Shi, Meiping Zhang, Haihua Liang, Fan Zhang, Yanmei Chen, Zefeng Xie, Wenrui Wu, Zhenzhang Li

Employing transfer learning and five-fold cross-validation with 22, 581 images from Hospital-1, the model achieves an impressive 95. 23% accuracy.

Transfer Learning

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

1 code implementation ICCV 2023 QiHao Zhao, Chen Jiang, Wei Hu, Fan Zhang, Jun Liu

In the analysis and ablation study, we demonstrate that our method compared with previous work can effectively increase the diversity of experts, significantly reduce the variance of the model, and improve recognition accuracy.

Long-tail Learning

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model

no code implementations11 Aug 2023 Fan Zhang, Naye Ji, Fuxing Gao, Siyuan Zhao, Zhaohan Wang, Shunman Li

Firstly, considering that speech audio not only contains acoustic and semantic features but also conveys personality traits, emotions, and more subtle information related to accompanying gestures, we pioneer the adaptation of WavLM, a large-scale pre-trained model, to extract low-level and high-level audio information.

Gesture Generation

Deep neural networks from the perspective of ergodic theory

no code implementations4 Aug 2023 Fan Zhang

The design of deep neural networks remains somewhat of an art rather than precise science.

TractCloud: Registration-free tractography parcellation with a novel local-global streamline point cloud representation

no code implementations18 Jul 2023 Tengfei Xue, Yuqian Chen, Chaoyi Zhang, Alexandra J. Golby, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

TractCloud achieves efficient and consistent whole-brain white matter parcellation across the lifespan (from neonates to elderly subjects, including brain tumor patients) without the need for registration.

Anatomy

Data-Driven Optimal Control of Tethered Space Robot Deployment with Learning Based Koopman Operator

no code implementations15 Jul 2023 Ao Jin, Fan Zhang, Panfeng Huang

To avoid complex constraints of the traditional nonlinear method for tethered space robot (TSR) deployment, this paper proposes a data-driven optimal control framework with an improved deep learning based Koopman operator that could be applied to complex environments.

ATWM: Defense against adversarial malware based on adversarial training

no code implementations11 Jul 2023 Kun Li, Fan Zhang, Wei Guo

In order to defend against malware attacks, researchers have proposed many Windows malware detection models based on deep learning.

Adversarial Defense Malware Detection

Generative Pretraining in Multimodality

2 code implementations11 Jul 2023 Quan Sun, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Yueze Wang, Hongcheng Gao, Jingjing Liu, Tiejun Huang, Xinlong Wang

We present Emu, a Transformer-based multimodal foundation model, which can seamlessly generate images and texts in multimodal context.

Image Captioning Temporal/Casual QA +4

TractGeoNet: A geometric deep learning framework for pointwise analysis of tract microstructure to predict language assessment performance

no code implementations8 Jul 2023 Yuqian Chen, Leo R. Zekelman, Chaoyi Zhang, Tengfei Xue, Yang song, Nikos Makris, Yogesh Rathi, Alexandra J. Golby, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

We evaluate the effectiveness of the proposed method by predicting individual performance on two neuropsychological assessments of language using a dataset of 20 association white matter fiber tracts from 806 subjects from the Human Connectome Project.

regression

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

no code implementations28 Jun 2023 Zhangyin Feng, Yong Dai, Fan Zhang, Duyu Tang, Xiaocheng Feng, Shuangzhi Wu, Bing Qin, Yunbo Cao, Shuming Shi

Traditional multitask learning methods basically can only exploit common knowledge in task- or language-wise, which lose either cross-language or cross-task knowledge.

Natural Language Understanding

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

1 code implementation NeurIPS 2023 Ho Man Kwan, Ge Gao, Fan Zhang, Andrew Gower, David Bull

Learning-based video compression is currently a popular research topic, offering the potential to compete with conventional standard video codecs.

Model Compression Quantization +1

FGAM:Fast Adversarial Malware Generation Method Based on Gradient Sign

no code implementations22 May 2023 Kun Li, Fan Zhang, Wei Guo

Adversarial attacks are to deceive the deep learning model by generating adversarial samples.

Malware Detection

Doppler-Resilient Design of CAZAC Sequences for mmWave/THz Sensing Applications

no code implementations12 May 2023 Fan Zhang, Tianqi Mao, Zhaocheng Wang

For an arbitrary-length ZC sequence, a feasible range of the root index is derived to satisfy the requirement of PSLR within the scope of RoI.

Label-Free Multi-Domain Machine Translation with Stage-wise Training

no code implementations6 May 2023 Fan Zhang, Mei Tu, Sangha Kim, Song Liu, Jinyao Yan

Our model is composed of three parts: a backbone model, a domain discriminator taking responsibility to discriminate data from different domains, and a set of experts that transfer the decoded features from generic to specific.

Machine Translation Translation

UPDExplainer: an Interpretable Transformer-based Framework for Urban Physical Disorder Detection Using Street View Imagery

no code implementations4 May 2023 Chuanbo Hu, Shan Jia, Fan Zhang, Changjiang Xiao, Mindi Ruan, Jacob Thrasher, Xin Li

Experimental results on the re-annotated Place Pulse 2. 0 dataset demonstrate promising detection performance of the proposed method, with an accuracy of 79. 9%.

Semantic Segmentation

Understand Waiting Time in Transaction Fee Mechanism: An Interdisciplinary Perspective

1 code implementation4 May 2023 Luyao Zhang, Fan Zhang

Our study identified NFT drops as a unique source of market congestion -- holiday effects -- beyond trend and season effects.

Causal Inference Computer Security +2

A fast and flexible algorithm for microstructure reconstruction combining simulated annealing and deep learning

1 code implementation25 Apr 2023 Zhenchuan Ma, Xiaohai He, Pengcheng Yan, Fan Zhang, Qizhi Teng

The proposed algorithm is flexible and can complete training and reconstruction in a short time with only one two-dimensional image.

Co-GRU Enhanced End-to-End Design for Long-haul Coherent Transmission Systems

no code implementations23 Apr 2023 Jiayu Zheng, Tianhong Zhang, Yu Wenjing, Weiqin Zhou, Chuanchuan Yang, Fan Zhang

In recent years, the end-to-end (E2E) scheme based on deep learning (DL) has been proposed as a potential scheme to jointly optimize the encoder and the decoder parameters of the optical communication system.

Mpox-AISM: AI-Mediated Super Monitoring for Mpox and Like-Mpox

no code implementations17 Mar 2023 Yubiao Yue, Minghua Jiang, Xinyue Zhang, Jialong Xu, Huacong Ye, Fan Zhang, Zhenzhang Li, Yang Li

With the help of the Internet and communication terminal, Mpox-AISM can perform a real-time, low-cost, and convenient diagnosis for earlier-stage mpox in various real-world settings, thereby effectively curbing the spread of mpox virus.

Data Augmentation Decision Making +2

Fiber Tract Shape Measures Inform Prediction of Non-Imaging Phenotypes

no code implementations16 Mar 2023 Wan Liu, Yuqian Chen, Chuyang Ye, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

In this paper, we investigate the potential of fiber tract shape features for predicting non-imaging phenotypes, both individually and in combination with traditional features.

LDMVFI: Video Frame Interpolation with Latent Diffusion Models

2 code implementations16 Mar 2023 Duolikun Danier, Fan Zhang, David Bull

Existing works on video frame interpolation (VFI) mostly employ deep neural networks that are trained by minimizing the L1, L2, or deep feature space distance (e. g. VGG loss) between their outputs and ground-truth frames.

Video Frame Interpolation

Efficient Self-supervised Continual Learning with Progressive Task-correlated Layer Freezing

no code implementations13 Mar 2023 Li Yang, Sen Lin, Fan Zhang, Junshan Zhang, Deliang Fan

Inspired by the success of Self-supervised learning (SSL) in learning visual representations from unlabeled data, a few recent works have studied SSL in the context of continual learning (CL), where multiple tasks are learned sequentially, giving rise to a new paradigm, namely self-supervised continual learning (SSCL).

Continual Learning Self-Supervised Learning

GeoLab: Geometry-based Tractography Parcellation of Superficial White Matter

1 code implementation2 Mar 2023 Nabil Vindas, Nicole Labra Avila, Fan Zhang, Tengfei Xue, Lauren J. O'Donnell, Jean-François Mangin

Superficial white matter (SWM) has been less studied than long-range connections despite being of interest to clinical research, andfew tractography parcellation methods have been adapted to SWM.

Few-shots Portrait Generation with Style Enhancement and Identity Preservation

1 code implementation1 Mar 2023 Runchuan Zhu, Naye Ji, Youbing Zhao, Fan Zhang

Nowadays, the wide application of virtual digital human promotes the comprehensive prosperity and development of digital culture supported by digital economy.

Cultural Vocal Bursts Intensity Prediction

ST-MFNet Mini: Knowledge Distillation-Driven Frame Interpolation

1 code implementation16 Feb 2023 Crispian Morris, Duolikun Danier, Fan Zhang, Nantheera Anantrasirichai, David R. Bull

Currently, one of the major challenges in deep learning-based video frame interpolation (VFI) is the large model sizes and high computational complexity associated with many high performance VFI approaches.

Knowledge Distillation Network Pruning +1

SimCGNN: Simple Contrastive Graph Neural Network for Session-based Recommendation

no code implementations8 Feb 2023 Yuan Cao, Xudong Zhang, Fan Zhang, Feifei Kou, Josiah Poon, Xiongnan Jin, Yongheng Wang, Jinpeng Chen

Session-based recommendation (SBR) problem, which focuses on next-item prediction for anonymous users, has received increasingly more attention from researchers.

Contrastive Learning Session-Based Recommendations

DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model

1 code implementation24 Jan 2023 Fan Zhang, Naye Ji, Fuxing Gao, Yongping Li

Speech-driven gesture synthesis is a field of growing interest in virtual human creation.

Denoising

TractGraphCNN: anatomically informed graph CNN for classification using diffusion MRI tractography

no code implementations5 Jan 2023 Yuqian Chen, Fan Zhang, Leo R. Zekelman, Tengfei Xue, Chaoyi Zhang, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'Donnell

This work shows the potential of incorporating anatomical information, especially known anatomical similarities between input features, to guide convolutions in neural networks.

Urban Visual Intelligence: Studying Cities with AI and Street-level Imagery

no code implementations2 Jan 2023 Fan Zhang, Arianna Salazar Miranda, Fábio Duarte, Lawrence Vale, Gary Hack, Min Chen, Yu Liu, Michael Batty, Carlo Ratti

The visual dimension of cities has been a fundamental subject in urban studies, since the pioneering work of scholars such as Sitte, Lynch, Arnheim, and Jacobs.

Learning Rain Location Prior for Nighttime Deraining

1 code implementation ICCV 2023 Fan Zhang, ShaoDi You, Yu Li, Ying Fu

This learned prior contains location information of rain streaks and, when injected into deraining models, can significantly improve their performance.

Rain Removal

Biomedical image analysis competitions: The state of current participation practice

no code implementations16 Dec 2022 Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Vivek Singh Bawa, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Jinwook Choi, Olivier Commowick, Marie Daum, Adrien Depeursinge, Reuben Dorent, Jan Egger, Hannah Eichhorn, Sandy Engelhardt, Melanie Ganz, Gabriel Girard, Lasse Hansen, Mattias Heinrich, Nicholas Heller, Alessa Hering, Arnaud Huaulmé, Hyunjeong Kim, Bennett Landman, Hongwei Bran Li, Jianning Li, Jun Ma, Anne Martel, Carlos Martín-Isla, Bjoern Menze, Chinedu Innocent Nwoye, Valentin Oreiller, Nicolas Padoy, Sarthak Pati, Kelly Payette, Carole Sudre, Kimberlin Van Wijnen, Armine Vardazaryan, Tom Vercauteren, Martin Wagner, Chuanbo Wang, Moi Hoon Yap, Zeyun Yu, Chun Yuan, Maximilian Zenk, Aneeq Zia, David Zimmerer, Rina Bao, Chanyeol Choi, Andrew Cohen, Oleh Dzyubachyk, Adrian Galdran, Tianyuan Gan, Tianqi Guo, Pradyumna Gupta, Mahmood Haithami, Edward Ho, Ikbeom Jang, Zhili Li, Zhengbo Luo, Filip Lux, Sokratis Makrogiannis, Dominik Müller, Young-tack Oh, Subeen Pang, Constantin Pape, Gorkem Polat, Charlotte Rosalie Reed, Kanghyun Ryu, Tim Scherr, Vajira Thambawita, Haoyu Wang, Xinliang Wang, Kele Xu, Hung Yeh, Doyeob Yeo, Yixuan Yuan, Yan Zeng, Xin Zhao, Julian Abbing, Jannes Adam, Nagesh Adluru, Niklas Agethen, Salman Ahmed, Yasmina Al Khalil, Mireia Alenyà, Esa Alhoniemi, Chengyang An, Talha Anwar, Tewodros Weldebirhan Arega, Netanell Avisdris, Dogu Baran Aydogan, Yingbin Bai, Maria Baldeon Calisto, Berke Doga Basaran, Marcel Beetz, Cheng Bian, Hao Bian, Kevin Blansit, Louise Bloch, Robert Bohnsack, Sara Bosticardo, Jack Breen, Mikael Brudfors, Raphael Brüngel, Mariano Cabezas, Alberto Cacciola, Zhiwei Chen, Yucong Chen, Daniel Tianming Chen, Minjeong Cho, Min-Kook Choi, Chuantao Xie Chuantao Xie, Dana Cobzas, Julien Cohen-Adad, Jorge Corral Acero, Sujit Kumar Das, Marcela de Oliveira, Hanqiu Deng, Guiming Dong, Lars Doorenbos, Cory Efird, Sergio Escalera, Di Fan, Mehdi Fatan Serj, Alexandre Fenneteau, Lucas Fidon, Patryk Filipiak, René Finzel, Nuno R. Freitas, Christoph M. Friedrich, Mitchell Fulton, Finn Gaida, Francesco Galati, Christoforos Galazis, Chang Hee Gan, Zheyao Gao, Shengbo Gao, Matej Gazda, Beerend Gerats, Neil Getty, Adam Gibicar, Ryan Gifford, Sajan Gohil, Maria Grammatikopoulou, Daniel Grzech, Orhun Güley, Timo Günnemann, Chunxu Guo, Sylvain Guy, Heonjin Ha, Luyi Han, Il Song Han, Ali Hatamizadeh, Tian He, Jimin Heo, Sebastian Hitziger, SeulGi Hong, Seungbum Hong, Rian Huang, Ziyan Huang, Markus Huellebrand, Stephan Huschauer, Mustaffa Hussain, Tomoo Inubushi, Ece Isik Polat, Mojtaba Jafaritadi, SeongHun Jeong, Bailiang Jian, Yuanhong Jiang, Zhifan Jiang, Yueming Jin, Smriti Joshi, Abdolrahim Kadkhodamohammadi, Reda Abdellah Kamraoui, Inha Kang, Junghwa Kang, Davood Karimi, April Khademi, Muhammad Irfan Khan, Suleiman A. Khan, Rishab Khantwal, Kwang-Ju Kim, Timothy Kline, Satoshi Kondo, Elina Kontio, Adrian Krenzer, Artem Kroviakov, Hugo Kuijf, Satyadwyoom Kumar, Francesco La Rosa, Abhi Lad, Doohee Lee, Minho Lee, Chiara Lena, Hao Li, Ling Li, Xingyu Li, Fuyuan Liao, Kuanlun Liao, Arlindo Limede Oliveira, Chaonan Lin, Shan Lin, Akis Linardos, Marius George Linguraru, Han Liu, Tao Liu, Di Liu, Yanling Liu, João Lourenço-Silva, Jingpei Lu, Jiangshan Lu, Imanol Luengo, Christina B. Lund, Huan Minh Luu, Yi Lv, Uzay Macar, Leon Maechler, Sina Mansour L., Kenji Marshall, Moona Mazher, Richard McKinley, Alfonso Medela, Felix Meissen, Mingyuan Meng, Dylan Miller, Seyed Hossein Mirjahanmardi, Arnab Mishra, Samir Mitha, Hassan Mohy-ud-Din, Tony Chi Wing Mok, Gowtham Krishnan Murugesan, Enamundram Naga Karthik, Sahil Nalawade, Jakub Nalepa, Mohamed Naser, Ramin Nateghi, Hammad Naveed, Quang-Minh Nguyen, Cuong Nguyen Quoc, Brennan Nichyporuk, Bruno Oliveira, David Owen, Jimut Bahan Pal, Junwen Pan, Wentao Pan, Winnie Pang, Bogyu Park, Vivek Pawar, Kamlesh Pawar, Michael Peven, Lena Philipp, Tomasz Pieciak, Szymon Plotka, Marcel Plutat, Fattaneh Pourakpour, Domen Preložnik, Kumaradevan Punithakumar, Abdul Qayyum, Sandro Queirós, Arman Rahmim, Salar Razavi, Jintao Ren, Mina Rezaei, Jonathan Adam Rico, ZunHyan Rieu, Markus Rink, Johannes Roth, Yusely Ruiz-Gonzalez, Numan Saeed, Anindo Saha, Mostafa Salem, Ricardo Sanchez-Matilla, Kurt Schilling, Wei Shao, Zhiqiang Shen, Ruize Shi, Pengcheng Shi, Daniel Sobotka, Théodore Soulier, Bella Specktor Fadida, Danail Stoyanov, Timothy Sum Hon Mun, Xiaowu Sun, Rong Tao, Franz Thaler, Antoine Théberge, Felix Thielke, Helena Torres, Kareem A. Wahid, Jiacheng Wang, Yifei Wang, Wei Wang, Xiong Wang, Jianhui Wen, Ning Wen, Marek Wodzinski, Ye Wu, Fangfang Xia, Tianqi Xiang, Chen Xiaofei, Lizhan Xu, Tingting Xue, Yuxuan Yang, Lin Yang, Kai Yao, Huifeng Yao, Amirsaeed Yazdani, Michael Yip, Hwanseung Yoo, Fereshteh Yousefirizi, Shunkai Yu, Lei Yu, Jonathan Zamora, Ramy Ashraf Zeineldin, Dewen Zeng, Jianpeng Zhang, Bokai Zhang, Jiapeng Zhang, Fan Zhang, Huahong Zhang, Zhongchen Zhao, Zixuan Zhao, Jiachen Zhao, Can Zhao, Qingshuo Zheng, Yuheng Zhi, Ziqi Zhou, Baosheng Zou, Klaus Maier-Hein, Paul F. Jäger, Annette Kopp-Schneider, Lena Maier-Hein

Of these, 84% were based on standard architectures.

Benchmarking

Text-Guided Mask-free Local Image Retouching

no code implementations15 Dec 2022 Zerun Liu, Fan Zhang, Jingxuan He, Jin Wang, Zhangye Wang, Lechao Cheng

In the realm of multi-modality, text-guided image retouching techniques emerged with the advent of deep learning.

Image Retouching

Purifier: Defending Data Inference Attacks via Transforming Confidence Scores

no code implementations1 Dec 2022 Ziqi Yang, Lijin Wang, Da Yang, Jie Wan, Ziming Zhao, Ee-Chien Chang, Fan Zhang, Kui Ren

Besides, our further experiments show that PURIFIER is also effective in defending adversarial model inversion attacks and attribute inference attacks.

Attribute Inference Attack +1

Tractography-Based Parcellation of Cerebellar Dentate Nuclei via a Deep Nonnegative Matrix Factorization Clustering Method

no code implementations18 Nov 2022 Xiao Xu, Yuqian Chen, Leo Zekelman, Yogesh Rathi, Nikos Makris, Fan Zhang, Lauren J. O'Donnell

In this paper, we investigate a deep nonnegative matrix factorization clustering method (DNMFC) for parcellation of the human DN based on its structural connectivity using diffusion MRI tractography.

Clustering

Line Drawing Guided Progressive Inpainting of Mural Damages

1 code implementation12 Nov 2022 Luxi Li, Qin Zou, Fan Zhang, Hongkai Yu, Long Chen, Chengfang Song, Xianfeng Huang, Xiaoguang Wang

Mural image inpainting refers to repairing the damage or missing areas in a mural image to restore the visual appearance.

Image Inpainting

Memory recall by controlling chaos

no code implementations10 Nov 2022 Fan Zhang

By incorporating feedback loops, that engender amplification and damping so that output is not proportional to input, the biological neural networks become highly nonlinear and thus very likely chaotic in nature.

Unsupervised Graph Outlier Detection: Problem Revisit, New Insight, and Superior Method

1 code implementation24 Oct 2022 Yihong Huang, Liping Wang, Fan Zhang, Xuemin Lin

In addition, we observe that existing algorithms have a performance drop with the mitigated data leakage issue.

Attribute Graph Outlier Detection

GTAV-NightRain: Photometric Realistic Large-scale Dataset for Night-time Rain Streak Removal

1 code implementation10 Oct 2022 Fan Zhang, ShaoDi You, Yu Li, Ying Fu

In this paper, we propose GTAV-NightRain dataset, which is a large-scale synthetic night-time rain streak removal dataset.

BVI-VFI: A Video Quality Database for Video Frame Interpolation

2 code implementations3 Oct 2022 Duolikun Danier, Fan Zhang, David Bull

In order to narrow this research gap, we have developed a new video quality database named BVI-VFI, which contains 540 distorted sequences generated by applying five commonly used VFI algorithms to 36 diverse source videos with various spatial resolutions and frame rates.

Video Frame Interpolation

Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction

no code implementations30 Sep 2022 Jianyi Zhang, Ang Li, Minxue Tang, Jingwei Sun, Xiang Chen, Fan Zhang, Changyou Chen, Yiran Chen, Hai Li

Based on this measure, we also design a computation-efficient client sampling strategy, such that the actively selected clients will generate a more class-balanced grouped dataset with theoretical guarantees.

Federated Learning Privacy Preserving

Enhancing HDR Video Compression through CNN-based Effective Bit Depth Adaptation

1 code implementation18 Jul 2022 Chen Feng, Zihao Qi, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull

In this work, we modify the MFRNet network architecture to enable multiple frame processing, and the new network, multi-frame MFRNet, has been integrated into the EBDA framework using two Versatile Video Coding (VVC) host codecs: VTM 16. 2 and the Fraunhofer Versatile Video Encoder (VVenC 1. 4. 0).

Video Compression

FD-GATDR: A Federated-Decentralized-Learning Graph Attention Network for Doctor Recommendation Using EHR

no code implementations11 Jul 2022 Luning Bi, Yunlong Wang, Fan Zhang, Zhuqing Liu, Yong Cai, Emily Zhao

In the past decade, with the development of big data technology, an increasing amount of patient information has been stored as electronic health records (EHRs).

Graph Attention Recommendation Systems

White Matter Tracts are Point Clouds: Neuropsychological Score Prediction and Critical Region Localization via Geometric Deep Learning

no code implementations6 Jul 2022 Yuqian Chen, Fan Zhang, Chaoyi Zhang, Tengfei Xue, Leo R. Zekelman, Jianzhong He, Yang song, Nikos Makris, Yogesh Rathi, Alexandra J. Golby, Weidong Cai, Lauren J. O'Donnell

In this paper, we propose a deep-learning-based framework for neuropsychological score prediction using microstructure measurements estimated from diffusion magnetic resonance imaging (dMRI) tractography, focusing on predicting performance on a receptive vocabulary assessment task based on a critical fiber tract for language, the arcuate fasciculus (AF).

TractoFormer: A Novel Fiber-level Whole Brain Tractography Analysis Framework Using Spectral Embedding and Vision Transformers

no code implementations5 Jul 2022 Fan Zhang, Tengfei Xue, Weidong Cai, Yogesh Rathi, Carl-Fredrik Westin, Lauren J O'Donnell

Whole brain tractography (WBT) data contains over hundreds of thousands of individual fiber streamlines (estimated brain connections), and this data is usually parcellated to create compact representations for data analysis applications such as disease classification.

Data Augmentation Ensemble Learning

CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks

no code implementations4 Jun 2022 Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jinrong Jiang, Qianjin Guo, Linghan Zheng

Detection of object anomalies is crucial in industrial processes, but unsupervised anomaly detection and localization is particularly important due to the difficulty of obtaining a large number of defective samples and the unpredictable types of anomalies in real life.

Unsupervised Anomaly Detection

Phased Progressive Learning with Coupling-Regulation-Imbalance Loss for Imbalanced Data Classification

no code implementations24 May 2022 Liang Xu, Yi Cheng, Fan Zhang, Bingxuan Wu, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

This loss is effective in addressing quantity imbalances and outliers, while regulating the focus of attention on samples with varying classification difficulties.

Classification imbalanced classification +1

Enhancing VVC with Deep Learning based Multi-Frame Post-Processing

no code implementations19 May 2022 Duolikun Danier, Chen Feng, Fan Zhang, David Bull

This paper describes a CNN-based multi-frame post-processing approach based on a perceptually-inspired Generative Adversarial Network architecture, CVEGAN.

Generative Adversarial Network Image Compression

A Saliency-Guided Street View Image Inpainting Framework for Efficient Last-Meters Wayfinding

1 code implementation14 May 2022 Chuanbo Hu, Shan Jia, Fan Zhang, Xin Li

However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e. g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters.

Image Inpainting object-detection +2

One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code

no code implementations12 May 2022 Yong Dai, Duyu Tang, Liangxin Liu, Minghuan Tan, Cong Zhou, Jingquan Wang, Zhangyin Feng, Fan Zhang, Xueyu Hu, Shuming Shi

Moreover, our model supports self-supervised pretraining with the same sparsely activated way, resulting in better initialized parameters for different modalities.

Image Retrieval Retrieval

Multi-Graph based Multi-Scenario Recommendation in Large-scale Online Video Services

no code implementations5 May 2022 Fan Zhang, Qiuying Peng, Yulin Wu, Zheng Pan, Rong Zeng, Da Lin, Yue Qi

Recently, industrial recommendation services have been boosted by the continual upgrade of deep learning methods.

Data Integration Graph Learning

Deep fiber clustering: Anatomically informed fiber clustering with self-supervised deep learning for fast and effective tractography parcellation

1 code implementation2 May 2022 Yuqian Chen, Chaoyi Zhang, Tengfei Xue, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

In this work, we propose a novel deep learning framework for white matter fiber clustering, Deep Fiber Clustering (DFC), which solves the unsupervised clustering problem as a self-supervised learning task with a domain-specific pretext task to predict pairwise fiber distances.

Anatomy Clustering +3

SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

no code implementations26 Apr 2022 Junwei Liao, Duyu Tang, Fan Zhang, Shuming Shi

We present SkillNet-NLG, a sparsely activated approach that handles many natural language generation tasks with one model.

Multi-Task Learning Text Generation

Enhancing Non-mass Breast Ultrasound Cancer Classification With Knowledge Transfer

no code implementations18 Apr 2022 Yangrun Hu, Yuanfan Guo, Fan Zhang, Mingda Wang, Tiancheng Lin, Rong Wu, Yi Xu

Based on the insight that mass data is sufficient and shares the same knowledge structure with non-mass data of identifying the malignancy of a lesion based on the ultrasound image, we propose a novel transfer learning framework to enhance the generalizability of the DNN model for non-mass BUS with the help of mass BUS.

Classification Transfer Learning

Global Attitude Synchronization of Networked Rigid Bodies Under Directed Topologies

no code implementations30 Mar 2022 Fan Zhang, Deyuan Meng, Jingyao Zhang

Simulations for networked spacecraft are presented to show the global synchronization performances under different directed topologies.

STICC: A multivariate spatial clustering method for repeated geographic pattern discovery with consideration of spatial contiguity

1 code implementation17 Mar 2022 Yuhao Kang, Kunlin Wu, Song Gao, Ignavier Ng, Jinmeng Rao, Shan Ye, Fan Zhang, Teng Fei

In this paper, we propose a Spatial Toeplitz Inverse Covariance-Based Clustering (STICC) method that considers both attributes and spatial relationships of geographic objects for multivariate spatial clustering.

Attribute Clustering

Mixed Reality Depth Contour Occlusion Using Binocular Similarity Matching and Three-dimensional Contour Optimisation

no code implementations4 Mar 2022 Naye Ji, Fan Zhang, Haoxiang Zhang, Youbing Zhao, Dingguo Yu

To evaluate the effectiveness of the algorithm, we demonstrate a time con-sumption statistical analysis for each stage of the DCO algorithm execution.

Mixed Reality Optical Flow Estimation +1

A CNN-based Post-Processor for Perceptually-Optimized Immersive Media Compression

no code implementations25 Feb 2022 Angeliki Katsenou, Fan Zhang, David Bull

In recent years, resolution adaptation based on deep neural networks has enabled significant performance gains for conventional (2D) video codecs.

Double Thompson Sampling in Finite stochastic Games

no code implementations21 Feb 2022 Shuqing Shi, Xiaobin Wang, Zhiyou Yang, Fan Zhang, Hong Qu

This algorithm achieves a total regret bound of $\tilde{\mathcal{O}}(D\sqrt{SAT})$in time horizon $T$ with $S$ states, $A$ actions and diameter $D$.

Thompson Sampling

RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training

no code implementations17 Feb 2022 Chen Feng, Duolikun Danier, Fan Zhang, David Bull

In recent years, deep learning techniques have shown significant potential for improving video quality assessment (VQA), achieving higher correlation with subjective opinions compared to conventional approaches.

Video Quality Assessment Visual Question Answering (VQA)

Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN

no code implementations15 Feb 2022 Duolikun Danier, Fan Zhang, David Bull

This paper presents a new deformable convolution-based video frame interpolation (VFI) method, using a coarse to fine 3D CNN to enhance the multi-flow prediction.

Video Frame Interpolation

A Subjective Quality Study for Video Frame Interpolation

no code implementations15 Feb 2022 Duolikun Danier, Fan Zhang, David Bull

Video frame interpolation (VFI) is one of the fundamental research areas in video processing and there has been extensive research on novel and enhanced interpolation algorithms.

SSIM Video Frame Interpolation

SupWMA: Consistent and Efficient Tractography Parcellation of Superficial White Matter with Deep Learning

1 code implementation29 Jan 2022 Tengfei Xue, Fan Zhang, Chaoyi Zhang, Yuqian Chen, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'Donnell

Most parcellation methods focus on the deep white matter (DWM), while fewer methods address the superficial white matter (SWM) due to its complexity.

Contrastive Learning

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

3 code implementations CVPR 2022 Duolikun Danier, Fan Zhang, David Bull

Video frame interpolation (VFI) is currently a very active research topic, with applications spanning computer vision, post production and video encoding.

Texture Synthesis Video Frame Interpolation

Can Graph Neural Networks Learn to Solve MaxSAT Problem?

no code implementations15 Nov 2021 Minghao Liu, Fuqi Jia, Pei Huang, Fan Zhang, Yuchen Sun, Shaowei Cai, Feifei Ma, Jian Zhang

With the rapid development of deep learning techniques, various recent work has tried to apply graph neural networks (GNNs) to solve NP-hard problems such as Boolean Satisfiability (SAT), which shows the potential in bridging the gap between machine learning and symbolic reasoning.

Branch and Bound in Mixed Integer Linear Programming Problems: A Survey of Techniques and Trends

no code implementations5 Nov 2021 Lingying Huang, Xiaomeng Chen, Wei Huo, Jiazheng Wang, Fan Zhang, Bo Bai, Ling Shi

In order to improve the speed of B&B algorithms, learning techniques have been introduced in this algorithm recently.

Variable Selection

Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision

no code implementations6 Oct 2021 Ruijie Ren, Mohit Gurnani Rajesh, Jordi Sanchez-Riera, Fan Zhang, Yurun Tian, Antonio Agudo, Yiannis Demiris, Krystian Mikolajczyk, Francesc Moreno-Noguer

We show that training our network solely with synthetic data and the proposed DA yields results competitive with models trained on real data.

Domain Adaptation

Coded Computation across Shared Heterogeneous Workers with Communication Delay

no code implementations23 Sep 2021 Yuxuan Sun, Fan Zhang, Junlin Zhao, Sheng Zhou, Zhisheng Niu, Deniz Gündüz

In this work, we consider a multi-master heterogeneous-worker distributed computing scenario, where multiple matrix multiplication tasks are encoded and allocated to workers for parallel computation.

Distributed Computing

Efficient Context-Aware Network for Abdominal Multi-organ Segmentation

1 code implementation22 Sep 2021 Fan Zhang, Yu Wang, Hua Yang

For the context block, we propose strip pooling module to capture anisotropic and long-range contextual information, which exists in abdominal scene.

Organ Segmentation

DSNet: A Dual-Stream Framework for Weakly-Supervised Gigapixel Pathology Image Analysis

no code implementations13 Sep 2021 Tiange Xiang, Yang song, Chaoyi Zhang, Dongnan Liu, Mei Chen, Fan Zhang, Heng Huang, Lauren O'Donnell, Weidong Cai

With image-level labels only, patch-wise classification would be sub-optimal due to inconsistency between the patch appearance and image-level label.

Classification whole slide images

Ultralow complexity long short-term memory network for fiber nonlinearity mitigation in coherent optical communication systems

no code implementations12 Aug 2021 Hao Ming, Xinyu Chen, Xiansong Fang, Lei Zhang, Chenjia Li, Fan Zhang

In this paper, we propose a center-oriented long short-term memory network (Co-LSTM) incorporating a simplified mode with a recycling mechanism in the equalization operation, which can mitigate fiber nonlinearity in coherent optical communication systems with ultralow complexity.

SR-HetGNN:Session-based Recommendation with Heterogeneous Graph Neural Network

no code implementations12 Aug 2021 Jinpeng Chen, Haiyang Li, Xudong Zhang, Fan Zhang, Senzhang Wang, Kaimin Wei, Jiaqi Ji

The current studies generally learn user preferences according to the transitions of items in the user's session sequence.

Session-Based Recommendations

On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation

no code implementations ACL 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

An explainable two-dimensional single model deep learning approach for Alzheimer's disease diagnosis and brain atrophy localization

no code implementations28 Jul 2021 Fan Zhang, Bo Pan, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

In this research, we propose a novel end-to-end deep learning approach for automated diagnosis of AD and localization of important brain regions related to the disease from sMRI data.

Data Augmentation

Deep Fiber Clustering: Anatomically Informed Unsupervised Deep Learning for Fast and Effective White Matter Parcellation

no code implementations11 Jul 2021 Yuqian Chen, Chaoyi Zhang, Yang song, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O'Donnell

White matter fiber clustering (WMFC) enables parcellation of white matter tractography for applications such as disease classification and anatomical tract segmentation.

Clustering Segmentation +1

Learning Temporal Consistency for Low Light Video Enhancement From Single Images

1 code implementation CVPR 2021 Fan Zhang, Yu Li, ShaoDi You, Ying Fu

Based on this idea, we propose our method which can infer motion prior for single image low light video enhancement and enforce temporal consistency.

Optical Flow Estimation Video Enhancement

Quality assessment methods for perceptual video compression

no code implementations15 Jun 2021 Fan Zhang, David R. Bull

This paper describes a quality assessment model for perceptual video compression applications (PVM), which stimulates visual masking and distortion-artefact perception using an adaptive combination of noticeable distortions and blurring artefacts.

Video Compression

An adaptive Lagrange multiplier determination method for rate-distortion optimisation in hybrid video codecs

no code implementations15 Jun 2021 Fan Zhang, David R. Bull

This paper describes an adaptive Lagrange multiplier determination method for rate-quality optimisation in video compression.

Video Compression

Perceptually-inspired super-resolution of compressed videos

no code implementations15 Jun 2021 Di Ma, Mariana Afonso, Fan Zhang, David R. Bull

Spatial resolution adaptation is a technique which has often been employed in video compression to enhance coding efficiency.

Generative Adversarial Network Super-Resolution +1

On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation

no code implementations9 Jun 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

no code implementations8 Jun 2021 Xiaocheng Tang, Zhiwei Qin, Fan Zhang, Zhaodong Wang, Zhe Xu, Yintai Ma, Hongtu Zhu, Jieping Ye

In this work, we propose a deep reinforcement learning based solution for order dispatching and we conduct large scale online A/B tests on DiDi's ride-dispatching platform to show that the proposed method achieves significant improvement on both total driver income and user experience related metrics.

reinforcement-learning Reinforcement Learning (RL) +1

Deception Detection in Videos using the Facial Action Coding System

no code implementations28 May 2021 Hammad Ud Din Ahmed, Usama Ijaz Bajwa, Fan Zhang, Muhammad Waqas Anwar

We specifically use long short-term memory (LSTM) which we trained using the real-life trial dataset and it provided one of the best facial only approaches to deception detection.

Deception Detection In Videos Decision Making

Quantitative mapping of the brain's structural connectivity using diffusion MRI tractography: a review

no code implementations23 Apr 2021 Fan Zhang, Alessandro Daducci, Yong He, Simona Schiavi, Caio Seguin, Robert Smith, Chun-Hung Yeh, Tengda Zhao, Lauren J. O'Donnell

Diffusion magnetic resonance imaging (dMRI) tractography is an advanced imaging technique that enables in vivo mapping of the brain's white matter connections at macro scale.

Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection

no code implementations NAACL 2021 Sihao Chen, Fan Zhang, Kazoo Sone, Dan Roth

Despite significant progress in neural abstractive summarization, recent studies have shown that the current models are prone to generating summaries that are unfaithful to the original context.

Abstractive Text Summarization Hallucination

Individually Fair Gradient Boosting

no code implementations ICLR 2021 Alexander Vargo, Fan Zhang, Mikhail Yurochkin, Yuekai Sun

Gradient boosting is a popular method for machine learning from tabular data, which arise often in applications where algorithmic fairness is a concern.

Fairness

Robustifying Conditional Portfolio Decisions via Optimal Transport

no code implementations30 Mar 2021 Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

Despite the non-linearity of the objective function in the probability measure, we show that the distributionally robust portfolio allocation with side information problem can be reformulated as a finite-dimensional optimization problem.

A Subjective Study on Videos at Various Bit Depths

no code implementations18 Mar 2021 Alex Mackin, Di Ma, Fan Zhang, David Bull

Bit depth adaptation, where the bit depth of a video sequence is reduced before transmission and up-sampled during display, can potentially reduce data rates with limited impact on perceptual quality.

VMAF-based Bitrate Ladder Estimation for Adaptive Streaming

no code implementations12 Mar 2021 Angeliki V. Katsenou, Fan Zhang, Kyle Swanson, Mariana Afonso, Joel Sole, David R. Bull

In HTTP Adaptive Streaming, video content is conventionally encoded by adapting its spatial resolution and quantization level to best match the prevailing network state and display characteristics.

Quantization

Enhancing VMAF through New Feature Integration and Model Combination

no code implementations10 Mar 2021 Fan Zhang, Angeliki Katsenou, Christos Bampis, Lukas Krasula, Zhi Li, David Bull

VMAF is a machine learning based video quality assessment method, originally designed for streaming applications, which combines multiple quality metrics and video features through SVM regression.

regression Video Quality Assessment

Three-dimensional charge density wave and robust zero-bias conductance peak inside the superconducting vortex core of a kagome superconductor CsV$_3$Sb$_5$

no code implementations8 Mar 2021 Zuowei Liang, Xingyuan Hou, Wanru Ma, Fan Zhang, Ping Wu, Zongyuan Zhang, Fanghang Yu, J. -J. Ying, Kun Jiang, Lei Shan, Zhenyu Wang, X. -H. Chen

The transition-metal-based kagome metals provide a versatile platform for correlated topological phases hosting various electronic instabilities.

Superconductivity Strongly Correlated Electrons

Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning

no code implementations8 Mar 2021 Yan Jiao, Xiaocheng Tang, Zhiwei Qin, Shuaiji Li, Fan Zhang, Hongtu Zhu, Jieping Ye

We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms.

reinforcement-learning Reinforcement Learning (RL)

Sensing population distribution from satellite imagery via deep learning: model selection, neighboring effect, and systematic biases

no code implementations3 Mar 2021 Xiao Huang, Di Zhu, Fan Zhang, Tao Liu, Xiao Li, Lei Zou

The rapid development of remote sensing techniques provides rich, large-coverage, and high-temporal information of the ground, which can be coupled with the emerging deep learning approaches that enable latent features and hidden geographical patterns to be extracted.

Model Selection

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation

no code implementations3 Feb 2021 Mingke Xu, Fan Zhang, Xiaodong Cui, Wei zhang

In this paper, we apply multiscale area attention in a deep convolutional neural network to attend emotional characteristics with varied granularities and therefore the classifier can benefit from an ensemble of attentions with different scales.

Data Augmentation Speech Emotion Recognition

Room-Temperature Superconductivity in Boron-Nitrogen Doped Lanthanum Superhydride

no code implementations24 Dec 2020 Yanfeng Ge, Fan Zhang, Russell J. Hemley

Recent theoretical and experimental studies of hydrogen-rich materials at megabar pressures (i. e., >100 GPa) have led to the discovery of very high-temperature superconductivity in these materials.

Superconductivity Materials Science

Two-fluid Modeling of Acoustic Wave Propagation in Gravitationally Stratified Isothermal Media

no code implementations26 Nov 2020 Fan Zhang, Stefaan Poedts, Andrea Lani, Błażej Kuźma, Kris Murawski

In the present numerical simulations, the initial density is specified to reach hydrostatic equilibrium, and as a comparison, chemical equilibrium is also taken into account to provide a density profile that differs from typical hydrostatic equilibrium profiles.

Plasma Physics Solar and Stellar Astrophysics

Hole-Doped Room-Temperature Superconductivity in H$_{3}$S$_{1-x}$Z$_x$ (Z=C, Si)

no code implementations25 Nov 2020 Yanfeng Ge, Fan Zhang, Ranga P. Dias, Russell J. Hemley, Yugui Yao

We examine the effects of the low-level substitution of S atoms by C and Si atoms on the superconductivity of H$_3$S with the $Im\bar{3}m$ structure at megabar pressure.

Superconductivity Materials Science

Distributionally Robust Local Non-parametric Conditional Estimation

no code implementations NeurIPS 2020 Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

Conditional estimation given specific covariate values (i. e., local conditional estimation or functional estimation) is ubiquitously useful with applications in engineering, social and natural sciences.

A simulation environment for drone cinematography

no code implementations3 Oct 2020 Fan Zhang, David Hall, Tao Xu, Stephen Boyle, David Bull

Methods for environmental image capture, 3D reconstruction (photogrammetry) and the creation of foreground assets are presented along with a flexible and user-friendly simulation interface.

3D Reconstruction

Video Compression with CNN-based Post Processing

no code implementations16 Sep 2020 Fan Zhang, Di Ma, Chen Feng, David R. Bull

In recent years, video compression techniques have been significantly challenged by the rapidly increased demands associated with high quality and immersive video content.

Video Compression

P-DIFF: Learning Classifier with Noisy Labels based on Probability Difference Distributions

1 code implementation14 Sep 2020 Wei Hu, QiHao Zhao, Yangyu Huang, Fan Zhang

Learning deep neural network (DNN) classifier with noisy labels is a challenging task because the DNN can easily over-fit on these noisy labels due to its high capability.

PDAM: A Panoptic-Level Feature Alignment Framework for Unsupervised Domain Adaptive Instance Segmentation in Microscopy Images

1 code implementation11 Sep 2020 Dongnan Liu, Donghao Zhang, Yang song, Fan Zhang, Lauren O'Donnell, Heng Huang, Mei Chen, Weidong Cai

In this work, we present an unsupervised domain adaptation (UDA) method, named Panoptic Domain Adaptive Mask R-CNN (PDAM), for unsupervised instance segmentation in microscopy images.

Instance Segmentation Segmentation +3

Joint Bandwidth Allocation and Path Selection in WANs with Path Cardinality Constraints

no code implementations10 Aug 2020 Jinxin Wang, Fan Zhang, Zhonglin Xie, Gong Zhang, Zaiwen Wen

Almost all existing works deal with such a problem using relaxation techniques to transform it to be a convex optimization problem.

Fairness

Video compression with low complexity CNN-based spatial resolution adaptation

no code implementations29 Jul 2020 Di Ma, Fan Zhang, David R. Bull

It has recently been demonstrated that spatial resolution adaptation can be integrated within video compression to improve overall coding performance by spatially down-sampling before encoding and super-resolving at the decoder.

Super-Resolution Video Compression

MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering

no code implementations14 Jul 2020 Di Ma, Fan Zhang, David R. Bull

Each MFRB extracts features from multiple convolutional layers using dense connections and a multi-level residual learning structure.

Video Compression

MediaPipe Hands: On-device Real-time Hand Tracking

4 code implementations18 Jun 2020 Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann

We present a real-time on-device hand tracking pipeline that predicts hand skeleton from single RGB camera for AR/VR applications.

BlazePose: On-device Real-time Body Pose tracking

7 code implementations17 Jun 2020 Valentin Bazarevsky, Ivan Grishchenko, Karthik Raveendran, Tyler Zhu, Fan Zhang, Matthias Grundmann

We present BlazePose, a lightweight convolutional neural network architecture for human pose estimation that is tailored for real-time inference on mobile devices.

2D Human Pose Estimation 3D Human Pose Estimation +4

Distributionally Robust Batch Contextual Bandits

no code implementations10 Jun 2020 Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

Leveraging this evaluation scheme, we further propose a novel learning algorithm that is able to learn a policy that is robust to adversarial perturbations and unknown covariate shifts with a performance guarantee based on the theory of uniform convergence.

Multi-Armed Bandits

Integrating global spatial features in CNN based Hyperspectral/SAR imagery classification

no code implementations30 May 2020 Fan Zhang, MinChao Yan, Chen Hu, Jun Ni, Fei Ma

In addition, a dual-branch convolutional neural network (CNN) classification method is designed in combination with the global information to mine the pixel features of the image.

Classification General Classification +3

Active Fuzzing for Testing and Securing Cyber-Physical Systems

1 code implementation28 May 2020 Yuqi Chen, Bohan Xuan, Christopher M. Poskitt, Jun Sun, Fan Zhang

Cyber-physical systems (CPSs) in critical infrastructure face a pervasive threat from attackers, motivating research into a variety of countermeasures for securing them.

Active Learning

Defending Model Inversion and Membership Inference Attacks via Prediction Purification

no code implementations8 May 2020 Ziqi Yang, Bin Shao, Bohan Xuan, Ee-Chien Chang, Fan Zhang

Neural networks are susceptible to data inference attacks such as the model inversion attack and the membership inference attack, where the attacker could infer the reconstruction and the membership of a data sample from the confidence scores predicted by the target classifier.

Inference Attack Membership Inference Attack

Encoding in the Dark Grand Challenge: An Overview

no code implementations7 May 2020 Nantheera Anantrasirichai, Fan Zhang, Alexandra Malyugina, Paul Hill, Angeliki Katsenou

In this paper, we present an overview of the proposed challenge, and test state-of-the-art methods that will be part of the benchmark methods at the stage of the participants' deliverable assessment.

Denoising Image Enhancement

TRAKO: Efficient Transmission of Tractography Data for Visualization

1 code implementation26 Apr 2020 Daniel Haehn, Loraine Franke, Fan Zhang, Suheyla Cetin Karayumak, Steve Pieper, Lauren O'Donnell, Yogesh Rathi

Fiber tracking produces large tractography datasets that are tens of gigabytes in size consisting of millions of streamlines.

BVI-DVC: A Training Database for Deep Video Compression

no code implementations30 Mar 2020 Di Ma, Fan Zhang, David R. Bull

Deep learning methods are increasingly being applied in the optimisation of video compression algorithms and can achieve significantly enhanced coding gains, compared to conventional approaches.

Video Compression

BVI-CC: A Dataset for Research on Video Compression and Quality Assessment

no code implementations23 Mar 2020 Angeliki V. Katsenou, Fan Zhang, Mariana Afonso, Goce Dimitrov, David R. Bull

The compression efficiency of the codecs was evaluated with commonly used objective quality metrics, and the subjective quality of their reconstructed content was also evaluated through psychophysical experiments.

Video Compression

Residual-Recursion Autoencoder for Shape Illustration Images

no code implementations6 Feb 2020 Qianwei Zhou, Peng Tao, Xiaoxin Li, Sheng-Yong Chen, Fan Zhang, Haigen Hu

Shape illustration images (SIIs) are common and important in describing the cross-sections of industrial products.

Efficient Scenario Generation for Heavy-tailed Chance Constrained Optimization

no code implementations6 Feb 2020 Jose Blanchet, Fan Zhang, Bert Zwart

We consider a generic class of chance-constrained optimization problems with heavy-tailed (i. e., power-law type) risk factors.

Optimization and Control Probability

Concurrently Extrapolating and Interpolating Networks for Continuous Model Generation

1 code implementation12 Jan 2020 Lijun Zhao, Jinjing Zhang, Fan Zhang, Anhong Wang, Huihui Bai, Yao Zhao

Most deep image smoothing operators are always trained repetitively when different explicit structure-texture pairs are employed as label images for each algorithm configured with different parameters.

image smoothing

Mitigate Parasitic Resistance in Resistive Crossbar-based Convolutional Neural Networks

no code implementations17 Dec 2019 Fan Zhang, Miao Hu

We demonstrated the proposed methods with implementations of a 4-layer CNN on MNIST and ResNet(20, 32, and 56) on CIFAR-10.

Defects Mitigation in Resistive Crossbars for Analog Vector Matrix Multiplication

no code implementations17 Dec 2019 Fan Zhang, Miao Hu

With storage and computation happening at the same place, computing in resistive crossbars minimizes data movement and avoids the memory bottleneck issue.

Hepatocellular Carcinoma Intra-arterial Treatment Response Prediction for Improved Therapeutic Decision-Making

no code implementations1 Dec 2019 Junlin Yang, Nicha C. Dvornek, Fan Zhang, Julius Chapiro, MingDe Lin, Aaron Abajian, James S. Duncan

This work proposes a pipeline to predict treatment response to intra-arterial therapy of patients with Hepatocellular Carcinoma (HCC) for improved therapeutic decision-making.

Decision Making

Inexact Primal-Dual Gradient Projection Methods for Nonlinear Optimization on Convex Set

no code implementations18 Nov 2019 Fan Zhang, Hao Wang, Jiashan Wang, Kai Yang

In this paper, we propose a novel primal-dual inexact gradient projection method for nonlinear optimization problems with convex-set constraint.

ViSTRA2: Video Coding using Spatial Resolution and Effective Bit Depth Adaptation

no code implementations7 Nov 2019 Fan Zhang, Mariana Afonso, David R. Bull

Our results show consistent and significant compression gains against HM and VVC based on Bj{\o}negaard Delta measurements, with average BD-rate savings of 12. 6% (PSNR) and 19. 5% (VMAF) over HM and 5. 5% (PSNR) and 8. 6% (VMAF) over VTM.

Video Compression

ACFNet: Attentional Class Feature Network for Semantic Segmentation

1 code implementation ICCV 2019 Fan Zhang, Yanqin Chen, Zhihang Li, Zhibin Hong, Jingtuo Liu, Feifei Ma, Junyu Han, Errui Ding

Recent works have made great progress in semantic segmentation by exploiting richer context, most of which are designed from a spatial perspective.

Segmentation Semantic Segmentation

Edge AIBench: Towards Comprehensive End-to-end Edge Computing Benchmarking

no code implementations6 Aug 2019 Tianshu Hao, Yunyou Huang, Xu Wen, Wanling Gao, Fan Zhang, Chen Zheng, Lei Wang, Hainan Ye, Kai Hwang, Zujie Ren, Jianfeng Zhan

In edge computing scenarios, the distribution of data and collaboration of workloads on different layers are serious concerns for performance, privacy, and security issues.

Performance Distributed, Parallel, and Cluster Computing

A Survey of Deep Learning-based Object Detection

no code implementations11 Jul 2019 Licheng Jiao, Fan Zhang, Fang Liu, Shuyuan Yang, Lingling Li, Zhixi Feng, Rong Qu

Object detection is one of the most important and challenging branches of computer vision, which has been widely applied in peoples life, such as monitoring security, autonomous driving and so on, with the purpose of locating instances of semantic objects of a certain class.

Autonomous Driving Object +2

Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model

no code implementations1 Jul 2019 Fan Zhang, Tong Wu, Yunlong Wang, Yong Cai, Cao Xiao, Emily Zhao, Lucas Glass, Jimeng Sun

Many computational models were proposed to extract temporal patterns from clinical time series for each patient and among patient group for predictive healthcare.

Time Series Time Series Analysis

MediaPipe: A Framework for Building Perception Pipelines

2 code implementations14 Jun 2019 Camillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris McClanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-Teh Chang, Wei Hua, Manfred Georg, Matthias Grundmann

A developer can use MediaPipe to build prototypes by combining existing perception components, to advance them to polished cross-platform applications and measure system performance and resource consumption on target platforms.

Distributed, Parallel, and Cluster Computing

A Distributionally Robust Boosting Algorithm

no code implementations20 May 2019 Jose Blanchet, Yang Kang, Fan Zhang, Zhangyi Hu

Distributionally Robust Optimization (DRO) has been shown to provide a flexible framework for decision making under uncertainty and statistical estimation.

Decision Making Decision Making Under Uncertainty

Noise-Tolerant Paradigm for Training Face Recognition CNNs

2 code implementations CVPR 2019 Wei Hu, Yangyu Huang, Fan Zhang, Ruirui Li

Benefit from large-scale training datasets, deep Convolutional Neural Networks(CNNs) have achieved impressive results in face recognition(FR).

Face Recognition

Quantifying Legibility of Indoor Spaces Using Deep Convolutional Neural Networks: Case Studies in Train Stations

no code implementations22 Jan 2019 Zhoutong Wang, Qianhui Liang, Fabio Duarte, Fan Zhang, Louis Charron, Lenna Johnsen, Bill Cai, Carlo Ratti

Evaluating legibility is particularly desirable in indoor spaces, since it has a large impact on human behavior and the efficiency of space utilization.

Nonconvex and Nonsmooth Sparse Optimization via Adaptively Iterative Reweighted Methods

no code implementations24 Oct 2018 Hao Wang, Fan Zhang, Yuanming Shi, Yaohua Hu

We propose a general formulation of nonconvex and nonsmooth sparse optimization problems with convex set constraint, which can take into account most existing types of nonconvex sparsity-inducing terms, bringing strong applicability to a wide range of applications.

Optimal Transport Based Distributionally Robust Optimization: Structural Properties and Iterative Schemes

1 code implementation4 Oct 2018 Jose Blanchet, Karthyek Murthy, Fan Zhang

We consider optimal transport based distributionally robust optimization (DRO) problems with locally strongly convex transport cost functions and affine decision rules.

Optimization and Control Primary: 90C15, Secondary: 65K05, 90C47

Memristor-based Deep Convolution Neural Network: A Case Study

no code implementations14 Sep 2018 Fan Zhang, Miao Hu

In this paper, we firstly introduce a method to efficiently implement large-scale high-dimensional convolution with realistic memristor-based circuit components.

Cannot find the paper you are looking for? You can Submit a new open access paper.