Search Results for author: Yang Yang

Found 329 papers, 111 papers with code

A Progressive Framework for Role-Aware Rumor Resolution

1 code implementation COLING 2022 Lei Chen, Guanying Li, Zhongyu Wei, Yang Yang, Baohua Zhou, Qi Zhang, Xuanjing Huang

Existing works on rumor resolution have shown great potential in recognizing word appearance and user participation.

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

1 code implementation28 Mar 2024 Wei Dong, Xing Zhang, Bihui Chen, Dawei Yan, Zhijun Lin, Qingsen Yan, Peng Wang, Yang Yang

Parameter-efficient fine-tuning for pre-trained Vision Transformers aims to adeptly tailor a model to downstream tasks by learning a minimal set of new adaptation parameters while preserving the frozen majority of pre-trained parameters.

High-Resolution Image Translation Model Based on Grayscale Redefinition

no code implementations26 Mar 2024 Xixian Wu, Dian Chao, Yang Yang

Image-to-image translation is a technique that focuses on transferring images from one domain to another while maintaining the essential content representations.

Image-to-Image Translation Translation

Semi-Supervised Image Captioning Considering Wasserstein Graph Matching

no code implementations26 Mar 2024 Yang Yang

Image captioning can automatically generate captions for the given images, and the key challenge is to learn a mapping function from visual features to natural language features.

Data Augmentation Graph Matching +2

Solution for Emotion Prediction Competition of Workshop on Emotionally and Culturally Intelligent AI

no code implementations26 Mar 2024 Shengdong Xu, Zhouyang Chi, Yang Yang

In order to address this issue, we propose a simple yet effective approach called single-multi modal with Emotion-Cultural specific prompt(ECSP), which focuses on using the single modal message to enhance the performance of multimodal models and a well-designed prompt to reduce cultural differences problem.

XLM-R

Solution for Point Tracking Task of ICCV 1st Perception Test Challenge 2023

no code implementations26 Mar 2024 Hongpeng Pan, Yang Yang, Zhongtian Fu, Yuxuan Zhang, Shian Du, Yi Xu, Xiangyang Ji

To address this issue, we propose a simple yet effective approach called TAP with confident static points (TAPIR+), which focuses on rectifying the tracking of the static point in the videos shot by a static camera.

Motion Detection Point Tracking +2

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing

no code implementations25 Mar 2024 Ziwei Chai, Guoyin Wang, Jing Su, Tianjie Zhang, Xuanwen Huang, Xuwu Wang, Jingjing Xu, Jianbo Yuan, Hongxia Yang, Fei Wu, Yang Yang

We present Expert-Token-Routing, a unified generalist framework that facilitates seamless integration of multiple expert LLMs.

Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments

1 code implementation20 Mar 2024 Yang Yang, Wenhai Wang, Zhe Chen, Jifeng Dai, Liang Zheng

However, in the real-world where test ground truths are not provided, it is non-trivial to find out whether bounding boxes are accurate, thus preventing us from assessing the detector generalization ability.

object-detection Object Detection +1

Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges

no code implementations18 Mar 2024 Yang Yang, Mingzhe Chen, Yufei Blankenship, Jemin Lee, Zabih Ghassemlooy, Julian Cheng, Shiwen Mao

The purpose of this paper is to provide a comprehensive overview of existing works and new trends in the field of positioning techniques from both the academic and industrial perspectives.

Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction

no code implementations18 Mar 2024 Hongxiao Wang, Yang Yang, Zhuo Zhao, Pengfei Gu, Nishchal Sapkota, Danny Z. Chen

For predicting cancer survival outcomes, standard approaches in clinical research are often based on two main modalities: pathology images for observing cell morphology features, and genomic (e. g., bulk RNA-seq) for quantifying gene expressions.

Survival Prediction

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

1 code implementation18 Mar 2024 Yang Yang, Wen Wang, Liang Peng, Chaotian Song, Yao Chen, Hengjia Li, Xiaolong Yang, Qinglin Lu, Deng Cai, Boxi Wu, Wei Liu

Customization generation techniques have significantly advanced the synthesis of specific concepts across varied contexts.

An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem

no code implementations17 Mar 2024 Yang Yang

As an important part of genetic algorithms (GAs), mutation operators is widely used in evolutionary algorithms to solve $\mathcal{NP}$-hard problems because it can increase the population diversity of individual.

Evolutionary Algorithms Math

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

no code implementations15 Mar 2024 Meixuan Li, Tianyu Li, Guoqing Wang, Peng Wang, Yang Yang, Heng Tao Shen

Aligning these distributions between corresponding regions from different tasks imparts higher flexibility and capacity to capture intra-region structures, accommodating a broader range of tasks.

Depth Estimation Semantic Segmentation +1

SEMRes-DDPM: Residual Network Based Diffusion Modelling Applied to Imbalanced Data

no code implementations9 Mar 2024 Ming Zheng, Yang Yang, Zhi-Hang Zhao, Shan-Chao Gan, Yang Chen, Si-Kai Ni, Yang Lu

In the current oversampling methods based on generative networks, the methods based on GANs can capture the true distribution of data, but there is the problem of pattern collapse and training instability in training; in the oversampling methods based on denoising diffusion probability models, the neural network of the inverse diffusion process using the U-Net is not applicable to tabular data, and although the MLP can be used to replace the U-Net, the problem exists due to the simplicity of the structure and the poor effect of removing noise.

Denoising

Towards Efficient and Effective Unlearning of Large Language Models for Recommendation

1 code implementation6 Mar 2024 Hangyu Wang, Jianghao Lin, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu

However, in order to protect user privacy and optimize utility, it is also crucial for LLMRec to intentionally forget specific user data, which is generally referred to as recommendation unlearning.

World Knowledge

Event-Driven Learning for Spiking Neural Networks

no code implementations1 Mar 2024 Wenjie Wei, Malu Zhang, Jilin Zhang, Ammar Belatreche, Jibin Wu, Zijing Xu, Xuerui Qiu, Hong Chen, Yang Yang, Haizhou Li

Specifically, we introduce two novel event-driven learning methods: the spike-timing-dependent event-driven (STD-ED) and membrane-potential-dependent event-driven (MPD-ED) algorithms.

Can GNN be Good Adapter for LLMs?

2 code implementations20 Feb 2024 Xuanwen Huang, Kaiqiao Han, Yang Yang, Dezheng Bao, Quanjin Tao, Ziwei Chai, Qi Zhu

In terms of efficiency, the GNN adapter introduces only a few trainable parameters and can be trained with low computation costs.

Node Classification Recommendation Systems +2

Brant-2: Foundation Model for Brain Signals

1 code implementation15 Feb 2024 Zhizhang Yuan, Daoze Zhang, Junru Chen, Gefei Gu, Yang Yang

Foundational models benefit from pre-training on large amounts of unlabeled data and enable strong performance in a wide variety of applications with a small amount of labeled data.

Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

1 code implementation14 Feb 2024 Linfeng Cao, Haoran Deng, Yang Yang, Chunping Wang, Lei Chen

In this paper, we argue that properly fetching and condensing the background nodes from massive web graph data might be a more economical shortcut to tackle the obstacles fundamentally.

Feature Correlation Graph Mining +1

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

1 code implementation6 Feb 2024 Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

We introduce MobileVLM V2, a family of significantly improved vision language models upon MobileVLM, which proves that a delicate orchestration of novel architectural design, an improved training scheme tailored for mobile VLMs, and rich high-quality dataset curation can substantially benefit VLMs' performance.

AutoML Language Modelling

Unveiling Latent Causal Rules: A Temporal Point Process Approach for Abnormal Event Explanation

no code implementations3 Feb 2024 Yiling Kuang, Chao Yang, Yang Yang, Shuang Li

In the M-step, we update both the rule set and model parameters to enhance the likelihood function's lower bound.

Point Processes

One Graph Model for Cross-domain Dynamic Link Prediction

no code implementations3 Feb 2024 Xuanwen Huang, Wei Chow, Yang Wang, Ziwei Chai, Chunping Wang, Lei Chen, Yang Yang

Extensive experiments on eight untrained graphs demonstrate that DyExpert achieves state-of-the-art performance in cross-domain link prediction.

Dynamic Link Prediction

Are Synthetic Time-series Data Really not as Good as Real Data?

no code implementations1 Feb 2024 Fanzhe Fu, Junru Chen, Jing Zhang, Carl Yang, Lvbin Ma, Yang Yang

Time-series data presents limitations stemming from data quality issues, bias and vulnerabilities, and generalization problem.

Representation Learning Time Series

Binaural Angular Separation Network

no code implementations16 Jan 2024 Yang Yang, George Sung, Shao-Fu Shih, Hakan Erdogan, Chehung Lee, Matthias Grundmann

We propose a neural network model that can separate target speech sources from interfering sources at different angular regions using two microphones.

Robust Semi-Supervised Learning for Self-learning Open-World Classes

1 code implementation15 Jan 2024 Wenjuan Xi, Xin Song, Weili Guo, Yang Yang

Existing semi-supervised learning (SSL) methods assume that labeled and unlabeled data share the same class space.

Open-World Semi-Supervised Learning Self-Learning

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

1 code implementation10 Jan 2024 Xueyu Hu, Ziyu Zhao, Shuang Wei, Ziwei Chai, Qianli Ma, Guoyin Wang, Xuwu Wang, Jing Su, Jingjing Xu, Ming Zhu, Yao Cheng, Jianbo Yuan, Jiwei Li, Kun Kuang, Yang Yang, Hongxia Yang, Fei Wu

In this paper, we introduce InfiAgent-DABench, the first benchmark specifically designed to evaluate LLM-based agents on data analysis tasks.

Benchmarking

StreamVC: Real-Time Low-Latency Voice Conversion

no code implementations5 Jan 2024 Yang Yang, Yury Kartynnik, Yunpeng Li, Jiuqiang Tang, Xing Li, George Sung, Matthias Grundmann

We present StreamVC, a streaming voice conversion solution that preserves the content and prosody of any source speech while matching the voice timbre from any target speech.

Speech Synthesis Voice Conversion

GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation

1 code implementation4 Jan 2024 Xuehao Gao, Yang Yang, Zhenyu Xie, Shaoyi Du, Zhongqian Sun, Yang Wu

The whole text-driven human motion synthesis problem is then divided into multiple abstraction levels and solved with a multi-stage generation framework with a cascaded latent diffusion model: an initial generator first generates the coarsest human motion guess from a given text description; then, a series of successive generators gradually enrich the motion details based on the textual description and the previous synthesized results.

Motion Synthesis

OFDM-Based Digital Semantic Communication with Importance Awareness

no code implementations4 Jan 2024 Chuanhong Liu, Caili Guo, Yang Yang, Wanli Ni, Tony Q. S. Quek

Based on semantic importance, we formulate a sub-carrier and bit allocation problem to maximize communication performance.

Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns

1 code implementation21 Dec 2023 Yifei Sun, Qi Zhu, Yang Yang, Chunping Wang, Tianyu Fan, Jiajun Zhu, Lei Chen

In this paper, we identify the fundamental cause of structural divergence as the discrepancy of generative patterns between the pre-training and downstream graphs.

Graph Mining Transfer Learning

Towards Fair Graph Federated Learning via Incentive Mechanisms

1 code implementation20 Dec 2023 Chenglu Pan, Jiarong Xu, Yue Yu, Ziqi Yang, Qingbiao Wu, Chunping Wang, Lei Chen, Yang Yang

Extensive experiments show that our model achieves the best trade-off between accuracy and the fairness of model gradient, as well as superior payoff fairness.

Fairness Federated Learning +1

Generalized Damping Torque Analysis of Ultra-Low Frequency Oscillation in the Jerk Space

no code implementations7 Dec 2023 Yichen Zhou, Yang Yang, Tao Zhou, Yonggang Li

A multi-information variable is constructed to transform the system into a new state space, where it is found that the jerk dynamics of the turbine-generator cascaded system is a second-order differential equation.

A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

no code implementations1 Dec 2023 Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed.

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model

1 code implementation20 Nov 2023 Lei Geng, Xu Yan, Ziqiang Cao, Juntao Li, Wenjie Li, Sujian Li, Xinjie Zhou, Yang Yang, Jun Zhang

We achieve a biomedical multilingual corpus by incorporating three granularity knowledge alignments (entity, fact, and passage levels) into monolingual corpora.

Relation XLM-R

Better with Less: A Data-Active Perspective on Pre-Training Graph Neural Networks

1 code implementation NeurIPS 2023 Jiarong Xu, Renhong Huang, Xin Jiang, Yuxuan Cao, Carl Yang, Chunping Wang, Yang Yang

The proposed pre-training pipeline is called the data-active graph pre-training (APT) framework, and is composed of a graph selector and a pre-training model.

Technical Note: Feasibility of translating 3.0T-trained Deep-Learning Segmentation Models Out-of-the-Box on Low-Field MRI 0.55T Knee-MRI of Healthy Controls

no code implementations26 Oct 2023 Rupsa Bhattacharjee, Zehra Akkaya, Johanna Luitjens, Pan Su, Yang Yang, Valentina Pedoia, Sharmila Majumdar

The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0. 55T, both qualitatively and quantitatively, in terms of comparing segmentation performance, areas of improvement, and compartment-wise cartilage thickness values between 0. 55T vs. 3. 0T.

Image Segmentation Segmentation +1

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

no code implementations24 Oct 2023 Yinjie Lei, Zixuan Wang, Feng Chen, Guoqing Wang, Peng Wang, Yang Yang

Multi-modal 3D scene understanding has gained considerable attention due to its wide applications in many areas, such as autonomous driving and human-computer interaction.

Autonomous Driving Scene Understanding

Non-Autoregressive Sentence Ordering

1 code implementation19 Oct 2023 Yi Bin, Wenhao Shi, Bin Ji, Jipeng Zhang, Yujuan Ding, Yang Yang

Existing sentence ordering approaches generally employ encoder-decoder frameworks with the pointer net to recover the coherence by recurrently predicting each sentence step-by-step.

Sentence Sentence Ordering

Solving Math Word Problems with Reexamination

1 code implementation14 Oct 2023 Yi Bin, Wenhao Shi, Yujuan Ding, Yang Yang, See-Kiong Ng

Math word problem (MWP) solving aims to understand the descriptive math problem and calculate the result, for which previous efforts are mostly devoted to upgrade different technical modules.

Descriptive Math

Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023

no code implementations10 Oct 2023 Xiangyu Wu, Yang Yang, Shengdong Xu, Yifeng Wu, QingGuo Chen, Jianfeng Lu

At the data level, inspired by the challenge paper, we categorized the whole questions into eight types and utilized the llama-2-chat model to directly generate the type for each question in a zero-shot manner.

object-detection Object Detection +3

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model

1 code implementation9 Oct 2023 Ziwei Chai, Tianjie Zhang, Liang Wu, Kaiqiao Han, Xiaohai Hu, Xuanwen Huang, Yang Yang

This synergy equips LLMs with the ability to proficiently interpret and reason on graph data, harnessing the superior expressive power of graph learning models.

Graph Learning Language Modelling +1

Towards Scalable Wireless Federated Learning: Challenges and Solutions

no code implementations8 Oct 2023 Yong Zhou, Yuanming Shi, Haibo Zhou, Jingjing Wang, Liqun Fu, Yang Yang

The explosive growth of smart devices (e. g., mobile phones, vehicles, drones) with sensing, communication, and computation capabilities gives rise to an unprecedented amount of data.

Federated Learning Privacy Preserving

Twin Graph-based Anomaly Detection via Attentive Multi-Modal Learning for Microservice System

1 code implementation7 Oct 2023 Jun Huang, Yang Yang, Hang Yu, Jianguo Li, Xiao Zheng

The MST graph provides a virtual representation of the status and scheduling relationships among service instances of a real-world microservice system.

Anomaly Detection Scheduling

CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis

no code implementations6 Oct 2023 Xiaoxiao Sun, Xingjian Leng, Zijian Wang, Yang Yang, Zi Huang, Liang Zheng

Analyzing model performance in various unseen environments is a critical research problem in the machine learning community.

Benchmarking Domain Generalization +1

Functional Geometry Guided Protein Sequence and Backbone Structure Co-Design

1 code implementation6 Oct 2023 Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Yang Yang, Lei LI

In this paper, we propose NAEPro, a model to jointly design Protein sequence and structure based on automatically detected functional sites.

Joint Design of Protein Sequence and Structure based on Motifs

no code implementations4 Oct 2023 Zhenqiao Song, Yunlong Zhao, Yufei Song, Wenxian Shi, Yang Yang, Lei LI

Designing novel proteins with desired functions is crucial in biology and chemistry.

Learning to Generate Lumped Hydrological Models

1 code implementation18 Sep 2023 Yang Yang, Ting Fong May Chui

Overall, this study demonstrates that the hydrological behavior of a catchment can be effectively described using a small number of latent variables, and that well-fitting hydrologic model functions can be reconstructed from these variables.

How to Generate Popular Post Headlines on Social Media?

no code implementations18 Sep 2023 Zhouxiang Fang, Min Yu, Zhendong Fu, Boning Zhang, Xuanwen Huang, Xiaoqi Tang, Yang Yang

Observation results demonstrate that trends and personal styles are widespread in headlines on social medias and have significant contribution to posts's popularity.

Headline Generation

Cross-Utterance Conditioned VAE for Speech Generation

no code implementations8 Sep 2023 Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

Experimental results on the LibriTTS datasets demonstrate that our proposed models significantly enhance speech synthesis and editing, producing more natural and expressive speech.

Speech Synthesis

SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search

no code implementations15 Aug 2023 Wen Zan, Yaopeng Han, Xiaotian Jiang, Yao Xiao, Yang Yang, Dayao Chen, Sheng Chen

At pretraining stage, we propose an effective pretraining method that employs both query and multiple fields of document as inputs, including an effective information compression method for lengthy fields.

Language Modelling

Routing Recovery for UAV Networks with Deliberate Attacks: A Reinforcement Learning based Approach

no code implementations14 Aug 2023 Sijie He, Ziye Jia, Chao Dong, Wei Wang, Yilu Cao, Yang Yang, Qihui Wu

The unmanned aerial vehicle (UAV) network is popular these years due to its various applications.

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination

1 code implementation8 Aug 2023 Haoxuan Li, Yi Bin, Junrong Liao, Yang Yang, Heng Tao Shen

Most existing image-text matching methods adopt triplet loss as the optimization objective, and choosing a proper negative sample for the triplet of <anchor, positive, negative> is important for effectively training the model, e. g., hard negatives make the model learn efficiently and effectively.

Image-text matching Representation Learning +1

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval

1 code implementation8 Aug 2023 Yi Bin, Haoxuan Li, Yahui Xu, Xing Xu, Yang Yang, Heng Tao Shen

Specifically, on two key tasks, \textit{i. e.}, image-to-text and text-to-image retrieval, HAT achieves 7. 6\% and 16. 7\% relative score improvement of Recall@1 on MSCOCO, and 4. 4\% and 11. 6\% on Flickr30k respectively.

Cross-Modal Retrieval Image Retrieval +1

A Novel DDPM-based Ensemble Approach for Energy Theft Detection in Smart Grids

no code implementations30 Jul 2023 Xun Yuan, Yang Yang, Asif Iqbal, Prosanta Gope, Biplab Sikdar

To address these challenges, several unsupervised ETD methods have been proposed, focusing on learning the normal patterns from honest users, specifically the reconstruction of input.

Denoising

MBrain: A Multi-channel Self-Supervised Learning Framework for Brain Signals

no code implementations15 Jun 2023 Donghong Cai, Junru Chen, Yang Yang, Teng Liu, Yafeng Li

Intuitively, brain signals, generated by the firing of neurons, are transmitted among different connecting structures in human brain.

EEG Seizure Detection +1

Accelerating Dynamic Network Embedding with Billions of Parameter Updates to Milliseconds

1 code implementation15 Jun 2023 Haoran Deng, Yang Yang, Jiahe Li, Haoyang Cai, ShiLiang Pu, Weihao Jiang

Network embedding, a graph representation learning method illustrating network topology by mapping nodes into lower-dimension vectors, is challenging to accommodate the ever-changing dynamic graphs in practice.

Graph Reconstruction Graph Representation Learning +3

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

1 code implementation11 Jun 2023 Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Yang Yang, Hongyin Tang, Keqing He, Jiahao Liu, Jingang Wang, Shu Zhao, Peng Zhang, Jie Tang

Currently, the reduction in the parameter scale of large-scale pre-trained language models (PLMs) through knowledge distillation has greatly facilitated their widespread deployment on various devices.

General Knowledge Knowledge Distillation +1

Probabilistic Multi-Dimensional Classification

1 code implementation10 Jun 2023 Vu-Linh Nguyen, Yang Yang, Cassio de Campos

We propose a formal framework for probabilistic MDC in which learning an optimal multi-dimensional classifier can be decomposed, without loss of generality, into learning a set of (smaller) single-variable multi-class probabilistic classifiers and a directed acyclic graph.

Classification

A Novel Correlation-optimized Deep Learning Method for Wind Speed Forecast

1 code implementation3 Jun 2023 Yang Yang, Jin Lang, Jian Wu, Yanyan Zhang, Xiang Zhao

Finally, the effectiveness of the proposed method is verified by three wind prediction cases from a wind farm in Liaoning, China.

Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning

1 code implementation1 Jun 2023 Shengqin Jiang, Yaoyu Fang, Haokui Zhang, Qingshan Liu, Yuankai Qi, Yang Yang, Peng Wang

Rehearsal-based video incremental learning often employs knowledge distillation to mitigate catastrophic forgetting of previously learned data.

Incremental Learning Knowledge Distillation +1

Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism

no code implementations31 May 2023 Haoxuan Xu, Songning Lai, Xianyang Li, Yang Yang

To address these issues, we propose cross-domain Car Detection Model with integrated convolutional block Attention mechanism(CDMA) that we apply to car recognition for autonomous driving and other areas.

Autonomous Driving object-detection +1

PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models

no code implementations30 May 2023 Zhuocheng Gong, Jiahao Liu, Qifan Wang, Yang Yang, Jingang Wang, Wei Wu, Yunsen Xian, Dongyan Zhao, Rui Yan

While transformer-based pre-trained language models (PLMs) have dominated a number of NLP applications, these models are heavy to deploy and expensive to use.

Quantization

Deep Neural Networks in Video Human Action Recognition: A Review

no code implementations25 May 2023 Zihan Wang, Yang Yang, Zhi Liu, Yifan Zheng

Our current related research addresses multiple novel proposed research works and compares their advantages and disadvantages between the derived deep learning frameworks rather than machine learning frameworks.

Action Recognition Optical Flow Estimation +1

Breaking the Curse of Quality Saturation with User-Centric Ranking

no code implementations24 May 2023 Zhuokai Zhao, Yang Yang, Wenyu Wang, Chihuang Liu, Yu Shi, Wenjie Hu, Haotian Zhang, Shuang Yang

A key puzzle in search, ads, and recommendation is that the ranking model can only utilize a small portion of the vastly available user interaction data.

Faster Video Moment Retrieval with Point-Level Supervision

no code implementations23 May 2023 Xun Jiang, Zailei Zhou, Xing Xu, Yang Yang, Guoqing Wang, Heng Tao Shen

Existing VMR methods suffer from two defects: (1) massive expensive temporal annotations are required to obtain satisfying performance; (2) complicated cross-modal interaction modules are deployed, which lead to high computational cost and low efficiency for the retrieval process.

Moment Retrieval Natural Language Queries +1

Task-agnostic Distillation of Encoder-Decoder Language Models

no code implementations21 May 2023 Chen Zhang, Yang Yang, Jingang Wang, Dawei Song

Finetuning pretrained language models (LMs) have enabled appealing performance on a diverse array of tasks.

Abstractive Text Summarization

Lifting the Curse of Capacity Gap in Distilling Language Models

1 code implementation20 May 2023 Chen Zhang, Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang, Dawei Song

However, when the capacity gap between the teacher and the student is large, a curse of capacity gap appears, invoking a deficiency in distilling LMs.

Knowledge Distillation

Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era

no code implementations10 May 2023 Chenghao Li, Chaoning Zhang, Atish Waghwase, Lik-Hang Lee, Francois Rameau, Yang Yang, Sung-Ho Bae, Choong Seon Hong

AI generated content) has made remarkable progress in the past few years, among which text-guided content generation is the most practical one since it enables the interaction between human instruction and AIGC.

Scene Generation Text to 3D +1

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

2 code implementations9 May 2023 Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, LiMin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Different from existing interactive systems that rely on pure language, by incorporating pointing instructions, the proposed iGPT significantly improves the efficiency of communication between users and chatbots, as well as the accuracy of chatbots in vision-centric tasks, especially in complicated visual scenarios where the number of objects is greater than 2.

Language Modelling

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure

1 code implementation8 May 2023 Yi Bin, Mengqun Han, Wenhao Shi, Lei Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen

For evaluating the possible expression variants, we design a path-based metric to evaluate the partial accuracy of expressions of a unified tree.

Math valid

MrTF: Model Refinery for Transductive Federated Learning

1 code implementation7 May 2023 Xin-Chun Li, Yang Yang, De-Chuan Zhan

We propose a novel learning paradigm named transductive federated learning (TFL) to simultaneously consider the structural information of the to-be-inferred data.

Federated Learning

A Simulation-Augmented Benchmarking Framework for Automatic RSO Streak Detection in Single-Frame Space Images

no code implementations30 Apr 2023 Zhe Chen, Yang Yang, Anne Bettens, Youngho Eun, Xiaofeng Wu

In our framework, by making the best use of the hardware parameters of the sensor that captures real-world space images, we first develop a high-fidelity RSO simulator that can generate various realistic space images.

Benchmarking object-detection +1

Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

1 code implementation23 Apr 2023 Bo Li, Gexiang Fang, Yang Yang, Quansen Wang, Wei Ye, Wen Zhao, Shikun Zhang

The capability of Large Language Models (LLMs) like ChatGPT to comprehend user intent and provide reasonable responses has made them extremely popular lately.

CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

no code implementations19 Apr 2023 Yang Yang, Weijie Ma, Hao Chen, Linlin Ou, Xinyi Yu

The combination of LiDAR and camera modalities is proven to be necessary and typical for 3D object detection according to recent studies.

3D Object Detection Depth Estimation +1

CoVLR: Coordinating Cross-Modal Consistency and Intra-Modal Structure for Vision-Language Retrieval

no code implementations15 Apr 2023 Yang Yang, Zhongtian Fu, Xiangyu Wu, Wenjie Li

To address this challenge, in this paper, we experimentally observe that the vision-language divergence may cause the existence of strong and weak modalities, and the hard cross-modal consistency cannot guarantee that strong modal instances' relationships are not affected by weak modality, resulting in the strong modal instances' relationships perturbed despite learned consistent representations. To this end, we propose a novel and directly Coordinated VisionLanguage Retrieval method (dubbed CoVLR), which aims to study and alleviate the desynchrony problem between the cross-modal alignment and single-modal cluster-preserving tasks.

Cross-Modal Retrieval Instance Search +1

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement

1 code implementation CVPR 2023 Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li, Heng Tao Shen

To address this issue, we propose a novel semantic-aware knowledge-guided framework (SKF) that can assist a low-light enhancement model in learning rich and diverse priors encapsulated in a semantic segmentation model.

Low-Light Image Enhancement Semantic Segmentation

$\text{DC}^2$: Dual-Camera Defocus Control by Learning to Refocus

no code implementations6 Apr 2023 Hadi AlZayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements.

Deblurring

When to Pre-Train Graph Neural Networks? From Data Generation Perspective!

1 code implementation29 Mar 2023 Yuxuan Cao, Jiarong Xu, Carl Yang, Jiaan Wang, Yunchao Zhang, Chunping Wang, Lei Chen, Yang Yang

All convex combinations of graphon bases give rise to a generator space, from which graphs generated form the solution space for those downstream data that can benefit from pre-training.

Learning a Deep Color Difference Metric for Photographic Images

1 code implementation CVPR 2023 Haoyu Chen, Zhihua Wang, Yang Yang, Qilin Sun, Kede Ma

Most well-established and widely used color difference (CD) metrics are handcrafted and subject-calibrated against uniformly colored patches, which do not generalize well to photographic images characterized by natural scene complexities.

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding

1 code implementation23 Mar 2023 Ziyang Lu, Yunqiang Pei, Guoqing Wang, Yang Yang, Zheng Wang, Heng Tao Shen

Despite their effectiveness, existing methods suffer from the difficulty of low recognition accuracy in cases of multiple adjacent objects with similar appearances. To address this issue, this work intuitively introduces the human-robot interaction as a cue to facilitate the development of 3D visual grounding.

Visual Grounding

Learning Behavior Recognition in Smart Classroom with Multiple Students Based on YOLOv5

no code implementations20 Mar 2023 Zhifeng Wang, Jialong Yao, Chunyan Zeng, Wanxuan Wu, Hongmin Xu, Yang Yang

The use of computer vision technology to identify students' learning behavior in the classroom can reduce the workload of traditional teachers in supervising students in the classroom, and ensure greater accuracy and comprehensiveness.

Rt-Track: Robust Tricks for Multi-Pedestrian Tracking

no code implementations16 Mar 2023 Yukuan Zhang, Yunhua Jia, Housheng Xie, Mengzhen Li, Limin Zhao, Yang Yang, Shan Zhao

However, modeling the motion and appearance models of objects in complex scenes still faces various challenging issues.

Multi-Object Tracking Object +1

Guided Speech Enhancement Network

no code implementations13 Mar 2023 Yang Yang, Shao-Fu Shih, Hakan Erdogan, Jamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, Matthias Grundmann

Multi-microphone speech enhancement problem is often decomposed into two decoupled steps: a beamformer that provides spatial filtering and a single-channel speech enhancement model that cleans up the beamformer output.

Denoising Speech Enhancement

Graph Neural Networks Enhanced Smart Contract Vulnerability Detection of Educational Blockchain

no code implementations8 Mar 2023 Zhifeng Wang, Wanxuan Wu, Chunyan Zeng, Jialong Yao, Yang Yang, Hongmin Xu

With the development of blockchain technology, more and more attention has been paid to the intersection of blockchain and education, and various educational evaluation systems and E-learning systems are developed based on blockchain technology.

Vulnerability Detection

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

no code implementations ICCV 2023 Benzhi Wang, Yang Yang, Jinlin Wu, Guo-Jun Qi, Zhen Lei

On the other hand, the similarity of cross-scale images is often smaller than that of images with the same scale for a person, which will increase the difficulty of matching.

Person Search

Bayesian Structure Scores for Probabilistic Circuits

1 code implementation23 Feb 2023 Yang Yang, Gennaro Gala, Robert Peharz

Probabilistic circuits (PCs) are a prominent representation of probability distributions with tractable inference.

Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection

no code implementations17 Feb 2023 Yang Yang, Kaixiong Xu, Kaizheng Wang

On the other hand, the cross-modal attention feature fusion module mines the features of both Color and Thermal modalities to complement each other, then the global features are constructed by adding the cross-modal complemented features element by element, which are attentionally weighted to achieve the effective fusion of the two modal features.

Pedestrian Detection

Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance

no code implementations5 Feb 2023 Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang, Chuanhong Liu

Specifically, we first propose semantic importance weight calculation method, which is based on the gradient of intelligent task's perception results with respect to the features.

VQNet 2.0: A New Generation Machine Learning Framework that Unifies Classical and Quantum

no code implementations9 Jan 2023 Huanyu Bian, Zhilong Jia, Menghan Dou, Yuan Fang, Lei LI, Yiming Zhao, Hanchao Wang, Zhaohui Zhou, Wei Wang, Wenyu Zhu, Ye Li, Yang Yang, Weiming Zhang, Nenghai Yu, Zhaoyun Chen, Guoping Guo

Therefore, based on VQNet 1. 0, we further propose VQNet 2. 0, a new generation of unified classical and quantum machine learning framework that supports hybrid optimization.

Quantum Machine Learning Unity

DC2: Dual-Camera Defocus Control by Learning To Refocus

no code implementations CVPR 2023 Hadi AlZayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements.

Deblurring

Multilateral Semantic Relations Modeling for Image Text Retrieval

no code implementations CVPR 2023 Zheng Wang, Zhenwei Gao, Kangshuai Guo, Yang Yang, Xiaoming Wang, Heng Tao Shen

Specifically, a given query is first mapped as a probabilistic embedding to learn its true semantic distribution based on Mahalanobis distance.

Retrieval Text Retrieval

Visible-Infrared Person Re-Identification via Semantic Alignment and Affinity Inference

1 code implementation ICCV 2023 Xingye Fang, Yang Yang, Ying Fu

We propose a Semantic Alignment and Affinity Inference framework (SAAI), which aims to align latent semantic part features with the learnable prototypes and improve inference with affinity information.

Person Re-Identification

C2ST: Cross-Modal Contextualized Sequence Transduction for Continuous Sign Language Recognition

no code implementations ICCV 2023 Huaiwen Zhang, Zihang Guo, Yang Yang, Xin Liu, De Hu

In this paper, we propose a Cross-modal Contextualized Sequence Transduction (C2ST) for CSLR, which effectively incorporates the knowledge of gloss sequence into the process of video representation learning and sequence transduction.

Language Modelling Representation Learning +1

Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction

no code implementations CVPR 2023 Xuehao Gao, Shaoyi Du, Yang Wu, Yang Yang

Encouraged by the effectiveness of encoding temporal dynamics within the frequency domain, recent human motion prediction systems prefer to first convert the motion representation from the original pose space into the frequency space.

Human motion prediction motion prediction +1

Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning

no code implementations26 Dec 2022 Jiwei Wei, Yang Yang, Zeyu Ma, Jingjing Li, Xing Xu, Heng Tao Shen

In this paper, we provide a new semantic enhanced knowledge graph that contains both expert knowledge and categories semantic correlation.

Zero-Shot Learning

Holistic risk assessment of inference attacks in machine learning

no code implementations15 Dec 2022 Yang Yang

As far as concerned, researchers have studied and analyzed in depth several types of inference attacks, albeit in isolation, but there is still a lack of a holistic rick assessment of inference attacks against machine learning models, such as their application in different scenarios, the common factors affecting the performance of these attacks and the relationship among the attacks.

Attribute Inference Attack +1

FakeEdge: Alleviate Dataset Shift in Link Prediction

1 code implementation29 Nov 2022 Kaiwen Dong, Yijun Tian, Zhichun Guo, Yang Yang, Nitesh V. Chawla

In this paper, we first identify the dataset shift problem in the link prediction task and provide theoretical analyses on how existing link prediction methods are vulnerable to it.

Link Prediction

Reconstructing high-order sequence features of dynamic functional connectivity networks based on diversified covert attention patterns for Alzheimer's disease classification

no code implementations19 Nov 2022 Zhixiang Zhang, Biao Jie, Zhengdong Wang, Jie zhou, Yang Yang

Recent studies have applied deep learning methods such as convolutional recurrent neural networks (CRNs) and Transformers to brain disease classification based on dynamic functional connectivity networks (dFCNs), such as Alzheimer's disease (AD), achieving better performance than traditional machine learning methods.

Classification

Interpretable Dimensionality Reduction by Feature Preserving Manifold Approximation and Projection

no code implementations17 Nov 2022 Yang Yang, Hongjian Sun, Jialei Gong, Yali Du, Di Yu

Based on the embedding tangent space, featMAP enables the interpretability by locally demonstrating the source features and feature importance.

Dimensionality Reduction Feature Importance +2

WiserVR: Semantic Communication Enabled Wireless Virtual Reality Delivery

no code implementations2 Nov 2022 Le Xia, Yao Sun, Chengsi Liang, Daquan Feng, Runze Cheng, Yang Yang, Muhammad Ali Imran

Virtual reality (VR) over wireless is expected to be one of the killer applications in next-generation communication networks.

Generating Accurate and Faithful Discharge Instructions: Task, Dataset, and Model

2 code implementations23 Oct 2022 Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu sun, Yang Yang, David A. Clifton

We build a benchmark clinical dataset and propose the Re3Writer, which imitates the working patterns of physicians to first retrieve related working experience from historical PIs written by physicians, then reason related medical knowledge.

DOMFN: A Divergence-Orientated Multi-Modal Fusion Network for Resume Assessment

1 code implementation MM '22: Proceedings of the 30th ACM International Conference on Multimedia 2022 Yang Yang, Jingshuai Zhang, Fan Gao, Xiaoru Gao, HengShu Zhu

Inspired by practical resume evaluations that consider both the content and layout, we construct the multi-modalities from resumes but face a new challenge that sometimes the performance of multi-modal fusion is even worse than the best uni-modality.

SA-DNet: A on-demand semantic object registration network adapting to non-rigid deformation

1 code implementation18 Oct 2022 Housheng Xie, Junhui Qiu, Yuan Dai, Yang Yang, Changcheng Xiang, Yukuan Zhang

After utilizing TPS to transform infrared and visible images based on the corresponding feature points in sROI, the registered images are fused using image fusion module (IFM) to achieve a fully functional registration and fusion network.

Image Registration

MMGA: Multimodal Learning with Graph Alignment

no code implementations18 Oct 2022 Xuan Yang, Quanjin Tao, Xiao Feng, Donghong Cai, Xiang Ren, Yang Yang

In this paper, we propose MMGA (Multimodal learning with Graph Alignment), a novel multimodal pre-training framework to incorporate information from graph (social network), image and text modalities on social media to enhance user representation learning.

Representation Learning

InterFace:Adjustable Angular Margin Inter-class Loss for Deep Face Recognition

no code implementations5 Oct 2022 Meng Sang, Jiaxuan Chen, Mengzhen Li, Pan Tan, Anning Pan, Shan Zhao, Yang Yang

In the field of face recognition, it is always a hot research topic to improve the loss solution to make the face features extracted by the network have greater discriminative power.

Face Model Face Recognition

Universal Prompt Tuning for Graph Neural Networks

1 code implementation NeurIPS 2023 Taoran Fang, Yunchao Zhang, Yang Yang, Chunping Wang, Lei Chen

In this paper, we introduce a universal prompt-based tuning method called Graph Prompt Feature (GPF) for pre-trained GNN models under any pre-training strategy.

Real-Time Cattle Interaction Recognition via Triple-stream Network

no code implementations6 Sep 2022 Yang Yang, Mizuka Komatsu, Kenji Oyama, Takenao Ohkawa

Based on this, we tackle the challenging task of real-time recognizing interactions between cattle in a single frame in this paper.

Self-Supervised Learning

Deep Joint Source-Channel Coding Based on Semantics of Pixels

no code implementations24 Aug 2022 Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang, Chuanhong Liu

The semantic information of the image for intelligent tasks is hidden behind the pixels, and slight changes in the pixels will affect the performance of intelligent tasks.

Rain Removal from Light Field Images with 4D Convolution and Multi-scale Gaussian Process

1 code implementation16 Aug 2022 Tao Yan, Mingyue Li, Bin Li, Yang Yang, Rynson W. H. Lau

However, making full use of the abundant information available from LFIs, such as 2D array of sub-views and the disparity map of each sub-view, for effective rain removal is still a challenging problem.

Depth Estimation Rain Removal

Continual Unsupervised Domain Adaptation for Semantic Segmentation using a Class-Specific Transfer

no code implementations12 Aug 2022 Robert A. Marsden, Felix Wiewel, Mario Döbler, Yang Yang, Bin Yang

In this work, we focus on UDA and additionally address the case of adapting not only to a single domain, but to a sequence of target domains.

Data Augmentation Semantic Segmentation +2

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark under Heterogeneous AI Computing Platforms

no code implementations11 Aug 2022 Jia-Xin Zhuang, Xiansong Huang, Yang Yang, Jiancong Chen, Yue Yu, Wei Gao, Ge Li, Jie Chen, Tong Zhang

In this paper, we present OpenMedIA, an open-source toolbox library containing a rich set of deep learning methods for medical image analysis under heterogeneous Artificial Intelligence (AI) computing platforms.

Image Classification Medical Image Classification +2

MobileCodec: Neural Inter-frame Video Compression on Mobile Devices

no code implementations18 Jul 2022 Hoang Le, Liang Zhang, Amir Said, Guillaume Sautiere, Yang Yang, Pranav Shrestha, Fei Yin, Reza Pourreza, Auke Wiggers

Realizing the potential of neural video codecs on mobile devices is a big technological challenge due to the computational complexity of deep networks and the power-constrained mobile hardware.

Video Compression

Neural Topological Ordering for Computation Graphs

no code implementations13 Jul 2022 Mukul Gagrani, Corrado Rainone, Yang Yang, Harris Teague, Wonseok Jeon, Herke van Hoof, Weiliang Will Zeng, Piero Zappi, Christopher Lott, Roberto Bondesan

Recent works on machine learning for combinatorial optimization have shown that learning based approaches can outperform heuristic methods in terms of speed and performance.

BIG-bench Machine Learning Combinatorial Optimization

DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection

no code implementations30 Jun 2022 Xuanwen Huang, Yang Yang, Yang Wang, Chunping Wang, Zhisheng Zhang, Jiarong Xu, Lei Chen, Michalis Vazirgiannis

Since GAD emphasizes the application and the rarity of anomalous samples, enriching the varieties of its datasets is fundamental work.

Graph Anomaly Detection

Towards KAB2S: Learning Key Knowledge from Single-Objective Problems to Multi-Objective Problem

no code implementations26 Jun 2022 Xu Wendi, Wang Xianpeng, Guo Qingxin, Song Xiangman, Zhao Ren, Zhao Guodong, Yang Yang, Xu Te, He Dakuo

As "a new frontier in evolutionary computation research", evolutionary transfer optimization(ETO) will overcome the traditional paradigm of zero reuse of related experience and knowledge from solved past problems in researches of evolutionary computation.

Multiobjective Optimization Scheduling

Multi-View Clustering for Open Knowledge Base Canonicalization

2 code implementations22 Jun 2022 Wei Shen, Yang Yang, Yinan Liu

In this paper, we propose CMVC, a novel unsupervised framework that leverages these two views of knowledge jointly for canonicalizing OKBs without the need of manually annotated labels.

Clustering Open Information Extraction +1

MiniDisc: Minimal Distillation Schedule for Language Model Compression

1 code implementation29 May 2022 Chen Zhang, Yang Yang, Qifan Wang, Jiahao Liu, Jingang Wang, Wei Wu, Dawei Song

In particular, motivated by the finding that the performance of the student is positively correlated to the scale-performance tradeoff of the teacher assistant, MiniDisc is designed with a $\lambda$-tradeoff to measure the optimality of the teacher assistant without trial distillation to the student.

Knowledge Distillation Language Modelling +2

Measuring Perceptual Color Differences of Smartphone Photographs

1 code implementation26 May 2022 Zhihua Wang, Keshuo Xu, Yang Yang, Jianlei Dong, Shuhang Gu, Lihao Xu, Yuming Fang, Kede Ma

Measuring perceptual color differences (CDs) is of great importance in modern smartphone photography.

Multi-Agent Feedback Enabled Neural Networks for Intelligent Communications

1 code implementation22 May 2022 Fanglei Sun, Yang Li, Ying Wen, Jingchen Hu, Jun Wang, Yang Yang, Kai Li

The design of MAFENN framework and algorithm are dedicated to enhance the learning capability of the feedfoward DL networks or their variations with the simple data feedback.

Denoising Intelligent Communication

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

1 code implementation ACL 2022 Yang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Sun, Weiqin Zu, Ying Wen, Yang Yang, Jun Wang

Modelling prosody variation is critical for synthesizing natural and expressive speech in end-to-end text-to-speech (TTS) systems.

A Bottom-Up End-User Intelligent Assistant Approach to Empower Gig Workers against AI Inequality

no code implementations29 Apr 2022 Toby Jia-Jun Li, Yuwen Lu, Jaylexia Clark, Meng Chen, Victor Cox, Meng Jiang, Yang Yang, Tamara Kay, Danielle Wood, Jay Brockman

The AI inequality is caused by (1) the technology divide in who has access to AI technologies in gig work; and (2) the data divide in who owns the data in gig work leads to unfair working conditions, growing pay gap, neglect of workers' diverse preferences, and workers' lack of trust in the platforms.

Position

Continual Learning with Bayesian Model based on a Fixed Pre-trained Feature Extractor

no code implementations28 Apr 2022 Yang Yang, Zhiying Cui, Junjie Xu, Changhong Zhong, Wei-Shi Zheng, Ruixuan Wang

In this case, updating the intelligent system with data of new diseases would inevitably downgrade its performance on previously learned diseases.

Class Incremental Learning Image Classification +1

DropMessage: Unifying Random Dropping for Graph Neural Networks

2 code implementations21 Apr 2022 Taoran Fang, Zhiqing Xiao, Chunping Wang, Jiarong Xu, Xuan Yang, Yang Yang

First, it is challenging to find a universal method that are suitable for all cases considering the divergence of different datasets and models.

Graph Representation Learning

Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications

no code implementations19 Apr 2022 Chuanhong Liu, Caili Guo, Yang Yang, Nan Jiang

To solve the problem, both compression ratio and resource allocation are optimized for the task-oriented communication system to maximize the success probability of tasks.

Invertible Mask Network for Face Privacy-Preserving

no code implementations19 Apr 2022 Yang Yang, Yiyang Huang, Ming Shi, Kejiang Chen, Weiming Zhang, Nenghai Yu

Then, put the "Mask" face onto the protected face and generate the masked face, in which the masked face is indistinguishable from "Mask" face.

Privacy Preserving

Positioning Using Visible Light Communications: A Perspective Arcs Approach

no code implementations18 Apr 2022 Zhiyu Zhu, Caili Guo, Rongzhen Bao, Mingzhe Chen, Walid Saad, Yang Yang

In this paper, the arc feature of the circular luminaire and the coordinate information obtained via visible light communication (VLC) are jointly used for VLC-enabled indoor positioning, and a novel perspective arcs approach is proposed.

GNN-encoder: Learning a Dual-encoder Architecture via Graph Neural Networks for Dense Passage Retrieval

no code implementations18 Apr 2022 Jiduan Liu, Jiahao Liu, Yang Yang, Jingang Wang, Wei Wu, Dongyan Zhao, Rui Yan

To enhance the performance of dense retrieval models without loss of efficiency, we propose a GNN-encoder model in which query (passage) information is fused into passage (query) representations via graph neural networks that are constructed by queries and their top retrieved passages.

Natural Questions Passage Retrieval +2

Completing Partial Point Clouds with Outliers by Collaborative Completion and Segmentation

no code implementations18 Mar 2022 Changfeng Ma, Yang Yang, Jie Guo, Chongjun Wang, Yanwen Guo

We propose in this paper an end-to-end network, named CS-Net, to complete the point clouds contaminated by noises or containing outliers.

Point Cloud Completion Segmentation

Medium Transmission Map Matters for Learning to Restore Real-World Underwater Images

1 code implementation17 Mar 2022 Yan Kai, Liang Lanyue, Zheng Ziqiang, Wang Guoqing, Yang Yang

Underwater visual perception is essentially important for underwater exploration, archeology, ecosystem and so on.

Image Enhancement Image Restoration

Adaptive Information Bottleneck Guided Joint Source and Channel Coding for Image Transmission

no code implementations12 Mar 2022 Lunan Sun, Yang Yang, Mingzhe Chen, Caili Guo, Walid Saad, H. Vincent Poor

In particular, a new IB objective for image transmission is proposed so as to minimize the distortion and the transmission rate.

Image Reconstruction

Region-of-Interest Based Neural Video Compression

no code implementations3 Mar 2022 Yura Perugachi-Diaz, Guillaume Sautière, Davide Abati, Yang Yang, Amirhossein Habibian, Taco S Cohen

To the best of our knowledge, our proposals are the first solutions that integrate ROI-based capabilities into neural video compression models.

Quantization Video Compression

An Exploratory Study of Stock Price Movements from Earnings Calls

no code implementations31 Jan 2022 Sourav Medya, Mohammad Rasoolinejad, Yang Yang, Brian Uzzi

Third, the semantic features of transcripts are more predictive of stock price movements than sales and earnings per share, i. e., traditional hard data in most of the cases.

MHSnet: Multi-head and Spatial Attention Network with False-Positive Reduction for Pulmonary Nodules Detection

no code implementations31 Jan 2022 Juanyun Mai, Minghao Wang, Jiayin Zheng, Yanbo Shao, Zhaoqi Diao, Xinliang Fu, Yulong Chen, Jianyu Xiao, Jian You, Airu Yin, Yang Yang, Xiangcheng Qiu, Jinsheng Tao, Bo wang, Hua Ji

The false positive reduction module significantly decreases the average number of candidates generated per scan by 68. 11% and the false discovery rate by 13. 48%, which is promising to reduce distracted proposals for the downstream tasks based on the detection results.

Head Detection

Semantic-assisted image compression

no code implementations29 Jan 2022 Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Xijun Xue

Experimental results show that the proposed SAIC method can retain more semantic-level information and achieve better performance of downstream AI tasks compared to the traditional deep learning-based method and the advanced perceptual method at the same compression ratio.

Image Compression

Bandwidth and Power Allocation for Task-Oriented SemanticCommunication

no code implementations26 Jan 2022 Chuanhong Liu, Caili Guo, Yang Yang, Jiujiu Chen

The first subproblem is a compression ratio optimization problem with a given resource allocation scheme, which is solved by a enumeration algorithm.

Privacy-aware Early Detection of COVID-19 through Adversarial Training

no code implementations9 Jan 2022 Omid Rohanian, Samaneh Kouchaki, Andrew Soltan, Jenny Yang, Morteza Rohanian, Yang Yang, David Clifton

One of our main contributions is that we specifically target the development of effective COVID-19 detection models with built-in mechanisms in order to selectively protect sensitive attributes against adversarial attacks.

Real-time Rail Recognition Based on 3D Point Clouds

no code implementations8 Jan 2022 Xinyi Yu, Weiqi He, Xuecheng Qian, Yang Yang, Linlin Ou

Accurate rail location is a crucial part in the railway support driving system for safety monitoring.

Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation

no code implementations5 Jan 2022 Yang Zhang, Yang Yang, Chenyun Xiong, Guodong Sun, Yanwen Guo

Encoder-decoder models have been widely used in RGBD semantic segmentation, and most of them are designed via a two-stream network.

RGBD Semantic Segmentation Segmentation +1

Self-Augmented Unpaired Image Dehazing via Density and Depth Decomposition

1 code implementation CVPR 2022 Yang Yang, Chaoyue Wang, Risheng Liu, Lin Zhang, Xiaojie Guo, DaCheng Tao

With estimated scene depth, our method is capable of re-rendering hazy images with different thicknesses which further benefits the training of the dehazing network.

Image Dehazing

Conditional Generative Data-free Knowledge Distillation

no code implementations31 Dec 2021 Xinyi Yu, Ling Yan, Yang Yang, Libo Zhou, Linlin Ou

In this paper, we propose a conditional generative data-free knowledge distillation (CGDD) framework for training lightweight networks without any training data.

Conditional Image Generation Data-free Knowledge Distillation +1

VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction

no code implementations8 Dec 2021 Dan Li, Yang Yang, Hongyin Tang, Jingang Wang, Tong Xu, Wei Wu, Enhong Chen

With the booming of pre-trained transformers, representation-based models based on Siamese transformer encoders have become mainstream techniques for efficient text matching.

Text Matching

ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval

no code implementations24 Nov 2021 Hao Ren, Ziqiang Zheng, Yang Wu, Hong Lu, Yang Yang, Ying Shan, Sai-Kit Yeung

The huge domain gap between sketches and photos and the highly abstract sketch representations pose challenges for sketch-based image retrieval (\underline{SBIR}).

Retrieval Sketch-Based Image Retrieval

Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning

1 code implementation8 Nov 2021 Qinkai Zheng, Xu Zou, Yuxiao Dong, Yukuo Cen, Da Yin, Jiarong Xu, Yang Yang, Jie Tang

To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of providing a scalable, unified, modular, and reproducible evaluation for the adversarial robustness of GML models.

Adversarial Robustness Benchmarking +1

Neural Embeddings of Urban Big Data Reveal Emergent Structures in Cities

no code implementations24 Oct 2021 Chao Fan, Yang Yang, Ali Mostafavi

In this study, we propose using a neural embedding model-graph neural network (GNN)- that leverages the heterogeneous features of urban areas and their interactions captured by human mobility network to obtain vector representations of these areas.

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning

no code implementations22 Oct 2021 Yang Yang, Hongchen Wei, HengShu Zhu, dianhai yu, Hui Xiong, Jian Yang

In detail, considering that the heterogeneous gap between modalities always leads to the supervision difficulty of using the global embedding directly, CPRC turns to transform both the raw image and corresponding generated sentence into the shared semantic space, and measure the generated sentence from two aspects: 1) Prediction consistency.

Image Captioning Informativeness +2

Region Semantically Aligned Network for Zero-Shot Learning

no code implementations14 Oct 2021 Ziyang Wang, Yunhao Gou, Jingjing Li, Yu Zhang, Yang Yang

Zero-shot learning (ZSL) aims to recognize unseen classes based on the knowledge of seen classes.

Attribute Transfer Learning +1

Transformer-based Transform Coding

3 code implementations ICLR 2022 Yinhao Zhu, Yang Yang, Taco Cohen

Neural data compression based on nonlinear transform coding has made great progress over the last few years, mainly due to improvements in prior models, quantization methods and nonlinear transforms.

Computational Efficiency Data Compression +3

Semantic Communications With AI Tasks

no code implementations29 Sep 2021 Yang Yang, Caili Guo, Fangfang Liu, Chuanhong Liu, Lunan Sun, Qizheng Sun, Jiujiu Chen

A radical paradigm shift of wireless networks from ``connected things'' to ``connected intelligence'' undergoes, which coincides with the Shanno and Weaver's envisions: Communications will transform from the technical level to the semantic level.

Defect Detection

Few-Shot Classification with Task-Adaptive Semantic Feature Learning

no code implementations29 Sep 2021 Meihong Pan, Chunqiu Xia, Hongyi Xin, Yang Yang, Xiaoyong Pan, Hong-Bin Shen

Such approach could lead to information imbalance between support and query samples, which confounds model generalization from support to query samples.

Classification

Extended Successive Convex Approximation for Phase Retrieval with Dictionary Learning

no code implementations13 Sep 2021 Tianyi Liu, Andreas M. Tillmann, Yang Yang, Yonina C. Eldar, Marius Pesavento

The second algorithm, referred to as SCAphase, uses auxiliary variables and is favorable in the case of highly diverse mixture models.

Dictionary Learning Retrieval

Boosting Graph Search with Attention Network for Solving the General Orienteering Problem

no code implementations10 Sep 2021 Zongtao Liu, Jing Xu, Jintao Su, Tao Xiao, Yang Yang

We propose a novel combination of a variant beam search algorithm and a learned heuristic for solving the general orienteering problem.

Prior-Guided Deep Interference Mitigation for FMCW Radars

no code implementations30 Aug 2021 JianPing Wang, Runlong Li, Yuan He, Yang Yang

The effectiveness and accuracy of our proposed complex-valued fully convolutional network (CV-FCN) based interference mitigation approach are verified and analyzed through both simulated and measured radar signals.

Joint LED Selection and Precoding Optimization for Multiple-User Multiple-Cell VLC Systems

no code implementations29 Aug 2021 Yang Yang, Yujie Yang, Mingzhe Chen, Chunyan Feng, Hailun Xia, Shuguang Cui, H. Vincent Poor

First, a MU-MC-VLC system model is established, and then a sum-rate maximization problem under dimming level and illumination uniformity constraints is formulated.

Rethinking the Misalignment Problem in Dense Object Detection

1 code implementation27 Aug 2021 Yang Yang, Min Li, Bo Meng, Junxing Ren, Degang Sun, Zihao Huang

On the basis of SALT and SDR loss, we propose SALT-Net, which explicitly exploits task-aligned point-set features for accurate detection results.

Dense Object Detection Object +2

Crypto Wash Trading

no code implementations24 Aug 2021 Lin William Cong, Xi Li, Ke Tang, Yang Yang

We introduce systematic tests exploiting robust statistical and behavioral patterns in trading to detect fake transactions on 29 cryptocurrency exchanges.

RGB Image Classification with Quantum Convolutional Ansaetze

no code implementations23 Jul 2021 Yu Jing, Xiaogang Li, Yang Yang, Chonghang Wu, Wenbing Fu, Wei Hu, Yuanyuan Li, Hua Xu

With the rapid growth of qubit numbers and coherence times in quantum hardware technology, implementing shallow neural networks on the so-called Noisy Intermediate-Scale Quantum (NISQ) devices has attracted a lot of interest.

Classification Image Classification

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos

no code implementations CVPR 2021 Mingxing Zhang, Yang Yang, Xinghan Chen, Yanli Ji, Xing Xu, Jingjing Li, Heng Tao Shen

Then for a moment candidate, we concatenate the starting/middle/ending representations of its starting/middle/ending elements respectively to form the final moment representation.

Sentence

Language Scaling for Universal Suggested Replies Model

no code implementations NAACL 2021 Qianlan Ying, Payal Bajaj, Budhaditya Deb, Yu Yang, Wei Wang, Bojia Lin, Milad Shokouhi, Xia Song, Yang Yang, Daxin Jiang

Faced with increased compute requirements and low resources for language expansion, we build a single universal model for improving the quality and reducing run-time costs of our production system.

Continual Learning Cross-Lingual Transfer

Objects as Extreme Points

no code implementations29 Apr 2021 Yang Yang, Min Li, Bo Meng, Zihao Huang, Junxing Ren, Degang Sun

We also propose a new metric to measure the similarity between two groups of extreme points, namely, Extreme Intersection over Union (EIoU), and incorporate this EIoU as a new regression loss.

Clustering Object +2

Semi-Supervised Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

no code implementations17 Apr 2021 Yang Yang, Zhao-Yang Fu, De-Chuan Zhan, Zhi-Bin Liu, Yuan Jiang

Moreover, we introduce the extrinsic unlabeled multi-modal multi-instance data, and propose the M3DNS, which considers the instance-level auto-encoder for single modality and modified bag-level optimal transport to strengthen the consistency among modalities.

Attribute-Based Robotic Grasping with One-Grasp Adaptation

no code implementations6 Apr 2021 Yang Yang, YuanHao Liu, Hengyue Liang, Xibai Lou, Changhyun Choi

In this work, we introduce an end-to-end learning method of attribute-based robotic grasping with one-grasp adaptation capability.

Attribute Object +1

Collision-Aware Target-Driven Object Grasping in Constrained Environments

no code implementations1 Apr 2021 Xibai Lou, Yang Yang, Changhyun Choi

Grasping a novel target object in constrained environments (e. g., walls, bins, and shelves) requires intensive reasoning about grasp pose reachability to avoid collisions with the surrounding structures.

Object Robotic Grasping

Self-supervised Discriminative Feature Learning for Deep Multi-view Clustering

1 code implementation28 Mar 2021 Jie Xu, Yazhou Ren, Huayi Tang, Zhimeng Yang, Lili Pan, Yang Yang, Xiaorong Pu

To leverage the multi-view complementary information, we concatenate all views' embedded features to form the global features, which can overcome the negative impact of some views' unclear clustering structures.

Clustering

Toward Tweet Entity Linking with Heterogeneous Information Networks

1 code implementation IEEE Transactions on Knowledge and Data Engineering 2021 Wei Shen, Yuwei Yin, Yang Yang, Jiawei Han, Jianyong Wang, Xiaojie Yuan

The task of linking an entity mention in a tweet with its corresponding entity in a heterogeneous information network is of great importance, for the purpose of enriching heterogeneous information networks with the abundant and fresh knowledge embedded in tweets.

Entity Linking Metric Learning

Optimization of User Selection and Bandwidth Allocation for Federated Learning in VLC/RF Systems

no code implementations5 Mar 2021 Chuanhong Liu, Caili Guo, Yang Yang, Mingzhe Chen, H. Vincent Poor, Shuguang Cui

Then, the problem of user selection and bandwidth allocation is studied for FL implemented over a hybrid VLC/RF system aiming to optimize the FL performance.

Federated Learning

Towards Unbiased COVID-19 Lesion Localisation and Segmentation via Weakly Supervised Learning

1 code implementation1 Mar 2021 Yang Yang, Jiancong Chen, Ruixuan Wang, Ting Ma, Lingwei Wang, Jie Chen, Wei-Shi Zheng, Tong Zhang

Despite tremendous efforts, it is very challenging to generate a robust model to assist in the accurate quantification assessment of COVID-19 on chest CT images.

Generative Adversarial Network Weakly-supervised Learning

CogDL: A Comprehensive Library for Graph Deep Learning

1 code implementation1 Mar 2021 Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

In CogDL, we propose a unified design for the training and evaluation of GNN models for various graph tasks, making it unique among existing graph learning libraries.

Graph Classification Graph Embedding +5

Transform Network Architectures for Deep Learning based End-to-End Image/Video Coding in Subsampled Color Spaces

no code implementations27 Feb 2021 Hilmi E. Egilmez, Ankitesh K. Singh, Muhammed Coban, Marta Karczewicz, Yinhao Zhu, Yang Yang, Amir Said, Taco S. Cohen

Most of the existing deep learning based end-to-end image/video coding (DLEC) architectures are designed for non-subsampled RGB color format.

Integrating Pre-trained Model into Rule-based Dialogue Management

no code implementations17 Feb 2021 Jun Quan, Meng Yang, Qiang Gan, Deyi Xiong, Yiming Liu, Yuchen Dong, Fangxin Ouyang, Jun Tian, Ruiling Deng, Yongzhi Li, Yang Yang, Daxin Jiang

Rule-based dialogue management is still the most popular solution for industrial task-oriented dialogue systems for their interpretablility.

Dialogue Management Management +1

A proof by foliation that Lawson's cones are $A_Φ$-minimizing

no code implementations16 Feb 2021 Connor Mooney, Yang Yang

We also analyze the behavior at infinity of the leaves in the foliations.

Analysis of PDEs Differential Geometry

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction

3 code implementations ICLR 2021 Yuhang Li, Ruihao Gong, Xu Tan, Yang Yang, Peng Hu, Qi Zhang, Fengwei Yu, Wei Wang, Shi Gu

To further employ the power of quantization, the mixed precision technique is incorporated in our framework by approximating the inter-layer and intra-layer sensitivity.

Image Classification object-detection +2

Progressive Neural Image Compression with Nested Quantization and Latent Ordering

no code implementations4 Feb 2021 Yadong Lu, Yinhao Zhu, Yang Yang, Amir Said, Taco S Cohen

We present PLONQ, a progressive neural image compression scheme which pushes the boundary of variable bitrate compression by allowing quality scalable coding with a single bitstream.

Image Compression Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.