Search Results for author: Xiaojie Wang

Found 55 papers, 27 papers with code

Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet Extraction

1 code implementation ACL 2022 Hao Chen, Zepeng Zhai, Fangxiang Feng, Ruifan Li, Xiaojie Wang

Specifically, we first define ten types of relations for ASTE task, and then adopt a biaffine attention module to embed these relations as an adjacent tensor between words in a sentence.

Aspect Sentiment Triplet Extraction Relation +1

Grouped-Attention for Content-Selection and Content-Plan Generation

no code implementations Findings (EMNLP) 2021 Bayu Distiawan Trisedya, Xiaojie Wang, Jianzhong Qi, Rui Zhang, Qingjun Cui

A key component of the GSC-attention is grouped-attention, which is token-level attention constrained within each input attribute that enables our proposed model captures both local and global context.

Attribute Data-to-Text Generation

Task-Oriented Clustering for Dialogues

1 code implementation Findings (EMNLP) 2021 Chenxu Lv, Hengtong Lu, Shuyu Lei, Huixing Jiang, Wei Wu, Caixia Yuan, Xiaojie Wang

A reliable clustering algorithm for task-oriented dialogues can help developer analysis and define dialogue tasks efficiently.

Clustering Representation Learning +1

Learn to Adapt for Generalized Zero-Shot Text Classification

1 code implementation ACL 2022 Yiwen Zhang, Caixia Yuan, Xiaojie Wang, Ziwei Bai, Yongbin Liu

Generalized zero-shot text classification aims to classify textual instances from both previously seen classes and incrementally emerging unseen classes.

Generalized Zero-Shot Learning Meta-Learning +3

DiffHarmony: Latent Diffusion Model Meets Image Harmonization

no code implementations9 Apr 2024 Pengfei Zhou, Fangxiang Feng, Xiaojie Wang

To deal with these issues, in this paper, we first adapt a pre-trained latent diffusion model to the image harmonization task to generate the harmonious but potentially blurry initial images.

Image Compression Image Harmonization +2

In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model

no code implementations10 Mar 2024 Junhui Yin, Xinyu Zhang, Lin Wu, Xianghua Xie, Xiaojie Wang

To this end, we explore the concept of test-time prompt tuning (TTPT), which enables the adaptation of the CLIP model to novel downstream tasks through only one step of optimization on an unsupervised objective that involves the test sample.

In-Context Learning Language Modelling +1

Towards Robustness Analysis of E-Commerce Ranking System

no code implementations7 Mar 2024 Ningfei Wang, Yupin Huang, Han Cheng, Jiri Gesi, Xiaojie Wang, Vivek Mittal

As e-commerce retailers use various techniques to improve the quality of search results, we hope that this research offers valuable guidance for measuring the robustness of the ranking systems.

Information Retrieval

Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications

no code implementations8 Dec 2023 Zhaolong Ning, Hao Hu, Xiaojie Wang, Qingqing Wu, Chau Yuen, F. Richard Yu, Yan Zhang

To address the aforementioned challenges, we propose a new optimization algorithm for joint IRS-user association, trajectory optimization of UAVs, successive interference cancellation (SIC) decoding order scheduling and power allocation to maximize system energy efficiency.

Q-Learning Scheduling

A Survey on Trustworthy Edge Intelligence: From Security and Reliability To Transparency and Sustainability

no code implementations27 Oct 2023 Xiaojie Wang, Beibei Wang, Yu Wu, Zhaolong Ning, Song Guo, Fei Richard Yu

Edge Intelligence (EI) integrates Edge Computing (EC) and Artificial Intelligence (AI) to push the capabilities of AI to the network edge for real-time, efficient and secure intelligent decision-making and computation.

Decision Making Edge-computing

Explicit Alignment and Many-to-many Entailment Based Reasoning for Conversational Machine Reading

no code implementations20 Oct 2023 Yangyang Luo, Shiyu Tian, Caixia Yuan, Xiaojie Wang

Conversational Machine Reading (CMR) requires answering a user's initial question through multi-turn dialogue interactions based on a given document.

Decision Making Question Generation +2

A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

1 code implementation1 Oct 2023 Lucen Zhong, Hengtong Lu, Caixia Yuan, Xiaojie Wang, Jiashen Sun, Ke Zeng, Guanglu Wan

A global policy consistency task is designed to capture the multi-turn dialog policy sequential relation, and an act-based contrastive learning task is designed to capture similarities among samples with the same dialog policy.

Contrastive Learning

AKEM: Aligning Knowledge Base to Queries with Ensemble Model for Entity Recognition and Linking

no code implementations12 Sep 2023 Di Lu, Zhongping Liang, Caixia Yuan, Xiaojie Wang

This paper presents a novel approach to address the Entity Recognition and Linking Challenge at NLPCC 2015.

regression

Whether you can locate or not? Interactive Referring Expression Generation

1 code implementation19 Aug 2023 Fulong Ye, Yuxing Long, Fangxiang Feng, Xiaojie Wang

Referring Expression Generation (REG) aims to generate unambiguous Referring Expressions (REs) for objects in a visual scene, with a dual task of Referring Expression Comprehension (REC) to locate the referred object.

Referring Expression Referring Expression Comprehension +1

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark

1 code implementation26 May 2023 Yuxing Long, Binyuan Hui, Caixia Yuan1, Fei Huang, Yongbin Li, Xiaojie Wang

Existing multimodal task-oriented dialog data fails to demonstrate the diverse expressions of user subjective preferences and recommendation acts in the real-life shopping scenario.

Multimodal Recommendation

An Asynchronous Updating Reinforcement Learning Framework for Task-oriented Dialog System

1 code implementation4 May 2023 Sai Zhang, Yuwei Hu, Xiaojie Wang, Caixia Yuan

The errors from DST might misguide the dialog policy, and the system action brings extra difficulties for the DST module.

dialog state tracking reinforcement-learning

CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising

1 code implementation journal 2023 Kang Xu, Weixin Li, Xia Wang, Xiaoyan Hu, Ke Yan, Xiaojie Wang, Xuan Dong

Based on the prior that, for each pixel, its similar pixels are usually spatially close, our insights are that (1) we partition the image into non-overlapped windows and perform regional self-attention to reduce the search range of each pixel, and (2) we encourage pixels across different windows to communicate with each other.

Image Denoising Jpeg Compression Artifact Reduction +1

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph

1 code implementation5 Jan 2023 Yuxing Long, Binyuan Hui, Fulong Ye, Yanyang Li, Zhuoxin Han, Caixia Yuan, Yongbin Li, Xiaojie Wang

Existing multimodal conversation agents have shown impressive abilities to locate absolute positions or retrieve attributes in simple scenarios, but they fail to perform well when complex relative positions and information alignments are involved, which poses a bottleneck in response quality.

Question Answering

Towards Unifying Reference Expression Generation and Comprehension

1 code implementation24 Oct 2022 Duo Zheng, Tao Kong, Ya Jing, Jiaan Wang, Xiaojie Wang

Additionally, IRTF could generate pseudo input regions for the REC task to enable a uniform way for sharing the identical representation space across the REC and REG.

Language Modelling Masked Language Modeling +1

GR-GAN: Gradual Refinement Text-to-image Generation

1 code implementation23 May 2022 Bo Yang, Fangxiang Feng, Xiaojie Wang

We also introduce a new metric Cross-Model Distance (CMD) for simultaneously evaluating image quality and image-text consistency.

Generative Adversarial Network Image-text matching +3

Question-Driven Graph Fusion Network For Visual Question Answering

no code implementations3 Apr 2022 Yuxi Qian, Yuncong Hu, Ruonan Wang, Fangxiang Feng, Xiaojie Wang

It first models semantic, spatial, and implicit visual relations in images by three graph attention networks, then question information is utilized to guide the aggregation process of the three graphs, further, our QD-GFN adopts an object filtering mechanism to remove question-irrelevant objects contained in the image.

Graph Attention Object +4

Co-VQA : Answering by Interactive Sub Question Sequence

no code implementations Findings (ACL) 2022 Ruonan Wang, Yuxi Qian, Fangxiang Feng, Xiaojie Wang, Huixing Jiang

Most existing approaches to Visual Question Answering (VQA) answer questions directly, however, people usually decompose a complex question into a sequence of simple sub questions and finally obtain the answer to the original question after answering the sub question sequence(SQS).

Question Answering Visual Question Answering +1

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots

1 code implementation Findings (ACL) 2022 Sai Zhang, Yuwei Hu, Yuchuan Wu, Jiaman Wu, Yongbin Li, Jian Sun, Caixia Yuan, Xiaojie Wang

We find some new linguistic phenomena and interactive manners in SSTOD which raise critical challenges of building dialog agents for the task.

 Ranked #1 on SSTOD on SSD_NAME

SSTOD

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene

1 code implementation16 Mar 2022 Duo Zheng, Fandong Meng, Qingyi Si, Hairun Fan, Zipeng Xu, Jie zhou, Fangxiang Feng, Xiaojie Wang

Visual dialog has witnessed great progress after introducing various vision-oriented goals into the conversation, especially such as GuessWhich and GuessWhat, where the only image is visible by either and both of the questioner and the answerer, respectively.

Visual Dialog

Deep Keyphrase Completion

no code implementations29 Oct 2021 Yu Zhao, Jia Song, Huali Feng, Fuzhen Zhuang, Qing Li, Xiaojie Wang, Ji Liu

Keyphrase provides accurate information of document content that is highly compact, concise, full of meanings, and widely used for discourse comprehension, organization, and text retrieval.

Keyphrase Extraction Keyphrase Generation +2

Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization

1 code implementation Findings (EMNLP) 2021 Junpeng Liu, Yanyan Zou, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Caixia Yuan, Xiaojie Wang

To capture the various topic information of a conversation and outline salient facts for the captured topics, this work proposes two topic-aware contrastive learning objectives, namely coherence detection and sub-summary generation objectives, which are expected to implicitly model the topic change and handle information scattering challenges for the dialogue summarization task.

Abstractive Dialogue Summarization Contrastive Learning +3

Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser

1 code implementation Findings (EMNLP) 2021 Duo Zheng, Zipeng Xu, Fandong Meng, Xiaojie Wang, Jiaan Wang, Jie zhou

To enhance VD Questioner: 1) we propose a Related entity enhanced Questioner (ReeQ) that generates questions under the guidance of related entities and learns entity-based questioning strategy from human dialogs; 2) we propose an Augmented Guesser (AugG) that is strong and is optimized for the VD setting especially.

Reinforcement Learning (RL) Visual Dialog

Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis

1 code implementation ACL 2021 Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang, Eduard Hovy

To overcome these challenges, in this paper, we propose a dual graph convolutional networks (DualGCN) model that considers the complementarity of syntax structures and semantic correlations simultaneously.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Multi-stage Pre-training over Simplified Multimodal Pre-training Models

1 code implementation ACL 2021 Tongtong Liu, Fangxiang Feng, Xiaojie Wang

Experimental results show that our method achieves comparable performance to the original LXMERT model in all downstream tasks, and even outperforms the original model in Image-Text Retrieval task.

Retrieval Sentence +1

Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue

1 code implementation12 Jul 2021 Zipeng Xu, Fandong Meng, Xiaojie Wang, Duo Zheng, Chenxu Lv, Jie zhou

In Reinforcement Learning, it is crucial to represent states and assign rewards based on the action-caused transitions of states.

reinforcement-learning Reinforcement Learning (RL)

Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue

1 code implementation4 Feb 2021 Han Liu, Caixia Yuan, Xiaojie Wang, Yushu Yang, Huixing Jiang, Zhongyuan Wang

We propose a novel task, Multi-Document Driven Dialogue (MD3), in which an agent can guess the target document that the user is interested in by leading a dialogue.

Attribute

Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

no code implementations15 Oct 2020 Peng Lu, Jiahui Liu, Xujun Peng, Xiaojie Wang

In order to tackle this problem, a weakly supervised cropping frame- work is proposed, where the distribution dissimilarity between high quality images and cropped images is used to guide the coordinate predictor’s training and the ground truths of cropping windows are not required by the proposed method.

Image Cropping

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

1 code implementation1 Oct 2020 Zipeng Xu, Fangxiang Feng, Xiaojie Wang, Yushu Yang, Huixing Jiang, Zhongyuan Wang

In this paper, we propose an Answer-Driven Visual State Estimator (ADVSE) to impose the effects of different answers on visual states.

Question Generation Question-Generation +1

Label-Wise Document Pre-Training for Multi-Label Text Classification

1 code implementation15 Aug 2020 Han Liu, Caixia Yuan, Xiaojie Wang

A major challenge of multi-label text classification (MLTC) is to stimulatingly exploit possible label differences and label correlations.

 Ranked #1 on Multi-Label Text Classification on AAPD (Micro F1 metric)

Document Classification General Classification +3

Connecting Embeddings for Knowledge Graph Entity Typing

1 code implementation ACL 2020 Yu Zhao, Anxiang Zhang, Ruobing Xie, Kang Liu, Xiaojie Wang

In this paper, we propose a novel approach for KG entity typing which is trained by jointly utilizing local typing knowledge from existing entity type assertions and global triple knowledge from KGs.

Entity Typing Knowledge Graph Completion +1

Cycle-CNN for Colorization towards Real Monochrome-Color Camera Systems

1 code implementation AAAI Technical Track: Vision 2020 Xuan Dong, Weixin Li, Xiaojie Wang, Yunhong Wang

We present a new CNN model, named cycle CNN, which can directly use the real data from monochrome-color camera systems for training.

Colorization

Guessing State Tracking for Visual Dialogue

no code implementations ECCV 2020 Wei Pang, Xiaojie Wang

This paper proposes a guessing state for the Guesser, and regards guess as a process with change of guessing state through a dialogue.

Visual Grounding

Visual Dialogue State Tracking for Question Generation

no code implementations12 Nov 2019 Wei Pang, Xiaojie Wang

A visual dialogue state is defined as the distribution on objects in the image as well as representations of objects.

Dialogue State Tracking Question Generation +2

Cascaded LSTMs based Deep Reinforcement Learning for Goal-driven Dialogue

1 code implementation31 Oct 2019 Yue Ma, Xiaojie Wang, Zhenjiang Dong, Hong Chen

Dialogue embeddings are learned by a LSTM at the middle of the network, and updated by the feeding of all turn embeddings.

Dialogue Management Management +3

Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random

no code implementations1 Jun 2019 Xiaojie Wang, Rui Zhang, Yu Sun, Jianzhong Qi

In recommender systems, usually the ratings of a user to most items are missing and a critical problem is that the missing ratings are often missing not at random (MNAR) in reality.

Imputation Recommendation Systems

Deep Reason: A Strong Baseline for Real-World Visual Reasoning

no code implementations24 May 2019 Chenfei Wu, Yanzhao Zhou, Gen Li, Nan Duan, Duyu Tang, Xiaojie Wang

This paper presents a strong baseline for real-world visual reasoning (GQA), which achieves 60. 93% in GQA 2019 challenge and won the sixth place.

Visual Reasoning

KDGAN: Knowledge Distillation with Generative Adversarial Networks

no code implementations NeurIPS 2018 Xiaojie Wang, Rui Zhang, Yu Sun, Jianzhong Qi

An alternative method is to adversarially train the classifier against a discriminator in a two-player game akin to generative adversarial networks (GAN), which can ensure the classifier to learn the true data distribution at the equilibrium of this game.

Knowledge Distillation Multi-Label Learning

Chain of Reasoning for Visual Question Answering

no code implementations NeurIPS 2018 Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong

A chain of reasoning (CoR) is constructed for supporting multi-step and dynamic reasoning on changed relations and objects.

Object Question Answering +3

Cannot find the paper you are looking for? You can Submit a new open access paper.