Search Results for author: Peng Xu

Found 144 papers, 68 papers with code

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System

1 code implementation ACL (dialdoc) 2021 Yan Xu, Etsuko Ishii, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users’ needs, which.

Data Augmentation Response Generation

CACA Agent: Capability Collaboration based AI Agent

no code implementations22 Mar 2024 Peng Xu, Haoran Wang, Chuang Wang, Xu Liu

As AI Agents based on Large Language Models (LLMs) have shown potential in practical applications across various fields, how to quickly deploy an AI agent and how to conveniently expand the application scenario of AI agents has become a challenge.

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

no code implementations16 Mar 2024 Deyi Ji, Siqi Gao, Lanyun Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao

In this paper, we address the challenge of multi-object tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT.

Homography Estimation Multi-Object Tracking +1

Learning-driven Physically-aware Large-scale Circuit Gate Sizing

no code implementations13 Mar 2024 Yuyang Ye, Peng Xu, Lizheng Ren, Tinghuan Chen, Hao Yan, Bei Yu, Longxing Shi

Gate sizing plays an important role in timing optimization after physical design.

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

no code implementations28 Feb 2024 Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu

Despite achieving rapid developments and with widespread applications, Large Vision-Language Models (LVLMs) confront a serious challenge of being prone to generating hallucinations.

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

2 code implementations18 Feb 2024 Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo

Large language models (LLMs) have demonstrated outstanding performance in various tasks, such as text summarization, text question-answering, and etc.

Question Answering Text Summarization

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

no code implementations2 Feb 2024 Jian Guan, Wei Wu, Zujie Wen, Peng Xu, Hongning Wang, Minlie Huang

We present AMOR, an agent framework based on open-source LLMs, which reasons with external knowledge bases and adapts to specific domains through human supervision to the reasoning process.

ChatQA: Building GPT-4 Level Conversational QA Models

no code implementations18 Jan 2024 Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro

In this work, we introduce ChatQA, a family of conversational question answering (QA) models that obtain GPT-4 level accuracies.

Conversational Question Answering Retrieval

p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models

1 code implementation17 Dec 2023 Haoyuan Wu, Xinyun Zhang, Peng Xu, Peiyu Liao, Xufeng Yao, Bei Yu

In this paper, we present a novel modeling framework that recasts adapter tuning after attention as a graph message passing process on attention graphs, where the projected query and value features and attention matrix constitute the node features and the graph adjacency matrix, respectively.

Image Captioning Question Answering +3

Large Model Based Referring Camouflaged Object Detection

no code implementations28 Nov 2023 Shupeng Cheng, Ge-Peng Ji, Pengda Qin, Deng-Ping Fan, BoWen Zhou, Peng Xu

Our motivation is to make full use of the semantic intelligence and intrinsic knowledge of recent Multimodal Large Language Models (MLLMs) to decompose this complex task in a human-like way.

Object object-detection +2

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

1 code implementation11 Oct 2023 Boxin Wang, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro

After instruction tuning on Retro, InstructRetro demonstrates significant improvement over the instruction tuned GPT on a wide range of zero-shot tasks.

Question Answering Reading Comprehension +2

Generative quantum machine learning via denoising diffusion probabilistic models

1 code implementation9 Oct 2023 Bingzhi Zhang, Peng Xu, Xiaohui Chen, Quntao Zhuang

Inspired by the classical counterpart, we propose the quantum denoising diffusion probabilistic model (QuDDPM) to enable efficiently trainable generative learning of quantum data.

Denoising Quantum Machine Learning +1

Retrieval meets Long Context Large Language Models

no code implementations4 Oct 2023 Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro

Perhaps surprisingly, we find that LLM with 4K context window using simple retrieval-augmentation at generation can achieve comparable performance to finetuned LLM with 16K context window via positional interpolation on long context tasks, while taking much less computation.

16k 4k +4

When Super-Resolution Meets Camouflaged Object Detection: A Comparison Study

no code implementations8 Aug 2023 Juan Wen, Shupeng Cheng, Peng Xu, BoWen Zhou, Radu Timofte, Weiyan Hou, Luc van Gool

Super Resolution (SR) and Camouflaged Object Detection (COD) are two hot topics in computer vision with various joint applications.

Object object-detection +2

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

1 code implementation7 Aug 2023 Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Secondly, it conducts an in-depth analysis of LVLMs' predictions using the ChatGPT Ensemble Evaluation (CEE), which leads to a robust and accurate evaluation and exhibits improved alignment with human evaluation compared to the word matching approach.

Hallucination Visual Reasoning

Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

no code implementations4 Jul 2023 Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions.

Conformal Prediction Language Modelling +1

Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching

1 code implementation27 Jun 2023 Peng Xu, Zhiyu Xiang, Chenyu Qiao, Jingyun Fu, Tianyu Pu

Existing works simply assume that the ground-truth distributions are uni-modal, which ignores the fact that most of the edge pixels can be multi-modal.

Domain Generalization Stereo Matching

Language to Rewards for Robotic Skill Synthesis

no code implementations14 Jun 2023 Wenhao Yu, Nimrod Gileadi, Chuyuan Fu, Sean Kirmani, Kuang-Huei Lee, Montse Gonzalez Arenas, Hao-Tien Lewis Chiang, Tom Erez, Leonard Hasenclever, Jan Humplik, Brian Ichter, Ted Xiao, Peng Xu, Andy Zeng, Tingnan Zhang, Nicolas Heess, Dorsa Sadigh, Jie Tan, Yuval Tassa, Fei Xia

However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing efforts in applying LLMs to robotics have largely treated LLMs as semantic planners or relied on human-engineered control primitives to interface with the robot.

In-Context Learning Logical Reasoning

Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks

1 code implementation23 May 2023 Peng Xu, Lin Zhang, Xuanzhou Liu, Jiaqi Sun, Yue Zhao, Haiqin Yang, Bei Yu

Neural architecture search (NAS) for Graph neural networks (GNNs), called NAS-GNNs, has achieved significant performance over manually designed GNN architectures.

Neural Architecture Search

Feature Expansion for Graph Neural Networks

1 code implementation10 May 2023 Jiaqi Sun, Lin Zhang, Guangyi Chen, Kun Zhang, Peng Xu, Yujiu Yang

Graph neural networks aim to learn representations for graph-structured data and show impressive performance, particularly in node classification.

Node Classification Representation Learning

Advances in Deep Concealed Scene Understanding

1 code implementation21 Apr 2023 Deng-Ping Fan, Ge-Peng Ji, Peng Xu, Ming-Ming Cheng, Christos Sakaridis, Luc van Gool

Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive objects exhibiting camouflage.

Scene Understanding Semantic Segmentation

SAM Struggles in Concealed Scenes -- Empirical Study on "Segment Anything"

no code implementations12 Apr 2023 Ge-Peng Ji, Deng-Ping Fan, Peng Xu, Ming-Ming Cheng, BoWen Zhou, Luc van Gool

Segmenting anything is a ground-breaking step toward artificial general intelligence, and the Segment Anything Model (SAM) greatly fosters the foundation models for computer vision.

High speed free-space optical communication using standard fiber communication component without optical amplification

no code implementations27 Feb 2023 Yao Zhang, Hua-Ying Liu, Xiaoyi Liu, Peng Xu, Xiang Dong, Pengfei Fan, Xiaohui Tian, Hua Yu, Dong Pan, Zhijun Yin, Guilu Long, Shi-Ning Zhu, Zhenda Xie

Free-space optical communication (FSO) can achieve fast, secure and license-free communication without need for physical cables, making it a cost-effective, energy-efficient and flexible solution when the fiber connection is unavailable.

Short-length SSVEP data extension by a novel generative adversarial networks based framework

1 code implementation13 Jan 2023 Yudong Pan, Ning li, Yangsong Zhang, Peng Xu, Dezhong Yao

This study substantiates the feasibility of the proposed method to extend the data length for short-time SSVEP signals for developing a high-performance BCI system.

EEG SSVEP

Evaluating Parameter Efficient Learning for Generation

no code implementations25 Oct 2022 Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan J. Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro

For cross-domain and cross-dataset cases, we show that (a) Adapter (Houlsby et al., 2019) performs the best amongst all the PERMs studied here, and (b) it outperforms finetuning if the task dataset is below a certain size.

Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization

no code implementations19 Oct 2022 Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

This problem is challenging, as it requires planning wiping actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations.

reinforcement-learning Reinforcement Learning (RL)

A Transformer-based deep neural network model for SSVEP classification

1 code implementation9 Oct 2022 Jianbo Chen, Yangsong Zhang, Yudong Pan, Peng Xu, Cuntai Guan

The proposed model validates the feasibility of deep learning models based on Transformer structure for SSVEP classification task, and could serve as a potential model to alleviate the calibration procedure in the practical application of SSVEP-based BCI systems.

Classification EEG +1

Multimodal Learning with Transformers: A Survey

no code implementations13 Jun 2022 Peng Xu, Xiatian Zhu, David A. Clifton

Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks.

Factuality Enhanced Language Models for Open-Ended Text Generation

3 code implementations9 Jun 2022 Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro

In this work, we measure and improve the factual accuracy of large-scale LMs for open-ended text generation.

Misconceptions Sentence +2

Attention Mechanism based Cognition-level Scene Understanding

no code implementations17 Apr 2022 Xuejiao Tang, Tai Le Quy, Eirini Ntoutsi, Kea Turner, Vasile Palade, Israat Haque, Peng Xu, Chris Brown, Wenbin Zhang

Given a question-image input, the Visual Commonsense Reasoning (VCR) model can predict an answer with the corresponding rationale, which requires inference ability from the real world.

Question Answering Scene Understanding +2

QA4QG: Using Question Answering to Constrain Multi-Hop Question Generation

1 code implementation14 Feb 2022 Dan Su, Peng Xu, Pascale Fung

Multi-hop question generation (MQG) aims to generate complex questions which require reasoning over multiple pieces of information of the input passage.

Multi-hop Question Answering Question Answering +2

How to Understand Masked Autoencoders

no code implementations8 Feb 2022 Shuhao Cao, Peng Xu, David A. Clifton

"Masked Autoencoders (MAE) Are Scalable Vision Learners" revolutionizes the self-supervised learning method in that it not only achieves the state-of-the-art for image pre-training, but is also a milestone that bridges the gap between visual and linguistic masked autoencoding (BERT-style) pre-trainings.

Self-Supervised Learning

Unsupervised Long-Term Person Re-Identification with Clothes Change

no code implementations7 Feb 2022 Mingkun Li, Shupeng Cheng, Peng Xu, Xiatian Zhu, Chun-Guang Li, Jun Guo

We investigate unsupervised person re-identification (Re-ID) with clothes change, a new challenging problem with more practical usability and scalability to real-world deployment.

Clustering Unsupervised Long Term Person Re-Identification +2

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

1 code implementation LREC 2022 Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

We further conduct experiments with Fairseq S2T Transformer, a state-of-the-art ASR model, on the biggest existing dataset, Common Voice zh-HK, and our proposed MDCC, and the results show the effectiveness of our dataset.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

2 code implementations LREC 2022 Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Peng Xu, Xu Yan, Zihan Liu, Rita Frieske, Tiezheng Yu, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

ASCEND (A Spontaneous Chinese-English Dataset) is a high-quality Mandarin Chinese-English code-switching corpus built on spontaneous multi-turn conversational dialogue sources collected in Hong Kong.

Hierarchical Neural Data Synthesis for Semantic Parsing

no code implementations4 Dec 2021 Wei Yang, Peng Xu, Yanshuai Cao

Moreover, even the questions pertinent to a given domain, which are the input of a semantic parsing system, might not be readily available, especially in cross-domain semantic parsing.

Data Augmentation Semantic Parsing +1

Attention-guided Generative Models for Extractive Question Answering

no code implementations12 Oct 2021 Peng Xu, Davis Liang, Zhiheng Huang, Bing Xiang

We propose a simple strategy to obtain an extractive answer span from the generative model by leveraging the decoder cross-attention patterns.

Extractive Question-Answering Hallucination +2

Multiplicative Position-aware Transformer Models for Language Understanding

no code implementations27 Sep 2021 Zhiheng Huang, Davis Liang, Peng Xu, Bing Xiang

Transformer models, which leverage architectural improvements like self-attention, perform remarkably well on Natural Language Processing (NLP) tasks.

Position

Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering

1 code implementation ACL 2021 Alexander Hanbo Li, Patrick Ng, Peng Xu, Henghui Zhu, Zhiguo Wang, Bing Xiang

However, a large amount of world's knowledge is stored in structured databases, and need to be accessed using query languages such as SQL.

Open-Domain Question Answering

CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue System

1 code implementation7 Jun 2021 Etsuko Ishii, Yan Xu, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users' needs, which.

Data Augmentation Response Generation

X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing

1 code implementation ACL (RepL4NLP) 2021 Zihan Liu, Genta Indra Winata, Peng Xu, Pascale Fung

Experimental results illustrate that our model can significantly outperform existing strong baselines in cross-lingual and cross-domain settings, and our model can also achieve a good generalization ability on target languages of target domains.

Semantic Parsing

BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling

1 code implementation5 Jun 2021 Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Peng Xu, Feijun Jiang, Yuxiang Hu, Chen Shi, Pascale Fung

However, existing datasets for end-to-end ToD modeling are limited to a single language, hindering the development of robust end-to-end ToD systems for multilingual countries and regions.

Cross-Lingual Transfer Transfer Learning

DeepChange: A Large Long-Term Person Re-Identification Benchmark with Clothes Change

1 code implementation31 May 2021 Peng Xu, Xiatian Zhu

Currently, one of the most significant limitations in this field is the lack of a large realistic benchmark.

Person Identification Person Re-Identification

Revealing and controlling nuclear dynamics following inner-shell photoionization of N2

no code implementations11 Mar 2021 Qingli Jing, Hong Qian, Peng Xu

In this work, we apply the Monte Carlo wave packet method to study the ultrafast nuclear dynamics following inner-shell photoionization of N2 exposed to an ultrashort intense X-ray pulse.

Atomic Physics Atomic and Molecular Clusters

Optimizing Deeper Transformers on Small Datasets

1 code implementation ACL 2021 Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J. D. Prince, Yanshuai Cao

This work shows that this does not always need to be the case: with proper initialization and optimization, the benefits of very deep transformers can carry over to challenging tasks with small datasets, including Text-to-SQL semantic parsing and logical reading comprehension.

Reading Comprehension Semantic Parsing +2

Mobile Robots Exploration via Deep Reinforcement Learning

no code implementations CUHK Course IERG5350 2020 Han Ma, Peng Xu

n this paper, we try to solve the mobile robot exploration problem in a 2D indoor office environment by deep reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Unsupervised Regionalization of Particle-resolved Aerosol Mixing State Indices on the Global Scale

no code implementations6 Dec 2020 Zhonghua Zheng, Joseph Ching, Jeffrey H. Curtis, Yu Yao, Peng Xu, Matthew West, Nicole Riemer

Here we developed a simple but effective unsupervised learning approach to regionalize predictions of global aerosol mixing state indices.

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

1 code implementation EMNLP 2020 Zihan Liu, Genta Indra Winata, Peng Xu, Zhaojiang Lin, Pascale Fung

Despite the promising results of current cross-lingual models for spoken language understanding systems, they still suffer from imperfect cross-lingual representation alignments between the source and target languages, which makes the performance sub-optimal.

Sentence Spoken Language Understanding

Embedding-based Zero-shot Retrieval through Query Generation

1 code implementation22 Sep 2020 Davis Liang, Peng Xu, Siamak Shakeri, Cicero Nogueira dos Santos, Ramesh Nallapati, Zhiheng Huang, Bing Xiang

In some cases, our model trained on synthetic data can even outperform the same model trained on real data

Passage Retrieval Retrieval

EmoGraph: Capturing Emotion Correlations using Graph Networks

no code implementations21 Aug 2020 Peng Xu, Zihan Liu, Genta Indra Winata, Zhaojiang Lin, Pascale Fung

Most emotion recognition methods tackle the emotion understanding task by considering individual emotion independently while ignoring their fuzziness nature and the interconnections among them.

Classification Emotion Classification +3

Extension of causal decomposition in the mutual complex dynamic process

no code implementations17 Aug 2020 Yi Zhang, Qin Yang, Lifu Zhang, Branko Celler, Steven Su, Peng Xu, Dezhong Yao

Causal decomposition depicts a cause-effect relationship that is not based on the concept of prediction, but based on the phase dependence of time series.

Time Series Time Series Analysis

On Learning Semantic Representations for Million-Scale Free-Hand Sketches

1 code implementation7 Jul 2020 Peng Xu, Yongye Huang, Tongtong Yuan, Tao Xiang, Timothy M. Hospedales, Yi-Zhe Song, Liang Wang

Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to accommodate both the abstract and messy traits of sketches.

Deep Hashing Learning Semantic Representations +1

Variational Transformers for Diverse Response Generation

2 code implementations28 Mar 2020 Zhaojiang Lin, Genta Indra Winata, Peng Xu, Zihan Liu, Pascale Fung

Despite the great promise of Transformers in many sequence modeling tasks (e. g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation.

Machine Translation Response Generation +1

TRANS-BLSTM: Transformer with Bidirectional LSTM for Language Understanding

no code implementations16 Mar 2020 Zhiheng Huang, Peng Xu, Davis Liang, Ajay Mishra, Bing Xiang

Prior to the transformer era, bidirectional Long Short-Term Memory (BLSTM) has been the dominant modeling architecture for neural machine translation and question answering.

Machine Translation Natural Language Inference +5

Learning Fast Adaptation on Cross-Accented Speech Recognition

1 code implementation4 Mar 2020 Genta Indra Winata, Samuel Cahyawijaya, Zihan Liu, Zhaojiang Lin, Andrea Madotto, Peng Xu, Pascale Fung

The great variability and complex characteristics of accents creates a major challenge for training a robust and accent-agnostic automatic speech recognition (ASR) system.

Audio and Speech Processing Sound

Learning to Walk in the Real World with Minimal Human Effort

1 code implementation20 Feb 2020 Sehoon Ha, Peng Xu, Zhenyu Tan, Sergey Levine, Jie Tan

In this paper, we develop a system for learning legged locomotion policies with deep RL in the real world with minimal human effort.

Multi-Task Learning

Deep Self-Supervised Representation Learning for Free-Hand Sketch

1 code implementation3 Feb 2020 Peng Xu, Zeyu Song, Qiyue Yin, Yi-Zhe Song, Liang Wang

In this paper, we tackle for the first time, the problem of self-supervised representation learning for free-hand sketches.

Representation Learning Retrieval +1

EEG-based Brain-Computer Interfaces (BCIs): A Survey of Recent Studies on Signal Sensing Technologies and Computational Intelligence Approaches and their Applications

no code implementations28 Jan 2020 Xiaotong Gu, Zehong Cao, Alireza Jolfaei, Peng Xu, Dongrui Wu, Tzyy-Ping Jung, Chin-Teng Lin

Recent technological advances such as wearable sensing devices, real-time data streaming, machine learning, and deep learning approaches have increased interest in electroencephalographic (EEG) based BCI for translational and healthcare applications.

Brain Computer Interface EEG +1

Deep Learning for Free-Hand Sketch: A Survey

2 code implementations8 Jan 2020 Peng Xu, Timothy M. Hospedales, Qiyue Yin, Yi-Zhe Song, Tao Xiang, Liang Wang

Free-hand sketches are highly illustrative, and have been widely used by humans to depict objects or stories from ancient times to the present.

Multi-Graph Transformer for Free-Hand Sketch Recognition

1 code implementation24 Dec 2019 Peng Xu, Chaitanya K. Joshi, Xavier Bresson

In this work, we propose a new representation of sketches as multiple sparsely connected graphs.

Sketch Recognition

SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong

no code implementations13 Dec 2019 Steven Schwarcz, Peng Xu, David D'Ambrosio, Juhana Kangaspunta, Anelia Angelova, Huong Phan, Navdeep Jaitly

The corpus consists of ping pong play with three main annotation streams that can be used to learn tracking and action recognition models -- tracking of the ping pong ball and poses of humans in the videos and the spin of the ball being hit by humans.

Action Recognition Pose Estimation +1

Fault Detection and Identification using Bayesian Recurrent Neural Networks

no code implementations11 Nov 2019 Weike Sun, Antonio R. C. Paiva, Peng Xu, Anantha Sundaram, Richard D. Braatz

In processing and manufacturing industries, there has been a large push to produce higher quality products and ensure maximum efficiency of processes.

Fault Detection

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables

no code implementations IJCNLP 2019 Zihan Liu, Jamin Shin, Yan Xu, Genta Indra Winata, Peng Xu, Andrea Madotto, Pascale Fung

Despite the surging demands for multilingual task-oriented dialog systems (e. g., Alexa, Google Home), there has been less research done in multilingual or cross-lingual scenarios.

Intent Detection Natural Language Understanding +2

Domain Adaptation with BERT-based Domain Classification and Data Selection

no code implementations WS 2019 Xiaofei Ma, Peng Xu, Zhiguo Wang, Ramesh Nallapati, Bing Xiang

The performance of deep neural models can deteriorate substantially when there is a domain shift between training and test data.

Classification Domain Adaptation +2

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning

no code implementations WS 2019 Dan Su, Yan Xu, Genta Indra Winata, Peng Xu, Hyeondey Kim, Zihan Liu, Pascale Fung

With a large number of datasets being released and new techniques being proposed, Question answering (QA) systems have witnessed great breakthroughs in reading comprehension (RC)tasks.

Language Modelling Multi-Task Learning +2

MoEL: Mixture of Empathetic Listeners

5 code implementations IJCNLP 2019 Zhaojiang Lin, Andrea Madotto, Jamin Shin, Peng Xu, Pascale Fung

Previous research on empathetic dialogue systems has mostly focused on generating responses given certain emotions.

Getting To Know You: User Attribute Extraction from Dialogues

1 code implementation LREC 2020 Chien-Sheng Wu, Andrea Madotto, Zhaojiang Lin, Peng Xu, Pascale Fung

User attributes provide rich and useful information for user understanding, yet structured and easy-to-use attributes are often sparsely populated.

Attribute Attribute Extraction +1

Multi-view Clustering with the Cooperation of Visible and Hidden Views

no code implementations12 Aug 2019 Zhaohong Deng, Ruixiu Liu, Te Zhang, Peng Xu, Kup-Sze Choi, Bin Qin, Shitong Wang

The existing algorithms usually focus on the cooperation of different views in the original space but neglect the influence of the hidden information among these different visible views, or they only consider the hidden information between the views.

Clustering

Multi-View Fuzzy Clustering with The Alternative Learning between Shared Hidden Space and Partition

no code implementations12 Aug 2019 Zhaohong Deng, Chen Cui, Peng Xu, Ling Liang, Haoran Chen, Te Zhang, Shitong Wang

How to exploit the relation-ship between different views effectively using the characteristic of multi-view data has become a crucial challenge.

Clustering

Generating Empathetic Responses by Looking Ahead the User's Sentiment

1 code implementation20 Jun 2019 Jamin Shin, Peng Xu, Andrea Madotto, Pascale Fung

Hence, in this paper, we propose Sentiment Look-ahead, which is a novel perspective for empathy that models the future user emotional state.

Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer

1 code implementation28 May 2019 Yanshuai Cao, Peng Xu

In this work, we develop a novel regularizer to improve the learning of long-range dependency of sequence data.

General Classification Inductive Bias +3

On Variational Learning of Controllable Representations for Text without Supervision

1 code implementation ICML 2020 Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao

The variational autoencoder (VAE) can learn the manifold of natural images on certain datasets, as evidenced by meaningful interpolating or extrapolating in the continuous latent space.

Style Transfer Text Style Transfer

Multi-view Information-theoretic Co-clustering for Co-occurrence Data

1 code implementation25 May 2019 Peng Xu, Zhaohong Deng, Kup-Sze Choi, Longbing Cao, Shitong Wang

More specifically, it exploits the agreement and disagreement among views by sharing a common clustering results along the sample dimension and keeping the clustering results of each view specific along the feature dimension.

Clustering

Deep Image Feature Learning with Fuzzy Rules

no code implementations25 May 2019 Xiang Ma, Liangzhe Chen, Zhaohong Deng, Peng Xu, Qisheng Yan, Kup-Sze Choi, Shitong Wang

The method progressively learns image features through a layer-by-layer manner based on fuzzy rules, so the feature learning process can be better explained by the generated rules.

Joint Information Preservation for Heterogeneous Domain Adaptation

no code implementations22 May 2019 Peng Xu, Zhaohong Deng, Kup-Sze Choi, Jun Wang, Shitong Wang

The two domains often lie in different feature spaces due to diverse data collection methods, which leads to the more challenging task of heterogeneous domain adaptation (HDA).

Domain Adaptation

Passage Ranking with Weak Supervision

no code implementations ICLR Workshop LLD 2019 Peng Xu, Xiaofei Ma, Ramesh Nallapati, Bing Xiang

In this paper, we propose a \textit{weak supervision} framework for neural ranking tasks based on the data programming paradigm \citep{Ratner2016}, which enables us to leverage multiple weak supervision signals from different sources.

Passage Ranking

Deep Zero-Shot Learning for Scene Sketch

no code implementations11 May 2019 Yao Xie, Peng Xu, Zhanyu Ma

We introduce a novel problem of scene sketch zero-shot learning (SSZSL), which is a challenging task, since (i) different from photo, the gap between common semantic domain (e. g., word vector) and sketch is too huge to exploit common semantic knowledge as the bridge for knowledge transfer, and (ii) compared with single-object sketch, more expressive feature representation for scene sketch is required to accommodate its high-level of abstraction and complexity.

Transfer Learning Zero-Shot Learning

Concise Fuzzy System Modeling Integrating Soft Subspace Clustering and Sparse Learning

no code implementations24 Apr 2019 Peng Xu, Zhaohong Deng, Chen Cui, Te Zhang, Kup-Sze Choi, Gu Suhang, Jun Wang, Shitong Wang

Furthermore, for highly nonlinear modeling task, it is usually necessary to use a large number of rules which further weakens the clarity and interpretability of TSK FS.

Clustering Sparse Learning

A novel repetition normalized adversarial reward for headline generation

no code implementations19 Feb 2019 Peng Xu, Pascale Fung

While reinforcement learning can effectively improve language generation models, it often suffers from generating incoherent and repetitive phrases \cite{paulus2017deep}.

Headline Generation reinforcement-learning +1

Transfer Representation Learning with TSK Fuzzy System

no code implementations9 Jan 2019 Peng Xu, Zhaohong Deng, Jun Wang, Qun Zhang, Shitong Wang

A core issue in transfer learning is to learn a shared feature space in where the distributions of the data from two domains are matched.

Dimensionality Reduction Representation Learning +1

Trust Region Based Adversarial Attack on Neural Networks

2 code implementations CVPR 2019 Zhewei Yao, Amir Gholami, Peng Xu, Kurt Keutzer, Michael Mahoney

To address this problem, we present a new family of trust region based adversarial attacks, with the goal of computing adversarial perturbations efficiently.

Adversarial Attack

Newton-MR: Inexact Newton Method With Minimum Residual Sub-problem Solver

no code implementations30 Sep 2018 Fred Roosta, Yang Liu, Peng Xu, Michael W. Mahoney

We consider a variant of inexact Newton Method, called Newton-MR, in which the least-squares sub-problems are solved approximately using Minimum Residual method.

Anomaly Detection for Skin Disease Images Using Variational Autoencoder

no code implementations3 Jul 2018 Yuchen Lu, Peng Xu

If we focus on specific diseases, the model is able to detect melanoma with 0. 864 AUCROC and detect actinic keratosis with 0. 872 AUCROC, even if it only sees the images of nevus.

Anomaly Detection

SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

1 code implementation CVPR 2018 Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo

Key to our network design is the embedding of unique characteristics of human sketch, where (i) a two-branch CNN-RNN architecture is adapted to explore the temporal ordering of strokes, and (ii) a novel hashing loss is specifically designed to accommodate both the temporal and abstract traits of sketches.

Deep Hashing Sketch Recognition

Neural Fine-Grained Entity Type Classification with Hierarchy-Aware Loss

3 code implementations NAACL 2018 Peng Xu, Denilson Barbosa

The task of Fine-grained Entity Type Classification (FETC) consists of assigning types from a hierarchy to entity mentions in text.

General Classification Multi-Label Classification +3

Investigations on Knowledge Base Embedding for Relation Prediction and Extraction

no code implementations6 Feb 2018 Peng Xu, Denilson Barbosa

We report an evaluation of the effectiveness of the existing knowledge base embedding models for relation prediction and for relation extraction on a wide range of benchmarks.

Relation Relation Extraction

GIANT: Globally Improved Approximate Newton Method for Distributed Optimization

no code implementations NeurIPS 2018 Shusen Wang, Farbod Roosta-Khorasani, Peng Xu, Michael W. Mahoney

For distributed computing environment, we consider the empirical risk minimization problem and propose a distributed and communication-efficient Newton-type optimization method.

Distributed Computing Distributed Optimization

Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study

no code implementations25 Aug 2017 Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney

While first-order optimization methods such as stochastic gradient descent (SGD) are popular in machine learning (ML), they come with well-known deficiencies, including relatively-slow convergence, sensitivity to the settings of hyper-parameters such as learning rate, stagnation at high training errors, and difficulty in escaping flat regions and saddle points.

BIG-bench Machine Learning Second-order methods

Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information

no code implementations23 Aug 2017 Peng Xu, Fred Roosta, Michael W. Mahoney

In this light, we consider the canonical problem of finite-sum minimization, provide appropriate uniform and non-uniform sub-sampling strategies to construct such Hessian approximations, and obtain optimal iteration complexity for the corresponding sub-sampled trust-region and cubic regularization methods.

Vocal Bursts Type Prediction

Accelerated Stochastic Power Iteration

2 code implementations10 Jul 2017 Christopher De Sa, Bryan He, Ioannis Mitliagkas, Christopher Ré, Peng Xu

We propose a simple variant of the power iteration with an added momentum term, that achieves both the optimal sample and iteration complexity.

Dimensionality Reduction

Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data

no code implementations25 Oct 2016 Paroma Varma, Bryan He, Dan Iter, Peng Xu, Rose Yu, Christopher De Sa, Christopher Ré

Prior work has explored learning accuracies for these sources even without ground truth labels, but they assume that a single accuracy parameter is sufficient to model the behavior of these sources over the entire training set.

Relation Extraction

Sub-sampled Newton Methods with Non-uniform Sampling

no code implementations NeurIPS 2016 Peng Xu, Jiyan Yang, Farbod Roosta-Khorasani, Christopher Ré, Michael W. Mahoney

As second-order methods prove to be effective in finding the minimizer to a high-precision, in this work, we propose randomized Newton-type algorithms that exploit \textit{non-uniform} sub-sampling of $\{\nabla^2 f_i(w)\}_{i=1}^{n}$, as well as inexact updates, as means to reduce the computational complexity.

Second-order methods

Learning Subclass Representations for Visually-varied Image Classification

no code implementations12 Jan 2016 Xinchao Li, Peng Xu, Yue Shi, Martha Larson, Alan Hanjalic

The novelty of the approach is that subclass representations make use of not only the content of the photos themselves, but also information on the co-occurrence of their tags, which determines membership in both subclasses and top-level classes.

Classification General Classification +2

Short-Term Depression in VLSI Stochastic Synapse

no code implementations NeurIPS 2008 Peng Xu, Timothy K. Horiuchi, Pamela A. Abshire

We report a compact realization of short-term depression (STD) in a VLSI stochastic synapse.

Cannot find the paper you are looking for? You can Submit a new open access paper.