Search Results for author: Kai Sun

Found 56 papers, 21 papers with code

A Transformational Biencoder with In-Domain Negative Sampling for Zero-Shot Entity Linking

no code implementations • Findings (ACL) 2022 • Kai Sun, Richong Zhang, Samuel Mensah, Yongyi Mao, Xudong Liu

As like previous work, we rely on negative entities to encourage our model to discriminate the golden entities during training.

Entity Linking

Paper
Add Code

Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval

no code implementations • EMNLP 2021 • Xueguang Ma, Minghan Li, Kai Sun, Ji Xin, Jimmy Lin

Recent work has shown that dense passage retrieval techniques achieve better ranking accuracy in open-domain question answering compared to sparse retrieval techniques such as BM25, but at the cost of large space and memory requirements.

Open-Domain Question Answering Passage Retrieval +2

Paper
Add Code

Advancing Geometric Problem Solving: A Comprehensive Benchmark for Multimodal Model Evaluation

no code implementations • 7 Apr 2024 • Kai Sun, Yushi Bai, Nianyi Lin

In this work, we present the MM-MATH dataset, a novel benchmark developed to rigorously evaluate the performance of advanced large language and multimodal models - including but not limited to GPT-4, GPT-4V, and Claude - within the domain of geometric computation.

Math

Paper
Add Code

A Semi-Analytical Approach for State-Space Electromagnetic Transient Simulation Using the Differential Transformation

no code implementations • 19 Dec 2023 • Min Xiong, Kaiyang Huang, Yang Liu, Rui Yao, Kai Sun, Feng Qiu

Case studies are conducted on EMT models of the IEEE 39-bus system and a synthetic 390-bus system to demonstrate the merits of the new simulation approach against traditional methods.

Paper
Add Code

AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation

no code implementations • 6 Dec 2023 • Xinzhou Wang, Yikai Wang, Junliang Ye, Zhengyi Wang, Fuchun Sun, Pengkun Liu, Ling Wang, Kai Sun, Xintong Wang, Bin He

Extensive experiments demonstrate the capability of our method in generating high-flexibility text-guided 3D models from the monocular video, while also showing improved reconstruction performance over existing non-rigid reconstruction methods.

3D Generation Denoising +1

Paper
Add Code

Anaphor Assisted Document-Level Relation Extraction

1 code implementation • 28 Oct 2023 • Chonggang Lu, Richong Zhang, Kai Sun, Jaein Kim, Cunwang Zhang, Yongyi Mao

Existing methods focus on building a heterogeneous document graph to model the internal structure of an entity and the external interaction between entities.

Document-level Relation Extraction Relation +1

Paper
Code

Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?

no code implementations • 20 Aug 2023 • Kai Sun, Yifan Ethan Xu, Hanwen Zha, Yue Liu, Xin Luna Dong

Since the recent prosperity of Large Language Models (LLMs), there have been interleaved discussions regarding how to reduce hallucinations from LLM responses, how to increase the factuality of LLMs, and whether Knowledge Graphs (KGs), which store the world knowledge in a symbolic form, will be replaced with LLMs.

Knowledge Graphs World Knowledge

Paper
Add Code

The Potential and Pitfalls of using a Large Language Model such as ChatGPT or GPT-4 as a Clinical Assistant

no code implementations • 16 Jul 2023 • Jingqing Zhang, Kai Sun, Akshay Jagadeesh, Mahta Ghahfarokhi, Deepa Gupta, Ashok Gupta, Vibhor Gupta, Yike Guo

Recent studies have demonstrated promising performance of ChatGPT and GPT-4 on several medical domain tasks.

Language Modelling Large Language Model +1

Paper
Add Code

Trading Syntax Trees for Wordpieces: Target-oriented Opinion Words Extraction with Wordpieces and Aspect Enhancement

no code implementations • 18 May 2023 • Samuel Mensah, Kai Sun, Nikolaos Aletras

State-of-the-art target-oriented opinion word extraction (TOWE) models typically use BERT-based text encoders that operate on the word level, along with graph convolutional networks (GCNs) that incorporate syntactic information extracted from syntax trees.

Sentence target-oriented opinion words extraction

Paper
Add Code

Self-training through Classifier Disagreement for Cross-Domain Opinion Target Extraction

no code implementations • 28 Feb 2023 • Kai Sun, Richong Zhang, Samuel Mensah, Nikolaos Aletras, Yongyi Mao, Xudong Liu

Inspired by the theoretical foundations in domain adaptation [2], we propose a new SSL approach that opts for selecting target samples whose model output from a domain-specific teacher and student network disagree on the unlabelled target data, in an effort to boost the target domain performance.

Aspect Extraction Domain Adaptation +1

Paper
Add Code

Semi-Analytical Electromagnetic Transient Simulation Using Differential Transformation

no code implementations • 18 Feb 2023 • Min Xiong, Rui Yao, Yang Liu, Kai Sun, Feng Qiu

For electromagnetic transient (EMT) simulation of a power system, a state-space-based approach needs to solve state-space EMT equations by using numerical integration methods, e. g., the Euler method, Runge-Kutta methods, and trapezoidal-rule method, at small time steps.

Numerical Integration

Paper
Add Code

Digital Asset Valuation: A Study on Domain Names, Email Addresses, and NFTs

no code implementations • 6 Oct 2022 • Kai Sun

We further propose improvements to make the model more aware of the time sensitivity of transactions and the popularity of assets.

Paper
Add Code

Preparing data for pathological artificial intelligence with clinical-grade performance

no code implementations • 22 May 2022 • Yuanqing Yang, Kai Sun, Yanhua Gao, Kuangsong Wang, Gang Yu

The digital pathology is fundamental of clinical-grade PAI, and the techniques of data standardization and weakly supervised learning methods based on whole slide image (WSI) are effective ways to overcome obstacles of performance reproduction.

Weakly-supervised Learning

Paper
Add Code

Machine Learning based Optimal Feedback Control for Microgrid Stabilization

no code implementations • 9 Mar 2022 • Tianwei Xia, Kai Sun, Wei Kang

A case study is carried out for a microgrid model based on a modified Kundur two-area system to test the real-time performance of the proposed control scheme.

BIG-bench Machine Learning

Paper
Add Code

Data-Driven Computational Methods for the Domain of Attraction and Zubov's Equation

no code implementations • 29 Dec 2021 • Wei Kang, Kai Sun, Liang Xu

We prove that a neural network approximation exists for the Lyapunov function of power systems such that the approximation error is a cubic polynomial of the number of generators.

Paper
Add Code

PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving

no code implementations • 23 Dec 2021 • Pengchuan Xiao, Zhenlei Shao, Steven Hao, Zishuo Zhang, Xiaolin Chai, Judy Jiao, Zesong Li, Jian Wu, Kai Sun, Kun Jiang, Yunlong Wang, Diange Yang

The accelerating development of autonomous driving technology has placed greater demands on obtaining large amounts of high-quality data.

3D Object Detection Autonomous Driving +5

Paper
Add Code

An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction

1 code implementation • EMNLP 2021 • Samuel Mensah, Kai Sun, Nikolaos Aletras

Target-oriented opinion words extraction (TOWE) (Fan et al., 2019b) is a new subtask of target-oriented sentiment analysis that aims to extract opinion words for a given aspect in text.

Position target-oriented opinion words extraction +1

Paper
Code

XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data

2 code implementations • 26 May 2021 • Eloise Withnell, XiaoYu Zhang, Kai Sun, Yike Guo

To the best of our knowledge, XOmiVAE is one of the first activation level-based interpretable deep learning models explaining novel clusters generated by VAE.

Classification Clustering +1

Paper
Code

Multi-scale super-resolution generation of low-resolution scanned pathological images

1 code implementation • 15 May 2021 • Kai Sun, Yanhua Gao, Ting Xie, Xun Wang, Qingqing Yang, Le Chen, Kuansong Wang, Gang Yu

We design a strategy to scan slides with low resolution (5X) and a super-resolution method is proposed to restore the image details when in diagnosis.

Generative Adversarial Network SSIM +1

Paper
Code

A Replication Study of Dense Passage Retriever

1 code implementation • 12 Apr 2021 • Xueguang Ma, Kai Sun, Ronak Pradeep, Jimmy Lin

Text retrieval using learned dense representations has recently emerged as a promising alternative to "traditional" text retrieval using sparse bag-of-words representations.

Open-Domain Question Answering Retrieval +1

1,458

Paper
Code

Deep Convolutional Sparse Coding Network for Pansharpening with Guidance of Side Information

1 code implementation • 10 Mar 2021 • Shuang Xu, Jiangshe Zhang, Kai Sun, Zixiang Zhao, Lu Huang, Junmin Liu, Chunxia Zhang

Pansharpening is a fundamental issue in remote sensing field.

Pansharpening Rolling Shutter Correction

Paper
Code

Deep Gradient Projection Networks for Pan-sharpening

1 code implementation • CVPR 2021 • Shuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang

Specifically, two optimization problems regularized by the deep prior are formulated, and they are separately responsible for the generative models for panchromatic images and low resolution multispectral images.

Paper
Code

OmiEmbed: a unified multi-task deep learning framework for multi-omics data

1 code implementation • 3 Feb 2021 • XiaoYu Zhang, Yuting Xing, Kai Sun, Yike Guo

To tackle this problem and pave the way for machine learning aided precision medicine, we proposed a unified multi-task deep learning framework named OmiEmbed to capture biomedical information from high-dimensional omics data with the deep embedding and downstream task modules.

BIG-bench Machine Learning Decision Making +2

Paper
Code

Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data

no code implementations • Findings (EMNLP) 2021 • Dian Yu, Kai Sun, Dong Yu, Claire Cardie

In spite of much recent research in the area, it is still unclear whether subject-area question-answering data is useful for machine reading comprehension (MRC) tasks.

Machine Reading Comprehension Multiple-choice +1

Paper
Add Code

A Blockchain-based Trust System for Decentralised Applications: When trustless needs trust

no code implementations • 26 Jan 2021 • Nguyen Truong, Gyu Myoung Lee, Kai Sun, Florian Guitton, Yike Guo

Blockchain technology has been envisaged to commence an era of decentralised applications and services (DApps) without the need for a trusted intermediary.

Cryptography and Security Distributed, Parallel, and Cluster Computing

Paper
Add Code

FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Filter

no code implementations • 31 Dec 2020 • Zixiang Zhao, Jiangshe Zhang, Shuang Xu, Kai Sun, Lu Huang, Junmin Liu, Chunxia Zhang

In addition, the latent information of features can be preserved effectively through adversarial training.

Generative Adversarial Network Image Enhancement +1

Paper
Add Code

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy

1 code implementation • 29 Dec 2020 • Shuang Xu, Lizhen Ji, Zhe Wang, Pengfei Li, Kai Sun, Chunxia Zhang, Jiangshe Zhang

According to the idea that each local region in the fused image should be similar to the sharpest one among source images, this paper presents an optimization-based approach to reduce defocus spread effects.

SSIM

Paper
Code

Aligning geographic entities from historical maps for building knowledge graphs

no code implementations • 5 Dec 2020 • Kai Sun, Yingjie Hu, Jia Song, Yunqiang Zhu

Despite their valuable content, it is often challenging to access and use the information in historical maps, due to their forms of paper-based maps or scanned images.

Knowledge Graphs

Paper
Add Code

Privacy Preservation in Federated Learning: An insightful survey from the GDPR Perspective

no code implementations • 10 Nov 2020 • Nguyen Truong, Kai Sun, Siyao Wang, Florian Guitton, Yike Guo

Furthermore, in the era of the Internet of Things and big data in which data is essentially distributed, transferring a vast amount of data to a data centre for processing seems to be a cumbersome solution.

Federated Learning Privacy Preserving

Paper
Add Code

Adding Chit-Chat to Enhance Task-Oriented Dialogues

1 code implementation • NAACL 2021 • Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie

Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e. g., booking hotels), open-domain chatbots aim at making socially engaging conversations.

Dialogue Generation Dialogue Understanding +1

Paper
Code

Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge

no code implementations • ACL 2022 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie

In this paper, we aim to extract commonsense knowledge to improve machine reading comprehension.

Machine Reading Comprehension

Paper
Add Code

When Image Decomposition Meets Deep Learning: A Novel Infrared and Visible Image Fusion Method

no code implementations • 2 Sep 2020 • Zixiang Zhao, Jiangshe Zhang, Shuang Xu, Kai Sun, Chunxia Zhang, Junmin Liu

The core idea is that the encoder decomposes an image into base and detail feature maps with low- and high-frequency information, respectively, and that the decoder is responsible for the original image reconstruction.

Image Enhancement Image Reconstruction +1

Paper
Add Code

Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations

no code implementations • EMNLP 2020 • Kai Sun, Richong Zhang, Samuel Mensah, Yongyi Mao, Xudong Liu

The idea of using multi-task learning approaches to address the joint extraction of entity and relation is motivated by the relatedness between the entity recognition task and the relation classification task.

Ranked #8 on Relation Extraction on WebNLG

General Classification Multi-Task Learning +3

Paper
Add Code

Recurrent Interaction Network for Jointly Extracting Entities andClassifying Relations

no code implementations • arXiv 2020 • Kai Sun, 2 Richong Zhang, 2∗Samuel Mensah, 1, 2 Yongyi Mao, 3 Xudong Liu

In this study, we argue that an explicit interaction between the NER model and the RE model will better guide the training of both models.

Multi-Task Learning named-entity-recognition +3

Paper
Add Code

Dialogue-Based Relation Extraction

3 code implementations • ACL 2020 • Dian Yu, Kai Sun, Claire Cardie, Dong Yu

We present the first human-annotated dialogue-based relation extraction (RE) dataset DialogRE, aiming to support the prediction of relation(s) between two arguments that appear in a dialogue.

Ranked #6 on Dialog Relation Extraction on DialogRE (F1c (v1) metric)

Dialog Relation Extraction Relation +1

132

Paper
Code

CLUE: A Chinese Language Understanding Evaluation Benchmark

3 code implementations • COLING 2020 • Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, Zhenzhong Lan

The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks.

General Classification Machine Reading Comprehension +4

3,825

Paper
Code

MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships

1 code implementation • CVPR 2020 • Yongjian Chen, Lei Tai, Kai Sun, Mingyang Li

Monocular 3D object detection is an essential component in autonomous driving while challenging to solve, especially for those occluded samples which are only partially visible.

Ranked #11 on Vehicle Pose Estimation on KITTI Cars Hard

Autonomous Driving Monocular 3D Object Detection +3

167

Paper
Code

Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

1 code implementation • 10 Nov 2019 • Jingqing Zhang, Xiao-Yu Zhang, Kai Sun, Xian Yang, Chengliang Dai, Yike Guo

The extraction of phenotype information which is naturally contained in electronic health records (EHRs) has been found to be useful in various clinical informatics applications such as disease diagnosis.

Computational Efficiency

Paper
Code

Aspect-Level Sentiment Analysis Via Convolution over Dependency Tree

no code implementations • IJCNLP 2019 • Kai Sun, Richong Zhang, Samuel Mensah, Yongyi Mao, Xudong Liu

We propose a method based on neural networks to identify the sentiment polarity of opinion words expressed on a specific aspect of a sentence.

Representation Learning Sentence +2

Paper
Add Code

Improving Pre-Trained Multilingual Model with Vocabulary Expansion

no code implementations • CONLL 2019 • Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu

However, in multilingual setting, it is extremely resource-consuming to pre-train a deep language model over large-scale corpora for each language.

Language Modelling Machine Reading Comprehension +6

Paper
Add Code

Improving Pre-Trained Multilingual Models with Vocabulary Expansion

no code implementations • 26 Sep 2019 • Hai Wang, Dian Yu, Kai Sun, Janshu Chen, Dong Yu

However, in multilingual setting, it is extremely resource-consuming to pre-train a deep language model over large-scale corpora for each language.

Language Modelling Machine Reading Comprehension +6

Paper
Add Code

Sem-LSD: A Learning-based Semantic Line Segment Detector

1 code implementation • 14 Sep 2019 • Yi Sun, Xushen Han, Kai Sun, Boren Li, Yongjiang Chen, Mingyang Li

Combined with high-level semantics, Sem-LS is more robust under cluttered environment compared with existing line-shaped representations.

Line Segment Detection Loop Closure Detection

Paper
Code

Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification

4 code implementations • 17 Aug 2019 • Xiao-Yu Zhang, Jingqing Zhang, Kai Sun, Xian Yang, Chengliang Dai, Yike Guo

The training procedure of OmiVAE is comprised of an unsupervised phase without the classifier and a supervised phase with the classifier.

Classification Decision Making +3

Paper
Code

AMAD: Adversarial Multiscale Anomaly Detection on High-Dimensional and Time-Evolving Categorical Data

no code implementations • 12 Jul 2019 • Zheng Gao, Lin Guo, Chi Ma, Xiao Ma, Kai Sun, Hang Xiang, Xiaoqiang Zhu, Hongsong Li, Xiaozhong Liu

Anomaly detection is facing with emerging challenges in many important industry domains, such as cyber security and online recommendation and advertising.

Anomaly Detection

Paper
Add Code

Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension

1 code implementation • TACL 2020 • Kai Sun, Dian Yu, Dong Yu, Claire Cardie

Machine reading comprehension tasks require a machine reader to answer questions relevant to the given document.

Data Augmentation Language Modelling +3

161

Paper
Code

DREAM: A Challenge Data Set and Models for Dialogue-Based Reading Comprehension

no code implementations • TACL 2019 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie

We present DREAM, the first dialogue-based multiple-choice reading comprehension data set.

Dialogue Understanding Multiple-choice +3

Paper
Add Code

Evidence Sentence Extraction for Machine Reading Comprehension

1 code implementation • CONLL 2019 • Hai Wang, Dian Yu, Kai Sun, Jianshu Chen, Dong Yu, David Mcallester, Dan Roth

Remarkable success has been achieved in the last few years on some limited machine reading comprehension (MRC) tasks.

Machine Reading Comprehension Multiple-choice +2

Paper
Code

Improving Question Answering with External Knowledge

1 code implementation • WS 2019 • Xiaoman Pan, Kai Sun, Dian Yu, Jianshu Chen, Heng Ji, Claire Cardie, Dong Yu

We focus on multiple-choice question answering (QA) tasks in subject areas such as science, where we require both broad background knowledge and the facts from the given subject-area reference corpus.

Multiple-choice Question Answering

Paper
Code

DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension

1 code implementation • 1 Feb 2019 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie

DREAM is likely to present significant challenges for existing reading comprehension systems: 84% of answers are non-extractive, 85% of questions require reasoning beyond a single sentence, and 34% of questions also involve commonsense knowledge.

Dialogue Understanding Multiple-choice +3

Paper
Code

Image Embedding of PMU Data for Deep Learning towards Transient Disturbance Classification

no code implementations • 22 Dec 2018 • Yongli Zhu, Chengxi Liu, Kai Sun

This paper presents a study on power grid disturbance classification by Deep Learning (DL).

Classification General Classification +2

Paper
Add Code

Improving Machine Reading Comprehension with General Reading Strategies

1 code implementation • NAACL 2019 • Kai Sun, Dian Yu, Dong Yu, Claire Cardie

Reading strategies have been shown to improve comprehension levels, especially for readers lacking adequate prior knowledge.

Ranked #7 on Question Answering on StoryCloze

Language Modelling Machine Reading Comprehension +2

Paper
Code

Visualizing and Understanding Deep Neural Networks in CTR Prediction

no code implementations • 22 Jun 2018 • Lin Guo, Hui Ye, Wenbo Su, Henhuan Liu, Kai Sun, Hang Xiang

Recently, many works have been done on visualizing and analyzing the mechanism of deep neural networks in the areas of image processing and natural language processing.

Click-Through Rate Prediction