Search Results for author: Dongsheng Wang

Found 33 papers, 12 papers with code

2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion

no code implementations26 Apr 2024 Dongsheng Wang, Xiaoqin Feng, Zeming Liu, Chuan Wang

To tackle this challenging MMNER task on the dataset, we introduce a new model called 2M-NER, which aligns the text and image representations using contrastive learning and integrates a multimodal collaboration module to effectively depict the interactions between the two modalities.

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

no code implementations5 Apr 2024 Ran Zmigrod, Dongsheng Wang, Mathieu Sibue, Yulong Pei, Petr Babkin, Ivan Brugere, Xiaomo Liu, Nacho Navarro, Antony Papadimitriou, William Watson, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah

Several datasets exist for research on specific tasks of VRDU such as document classification (DC), key entity extraction (KEE), entity linking, visual question answering (VQA), inter alia.

Document Classification document understanding +5

Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency

no code implementations26 Mar 2024 Toyin Aguda, Suchetha Siddagangappa, Elena Kochkina, Simerjot Kaur, Dongsheng Wang, Charese Smiley, Sameena Shah

Collecting labeled datasets in finance is challenging due to scarcity of domain experts and higher cost of employing them.

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

1 code implementation20 Mar 2024 Di Wang, Jing Zhang, Minqiang Xu, Lin Liu, Dongsheng Wang, Erzhong Gao, Chengxi Han, HaoNan Guo, Bo Du, DaCheng Tao, Liangpei Zhang

However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as image classification or object discrimination tasks.

 Ranked #1 on Semantic Segmentation on SpaceNet 1 (using extra training data)

Aerial Scene Classification Building change detection for remote sensing images +12

DocGraphLM: Documental Graph Language Model for Information Extraction

no code implementations5 Jan 2024 Dongsheng Wang, Zhiqiang Ma, Armineh Nourbakhsh, Kang Gu, Sameena Shah

Advances in Visually Rich Document Understanding (VrDU) have enabled information extraction and question answering over documents with complex layouts.

document understanding Language Modelling +2

DocLLM: A layout-aware generative language model for multimodal document understanding

no code implementations31 Dec 2023 Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities.

document understanding Language Modelling

Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection

1 code implementation NeurIPS 2023 Ruiying Lu, Yujie Wu, Long Tian, Dongsheng Wang, Bo Chen, Xiyang Liu, Ruimin Hu

First, instead of learning the continuous representations, we preserve the typical normal patterns as discrete iconic prototypes, and confirm the importance of Vector Quantization in preventing the model from falling into the shortcut.

Quantization Unsupervised Anomaly Detection

PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image Classification

1 code implementation ICCV 2023 Miaoge Li, Dongsheng Wang, Xinyang Liu, Zequn Zeng, Ruiying Lu, Bo Chen, Mingyuan Zhou

We find that by formulating the multi-label classification as a CT problem, we can exploit the interactions between the image and label efficiently by minimizing the bidirectional CT cost.

Multi-Label Classification Multi-Label Image Classification

REFinD: Relation Extraction Financial Dataset

no code implementations22 May 2023 Simerjot Kaur, Charese Smiley, Akshat Gupta, Joy Sain, Dongsheng Wang, Suchetha Siddagangappa, Toyin Aguda, Sameena Shah

A number of datasets for Relation Extraction (RE) have been created to aide downstream tasks such as information retrieval, semantic search, question answering and textual entailment.

General Knowledge Information Retrieval +5

Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation

1 code implementation17 Mar 2023 Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu

To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.

Object object-detection +1

Patch-Token Aligned Bayesian Prompt Learning for Vision-Language Models

no code implementations16 Mar 2023 Xinyang Liu, Dongsheng Wang, Miaoge Li, Zhibin Duan, Yishi Xu, Bo Chen, Mingyuan Zhou

For downstream applications of vision-language pre-trained models, there has been significant interest in constructing effective prompts.

Prompt Engineering

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

1 code implementation CVPR 2023 Zequn Zeng, Hao Zhang, Zhengjue Wang, Ruiying Lu, Dongsheng Wang, Bo Chen

Zero-shot capability has been considered as a new revolution of deep learning, letting machines work on tasks without curated training data.

Image Captioning Language Modelling

HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding

1 code implementation16 Oct 2022 Yishi Xu, Dongsheng Wang, Bo Chen, Ruiying Lu, Zhibin Duan, Mingyuan Zhou

With the tree-likeness property of hyperbolic space, the underlying semantic hierarchy among words and topics can be better exploited to mine more interpretable topics.

Graph structure learning Topic Models

Knowledge-Aware Bayesian Deep Topic Model

1 code implementation20 Sep 2022 Dongsheng Wang, Yishi Xu, Miaoge Li, Zhibin Duan, Chaojie Wang, Bo Chen, Mingyuan Zhou

We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling.

Topic Models

Ordinal Graph Gamma Belief Network for Social Recommender Systems

no code implementations12 Sep 2022 Dongsheng Wang, Chaojie Wang, Bo Chen, Mingyuan Zhou

To build recommender systems that not only consider user-item interactions represented as ordinal variables, but also exploit the social network describing the relationships between the users, we develop a hierarchical Bayesian model termed ordinal graph factor analysis (OGFA), which jointly models user-item and user-user interactions.

Recommendation Systems

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

2 code implementations ICLR 2022 Dongsheng Wang, Dandan Guo, He Zhao, Huangjie Zheng, Korawat Tanwisuth, Bo Chen, Mingyuan Zhou

This paper introduces a new topic-modeling framework where each document is viewed as a set of word embedding vectors and each topic is modeled as an embedding vector in the same embedding space.

Word Embeddings

Short Range Correlation Transformer for Occluded Person Re-Identification

no code implementations4 Jan 2022 Yunbin Zhao, Songhao Zhu, Dongsheng Wang, Zhiwei Liang

However, the performance of vision transformer in extracting local features is inferior to that of convolutional neural network.

Person Re-Identification

TopicNet: Semantic Graph-Guided Topic Discovery

1 code implementation NeurIPS 2021 Zhibin Duan, Yishi Xu, Bo Chen, Dongsheng Wang, Chaojie Wang, Mingyuan Zhou

Existing deep hierarchical topic models are able to extract semantically meaningful topics from a text corpus in an unsupervised manner and automatically organize them into a topic hierarchy.

Inductive Bias Topic Models +1

Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network

1 code implementation30 Jun 2021 Zhibin Duan, Dongsheng Wang, Bo Chen, Chaojie Wang, Wenchao Chen, Yewen Li, Jie Ren, Mingyuan Zhou

However, they often assume in the prior that the topics at each layer are independently drawn from the Dirichlet distribution, ignoring the dependencies between the topics both at the same layer and across different layers.

Topic Models Variational Inference

Semantic Representation and Inference for NLP

no code implementations15 Jun 2021 Dongsheng Wang

This thesis investigates the use of deep learning for novel semantic representation and inference, and makes contributions in the following three areas: creating training data, improving semantic representations and extending inference learning.

Claim Verification Fact Checking +2

Image Inpainting with Edge-guided Learnable Bidirectional Attention Maps

1 code implementation25 Apr 2021 Dongsheng Wang, Chaohao Xie, Shaohui Liu, Zhenxing Niu, WangMeng Zuo

In this paper, we present an edge-guided learnable bidirectional attention map (Edge-LBAM) for improving image inpainting of irregular holes with several distinct merits.

Image Inpainting valid

Structural block driven - enhanced convolutional neural representation for relation extraction

no code implementations21 Mar 2021 Dongsheng Wang, Prayag Tiwari, Sahil Garg, Hongyin Zhu, Peter Bruza

In this paper, we propose a novel lightweight relation extraction approach of structural block driven - convolutional neural learning.

Relation Relation Extraction +1

Topic-aware Contextualized Transformers

no code implementations1 Jan 2021 Ruiying Lu, Bo Chen, Dan dan Guo, Dongsheng Wang, Mingyuan Zhou

Moving beyond conventional Transformers that ignore longer-range word dependencies and contextualize their word representations at the segment level, the proposed method not only captures global semantic coherence of all segments and global word concurrence patterns, but also enriches the representation of each token by adapting it to its local context, which is not limited to the segment it resides in and can be flexibly defined according to the task.

Word Embeddings

Multi-Head Self-Attention with Role-Guided Masks

1 code implementation22 Dec 2020 Dongsheng Wang, Casper Hansen, Lucas Chaves Lima, Christian Hansen, Maria Maistro, Jakob Grue Simonsen, Christina Lioma

The state of the art in learning meaningful semantic representations of words is the Transformer model and its attention mechanisms.

Machine Translation text-classification +2

Deep Relational Topic Modeling via Graph Poisson Gamma Belief Network

no code implementations NeurIPS 2020 Chaojie Wang, Hao Zhang, Bo Chen, Dongsheng Wang, Zhengjue Wang, Mingyuan Zhou

To analyze a collection of interconnected documents, relational topic models (RTMs) have been developed to describe both the link structure and document content, exploring their underlying relationships via a single-layer latent representation with limited expressive capability.

Topic Models

Denmark's Participation in the Search Engine TREC COVID-19 Challenge: Lessons Learned about Searching for Precise Biomedical Scientific Information on COVID-19

no code implementations25 Nov 2020 Lucas Chaves Lima, Casper Hansen, Christian Hansen, Dongsheng Wang, Maria Maistro, Birger Larsen, Jakob Grue Simonsen, Christina Lioma

This report describes the participation of two Danish universities, University of Copenhagen and Aalborg University, in the international search engine competition on COVID-19 (the 2020 TREC-COVID Challenge) organised by the U. S. National Institute of Standards and Technology (NIST) and its Text Retrieval Conference (TREC) division.

Retrieval Text Retrieval

QGAN: Quantize Generative Adversarial Networks to Extreme low-bits

no code implementations25 Sep 2019 Peiqi Wang, Yu Ji, Xinfeng Xie, Yongqiang Lyu, Dongsheng Wang, Yuan Xie

Despite the success in model reduction of convolutional neural networks (CNNs), neural network quantization methods have not yet been studied on GANs, which are mainly faced with the issues of both the effectiveness of quantization algorithms and the instability of training GAN models.

Quantization

Contextual Compositionality Detection with External Knowledge Bases andWord Embeddings

no code implementations20 Mar 2019 Dongsheng Wang, Quichi Li, Lucas Chaves Lima, Jakob Grue Simonsen, Christina Lioma

In this paper, we operationalize the viewpoint that compositionality is contextual rather than deterministic, i. e., that whether a phrase is compositional or non-compositional depends on its context.

QGAN: Quantized Generative Adversarial Networks

no code implementations24 Jan 2019 Peiqi Wang, Dongsheng Wang, Yu Ji, Xinfeng Xie, Haoxuan Song, XuXin Liu, Yongqiang Lyu, Yuan Xie

The intensive computation and memory requirements of generative adversarial neural networks (GANs) hinder its real-world deployment on edge devices such as smartphones.

Quantization

HitNet: Hybrid Ternary Recurrent Neural Network

no code implementations NeurIPS 2018 Peiqi Wang, Xinfeng Xie, Lei Deng, Guoqi Li, Dongsheng Wang, Yuan Xie

For example, we improve the perplexity per word (PPW) of a ternary LSTM on Penn Tree Bank (PTB) corpus from 126 (the state-of-the-art result to the best of our knowledge) to 110. 3 with a full precision model in 97. 2, and a ternary GRU from 142 to 113. 5 with a full precision model in 102. 7.

Quantization

Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design

no code implementations22 Sep 2017 Zhourui Song, Zhenyu Liu, Dongsheng Wang

The heavy burdens of computation and off-chip traffic impede deploying the large scale convolution neural network on embedded platforms.

Cannot find the paper you are looking for? You can Submit a new open access paper.