Search Results for author: Haoyu Dong

Found 25 papers, 14 papers with code

Rethinking Perceptual Metrics for Medical Image Translation

no code implementations10 Apr 2024 Nicholas Konz, YuWen Chen, Hanxue Gu, Haoyu Dong, Maciej A. Mazurowski

Modern medical image translation methods use generative models for tasks such as the conversion of CT images to MRI.

Translation

ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models

no code implementations16 Mar 2024 YuWen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee, Maciej A. Mazurowski

We evaluate our method by training a segmentation model on images translated from CT to MRI with their original CT masks and testing its performance on real MRIs.

Anatomy Translation

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries

no code implementations20 Feb 2024 Wei Zhao, Zhitao Hou, Siyuan Wu, Yan Gao, Haoyu Dong, Yao Wan, Hongyu Zhang, Yulei Sui, Haidong Zhang

Writing formulas on spreadsheets, such as Microsoft Excel and Google Sheets, is a widespread practice among users performing data analysis.

Natural Language Queries

Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models

1 code implementation7 Feb 2024 Nicholas Konz, YuWen Chen, Haoyu Dong, Maciej A. Mazurowski

Diffusion models have enabled remarkably high-quality medical image generation, yet it is challenging to enforce anatomical constraints in generated images.

counterfactual Image Generation +1

Rethinking Medical Report Generation: Disease Revealing Enhancement with Knowledge Graph

1 code implementation24 Jul 2023 Yixin Wang, Zihao Lin, Haoyu Dong

Knowledge Graph (KG) plays a crucial role in Medical Report Generation (MRG) because it reveals the relations among diseases and thus can be utilized to guide the generation process.

Medical Report Generation

A systematic study of the foreground-background imbalance problem in deep learning for object detection

no code implementations28 Jun 2023 Hanxue Gu, Haoyu Dong, Nicholas Konz, Maciej A. Mazurowski

We experimentally study the effects of different aspects of F-B imbalance (object size, number of objects, dataset size, object type) on detection performance.

Object object-detection +1

Unsupervised anomaly localization in high-resolution breast scans using deep pluralistic image completion

1 code implementation4 May 2023 Nicholas Konz, Haoyu Dong, Maciej A. Mazurowski

Given the scarcity of abnormal images and the abundance of normal images for this problem, an anomaly detection/localization approach could be well-suited.

Anomaly Detection

Segment Anything Model for Medical Image Analysis: an Experimental Study

2 code implementations20 Apr 2023 Maciej A. Mazurowski, Haoyu Dong, Hanxue Gu, Jichen Yang, Nicholas Konz, Yixin Zhang

We conclude that SAM shows impressive zero-shot segmentation performance for certain medical imaging datasets, but moderate to poor performance for others.

Image Segmentation Interactive Segmentation +5

The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning

1 code implementation6 Jul 2022 Nicholas Konz, Hanxue Gu, Haoyu Dong, Maciej A. Mazurowski

These results give a more principled underpinning for the intuition that radiological images can be more challenging to apply deep learning to than natural image datasets common to machine learning research.

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data

1 code implementation25 May 2022 Fan Zhou, Mengkang Hu, Haoyu Dong, Zhoujun Cheng, Shi Han, Dongmei Zhang

Existing auto-regressive pre-trained language models (PLMs) like T5 and BART, have been well applied to table question answering by UNIFIEDSKG and TAPEX, respectively, and demonstrated state-of-the-art results on multiple benchmarks.

Question Answering

PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation

1 code implementation25 May 2022 Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, Dongmei Zhang

However, directly learning the logical inference knowledge from table-text pairs is very difficult for neural models because of the ambiguity of natural language and the scarcity of parallel data.

Table-to-Text Generation

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

no code implementations24 Jan 2022 Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, Dongmei Zhang

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs, and various other document types, a flurry of table pre-training frameworks have been proposed following the success of text and images, and they have achieved new state-of-the-arts on various tasks such as table question answering, table type recognition, column relation classification, table search, formula prediction, etc.

Denoising Question Answering +2

Lightweight Transformer Backbone for Medical Object Detection

no code implementations22 Nov 2021 Yifan Zhang, Haoyu Dong, Nicholas Konz, Hanxue Gu, Maciej A. Mazurowski

Specifically, we propose a novel modification of visual transformer (ViT) on image feature patches to connect the feature patches of a tumor with healthy backgrounds of breast images and form a more robust backbone for tumor detection.

Lesion Detection Medical Object Detection +2

FORTAP: Using Formulas for Numerical-Reasoning-Aware Table Pretraining

1 code implementation ACL 2022 Zhoujun Cheng, Haoyu Dong, Ran Jia, Pengfei Wu, Shi Han, Fan Cheng, Dongmei Zhang

In this paper, we find that the spreadsheet formula, which performs calculations on numerical values in tables, is naturally a strong supervision of numerical reasoning.

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation

1 code implementation ACL 2022 Zhoujun Cheng, Haoyu Dong, Zhiruo Wang, Ran Jia, Jiaqi Guo, Yan Gao, Shi Han, Jian-Guang Lou, Dongmei Zhang

HiTab provides 10, 686 QA pairs and descriptive sentences with well-annotated quantity and entity alignment on 3, 597 tables with broad coverage of table hierarchies and numerical reasoning types.

Descriptive Entity Alignment +2

TableSense: Spreadsheet Table Detection with Convolutional Neural Networks

1 code implementation25 Jun 2021 Haoyu Dong, Shijie Liu, Shi Han, Zhouyu Fu, Dongmei Zhang

Spreadsheet table detection is the task of detecting all tables on a given sheet and locating their respective ranges.

Active Learning Boundary Detection +1

Trust It or Not: Confidence-Guided Automatic Radiology Report Generation

no code implementations21 Jun 2021 Yixin Wang, Zihao Lin, Zhe Xu, Haoyu Dong, Jiang Tian, Jie Luo, Zhongchao shi, Yang Zhang, Jianping Fan, Zhiqiang He

Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets.

Decision Making Image Captioning +2

Using Text to Teach Image Retrieval

no code implementations19 Nov 2020 Haoyu Dong, Ze Wang, Qiang Qiu, Guillermo Sapiro

Image retrieval relies heavily on the quality of the data modeling and the distance measurement in the feature space.

Image Retrieval Retrieval +2

TUTA: Tree-based Transformers for Generally Structured Table Pre-training

1 code implementation21 Oct 2020 Zhiruo Wang, Haoyu Dong, Ran Jia, Jia Li, Zhiyi Fu, Shi Han, Dongmei Zhang

First, we devise a unified tree-based structure, called a bi-dimensional coordinate tree, to describe both the spatial and hierarchical information of generally structured tables.

Rethinking Exposure Bias In Language Modeling

no code implementations13 Oct 2019 Yifan Xu, Kening Zhang, Haoyu Dong, Yuezhou Sun, Wenlong Zhao, Zhuowen Tu

Exposure bias describes the phenomenon that a language model trained under the teacher forcing schema may perform poorly at the inference stage when its predictions are conditioned on its previous predictions unseen from the training corpus.

Language Modelling Reinforcement Learning (RL)

Semantic Structure Extraction for Spreadsheet Tables with a Multi-task Learning Architecture

no code implementations NeurIPS Workshop Document_Intelligen 2019 Haoyu Dong, Shijie Liu, Zhouyu Fu, Shi Han, Dongmei Zhang

To learn spatial correlations and capture semantics on spreadsheets, we have developed a novel learning-based framework for spreadsheet semantic structure extraction.

Language Modelling Multi-Task Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.