Search Results for author: Dan Zhang

Found 60 papers, 25 papers with code

VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis

1 code implementation20 Mar 2024 Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva

Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content.

Generative Temporal Nursing Text-to-Video Generation +1

Renovating Names in Open-Vocabulary Segmentation Benchmarks

no code implementations14 Mar 2024 Haiwen Huang, Songyou Peng, Dan Zhang, Andreas Geiger

We further demonstrate that using our renovated names enables training of stronger open-vocabulary segmentation models.

Segmentation

MEGAnno+: A Human-LLM Collaborative Annotation System

no code implementations28 Feb 2024 Hannah Kim, Kushan Mitra, Rafael Li Chen, Sajjadur Rahman, Dan Zhang

Large language models (LLMs) can label data faster and cheaper than humans for various NLP tasks.

Management

OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

no code implementations24 Feb 2024 Fanjin Zhang, Shijie Shi, Yifan Zhu, Bo Chen, Yukuo Cen, Jifan Yu, Yelin Chen, Lulu Wang, Qingfei Zhao, Yuqing Cheng, Tianyi Han, Yuwei An, Dan Zhang, Weng Lam Tam, Kun Cao, Yunhe Pang, Xinyu Guan, Huihui Yuan, Jian Song, Xiaoyan Li, Yuxiao Dong, Jie Tang

We envisage that OAG-Bench can serve as a common ground for the community to evaluate and compare algorithms in academic graph mining, thereby accelerating algorithm development and advancement in this field.

Graph Mining

Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience

no code implementations22 Feb 2024 Xinke Shen, Lingyi Tao, Xuyang Chen, Sen Song, Quanying Liu, Dan Zhang

Targeting the Electroencephalogram (EEG) technique, known for its rich spatial and temporal information, this study presents a general framework for Contrastive Learning of Shared SpatioTemporal EEG Representations across individuals (CL-SSTER).

Brain Decoding Contrastive Learning +1

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

1 code implementation16 Jan 2024 Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

Current L2I models either suffer from poor editability via text or weak alignment between the generated image and the input layout.

Domain Generalization Layout-to-Image Generation +1

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

1 code implementation15 Jan 2024 Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang

To bridge these gaps, we introduce SciGLM, a suite of scientific language models able to conduct college-level scientific reasoning.

Math Mathematical Reasoning

Beyond Top-Class Agreement: Using Divergences to Forecast Performance under Distribution Shift

no code implementations13 Dec 2023 Mona Schirmer, Dan Zhang, Eric Nalisnick

Knowing if a model will generalize to data 'in the wild' is crucial for safe deployment.

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

1 code implementation18 Oct 2023 Xiang Chen, Duanzheng Song, Honghao Gui, Chenxi Wang, Ningyu Zhang, Jiang Yong, Fei Huang, Chengfei Lv, Dan Zhang, Huajun Chen

Despite their impressive generative capabilities, LLMs are hindered by fact-conflicting hallucinations in real-world applications.

Benchmarking Hallucination

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation

no code implementations12 Oct 2023 Qiang Li, Dan Zhang, Shengzhao Lei, Xun Zhao, Porawit Kamnoedboon, Weiwei Li, Junhao Dong, Shuyan Li

Despite the promising performance of existing visual models on public benchmarks, the critical assessment of their robustness for real-world applications remains an ongoing challenge.

Classification

DocPrompt: Large-scale continue pretrain for zero-shot and few-shot document question answering

no code implementations21 Aug 2023 Sijin Wu, Dan Zhang, Teng Hu, Shikun Feng

In this paper, we propose Docprompt for document question answering tasks with powerful zero-shot and few-shot performance.

Question Answering

Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation

no code implementations19 Aug 2023 Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li

Within the context of autonomous driving, encountering unknown objects becomes inevitable during deployment in the open world.

Anomaly Detection Autonomous Driving +3

Divide & Bind Your Attention for Improved Generative Semantic Nursing

1 code implementation20 Jul 2023 Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

To address the challenges posed by complex prompts or scenarios involving multiple entities and to achieve improved attribute binding, we propose Divide & Bind.

Attribute Generative Semantic Nursing +1

Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization

1 code implementation2 Jul 2023 Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, i. e., intra-source style augmentation (ISSA) effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Data Augmentation +3

Controlling Text-to-Image Diffusion by Orthogonal Finetuning

no code implementations NeurIPS 2023 Zeju Qiu, Weiyang Liu, Haiwen Feng, Yuxuan Xue, Yao Feng, Zhen Liu, Dan Zhang, Adrian Weller, Bernhard Schölkopf

To tackle this challenge, we introduce a principled finetuning method -- Orthogonal Finetuning (OFT), for adapting text-to-image diffusion models to downstream tasks.

Contrastive Representation Disentanglement for Clustering

no code implementations8 Jun 2023 Fei Ding, Dan Zhang, Yin Yang, Venkat Krovi, Feng Luo

We conduct a theoretical analysis of the proposed loss and highlight how it assigns different weights to negative samples during the process of disentangling the feature representation.

Clustering Contrastive Learning +2

High Dynamic Range Imaging with Context-aware Transformer

no code implementations10 Apr 2023 Fangfang Zhou, Dan Zhang, Zhenming Fu

In each Dual Transformer (DT), the global features are extracted by the window-based Transformer, while the local details are extracted using the channel attention mechanism with deformable CNNs.

Deblurring Vocal Bursts Intensity Prediction

MM-BSN: Self-Supervised Image Denoising for Real-World with Multi-Mask based on Blind-Spot Network

1 code implementation4 Apr 2023 Dan Zhang, Fangfang Zhou, Yuwen Jiang, Zhengming Fu

Our MM-BSN can be used to address the problem of large-noise denoising, which cannot be efficiently handled by other BSN methods.

Image Denoising

Self-Supervised Image Denoising for Real-World Images with Context-aware Transformer

no code implementations4 Apr 2023 Dan Zhang, Fangfang Zhou

In this paper, we propose a novel Denoise Transformer for real-world image denoising, which is mainly constructed with Context-aware Denoise Transformer (CADT) units and Secondary Noise Extractor (SNE) block.

Image Denoising SSIM

Identification of Systematic Errors of Image Classifiers on Rare Subgroups

no code implementations ICCV 2023 Jan Hendrik Metzen, Robin Hutmacher, N. Grace Hua, Valentyn Boreiko, Dan Zhang

Despite excellent average-case performance of many image classifiers, their performance can substantially deteriorate on semantically coherent subgroups of the data that were under-represented in the training data.

Adversarial Attack Fairness

Learning From Yourself: A Self-Distillation Method for Fake Speech Detection

no code implementations2 Mar 2023 Jun Xue, Cunhang Fan, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang, Zhao Lv

To address this problem, we propose using the deepest network instruct shallow network for enhancing shallow networks.

Language-Specific Representation of Emotion-Concept Knowledge Causally Supports Emotion Inference

1 code implementation19 Feb 2023 Ming Li, Yusheng Su, Hsiu-Yuan Huang, Jiali Cheng, Xin Hu, Xinmiao Zhang, Huadong Wang, Yujia Qin, Xiaozhi Wang, Kristen A. Lindquist, Zhiyuan Liu, Dan Zhang

Humans no doubt use language to communicate about their emotional experiences, but does language in turn help humans understand emotions, or is language just a vehicle of communication?

Attribute Language Modelling

Towards Multifaceted Human-Centered AI

no code implementations9 Jan 2023 Sajjadur Rahman, Hannah Kim, Dan Zhang, Estevam Hruschka, Eser Kandogan

Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks.

MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

no code implementations8 Jan 2023 Dan Zhang, Hannah Kim, Rafael Li Chen, Eser Kandogan, Estevam Hruschka

We present MEGAnno, a novel exploratory annotation framework designed for NLP researchers and practitioners.

Sentiment Analysis

GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

1 code implementation22 Dec 2022 Haiwen Huang, Andreas Geiger, Dan Zhang

We address the task of open-world class-agnostic object detection, i. e., detecting every object in an image by learning from a limited number of base object classes.

Class-agnostic Object Detection Object +2

Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model

no code implementations2 Nov 2022 Mingqi Li, Fei Ding, Dan Zhang, Long Cheng, Hongxin Hu, Feng Luo

In this paper, we propose Multi-level Multilingual Knowledge Distillation (MMKD), a novel method for improving multilingual language models.

Knowledge Distillation Language Modelling +2

Intra-Source Style Augmentation for Improved Domain Generalization

1 code implementation18 Oct 2022 Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Using the proposed masked noise encoder to randomize style and content combinations in the training set, ISSA effectively increases the diversity of training data and reduces spurious correlation.

Autonomous Driving Domain Generalization +1

TriangleNet: Edge Prior Augmented Network for Semantic Segmentation through Cross-Task Consistency

1 code implementation11 Oct 2022 Dan Zhang, Rui Zheng, Luosang Gadeng, Pei Yang

The proposed method underscores the significance of multi-task learning and explicit cross-task consistency enhancement for advancing semantic segmentation and highlights the potential of multitasking in real-time semantic segmentation.

Autonomous Driving Edge Detection +3

One-Shot Synthesis of Images and Segmentation Masks

1 code implementation15 Sep 2022 Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

To this end, inspired by the recent architectural developments of single-image GANs, we introduce our OSMIS model which enables the synthesis of segmentation masks that are precisely aligned to the generated images in the one-shot regime.

Data Augmentation Image Generation +2

Retinal Structure Detection in OCTA Image via Voting-based Multi-task Learning

1 code implementation23 Aug 2022 Jinkui Hao, Ting Shen, Xueli Zhu, Yonghuai Liu, Ardhendu Behera, Dan Zhang, Bang Chen, Jiang Liu, Jiong Zhang, Yitian Zhao

Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making.

Classification Decision Making +1

A Data Driven Method for Multi-step Prediction of Ship Roll Motion in High Sea States

no code implementations26 Jul 2022 Dan Zhang, Xi Zhou, Zi-Hao Wang, Yan Peng, Shao-Rong Xie

This paper presents a novel data-driven methodology to provide a multi-step prediction of ship roll motions in high sea states.

feature selection

Sparse-based Domain Adaptation Network for OCTA Image Super-Resolution Reconstruction

no code implementations25 Jul 2022 Huaying Hao, Cong Xu, Dan Zhang, Qifeng Yan, Jiong Zhang, Yue Liu, Yitian Zhao

To be more specific, we first perform a simple degradation of the 3x3 mm2/high-resolution (HR) image to obtain the synthetic LR image.

Domain Adaptation Image Super-Resolution

Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

no code implementations20 Sep 2021 Xinke Shen, Xianggen Liu, Xin Hu, Dan Zhang, Sen Song

Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones.

Contrastive Learning EEG +4

BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation

no code implementations27 Jul 2021 Qi Tang, Runmin Cong, Ronghui Sheng, Lingzhi He, Dan Zhang, Yao Zhao, Sam Kwong

The other is the content guidance bridge (CGBdg) designed for the depth map reconstruction process, which provides the content guidance learned from DSR task for MDE task.

Depth Map Super-Resolution Monocular Depth Estimation +1

A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators

no code implementations26 May 2021 Dan Zhang, Safeen Huda, Ebrahim Songhori, Kartik Prabhu, Quoc Le, Anna Goldie, Azalia Mirhoseini

The rapidly-changing deep learning landscape presents a unique opportunity for building inference accelerators optimized for specific datacenter-scale workloads.

Optical Character Recognition (OCR) Scheduling

Annotating Columns with Pre-trained Language Models

1 code implementation5 Apr 2021 Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çağatay Demiralp, Chen Chen, Wang-Chiew Tan

Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information.

Columns Property Annotation Column Type Annotation +3

Generating Novel Scene Compositions from Single Images and Videos

1 code implementation24 Mar 2021 Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva

In this work, we introduce SIV-GAN, an unconditional generative model that can generate new scene compositions from a single training image or a single video clip.

Image Generation Memorization

MITNet: GAN Enhanced Magnetic Induction Tomography Based on Complex CNN

no code implementations16 Feb 2021 Zuohui Chen, Qing Yuan, Xujie Song, Cheng Chen, Dan Zhang, Yun Xiang, Ruigang Liu, Qi Xuan

Magnetic induction tomography (MIT) is an efficient solution for long-term brain disease monitoring, which focuses on reconstructing bio-impedance distribution inside the human brain using non-intrusive electromagnetic fields.

Generative Adversarial Network Image Reconstruction

$\rm ^{83}Rb$/$\rm ^{83m}Kr$ production and cross-section measurement with 3.4 MeV and 20 MeV proton beams

no code implementations4 Feb 2021 Dan Zhang, Jingkai Xia, YiFan Li, Jingtao You, Yao Li, Changbo Fu, Jianglai Liu, Ning Zhou, Jie Bao, Huan Jia, Chenzhang Yuan, Yuan He, Weixing Xiong, Mengyun Guan

$\rm ^{83m}Kr$, with a short lifetime, is an ideal calibration source for liquid xenon or liquid argon detectors.

Nuclear Experiment Instrumentation and Detectors

You Only Need Adversarial Supervision for Semantic Image Synthesis

1 code implementation ICLR 2021 Vadim Sushko, Edgar Schönfeld, Dan Zhang, Juergen Gall, Bernt Schiele, Anna Khoreva

By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity with better alignment to their input label maps, making the use of the perceptual loss superfluous.

Image-to-Image Translation Semantic Segmentation

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

1 code implementation ICLR 2021 Kanil Patel, William Beluch, Bin Yang, Michael Pfeiffer, Dan Zhang

The goal of this paper is to resolve the identified issues of HB in order to provide calibrated confidence estimates using only a small holdout calibration dataset for bin optimization while preserving multi-class ranking accuracy.

Quantization

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

1 code implementation NeurIPS 2020 Robin Tibor Schirrmeister, Yuxuan Zhou, Tonio Ball, Dan Zhang

We refine previous investigations of this failure at anomaly detection for invertible generative networks and provide a clear explanation of it as a combination of model bias and domain prior: Convolutional networks learn similar low-level feature distributions when trained on any natural image dataset and these low-level features dominate the likelihood.

Anomaly Detection

MGA: Momentum Gradient Attack on Network

no code implementations26 Feb 2020 Jinyin Chen, Yixian Chen, Haibin Zheng, Shijing Shen, Shanqing Yu, Dan Zhang, Qi Xuan

The adversarial attack methods based on gradient information can adequately find the perturbations, that is, the combinations of rewired links, thereby reducing the effectiveness of the deep learning model based graph embedding algorithms, but it is also easy to fall into a local optimum.

Social and Information Networks

On-manifold Adversarial Data Augmentation Improves Uncertainty Calibration

no code implementations16 Dec 2019 Kanil Patel, William Beluch, Dan Zhang, Michael Pfeiffer, Bin Yang

Uncertainty estimates help to identify ambiguous, novel, or anomalous inputs, but the reliable quantification of uncertainty has proven to be challenging for modern deep networks.

Adversarial Attack Data Augmentation

Sato: Contextual Semantic Type Detection in Tables

1 code implementation14 Nov 2019 Dan Zhang, Yoshihiko Suhara, Jinfeng Li, Madelon Hulsebos, Çağatay Demiralp, Wang-Chiew Tan

Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search.

Column Type Annotation Information Retrieval +3

Group Pruning using a Bounded-Lp norm for Group Gating and Regularization

no code implementations9 Aug 2019 Chaithanya Kumar Mummadi, Tim Genewein, Dan Zhang, Thomas Brox, Volker Fischer

We achieve state-of-the-art pruning results for ResNet-50 with higher accuracy on ImageNet.

PolSAR Image Classification based on Polarimetric Scattering Coding and Sparse Support Matrix Machine

no code implementations17 Jun 2019 Xu Liu, Licheng Jiao, Dan Zhang, Fang Liu

In this paper, a novel POLSAR image classification method is proposed based on polarimetric scattering coding and sparse support matrix machine.

Classification General Classification +1

Unsupervised Euclidean Distance Attack on Network Embedding

no code implementations27 May 2019 Qi Xuan, Jun Zheng, Lihong Chen, Shanqing Yu, Jinyin Chen, Dan Zhang, Qingpeng Zhang Member

Since a large number of downstream network algorithms, such as community detection and node classification, rely on the Euclidean distance between nodes to evaluate the similarity between them in the embedding space, EDA can be considered as a universal attack on a variety of network algorithms.

Social and Information Networks Physics and Society

E-LSTM-D: A Deep Learning Framework for Dynamic Network Link Prediction

1 code implementation22 Feb 2019 Jinyin Chen, Jian Zhang, Xuanheng Xu, Chengbo Fu, Dan Zhang, Qingpeng Zhang, Qi Xuan

Predicting the potential relations between nodes in networks, known as link prediction, has long been a challenge in network science.

Link Prediction Time Series Analysis

Progressive Augmentation of GANs

1 code implementation NeurIPS 2019 Dan Zhang, Anna Khoreva

Training of Generative Adversarial Networks (GANs) is notoriously fragile, requiring to maintain a careful balance between the generator and the discriminator in order to perform well.

Image Generation

PA-GAN: Improving GAN Training by Progressive Augmentation

no code implementations27 Sep 2018 Dan Zhang, Anna Khoreva

Despite recent progress, Generative Adversarial Networks (GANs) still suffer from training instability, requiring careful consideration of architecture design choices and hyper-parameter tuning.

Image Generation

Polarimetric Convolutional Network for PolSAR Image Classification

1 code implementation9 Jul 2018 Xu Liu, Licheng Jiao, Xu Tang, Qigong Sun, Dan Zhang

Based on sparse scattering coding and convolution neural network, the polarimetric convolutional network is proposed to classify PolSAR images by making full use of polarimetric information.

Classification General Classification +1

Partial Labeled Gastric Tumor Segmentation via patch-based Reiterative Learning

no code implementations20 Dec 2017 Yang Nan, Gianmarc Coppola, Qiaokang Liang, Kunglin Zou, Wei Sun, Dan Zhang, Yaonan Wang, Guanzhen Yu

Gastric cancer is the second leading cause of cancer-related deaths worldwide, and the major hurdle in biomedical image analysis is the determination of the cancer extent.

Image Segmentation Tumor Segmentation

Principled Evaluation of Differentially Private Algorithms using DPBench

1 code implementation15 Dec 2015 Michael Hay, Ashwin Machanavajjhala, Gerome Miklau, Yan Chen, Dan Zhang

Differential privacy has become the dominant standard in the research community for strong privacy protection.

Databases Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.