Search Results for author: Xin Zhu

Found 18 papers, 7 papers with code

The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Massive Random Access

no code implementations • 8 Mar 2024 • Xin Zhu, Ahmet Enis Cetin

Furthermore, with the assistance of the block MHT layer, the proposed blind normalized SVGD algorithm achieves a higher preamble detection accuracy and throughput than other state-of-the-art detection methods.

Denoising

Paper
Add Code

A Probabilistic Hadamard U-Net for MRI Bias Field Correction

no code implementations • 8 Mar 2024 • Xin Zhu, Hongyi Pan, Yury Velichko, Adam B. Murphy, Ashley Ross, Baris Turkbey, Ahmet Enis Cetin, Ulas Bagci

Random samples drawn from latent space are then incorporated with a prototypical corrected image to generate multiple plausible images.

MRI segmentation

Paper
Add Code

Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style

no code implementations • 20 Dec 2023 • Haohan Wang, Wei Feng, Yang Lu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Lixing Bo, Jingping Shao

Furthermore, for products with specific and fine-grained requirements in layout, elements, etc, a Personality-Wise Generator is devised to learn such personalized style directly from a reference image to resolve textual ambiguities, and is trained in a self-supervised manner for more efficient training data usage.

Paper
Add Code

Planning and Rendering: Towards End-to-End Product Poster Generation

no code implementations • 14 Dec 2023 • Zhaochen Li, Fengheng Li, Wei Feng, Honghe Zhu, An Liu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao, Zhenglu Yang

At the planning stage, we propose a PlanNet to generate the layout of the product and other visual components considering both the appearance features of the product and semantic features of the text, which improves the diversity and rationality of the layouts.

Image Inpainting

Paper
Add Code

A novel asymmetrical autoencoder with a sparsifying discrete cosine Stockwell transform layer for gearbox sensor data compression

no code implementations • 4 Oct 2023 • Xin Zhu, Daoguang Yang, Hongyi Pan, Hamid Reza Karimi, Didem Ozevin, Ahmet Enis Cetin

In comparison to the linear layer, the DCST layer reduces the number of trainable parameters and improves the accuracy of data reconstruction.

Data Compression

Paper
Add Code

Domain Generalization with Fourier Transform and Soft Thresholding

1 code implementation • 18 Sep 2023 • Hongyi Pan, Bin Wang, Zheyuan Zhang, Xin Zhu, Debesh Jha, Ahmet Enis Cetin, Concetto Spampinato, Ulas Bagci

However, it neglects background interference in the amplitude spectrum.

Domain Generalization Image Augmentation +2

Paper
Code

Stein Variational Gradient Descent-based Detection For Random Access With Preambles In MTC

no code implementations • 15 Sep 2023 • Xin Zhu, Hongyi Pan, Salih Atici, Ahmet Enis Cetin

Traditional preamble detection algorithms have low accuracy in the grant-based random access scheme in massive machine-type communication (mMTC).

Paper
Add Code

Electroencephalogram Sensor Data Compression Using An Asymmetrical Sparse Autoencoder With A Discrete Cosine Transform Layer

no code implementations • 15 Sep 2023 • Xin Zhu, Hongyi Pan, Shuaiang Rong, Ahmet Enis Cetin

The latent space data is transmitted to the receiver.

Data Compression EEG

Paper
Add Code

Mutual Query Network for Multi-Modal Product Image Segmentation

1 code implementation • 26 Jun 2023 • Yun Guo, Wei Feng, Zheng Zhang, Xiancong Ren, Yaoyu Li, Jingjing Lv, Xin Zhu, Zhangang Lin, Jingping Shao

Product image segmentation is vital in e-commerce.

Image Segmentation Segmentation +1

Paper
Code

Relation-Aware Diffusion Model for Controllable Poster Layout Generation

1 code implementation • 15 Jun 2023 • Fengheng Li, An Liu, Wei Feng, Honghe Zhu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao

To advance research in this field, we have constructed a poster layout dataset named CGL-Dataset V2.

Relation

Paper
Code

A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer

1 code implementation • 27 May 2023 • Hongyi Pan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

In this paper, we propose a novel Hadamard Transform (HT)-based neural network layer for hybrid quantum-classical computing.

Paper
Code

Multichannel Orthogonal Transform-Based Perceptron Layers for Efficient ResNets

no code implementations • 13 Mar 2023 • Hongyi Pan, Emadeldeen Hamdan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

Trainable soft-thresholding layers, that remove noise in the transform domain, bring nonlinearity to the transform domain layers.

Paper
Add Code

CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection

1 code implementation • 5 Dec 2022 • Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin, Jinghe Hu, Jingping Shao

Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion.

Scene Text Detection Segmentation +1

Paper
Code

Real-time Wireless ECG-derived Respiration Rate Estimation Using an Autoencoder with a DCT Layer

1 code implementation • 15 Nov 2022 • Hongyi Pan, Xin Zhu, Zhilu Ye, Pai-Yen Chen, Ahmet Enis Cetin

To improve the estimation precision, we propose a neural network that uses a novel Discrete Cosine Transform (DCT) layer to denoise and decorrelates the data.

Paper
Code

DCT Perceptron Layer: A Transform Domain Approach for Convolution Layer

no code implementations • 15 Nov 2022 • Hongyi Pan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

In this paper, we propose a novel Discrete Cosine Transform (DCT)-based neural network layer which we call DCT-perceptron to replace the $3\times3$ Conv2D layers in the Residual neural Network (ResNet).

Paper
Add Code

Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition

no code implementations • 22 Apr 2019 • Guanbin Li, Xin Zhu, Yirui Zeng, Qing Wang, Liang Lin

Specifically, by analyzing the symbiosis and mutual exclusion of AUs in various facial expressions, we organize the facial AUs in the form of structured knowledge-graph and integrate a Gated Graph Neural Network (GGNN) in a multi-scale CNN framework to propagate node information through the graph for generating enhanced AU representation.

Facial Action Unit Detection Representation Learning