Search Results for author: Jerry Yao-Chieh Hu

Found 9 papers, 7 papers with code

Nonparametric Modern Hopfield Models

1 code implementation5 Apr 2024 Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu, Feng Ruan, Han Liu

We present a nonparametric construction for deep learning compatible modern Hopfield models and utilize this framework to debut an efficient variant.

BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model

1 code implementation4 Apr 2024 Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu

We introduce the \textbf{B}i-Directional \textbf{S}parse \textbf{Hop}field Network (\textbf{BiSHop}), a novel end-to-end framework for deep tabular learning.

Representation Learning

Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

1 code implementation4 Apr 2024 Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu

Interestingly, this memory model manifests a model-based interpretation of an outlier-efficient attention mechanism ($\text{Softmax}_1$): it is an approximation of the memory retrieval process of $\mathtt{OutEffHop}$.

Benchmarking Quantization +1

Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models

1 code implementation4 Apr 2024 Dennis Wu, Jerry Yao-Chieh Hu, Teng-Yun Hsiao, Han Liu

Specifically, we accomplish this by constructing a separation loss $\mathcal{L}_\Phi$ that separates the local minima of kernelized energy by separating stored memory patterns in kernel space.

Retrieval

On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis

no code implementations7 Feb 2024 Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song, Han Liu

Specifically, we establish an upper bound criterion for the norm of input query patterns and memory patterns.

Retrieval

STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series Prediction

1 code implementation28 Dec 2023 Dennis Wu, Jerry Yao-Chieh Hu, Weijian Li, Bo-Yu Chen, Han Liu

We present STanHop-Net (Sparse Tandem Hopfield Network) for multivariate time series prediction with memory-enhanced capabilities.

Retrieval Time Series +1

On Sparse Modern Hopfield Model

1 code implementation NeurIPS 2023 Jerry Yao-Chieh Hu, Donglin Yang, Dennis Wu, Chenwei Xu, Bo-Yu Chen, Han Liu

Building upon this, we derive the sparse memory retrieval dynamics from the sparse energy function and show its one-step approximation is equivalent to the sparse-structured attention.

Retrieval

Feature Programming for Multivariate Time Series Prediction

1 code implementation9 Jun 2023 Alex Reneau, Jerry Yao-Chieh Hu, Chenwei Xu, Weijian Li, Ammar Gilani, Han Liu

We introduce the concept of programmable feature engineering for time series modeling and propose a feature programming framework.

Automated Feature Engineering Feature Engineering +3

Cannot find the paper you are looking for? You can Submit a new open access paper.