Search Results for author: Jerry Yao-Chieh Hu

Found 9 papers, 7 papers with code

Nonparametric Modern Hopfield Models

1 code implementation • 5 Apr 2024 • Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu, Feng Ruan, Han Liu

We present a nonparametric construction for deep learning compatible modern Hopfield models and utilize this framework to debut an efficient variant.

Paper
Code

BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model

1 code implementation • 4 Apr 2024 • Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu

We introduce the \textbf{B}i-Directional \textbf{S}parse \textbf{Hop}field Network (\textbf{BiSHop}), a novel end-to-end framework for deep tabular learning.

Representation Learning

Paper
Code

Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

1 code implementation • 4 Apr 2024 • Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu

Interestingly, this memory model manifests a model-based interpretation of an outlier-efficient attention mechanism ($\text{Softmax}_1$): it is an approximation of the memory retrieval process of $\mathtt{OutEffHop}$.

Ranked #1 on Quantization on Wiki-40B

Benchmarking Quantization +1

Paper
Code

Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models

1 code implementation • 4 Apr 2024 • Dennis Wu, Jerry Yao-Chieh Hu, Teng-Yun Hsiao, Han Liu

Specifically, we accomplish this by constructing a separation loss $\mathcal{L}_\Phi$ that separates the local minima of kernelized energy by separating stored memory patterns in kernel space.

Retrieval

Paper
Code

On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis

no code implementations • 7 Feb 2024 • Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song, Han Liu

Specifically, we establish an upper bound criterion for the norm of input query patterns and memory patterns.

Retrieval

Paper
Add Code

Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e

no code implementations • 28 Dec 2023 • Chenwei Xu, Jerry Yao-Chieh Hu, Aakaash Narayanan, Mattson Thieme, Vladimir Nagaslaev, Mark Austin, Jeremy Arnold, Jose Berlioz, Pierrick Hanlet, Aisha Ibrahim, Dennis Nicklaus, Jovan Mitrevski, Jason Michael St. John, Gauri Pradhan, Andrea Saewert, Kiyomi Seiya, Brian Schupbach, Randy Thurman-Keup, Nhan Tran, Rui Shi, Seda Ogrenci, Alexis Maya-Isabelle Shuping, Kyle Hazelwood, Han Liu

We introduce a novel Proximal Policy Optimization (PPO) algorithm aimed at addressing the challenge of maintaining a uniform proton beam intensity delivery in the Muon to Electron Conversion Experiment (Mu2e) at Fermi National Accelerator Laboratory (Fermilab).

Reinforcement Learning (RL)

Paper
Add Code

STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series Prediction

1 code implementation • 28 Dec 2023 • Dennis Wu, Jerry Yao-Chieh Hu, Weijian Li, Bo-Yu Chen, Han Liu

We present STanHop-Net (Sparse Tandem Hopfield Network) for multivariate time series prediction with memory-enhanced capabilities.

Retrieval Time Series +1

Paper
Code

On Sparse Modern Hopfield Model

1 code implementation • NeurIPS 2023 • Jerry Yao-Chieh Hu, Donglin Yang, Dennis Wu, Chenwei Xu, Bo-Yu Chen, Han Liu

Building upon this, we derive the sparse memory retrieval dynamics from the sparse energy function and show its one-step approximation is equivalent to the sparse-structured attention.

Retrieval

Paper
Code

Feature Programming for Multivariate Time Series Prediction

1 code implementation • 9 Jun 2023 • Alex Reneau, Jerry Yao-Chieh Hu, Chenwei Xu, Weijian Li, Ammar Gilani, Han Liu

We introduce the concept of programmable feature engineering for time series modeling and propose a feature programming framework.

Automated Feature Engineering Feature Engineering +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.