no code implementations • 21 Sep 2021 • Yuwen Li, Zhengqi Liu, Ludmil Zikatanov
The training process of neural networks usually optimize weights and bias parameters of linear transformations, while nonlinear activation functions are pre-specified and fixed.