Understanding BatchNorm in Ternary Training

Eyyüb Sari; Vahid Partovi Nia

Vol. 5 No. 1 (2019)
Special Issue: Proceedings of CVIS 2019

Articles

Understanding BatchNorm in Ternary Training

Published 2020-01-02

Eyyüb Sari
Vahid Partovi Nia

How to Cite

Sari, E., & Nia, V. P. (2020). Understanding BatchNorm in Ternary Training. Journal of Computational Vision and Imaging Systems, 5(1), 2. Retrieved from https://openjournals.uwaterloo.ca/index.php/vsl/article/view/1646

Download Citation

Abstract

Neural networks are comprised of two components, weights and
activation function. Ternary weight neural networks (TNNs) achieve
a good performance and offer up to 16x compression ratio. TNNs
are difficult to train without BatchNorm and there has been no study
to clarify the role of BatchNorm in a ternary network. Benefiting
from a study in binary networks, we show how BatchNorm helps in
resolving the exploding gradients issue.

pdf