TY - GEN
T1 - NATURAL STATISTICS OF NETWORK ACTIVATIONS AND IMPLICATIONS FOR KNOWLEDGE DISTILLATION
AU - Rotman, Michael
AU - Wolf, Lior
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - In a matter that is analogous to the study of natural image statistics, we study the natural statistics of the deep neural network activations at various layers. As we show, these statistics, similar to image statistics, follow a power law. We also show, both analytically and empirically, that with depth the exponent of this power law increases at a linear rate. As a direct implication of our discoveries, we present a method for performing Knowledge Distillation (KD). While classical KD methods consider the logits of the teacher network, more recent methods obtain a leap in performance by considering the activation maps. This, however, uses metrics that are suitable for comparing images. We propose to employ two additional loss terms that are based on the spectral properties of the intermediate activation maps. The proposed method obtains state of the art results on multiple image recognition KD benchmarks.
AB - In a matter that is analogous to the study of natural image statistics, we study the natural statistics of the deep neural network activations at various layers. As we show, these statistics, similar to image statistics, follow a power law. We also show, both analytically and empirically, that with depth the exponent of this power law increases at a linear rate. As a direct implication of our discoveries, we present a method for performing Knowledge Distillation (KD). While classical KD methods consider the logits of the teacher network, more recent methods obtain a leap in performance by considering the activation maps. This, however, uses metrics that are suitable for comparing images. We propose to employ two additional loss terms that are based on the spectral properties of the intermediate activation maps. The proposed method obtains state of the art results on multiple image recognition KD benchmarks.
KW - Image statistics
KW - Knowledge distillation
UR - http://www.scopus.com/inward/record.url?scp=85125559589&partnerID=8YFLogxK
U2 - 10.1109/ICIP42928.2021.9506486
DO - 10.1109/ICIP42928.2021.9506486
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85125559589
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 399
EP - 403
BT - 2021 IEEE International Conference on Image Processing, ICIP 2021 - Proceedings
PB - IEEE Computer Society
T2 - 2021 IEEE International Conference on Image Processing, ICIP 2021
Y2 - 19 September 2021 through 22 September 2021
ER -