TY - JOUR
T1 - Channel-level acceleration of deep face representations
AU - Polyak, Adam
AU - Wolf, Lior
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/10/26
Y1 - 2015/10/26
N2 - A major challenge in biometrics is performing the test at the client side, where hardware resources are often limited. Deep learning approaches pose a unique challenge: while such architectures dominate the field of face recognition with regard to accuracy, they require elaborate, multi-stage computations. Recently, there has been some work on compressing networks for the purpose of reducing run time and network size. However, it is not clear that these compression methods would work in deep face nets, which are, generally speaking, less redundant than the object recognition networks, i.e., they are already relatively lean. We propose two novel methods for compression: one based on eliminating lowly active channels and the other on coupling pruning with repeated use of already computed elements. Pruning of entire channels is an appealing idea, since it leads to direct saving in run time in almost every reasonable architecture.
AB - A major challenge in biometrics is performing the test at the client side, where hardware resources are often limited. Deep learning approaches pose a unique challenge: while such architectures dominate the field of face recognition with regard to accuracy, they require elaborate, multi-stage computations. Recently, there has been some work on compressing networks for the purpose of reducing run time and network size. However, it is not clear that these compression methods would work in deep face nets, which are, generally speaking, less redundant than the object recognition networks, i.e., they are already relatively lean. We propose two novel methods for compression: one based on eliminating lowly active channels and the other on coupling pruning with repeated use of already computed elements. Pruning of entire channels is an appealing idea, since it leads to direct saving in run time in almost every reasonable architecture.
KW - Face recognition
KW - Neural network compression
UR - http://www.scopus.com/inward/record.url?scp=84961358915&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2015.2494536
DO - 10.1109/ACCESS.2015.2494536
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:84961358915
SN - 2169-3536
VL - 3
SP - 2163
EP - 2175
JO - IEEE Access
JF - IEEE Access
M1 - 7303876
ER -