TY - GEN
T1 - Software-improved hardware lock elision
AU - Afek, Yehuda
AU - Levy, Amir
AU - Morrison, Adam
PY - 2014
Y1 - 2014
N2 - With hardware transactional memory (HTM) becoming available in mainstream processors, lock-based critical sections may now initiate a hardware transaction instead of taking the lock, enabling their concurrent execution unless a real data conflict occurs. However, just a few transactional aborts can cause the lock to be acquired non-transactionally resulting in the serialization of all the threads, severely degrading the amount of speedup obtained. In this paper we provide two software extension mechanisms that considerably improve the concurrency and speedup levels attained by lock based programs using HTM-based lock elision. The first sacrifices opacity to achieve higher levels of concurrency, and the second retains opacity while reaching slightly lower levels of concurrency. Evaluation on STAMP and on data structure benchmarks on an Intel Haswell processor shows that these techniques improve the speedup by up to 3.5 times and 10 times respectively, compared to using Haswell's hardware lock elision as is.
AB - With hardware transactional memory (HTM) becoming available in mainstream processors, lock-based critical sections may now initiate a hardware transaction instead of taking the lock, enabling their concurrent execution unless a real data conflict occurs. However, just a few transactional aborts can cause the lock to be acquired non-transactionally resulting in the serialization of all the threads, severely degrading the amount of speedup obtained. In this paper we provide two software extension mechanisms that considerably improve the concurrency and speedup levels attained by lock based programs using HTM-based lock elision. The first sacrifices opacity to achieve higher levels of concurrency, and the second retains opacity while reaching slightly lower levels of concurrency. Evaluation on STAMP and on data structure benchmarks on an Intel Haswell processor shows that these techniques improve the speedup by up to 3.5 times and 10 times respectively, compared to using Haswell's hardware lock elision as is.
KW - Lock elision
KW - Lock removal
UR - http://www.scopus.com/inward/record.url?scp=84905454462&partnerID=8YFLogxK
U2 - 10.1145/2611462.2611482
DO - 10.1145/2611462.2611482
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84905454462
SN - 9781450329446
T3 - Proceedings of the Annual ACM Symposium on Principles of Distributed Computing
SP - 212
EP - 221
BT - PODC 2014 - Proceedings of the 2014 ACM Symposium on Principles of Distributed Computing
PB - Association for Computing Machinery
T2 - 2014 ACM Symposium on Principles of Distributed Computing, PODC 2014
Y2 - 15 July 2014 through 18 July 2014
ER -