TY - GEN
T1 - Flat-combining NUMA locks
AU - Dice, Dave
AU - Marathe, Virendra J.
AU - Shavit, Niro
PY - 2011
Y1 - 2011
N2 - Multicore machines are growing in size, and accordingly shifting from simple bus-based designs to NUMA and CCNUMA architectures. With this shift, the need for scalable hierarchical locking algorithms is becoming crucial to performance. This paper presents a novel scalable hierarchical queue-lock algorithm based on the flat combining synchronization paradigm. At the core of the new algorithm is a scheme for building local queues of waiting threads in a highly efficient manner, and then merging them globally, all with little interconnect traffic and virtually no costly synchronization operations in the common case. In empirical testing on an Oracle SPARC Enterprise T5440 Server, a 256-way CC-NUMA machine, our new flat-combining hierarchical lock significantly outperforms all classic locking algorithms, and at high concurrency levels, provides up to a factor of two improvement over HCLH, the most efficient known hierarchical locking algorithm.
AB - Multicore machines are growing in size, and accordingly shifting from simple bus-based designs to NUMA and CCNUMA architectures. With this shift, the need for scalable hierarchical locking algorithms is becoming crucial to performance. This paper presents a novel scalable hierarchical queue-lock algorithm based on the flat combining synchronization paradigm. At the core of the new algorithm is a scheme for building local queues of waiting threads in a highly efficient manner, and then merging them globally, all with little interconnect traffic and virtually no costly synchronization operations in the common case. In empirical testing on an Oracle SPARC Enterprise T5440 Server, a 256-way CC-NUMA machine, our new flat-combining hierarchical lock significantly outperforms all classic locking algorithms, and at high concurrency levels, provides up to a factor of two improvement over HCLH, the most efficient known hierarchical locking algorithm.
KW - flat combining
KW - hierarchical locks
KW - queue locks
UR - http://www.scopus.com/inward/record.url?scp=79959640934&partnerID=8YFLogxK
U2 - 10.1145/1989493.1989502
DO - 10.1145/1989493.1989502
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:79959640934
SN - 9781450307437
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 65
EP - 74
BT - SPAA'11 - Proceedings of the 23rd Annual Symposium on Parallelism in Algorithms and Architectures
T2 - 23rd ACM Symposium on Parallelism in Algorithms and Architectures, SPAA'11
Y2 - 4 June 2011 through 6 June 2011
ER -