LL/SC and Atomic Copy: Constant Time, Space Efficient Implementations Using Only Pointer-Width CAS

Blelloch, Guy E.; Wei, Yuanhao

doi:10.4230/LIPIcs.DISC.2020.5

Abstract

When designing concurrent algorithms, Load-Link/Store-Conditional (LL/SC) is a very useful primitive since it avoids ABA problems. The full semantics of LL/SC are not supported in hardware by any modern architecture, so there has been a significant amount of work on simulations of LL/SC using CAS. However, all previous algorithms that are constant time either use unbounded sequence numbers (and thus base objects of unbounded size), or require Ω(MP) space to implement M LL/SC objects for P processes.
We present the first constant time implementation of LL/SC from bounded-sized CAS objects using only constant space overhead per LL/SC variable. In particular, our implementation uses Θ(M+kP²) space, where k is the number of outstanding LL operations per process, and only requires pointer-width CAS operations. In most algorithms that use LL/SC, k is a small constant which reduces our additive space overhead to Θ(P²). Our algorithm can also be extended to implement L word LL/SC objects in Θ(L) time for LL and SC, O(1) time for VL, and Θ((M+kP²)L) space.
To achieve these bounds, our main technical contribution is implementing a new primitive called Single-Writer Copy which takes a pointer to a word sized memory location and atomically copies its contents into another object. The restriction is that only one process is allowed to write/copy into the destination object at a time. The ability to read from one memory location and write to another atomically, and in constant-time, is very powerful and we believe this primitive will be useful in designing other algorithms.

Cite As Get BibTex

Guy E. Blelloch and Yuanhao Wei. LL/SC and Atomic Copy: Constant Time, Space Efficient Implementations Using Only Pointer-Width CAS. In 34th International Symposium on Distributed Computing (DISC 2020). Leibniz International Proceedings in Informatics (LIPIcs), Volume 179, pp. 5:1-5:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020) https://doi.org/10.4230/LIPIcs.DISC.2020.5

Author Details

Guy E. Blelloch

Carnegie Mellon University, Pittsburgh, PA, USA

Yuanhao Wei

Carnegie Mellon University, Pittsburgh, PA, USA

Funding

This work was supported in part by NSF grants CCF-1901381, CCF-1910030, and CCF-1919223. Yuanhao Wei was also supported by a NSERC PGS-D Scholarship.

Acknowledgements

We would like to thank our anonymous reviewers for their helpful comments.

References

Yehuda Afek, Dalia Dauber, and Dan Touitou. Wait-free made fast. In Symposium on Theory of Computing (STOC), pages 538-547, 1995.
Zahra Aghazadeh, Wojciech Golab, and Philipp Woelfel. Making objects writable. In ACM Symposium on Principles of Distributed Computing (PODC), pages 385-395, 2014.
Zahra Aghazadeh and Philipp Woelfel. Space-and time-efficient long-lived test-and-set objects. In International Conference on Principles of Distributed Systems (OPODIS), pages 404-419. Springer, 2014.
Zahra Aghazadeh and Philipp Woelfel. Upper bounds for boundless tagging with bounded objects. In International Symposium on Distributed Computing (DISC), pages 442-457, 2016.
James H Anderson and Mark Moir. Universal constructions for large objects. In International Workshop on Distributed Algorithms (WDAG), pages 168-182. Springer, 1995.
James H Anderson and Mark Moir. Universal constructions for multi-object operations. In ACM Symposium on Principles of Distributed Computing (PODC), pages 184-193, 1995.
Maya Arbel-Raviv and Trevor Brown. Reuse, don't recycle: Transforming lock-free algorithms that throw away descriptors. arXiv preprint arXiv:1708.01797, 2017.
Hagit Attiya and Jennifer Welch. Distributed computing: fundamentals, simulations, and advanced topics, volume 19. John Wiley & Sons, 2004.
Henry G Baker Jr. List processing in real time on a serial computer. Communications of the ACM, 21(4):280-294, 1978.
Greg Barnes. A method for implementing lock-free shared-data structures. In ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pages 261-270, 1993.
Guy E. Blelloch and Yuanhao Wei. LL/SC and atomic copy: Constant time, space efficient implementations using only pointer-width CAS, 2019. URL: http://arxiv.org/abs/1911.09671.
Guy E. Blelloch and Yuanhao Wei. Concurrent reference counting and resource management in wait-free constant time, 2020. URL: http://arxiv.org/abs/2002.07053.
Simon Doherty, Maurice Herlihy, Victor Luchangco, and Mark Moir. Bringing practical lock-free synchronization to 64-bit applications. In ACM Symposium on Principles of Distributed Computing (PODC), pages 31-39. ACM, 2004.
Faith Ellen and Philipp Woelfel. An optimal implementation of fetch-and-increment. In International Symposium on Distributed Computing (DISC), pages 284-298. Springer, 2013.
Maurice Herlihy. Wait-free synchronization. ACM Transactions on Programming Languages and Systems (TOPLAS), 13(1):124-149, 1991.
Maurice Herlihy. A methodology for implementing highly concurrent data objects. ACM Transactions on Programming Languages and Systems (TOPLAS), 15(5):745-770, 1993.
Maurice P Herlihy and Jeannette M Wing. Linearizability: A correctness condition for concurrent objects. ACM Transactions on Programming Languages and Systems, 12(3):463-492, 1990.
IBM. IBM System/370 Extended Architecture, Principles of Operation, publication no. sa22-7085. 1983.
Amos Israeli and Lihu Rappoport. Disjoint-access-parallel implementations of strong shared memory primitives. In ACM Symposium on Principles of Distributed Computing (PODC), pages 151-160. ACM, 1994.
Prasad Jayanti and Srdjan Petrovic. Efficient and practical constructions of LL/SC variables. In ACM Symposium on Principles of Distributed Computing (PODC), pages 285-294. ACM, 2003.
Prasad Jayanti and Srdjan Petrovic. Efficient wait-free implementation of multiword LL/SC variables. In IEEE International Conference on Distributed Computing Systems (ICSCS), pages 59-68. IEEE, 2005.
Prasad Jayanti and Srdjan Petrovic. Efficiently implementing a large number of LL/SC objects. In International Conference on Principles of Distributed Systems (OPODIS), pages 17-31. Springer, 2005.
Prasad Jayanti and Srdjan Petrovic. Efficiently implementing LL/SC objects shared by an unknown number of processes. In International Workshop on Distributed Computing (IWDC), pages 45-56. Springer, 2005.
Maged M Michael. ABA prevention using single-word instructions. IBM Research Division, RC23089 (W0401-136), Tech. Rep, 2004.
Maged M Michael. Hazard pointers: Safe memory reclamation for lock-free objects. IEEE Transactions on Parallel and Distributed Systems, 15(6):491-504, 2004.
Maged M Michael. Practical lock-free and wait-free LL/SC/VL implementations using 64-bit CAS. In International Symposium on Distributed Computing (DISC), pages 144-158. Springer, 2004.
Mark Moir. Practical implementations of non-blocking synchronization primitives. In ACM Symposium on Principles of Distributed Computing (PODC), pages 219-228, 1997.

LL/SC and Atomic Copy: Constant Time, Space Efficient Implementations Using Only Pointer-Width CAS

Authors Guy E. Blelloch , Yuanhao Wei

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Acknowledgements

References

Thanks for your feedback!

Could not send message

LL/SC and Atomic Copy: Constant Time, Space Efficient Implementations Using Only Pointer-Width CAS

Authors Guy E. Blelloch , Yuanhao Wei

File

Document Identifiers

Related Versions

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Funding

Acknowledgements

References

Thanks for your feedback!

Could not send message