ACM Other Conferences

10.1145/acmotherconferences

0000000

10.5555/0000000

Proceedings of the 19th Symposium on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems (ATMOS 2019)

ATMOS 2019

10.4230/OASIcs.ATMOS.2019.9

10003752.10003809.10010170.10010171

Theory of computation~Shared memory algorithms

500

Exploiting Amorphous Data Parallelism to Speed-Up Massive Time-Dependent Shortest-Path Computations

Kontogiannis

Spyros

University of Ioannina, Greece kontog@uoi.gr Author Papadopoulos

Anastasios

University of Patras, Greece anpapad@ceid.upatras.gr Author Paraskevopoulos

Andreas

University of Patras, Greece paraskevop@ceid.upatras.gr Author Zaroliagis

Christos

University of Patras, Greece zaro@ceid.upatras.gr Author

15 11 2019

9:1 9:18

We aim at exploiting parallelism in shared-memory multiprocessing systems, in order to speed up the execution time with as small redundancy in work as possible, for an elementary task that comes up frequently as a subroutine in the daily maintenance of large-scale time-dependent graphs representing real-world relationships or technological networks: the many-to-all time-dependent shortest paths (MATDSP) problem. MATDSP requires the computation of one time-dependent shortest-path tree (TDSPT) per origin-vertex and departure-time, from an arbitrary collection of pairs of origins and departure-times, towards all reachable destinations in the graph.

Our goal is to explore the potential and highlight the limitations of amorphous data parallelism, when dealing with MATDSP in multicore computing environments with a given amount of processing elements and a shared memory to exploit. Apart from speeding-up execution time, consumption of resources (and energy) is also critical. Therefore, we aim at limiting the work overhead for solving a MATDSP instance, as measured by the overall number of arc relaxations in shortest-path computations, while trying to minimize the overall execution time. Towards this direction, we provide several algorithmic engineering interventions for solving MATDSP concerning: (i) the compact representation of the instance; (ii) the choice and the improvement of the time-dependent single-source shortest path algorithm that is used as a subroutine; (iii) the way according to which the overall work is allocated to the processing elements; (iv) the adoption of the amorphous data parallelism rationale, in order to avoid costly synchronization among the processing elements while doing their own part of the work.

Our experimental evaluations, both on real-world and on synthetic benchmark instances of time-dependent road networks, provide insight how one should organize heavy MATDSP computations, depending on the application scenario. This insight is in some cases rather unexpected. For instance, it is not always the case that pure data parallelism (among otherwise totally independent processors) is the best choice for minimizing execution times. In certain cases it may be worthwhile to limit the level of data parallelism in favor of algorithmic parallelism, in order to achieve more efficient MATDSP computations.

amorphous data parallelism delta-stepping algorithm travel-time oracle many-to-all shortest paths time-dependent road networks

R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory Algorithms and Applications. Prentice Hall, Englewood Cliffs, 1993.

M. Amber-Hassaan, M. Burtscher, and K. Pingali. Ordered vs. unordered: Acomparison of parallelism and work-efficiency in irregular algorithms. Sigplan Notices - SIGPLAN, 46:3-12, 2011.

G. V. Batz, R. Geisberger, P. Sanders, and C. Vetter. Minimum Time-Dependent Travel Times with Contraction Hierarchies. ACM Journal of Experimental Algorithmics, 18(1):1-43, 2013. URL: https://github.com/GVeitBatz/KaTCH.

R. Bellman. On a Routing Problem. Quarterly of Applied Mathematics, 16, 1958.

V. T. Chakaravarthy, F. Checconiy, P. Murali, F. Petriniy, and Y. Sabharwal. Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems. IEEE Transactions on Parallel &Distributed Systems, 28:2031-2045, 2017.

D. Delling, A. V. Goldberg, A. Nowatzyk, and R. F. Werneck. PHAST: Hardware-accelerated shortest path trees. IEEE International Parallel Distributed Processing Symposium (IPDPS), pages 921-931, 2011.

E. W. Dijkstra. A note on two problems in connexion with graphs. Numerische Mathematik, 1(1):269-271, 1959.

S. E. Dreyfus. An appraisal of some shortest-path algorithms. Operations Research, 17(3):395-412, 1969.

M. L. Fredman, R. Sedgewick, D. D. Sleator, and R. E. Tarjan. The pairing heap - A new form of self-adjusting heap. Algorithmica, 1:111-119, 1986.

L. R. Ford Jr. Network flow theory. Technical report, RAND CORP SANTA MONICA CA, 1956.

S. Kontogiannis, G. Papastavrou, A. Paraskevopoulos, D. Wagner, and C. Zaroliagis. Improved Oracles for Time-Dependent Road Networks. Algorithmic Approaches for Transportation Modeling Optimization, and Systems (ATMOS), 2017.

D. Larkin, S. Sen, and R. E. Tarjan. A Back-to-basics Empirical Study of Priority Queues. Algorithm Engineering &Experiments (ALENEX), pages 61-72, 2014.

G. Mali, P. Michail, A. Paraskevopoulos, and C. Zaroliagis. A new dynamic graph structure for large-scale transportation networks. Conference on Algorithms and Complexity (CIAC), pages 312-323, 2013. LNCS 7878.

U. Meyer and P. Sanders. Δ-stepping: a parallelizable shortest path algorithm. Journal of Algorithms, 49(1):114-152, 2003.

G. Nannicini, D. Delling, D. Schultes, and L. Liberti. Bidirectional A* search on time-dependent road networks. Networks, 59:240-251, 2012.

D. Nguyen, A. Lenharth, and K. Pingali. A lightweight infrastructure for graph analytics. ACM Symposium on Operating Systems Principles (SOSP), pages 456-471, 2013.

D. Nguyen, A. Lenharth, and K. Pingali. Deterministic Galois: On-demand Portable and Parameterless. International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 499-512, 2014.

A. Orda and R. Rom. Shortest-path and minimum-delay algorithms in networks with time-dependent edge-length. Journal of the ACM, 37(3):607-625, 1990.

K. Pingali, D. Nguyen, M. Kulkarni, M. Burtscher, M. Amber-Hassaan, R. Kaleem, T.-H. Lee, A. Lenharth, R. Manevich, M. Méndez-Lojo, D. Prountzos, and X. Sui. The tao of parallelism in algorithms. ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), pages 12-25, 2011.

P. Sanders. Fast Priority Queues for Cached Memory. Journal of Experimental Algorithmics, 5(7), 2000. ACM, New York, NY, USA.

P. Sanders, D. Schultes, and C. Vetter: Mobile route planning. Mobile route planning. European symposium on Algorithms (ESA), pages 732-743, 2008.

E. Strubell, A. Ganesh, and A. McCallum. Energy and Policy Considerations for Deep Learning in NLP. Annual Meeting of the Association for Computational Linguistics (ACL), 2019. URL: http://arxiv.org/abs/1906.02243.

<book-part-wrapper xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="2.0" xml:lang="en" content-type="research-article">

<collection-meta collection-type="book-series">

<collection-id collection-id-type="doi">10.1145/acmotherconferences</collection-id>

<title-group>

<title>ACM Other Conferences</title>

</title-group>

</collection-meta>

<book-meta>

<book-id book-id-type="acm-id">0000000</book-id>

<book-id book-id-type="doi">10.5555/0000000</book-id>

<book-title-group>

<book-title>Proceedings of the 19th Symposium on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems (ATMOS 2019)</book-title>

<alt-title alt-title-type="acronym">ATMOS 2019</alt-title>

</book-title-group>

</book-meta>

<book-part book-part-type="chapter" xml:lang="en">

<book-part-meta>

<book-part-id book-part-id-type="doi">10.4230/OASIcs.ATMOS.2019.9</book-part-id>

<book-part-id book-part-id-type="article-no">9</book-part-id>

<subj-group subj-group-type="ccs2012">

<compound-subject>

<compound-subject-part content-type="code">10003752.10003809.10010170.10010171</compound-subject-part>

<compound-subject-part content-type="text">Theory of computation~Shared memory algorithms</compound-subject-part>

<compound-subject-part content-type="weight">500</compound-subject-part>

</compound-subject>

</subj-group>

<title-group>

<title>Exploiting Amorphous Data Parallelism to Speed-Up Massive Time-Dependent Shortest-Path Computations</title>

</title-group>

<contrib-group>

<name>

<surname>Kontogiannis</surname>

<given-names>Spyros</given-names>

</name>

<aff>University of Ioannina, Greece</aff>

<email>kontog@uoi.gr</email>

<role>Author</role>

</contrib>

<name>

<surname>Papadopoulos</surname>

<given-names>Anastasios</given-names>

</name>

<aff>University of Patras, Greece</aff>

<email>anpapad@ceid.upatras.gr</email>

<role>Author</role>

</contrib>

<name>

<surname>Paraskevopoulos</surname>

<given-names>Andreas</given-names>

</name>

<aff>University of Patras, Greece</aff>

<email>paraskevop@ceid.upatras.gr</email>

<role>Author</role>

</contrib>

<name>

<surname>Zaroliagis</surname>

<given-names>Christos</given-names>

</name>

<aff>University of Patras, Greece</aff>

<email>zaro@ceid.upatras.gr</email>

<role>Author</role>

</contrib>

</contrib-group>

<pub-date date-type="publication">

</pub-date>

We aim at exploiting parallelism in shared-memory multiprocessing systems, in order to speed up the execution time with as small redundancy in work as possible, for an elementary task that comes up frequently as a subroutine in the daily maintenance of large-scale time-dependent graphs representing real-world relationships or technological networks: the many-to-all time-dependent shortest paths (MATDSP) problem. MATDSP requires the computation of one time-dependent shortest-path tree (TDSPT) per origin-vertex and departure-time, from an arbitrary collection of pairs of origins and departure-times, towards all reachable destinations in the graph.

Our goal is to explore the potential and highlight the limitations of amorphous data parallelism, when dealing with MATDSP in multicore computing environments with a given amount of processing elements and a shared memory to exploit. Apart from speeding-up execution time, consumption of resources (and energy) is also critical. Therefore, we aim at limiting the work overhead for solving a MATDSP instance, as measured by the overall number of arc relaxations in shortest-path computations, while trying to minimize the overall execution time. Towards this direction, we provide several algorithmic engineering interventions for solving MATDSP concerning: (i) the compact representation of the instance; (ii) the choice and the improvement of the time-dependent single-source shortest path algorithm that is used as a subroutine; (iii) the way according to which the overall work is allocated to the processing elements; (iv) the adoption of the amorphous data parallelism rationale, in order to avoid costly synchronization among the processing elements while doing their own part of the work.

Our experimental evaluations, both on real-world and on synthetic benchmark instances of time-dependent road networks, provide insight how one should organize heavy MATDSP computations, depending on the application scenario. This insight is in some cases rather unexpected. For instance, it is not always the case that pure data parallelism (among otherwise totally independent processors) is the best choice for minimizing execution times. In certain cases it may be worthwhile to limit the level of data parallelism in favor of algorithmic parallelism, in order to achieve more efficient MATDSP computations.

</abstract>

<kwd-group>

<kwd>amorphous data parallelism</kwd>

<kwd>delta-stepping algorithm</kwd>

<kwd>travel-time oracle</kwd>

<kwd>many-to-all shortest paths</kwd>

<kwd>time-dependent road networks</kwd>

</kwd-group>

</book-part-meta>

<back>

<ref-list specific-use="unparsed">

<mixed-citation>R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory Algorithms and Applications. Prentice Hall, Englewood Cliffs, 1993.</mixed-citation>

</ref>

<mixed-citation>M. Amber-Hassaan, M. Burtscher, and K. Pingali. Ordered vs. unordered: Acomparison of parallelism and work-efficiency in irregular algorithms. Sigplan Notices - SIGPLAN, 46:3-12, 2011.</mixed-citation>

</ref>

<mixed-citation>G. V. Batz, R. Geisberger, P. Sanders, and C. Vetter. Minimum Time-Dependent Travel Times with Contraction Hierarchies. ACM Journal of Experimental Algorithmics, 18(1):1-43, 2013. URL: https://github.com/GVeitBatz/KaTCH.</mixed-citation>

</ref>

<mixed-citation>R. Bellman. On a Routing Problem. Quarterly of Applied Mathematics, 16, 1958.</mixed-citation>

</ref>

<mixed-citation>V. T. Chakaravarthy, F. Checconiy, P. Murali, F. Petriniy, and Y. Sabharwal. Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems. IEEE Transactions on Parallel &Distributed Systems, 28:2031-2045, 2017.</mixed-citation>

</ref>

<mixed-citation>D. Delling, A. V. Goldberg, A. Nowatzyk, and R. F. Werneck. PHAST: Hardware-accelerated shortest path trees. IEEE International Parallel Distributed Processing Symposium (IPDPS), pages 921-931, 2011.</mixed-citation>

</ref>

<mixed-citation>E. W. Dijkstra. A note on two problems in connexion with graphs. Numerische Mathematik, 1(1):269-271, 1959.</mixed-citation>

</ref>

<mixed-citation>S. E. Dreyfus. An appraisal of some shortest-path algorithms. Operations Research, 17(3):395-412, 1969.</mixed-citation>

</ref>

<mixed-citation>M. L. Fredman, R. Sedgewick, D. D. Sleator, and R. E. Tarjan. The pairing heap - A new form of self-adjusting heap. Algorithmica, 1:111-119, 1986.</mixed-citation>

</ref>

<mixed-citation>L. R. Ford Jr. Network flow theory. Technical report, RAND CORP SANTA MONICA CA, 1956.</mixed-citation>

</ref>

<mixed-citation>S. Kontogiannis, G. Papastavrou, A. Paraskevopoulos, D. Wagner, and C. Zaroliagis. Improved Oracles for Time-Dependent Road Networks. Algorithmic Approaches for Transportation Modeling Optimization, and Systems (ATMOS), 2017.</mixed-citation>

</ref>

<mixed-citation>D. Larkin, S. Sen, and R. E. Tarjan. A Back-to-basics Empirical Study of Priority Queues. Algorithm Engineering &Experiments (ALENEX), pages 61-72, 2014.</mixed-citation>

</ref>

<mixed-citation>G. Mali, P. Michail, A. Paraskevopoulos, and C. Zaroliagis. A new dynamic graph structure for large-scale transportation networks. Conference on Algorithms and Complexity (CIAC), pages 312-323, 2013. LNCS 7878.</mixed-citation>

</ref>

<mixed-citation>U. Meyer and P. Sanders. Δ-stepping: a parallelizable shortest path algorithm. Journal of Algorithms, 49(1):114-152, 2003.</mixed-citation>

</ref>

<mixed-citation>G. Nannicini, D. Delling, D. Schultes, and L. Liberti. Bidirectional A* search on time-dependent road networks. Networks, 59:240-251, 2012.</mixed-citation>

</ref>

<mixed-citation>D. Nguyen, A. Lenharth, and K. Pingali. A lightweight infrastructure for graph analytics. ACM Symposium on Operating Systems Principles (SOSP), pages 456-471, 2013.</mixed-citation>

</ref>

<mixed-citation>D. Nguyen, A. Lenharth, and K. Pingali. Deterministic Galois: On-demand Portable and Parameterless. International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 499-512, 2014.</mixed-citation>

</ref>

<mixed-citation>A. Orda and R. Rom. Shortest-path and minimum-delay algorithms in networks with time-dependent edge-length. Journal of the ACM, 37(3):607-625, 1990.</mixed-citation>

</ref>

<mixed-citation>K. Pingali, D. Nguyen, M. Kulkarni, M. Burtscher, M. Amber-Hassaan, R. Kaleem, T.-H. Lee, A. Lenharth, R. Manevich, M. Méndez-Lojo, D. Prountzos, and X. Sui. The tao of parallelism in algorithms. ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), pages 12-25, 2011.</mixed-citation>

</ref>

<mixed-citation>P. Sanders. Fast Priority Queues for Cached Memory. Journal of Experimental Algorithmics, 5(7), 2000. ACM, New York, NY, USA.</mixed-citation>

</ref>

<mixed-citation>P. Sanders, D. Schultes, and C. Vetter: Mobile route planning. Mobile route planning. European symposium on Algorithms (ESA), pages 732-743, 2008.</mixed-citation>

</ref>

<mixed-citation>E. Strubell, A. Ganesh, and A. McCallum. Energy and Policy Considerations for Deep Learning in NLP. Annual Meeting of the Association for Computational Linguistics (ACL), 2019. URL: http://arxiv.org/abs/1906.02243.</mixed-citation>

</ref>

</ref-list>

</back>

</book-part>

</book-part-wrapper>