Database Theory in Action: Cypher, GQL, and Regular Path Queries

Gheerbrant, Amélie; Libkin, Leonid; Peterfreund, Liat; Rogova, Alexandra

doi:10.4230/LIPIcs.ICDT.2025.36

Database Theory in Action:
Cypher, GQL, and Regular Path Queries

Amélie Gheerbrant

Université Paris Cité, CNRS, IRIF, France Leonid Libkin

RelationalAI and IRIF, CNRS, Paris, France
University of Edinburgh, UK Liat Peterfreund

The Hebrew University of Jerusalem, Israel Alexandra Rogova

Université Paris Cité, CNRS, IRIF, France

Abstract

Cypher has so far been the most commonly used query language for property graphs, and served as the foundation of the recently standardized graph query language GQL. In designing the features of GQL, the standards committee addressed the perceived limitations of Cypher. One such limitation is the inability of Cypher, as originally designed, to express all regular path queries (RPQs). Despite this claim having been stated many times as a folklore result, we could not find any proof of it. In this note we formalize the core of Cypher’s pattern matching and formally prove that indeed it falls short of all RPQs, justifying the inclusion of new pattern matching features in GQL.

Keywords and phrases:

Regular path queries, Cypher, GQL, inexpressibility

Copyright and License:

2012 ACM Subject Classification:

Information systems

\rightarrow

Graph-based database models

Funding:

We acknowledge support from: VeriGraph project, ANR-21-CE48-0015 (L. Libkin); Israel Science Foundation 2355/24 (L. Peterfreund); NCN grant 2018/30/E/ST6/00042 (A. Rogova).

DOI:

10.4230/LIPIcs.ICDT.2025.36

Event:

28th International Conference on Database Theory (ICDT 2025)

Editors:

Sudeepa Roy and Ahmet Kara

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

One of the key goals of the design of the new standard Graph Query Language (ISO/IEC 39075:2024 Information technology – Database languages – GQL, www.iso.org/standard/76120.html) was to overcome Cypher’s perceived limitation with respect to regular path queries (RPQs). One finds statements that Cypher falls short of the full power of RPQs in many sources and surveys, e.g. [5, 2, 1]. There is a strong intuition behind this statement: in Cypher, the use of Kleene star $*$ is limited to edge labels. Essentially, one can say that there is a path of edges labeled $\ell$ between two nodes $n$ and $n^{\prime}$ , but it seems that one cannot say that there is a path $n=n_{0}\cdots n_{1}\cdots n_{2}\cdots n_{k-1}\cdots n_{k}=n^{\prime}$ so that each part of this path between $n_{i}$ and $n_{i+1}$ for $0\leq i<k$ , satisfies a pattern $\pi$ more complex than a labeled edge.

This observation led to the substantial extension of GQL’s pattern matching, which also coincide with available pattern matching in SQL/PGQ, a newly standardized extension of SQL with mechanisms for property graph querying (see their informal description in [3]). Namely, a bounded (between $n$ and $m$ times for $n<m\in{\mathbb{N}}$ ) or unbounded (at least $n$ times) repetition can be applied to an arbitrary pattern $\pi$ (currently such patterns cannot themselves contain repetitions, but a language opportunity¹¹1 In SQL and GQL standards, this means the committee plans to revisit a specific feature. exists to remove this restriction).

This substantial extension of the language was based on a belief that arbitrary regular properties of paths cannot be expressed in Cypher. Our goal is to justify this decision by providing a proof of the widely believed – but hitherto unproven – statement.

2 Graph databases and Cypher patterns: an abstraction

Cypher operates on property graphs: these are labeled graphs, potentially with multiple edges between two nodes, where both edges and nodes carry properties given as key/value pairs. The latter play no role in comparison with RPQs which only refer to labels; thus we use a simple model of graph databases where properties are disregarded.

Graph databases.

Assume pairwise disjoint countable sets $\mathcal{N}$ of node ids, $\mathcal{E}$ of edge ids and $\mathcal{L}$ of labels. A graph database is a tuple $G=\langle N,E,\lambda,\sigma,\tau\rangle$ where

$\blacksquare$

$N\subset\mathcal{N}$ is a finite set of node ids used in $G$ ;
$\blacksquare$

$E\subset\mathcal{E}$ is a finite set of directed edge ids used in $G$ ;
$\blacksquare$

$\lambda:N\cup E\to{\mathcal{L}}$ is a labeling function that associates with every id a label from $\mathcal{L}$ ;
$\blacksquare$

$\sigma,\tau:E\to N$ define source and target of an edge.

In real Cypher, $\lambda$ can assign sets of labels to a node, and in GQL it can assign sets of labels to edges as well, but since the separating example does not depend on these features (which can also be modeled by considering $2^{\mathcal{L}}$ as the new set of labels), we do not use them here.

A path in $G$ is an alternating sequence $n_{0}e_{1}n_{1}e_{2}\cdots e_{k}n_{k}$ , for $k\geq 0$ , of nodes and edges that starts and ends with a node and so that each edge $e_{i}$ connects $n_{i-1}$ to $n_{i}$ for $i\leq k$ . That is, either $e_{i}$ is a forward edge with $\sigma(e_{i})=n_{i-1}$ and $\tau(e_{i})=n_{i}$ , or a backward edgewith $\sigma(e_{i})=n_{i}$ and $\tau(e_{i})=n_{i-1}$ . If $k=0$ , the path consists of a single node $n_{0}$ . We explicitly write out paths as $\langle\!\langle n_{0},e_{1},n_{1},\cdots,e_{k},n_{k}\rangle\!\rangle$ . Two paths $p=\langle\!\langle n_{0},e_{0},\ldots,n_{k}\rangle\!\rangle$ and $p^{\prime}=\langle\!\langle n^{\prime}_{0},e^{\prime}_{0},\ldots,n^{\prime}_{j% }\rangle\!\rangle$ concatenate, if $n_{k}=n^{\prime}_{0}$ , in which case their concatenation $p\cdot p^{\prime}$ is defined as $\langle\!\langle n_{0},e_{0},\ldots,n_{k},e^{\prime}_{0},\ldots,n^{\prime}_{j}% \rangle\!\rangle$ .

Cypher Patterns.

We refine an abstraction of patterns of GQL from [4], omitting the GQL specific features, and adding Cypher patterns matching repeated edges of a given label. Let $\mathcal{V}$ be a countably infinite set of variables. Patterns are defined by

$\begin{array}[]{rclllll}\pi&:=&(x:\ell)&\mid&(x)\overset{y:\ell}{% \longrightarrow}(z)&\mid&(x)\overset{y:\ell}{\longleftarrow}(z)\ \mid\ (x)% \overset{\ell^{*}}{\longrightarrow}(z)\ \mid\ (x)\overset{\ell^{*}}{% \longleftarrow}(z)\\ &\mid&\pi_{1}\,\pi_{2}&\mid&\pi_{1}+\pi_{2}&\mid&\pi\langle\theta\rangle,\ \ % \ \ \ \ \ \ \ \ \ x,y,z\in\mathcal{V}\end{array}$

Let $\mathcal{V}(\pi)$ be the set of all variables mentioned in $\pi$ . The syntactic conditions are:

$\blacksquare$

conditions in $\pi\langle\theta\rangle$ are given by $\theta,\theta^{\prime}\ :=\ (x=x^{\prime})\mid\theta\vee\theta^{\prime}\mid% \theta\wedge\theta^{\prime}\mid\neg\theta$ ; all variables mentioned in $\theta$ must occur in $\mathcal{V}(\pi)$ .
$\blacksquare$

$\pi_{1}+\pi_{2}$ is only defined when $\mathcal{V}(\pi_{1})=\mathcal{V}({\pi_{2}})$ .

In Cypher variables and labels in edge/node patterns are optional, but we include them for simplicity. This does not affect the separating example in which we assume one fixed label for all nodes/edges; the use of variables does not affect expressibility of RPQs. Also, Cypher’s repetitions of labeled edges of the form $n . . m$ (between $n$ and $m$ ) or $n . .$ (at least $n$ ), or $. . m$ (at most $m$ ) are all expressible with concatenation, union, and Kleene star.

Semantics of Patterns.

The semantics $\left\llbracket\pi\right\rrbracket$ of a path pattern $\pi$ , with respect to a graph database $G$ , is a set of pairs $(p,\mu)$ where $p$ is a path and $\mu$ is a mapping $\mathcal{V}({\pi})\rightarrow N\cup E$ as defined in Fig. 1. Two partial mappings $\mu,\mu^{\prime}:\mathcal{V}\to N\cup E$ are joinable if $\mu(x)=\mu^{\prime}(x)$ for each shared $x$ . Their join $\mu\bowtie\mu^{\prime}$ is then unambiguously defined as the mapping that coincides with $\mu(x)$ for $x$ in the domain of $\mu$ and $\mu^{\prime}(x)$ for $x$ in the domain of $\mu^{\prime}$ . In the figure, we omit the standard conditions for $\mu\models\theta$ for Boolean connectives.

$\begin{array}[]{rcl}\left\llbracket(x)\right\rrbracket&:=&\left\{(\langle\!% \langle n\rangle\!\rangle,\{x\mapsto n\})\mid n\in N\right\}\\ \left\llbracket\overset{x}{\rightarrow}\right\rrbracket&:=&\left\{(\langle\!% \langle n_{1},e,n_{2}\rangle\!\rangle,\{x\mapsto e\})\ |\ e\in E,\ \sigma(e)=n% _{1},\ \tau(e)=n_{2}\right\}\\ \left\llbracket\overset{x}{\leftarrow}\right\rrbracket&:=&\left\{(\langle\!% \langle n_{2},e,n_{1}\rangle\!\rangle,\{x\mapsto e\})\ |\ e\in E,\ \sigma{(e)}% =n_{1},\ \tau{(e)}=n_{2}\right\}\\ \left\llbracket(x)\overset{:\ell*}{\longrightarrow}(z)\right\rrbracket&:=&\{(% \langle\!\langle n_{1},e_{1},n_{2},\ldots,e_{k-1},n_{k}\rangle\!\rangle,\{x% \mapsto n_{1},z\mapsto n_{k}\})\ \mid\\ &&e_{1},\ldots,e_{k-1}\in E,\sigma(e_{i})=n_{i},\tau(e_{i})=n_{i+1},\lambda(e_% {i})=\ell\text{ for all }i<k\}\\ \left\llbracket(x)\overset{:\ell*}{\longleftarrow}(z)\right\rrbracket&:=&\{(% \langle\!\langle n_{1},e_{1},n_{2},\ldots,e_{k-1},n_{k}\rangle\!\rangle,\{x% \mapsto n_{k},z\mapsto n_{1}\})\ \mid\\ &&e_{1},\ldots,e_{k-1}\in E,\sigma(e_{i})=n_{i+1},\tau(e_{i})=n_{i},\lambda(e_% {i})=\ell\text{ for all }i<k\}\\ \left\llbracket\pi_{1}+\pi_{2}\right\rrbracket&:=&\left\llbracket\pi_{1}\right% \rrbracket\cup\left\llbracket\pi_{2}\right\rrbracket\\ \left\llbracket{\pi_{1}\,\pi_{2}}\right\rrbracket&:=&\{({p_{1}\cdot p_{2}},\mu% _{1}\bowtie\mu_{2})\ |\ (p_{1},\mu_{1})\in\left\llbracket{\pi_{1}}\right% \rrbracket,\ (p_{2},\mu_{2})\in\left\llbracket{\pi_{2}}\right\rrbracket,\\ &&\mu_{1},\mu_{2}\text{ are joinable and }p_{1},p_{2}\text{ concatenate}\}\\ \left\llbracket\pi\langle\theta\rangle\right\rrbracket&:=&\left\{(p,\mu)\in% \left\llbracket\pi\right\rrbracket\ \mid\ \mu\models\theta\right\}\ \ \text{% where }\mu\models x=y\text{ iff }\mu(x)=\mu(y)\end{array}$

Figure 1: Semantics of Cypher patterns with respect to

G=\langle N,E,\lambda,\sigma,\tau\rangle

.

3 Cypher Vs. RPQs

Recall that an RPQ is a regular expression $q$ over the labels $\mathcal{L}$ . The result of $q$ in $G$ , written $q(G)$ , is the set of pairs of nodes $(n_{0},n_{k})$ such that there is a path $\langle\!\langle n_{0},e_{0},n_{1},e_{1},\ldots,e_{k-1},n_{k}\rangle\!\rangle$ with the word $\lambda(e_{0})\lambda(e_{1})\cdots\lambda(e_{k-1})$ being in the regular language of $q$ .

A Cypher pattern $\pi$ with two designated variables $x_{s},x_{t}\in\mathcal{V}(\pi)$ is said to express an RPQ $q$ in $G$ if $q(G)=\{(\mu(x_{s}),\mu(x_{t}))\mid(p,\mu)\in\left\llbracket\pi\right\rrbracket\}$ .

Theorem 1.

Cypher patterns, as defined above, cannot express all RPQs. In particular they cannot express the pattern testing for an even-length path of edges labeled $\ell$ .

In other words, the regular path query ${(\ell\ell})^{*}$ cannot be expressed in Cypher.

Proof.

Consider graphs $G_{n}$ with $N=\{v_{1},\ldots,v_{n}\}$ and $E=\{e_{1},\ldots,e_{n-1}\}$ so that $\sigma(e_{i})=v_{i}$ and $\tau(e_{i})=v_{i+1}$ for $i<n$ (i.e., directed paths), with each edge labelled $\ell$ . We can therefore assume that all edge labels used in patterns are $\ell$ (if not, such a pattern is not matched, and thus the entire subpattern in which it occurred cannot be matched, up to $+$ in the parse tree, and thus can be removed). We can also further assume that no variable is used as both a node variable and an edge variable (as this would falsify the pattern), nor any explicit equality between such variables is used in conditional patterns.

We now represent such graphs $G_{n}$ as first-order structures $S_{n}$ in the vocabulary $R,R^{*}$ with the universe $N$ , and relations interpreted as follows:

$\blacksquare$

$R=\{(v_{i},v_{i+1})\ \mid\ 1\leq i<n\}$ is the edge relation;
$\blacksquare$

$R^{*}=\{(v_{i},v_{j})\ \mid\ 1\leq i\leq j\leq n\}$ is the reflexive transitive closure of $R$ .

We next show how patterns are translated into first-order formulae over this vocabulary. We use $R$ for convenience, as it is definable from $R^{*}$ which is isomorphic to a linear order on $\{1,\ldots,n\}$ . We will then easily obtain the inexpressibility results since FO cannot define even cardinality of linear orders.

For the translation, with each pattern $\pi$ we associate two new variables $x^{s}_{\pi},x^{t}_{\pi}$ (intuitively, to be witnessed by the endpoints of patterns), and with each edge variable $z$ used in a pattern we associate two first-order variables $z^{s},z^{t}$ to be used in FO formulas (for source and target of edges). Then a pattern $\pi$ with node variables $y_{1},\ldots,y_{m}$ and edge variables $z_{1},\ldots,z_{k}$ (all distinct) is translated into an FO formula $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y_{1},\ldots,y_{m},z_{1}^{s},z_{1}^{t},% \ldots,z_{k}^{s},z_{k}^{t})\,.$ The condition on the translation is that for a path $p=\langle\!\langle u_{0},f_{0},u_{1},\ldots,f_{r-1},u_{r}\rangle\!\rangle$ we have

\begin{array}[]{cl}&(p,\mu)\in\left\llbracket\pi\right\rrbracket_{G_{n}}\\ \Leftrightarrow&S_{n}\models\alpha_{\pi}\big{(}u_{0},u_{r},\mu(y_{1}),\ldots,% \mu(y_{m}),\sigma(\mu(z_{1})),\tau(\mu(z_{1})),\ldots,\sigma(\mu(z_{k})),\tau(% \mu(z_{k}))\big{)}\end{array}

(1)

Now suppose the pattern $(\ell\ell)^{*}$ is definable in Cypher over graphs $G_{n}$ by a pattern $\pi$ as above. Then $\beta(x^{s}_{\pi},x^{t}_{\pi}):=\exists y_{1},\ldots,y_{m},z_{1}^{s},\ldots,z_% {k}^{t}\ \alpha_{\pi}$ is true for $v_{i},v_{j}$ iff the path between them is of even length and therefore the sentence $\gamma:=\exists s,t\ \big{(}\neg\exists s^{\prime}\ R(s^{\prime},s)\wedge\neg% \exists t^{\prime}\ R(t,t^{\prime})\wedge\beta(s,t)\big{)}$ states the path from $v_{1}$ to $v_{n}$ is of even length, which is impossible.

Next, to conclude the proof, we present the translation, recursively.

$\blacksquare$

If $\pi=(y)$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y):=x^{s}_{\pi}=x^{t}_{\pi}\wedge x^{t}_{% \pi}=y$ .
$\blacksquare$

If $\pi=(y_{1})\overset{z:\ell}{\rightarrow}(y_{2})$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y_{1},y_{2},z^{s},z^{t}):=x^{s}_{\pi}=y_{% 1}\wedge x^{t}_{\pi}=y_{2}\wedge z^{s}=y_{1}\wedge z^{t}=y_{2}\wedge R(y_{1},y% _{2})$ .
$\blacksquare$

If $\pi=(y_{1})\overset{z:\ell}{\leftarrow}(y_{2})$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y_{1},y_{2},z^{s},z^{t}):=x^{s}_{\pi}=y_{% 2}\wedge x^{t}_{\pi}=y_{1}\wedge z^{s}=y_{2}\wedge z^{t}=y_{1}\wedge R(y_{2},y% _{1})$ .
$\blacksquare$

If $\pi=(y_{1})\overset{:\ell^{*}}{\rightarrow}(y_{2})$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y_{1},y_{2}):=x^{s}_{\pi}=y_{1}\wedge x^{% t}_{\pi}=y_{2}\wedge R^{*}(y_{1},y_{2})$
$\blacksquare$

if $\pi=(y_{1})\overset{:\ell^{*}}{\leftarrow}(y_{2})$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},y_{1},y_{2}):=x^{s}_{\pi}=y_{2}\wedge x^{% t}_{\pi}=y_{1}\wedge R^{*}(y_{2},y_{1})$ .
$\blacksquare$

If $\pi=\pi_{1}\pi_{2}$ with $\pi_{1},\pi_{2}$ translated as $\alpha_{\pi_{1}}(x^{s}_{\pi_{1}},x^{t}_{\pi_{1}},\overline{v}_{1})$ and $\alpha_{\pi_{2}}(x^{s}_{\pi_{2}},x^{t}_{\pi_{2}},\overline{v}_{2})$ respectively (where $\overline{v}_{i}$ list variables in those formulae corresponding to node and edge variables in patterns), then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},\overline{v}_{1},\overline{v}_{2})$ is defined as

$\exists x^{s}_{\pi_{1}},x^{t}_{\pi_{1}},x^{s}_{\pi_{2}},x^{t}_{\pi_{2}}\ \Big{% (}\alpha_{\pi_{1}}(x^{s}_{\pi_{1}},x^{t}_{\pi_{1}},\overline{v}_{1})\wedge% \alpha_{\pi_{2}}(x^{s}_{\pi_{2}},x^{t}_{\pi_{2}},\overline{v}_{2})\wedge x^{s}% _{\pi}=x^{s}_{\pi_{1}}\wedge x^{t}_{\pi}=x^{t}_{\pi_{2}}\wedge x^{t}_{\pi_{1}}% =x^{s}_{\pi_{2}}\Big{)}$

where in $\overline{v}_{1},\overline{v}_{2}$ repeated variables are mentioned only once.
$\blacksquare$

If $\pi=\pi_{1}+\pi_{2}$ with $\pi_{1},\pi_{2}$ translated as $\alpha_{\pi_{1}}(x^{s}_{\pi_{1}},x^{t}_{\pi_{1}},\overline{v})$ and $\alpha_{\pi_{2}}(x^{s}_{\pi_{2}},x^{t}_{\pi_{2}},\overline{v})$ (note that variables must be the same as the schemas of $\pi_{1}$ and $\pi_{2}$ coincide), then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},\overline{v}):=\alpha_{\pi_{1}}(x^{s}_{% \pi},x^{t}_{\pi},\overline{v})\vee\alpha_{\pi_{2}}(x^{s}_{\pi},x^{t}_{\pi},% \overline{v})$ .
$\blacksquare$
If $\pi=\pi_{1}\langle\theta\rangle$ then $\alpha_{\pi}(x^{s}_{\pi},x^{t}_{\pi},\overline{v}):=\alpha_{\pi_{1}}(x^{s}_{% \pi},x^{t}_{\pi},\overline{v})\wedge\theta^{\prime}$ where $\theta^{\prime}$ is obtained from $\theta$ by the following transformations:
- –
  
  each condition $y_{i}=y_{j}$ stays;
- –
  
  each condition $z_{i}=z_{j}$ is replaced by $z_{i}^{s}=z_{j}^{s}\wedge z_{i}^{t}=z_{j}^{t}$ ;
- –
  
  these are propagated through the Boolean connectives.

It is straightforward to verify that these translations satisfy (1), completing the proof. $\hfill\blacktriangleleft$

The proof shows that on graphs $G_{n}$ , Cypher patterns fall far short of RPQs. The latter can express every regular property of languages in $\ell^{*}$ , or in other words test if $n$ belongs to a set which is a finite union of arithmetic progressions. For Cypher patterns, on the other hand, the first-order definability of a pattern $\pi$ in the theory of order implies the existence of the threshold $t$ such that either $(v_{1},v_{n})$ is selected by $\pi$ for all $n>t$ , or $(v_{1},v_{n})$ is not selected by $\pi$ for all $n>t$ .

4 Conclusions

With formal models of GQL, SQL/PGQ, and Cypher finally available, this note is an example of how research in database theory can affect the design of new languages. When it comes to graph languages, industrial developments are far ahead of academic research, creating opportunities for the academic community to develop tools to evaluate decisions already made and lay a solid foundation for new language features in upcoming editions of the standards.

References

[1] Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan L. Reutter, and Domagoj Vrgoč. Foundations of modern query languages for graph databases. ACM Comput. Surv., 50(5):68:1–68:40, 2017. doi:10.1145/3104031.
[2] Angela Bonifati, George H. L. Fletcher, Hannes Voigt, and Nikolay Yakovets. Querying Graphs. Morgan & Claypool Publishers, 2018. doi:10.2200/S00873ED1V01Y201808DTM051.
[3] Alin Deutsch, Nadime Francis, Alastair Green, Keith Hare, Bei Li, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Wim Martens, Jan Michels, Filip Murlak, Stefan Plantikow, Petra Selmer, Hannes Voigt, Oskar van Rest, Domagoj Vrgoč, Mingxi Wu, and Fred Zemke. Graph pattern matching in GQL and SQL/PGQ. In SIGMOD, pages 1–12. ACM, 2022. arXiv:2112.06217.
[4] Nadime Francis, Amélie Gheerbrant, Paolo Guagliardo, Leonid Libkin, Victor Marsault, Wim Martens, Filip Murlak, Liat Peterfreund, Alexandra Rogova, and Domagoj Vrgoc. GPC: A pattern calculus for property graphs. In Floris Geerts, Hung Q. Ngo, and Stavros Sintos, editors, Proceedings of the 42nd ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2023, Seattle, WA, USA, June 18-23, 2023, pages 241–250. ACM, 2023. doi:10.1145/3584372.3588662.
[5] Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, and Andrés Taylor. Cypher: An evolving query language for property graphs. In Proceedings of the 2018 International Conference on Management of Data, pages 1433–1445, New York, NY, USA, 2018. Association for Computing Machinery. doi:10.1145/3183713.3190657.

[bib.bib1] [1] Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan L. Reutter, and Domagoj Vrgoč. Foundations of modern query languages for graph databases. ACM Comput. Surv., 50(5):68:1–68:40, 2017. doi:10.1145/3104031.

[bib.bib2] [2] Angela Bonifati, George H. L. Fletcher, Hannes Voigt, and Nikolay Yakovets. Querying Graphs. Morgan & Claypool Publishers, 2018. doi:10.2200/S00873ED1V01Y201808DTM051.

[bib.bib3] [3] Alin Deutsch, Nadime Francis, Alastair Green, Keith Hare, Bei Li, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Wim Martens, Jan Michels, Filip Murlak, Stefan Plantikow, Petra Selmer, Hannes Voigt, Oskar van Rest, Domagoj Vrgoč, Mingxi Wu, and Fred Zemke. Graph pattern matching in GQL and SQL/PGQ. In SIGMOD, pages 1–12. ACM, 2022. arXiv:2112.06217.

[bib.bib4] [4] Nadime Francis, Amélie Gheerbrant, Paolo Guagliardo, Leonid Libkin, Victor Marsault, Wim Martens, Filip Murlak, Liat Peterfreund, Alexandra Rogova, and Domagoj Vrgoc. GPC: A pattern calculus for property graphs. In Floris Geerts, Hung Q. Ngo, and Stavros Sintos, editors, Proceedings of the 42nd ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2023, Seattle, WA, USA, June 18-23, 2023, pages 241–250. ACM, 2023. doi:10.1145/3584372.3588662.

[bib.bib5] [5] Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, and Andrés Taylor. Cypher: An evolving query language for property graphs. In Proceedings of the 2018 International Conference on Management of Data, pages 1433–1445, New York, NY, USA, 2018. Association for Computing Machinery. doi:10.1145/3183713.3190657.