Abstract 1 Introduction and our contribution 2 Preliminaries 3 Reduction to nc-rank and algorithms for semistability References

Algorithmic Aspects of Semistability of Quiver Representations

Yuni Iwamasa ORCID Graduate School of Informatics, Kyoto University, Japan Taihei Oki ORCID Institute for Chemical Reaction Design and Discovery (ICReDD), Hokkaido University, Sapporo, Japan
Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan
Tasuku Soma ORCID The Institute of Statistical Mathematics, Tokyo, Japan
Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan
Abstract

We study the semistability of quiver representations from an algorithmic perspective. We present efficient algorithms for several fundamental computational problems on the semistability of quiver representations: deciding the semistability and σ-semistability, finding the maximizers of King’s criterion, and computing the Harder–Narasimhan filtration. We also investigate a class of polyhedral cones defined by the linear system in King’s criterion, which we refer to as King cones. For rank-one representations, we demonstrate that these King cones can be encoded by submodular flow polytopes, enabling us to decide the σ-semistability in strongly polynomial time. Our approach employs submodularity in quiver representations, which may be of independent interest.

Keywords and phrases:
quivers, σ-semistability, King’s criterion, operator scaling, submodular flow
Category:
Track A: Algorithms, Complexity and Games
Funding:
Yuni Iwamasa: Supported by JSPS KAKENHI Grant Numbers JP22K17854, JP24K02901, Japan.
Taihei Oki: Supported by JST ERATO Grant Number JPMJER1903, JST CREST Grant Number JPMJCR24Q2, JST FOREST Grant Number JPMJFR232L, JSPS KAKENHI Grant Number JP22K17853, and Start-up Research Funds in ICReDD, Hokkaido University.
Tasuku Soma: Supported by JPSP KAKENHI Grant Number JP19K20212 and JST, PRESTO Grant Number JPMJPR24K5, Japan.
Copyright and License:
[Uncaptioned image] © Yuni Iwamasa, Taihei Oki, and Tasuku Soma; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Computing methodologies Algebraic algorithms
; Mathematics of computing Submodular optimization and polymatroids
Related Version:
Full Version: https://arxiv.org/abs/2407.06493
Acknowledgements:
The authors thank Hiroshi Hirai and Keiya Sakabe for their valuable comments on an earlier version of this paper. The last author thanks Cole Franks for bringing the submodularity of quiver representations to his attention. The authors thank an anonymous reviewer for pointing out the reference [35].
Funding:
This work was supported by JSPS KAKENHI Grant Number JP24K21315, Japan.
Editors:
Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

1 Introduction and our contribution

Quiver representation is a simple generalization of matrices that has led to surprisingly deep extensions of various results in linear algebra [14]. In this paper, we study the semistability of quiver representations, which is a central concept in the geometric invariant theory (GIT), from an algorithmic perspective. The semistability of quiver representations appears in operator scaling [25, 24, 17, 5, 19], Brascamp–Lieb (BL) inequality [23], Tyler’s M-estimator [20], and scatter estimation of structured normal models [1], which have attracted considerable attention in theoretical computer science owing to their connection to the noncommutative Edmonds’ problem, algebraic complexity theory, and submodular optimization [35, 31, 6, 26]. The goal of this paper is to provide efficient algorithms for various fundamental computational problems on the semistability of quiver representations. In the following, we describe the problems more formally and present our results.

1.1 Semistability of quiver representations

Here, we present the formal definition of a quiver representation. We follow the terminologies in [14, 6]. Let Q=(Q0,Q1) be a quiver with a vertex set Q0 and an arc set Q1. In this paper, we consider only acyclic quivers except otherwise stated. For each arc aQ1, we denote the tail and head of a by ta and ha, respectively. A representation V of Q consists of complex vector spaces V(i) for vertex iQ0 and linear maps V(a):V(ta)V(ha) for arc aQ1. A subrepresentation W of V is a representation of the same quiver such that W(i)V(i) for iQ0, and W(a)=V(a)|W(ta) and imW(a)W(ha) for aQ1. The vector of dimensions of V(i) is called the dimension vector of the representation, denoted by dim¯V. We call the vector space of all representations of Q with the dimension vector α the representation space of Q with dimension vector α, which we denote by Rep(Q,α). After fixing the dimension vector α and a basis of each V(i), we can represent V(a) as an α(ha)×α(ta) matrix. Therefore, the representation space can be identified as

Rep(Q,α)=aQ1Mat(α(ha),α(ta)),

where Mat(m,n) denotes the space of m×n complex matrices.

Fix a dimension vector α. Let

GL(Q,α)iQ0GL(α(i)),

where GL(n) denotes the general linear group of degree n. Then, GL(Q,α) acts on the representation space by a change of basis:

gV(ghaV(a)gta1)aQ1.

Note that this is a left action, i.e., (gh)V=g(hV) for g,hGL(Q,α). We say that a representation V is semistable under the GL(Q,α)-action if the orbit closure of V does not contain the origin, i.e.,

infgGL(Q,α)aQ1ghaV(a)gta1F2>0.

Otherwise, V is said to be unstable. The set of all unstable representations is called the null-cone of the GL(Q,α)-action. It is easy to see that any representation is unstable under the GL(Q,α)-action if Q is acyclic.111The readers may wonder whether we can check the semistability (under GL(Q,α)-action) of quiver representations of cyclic quivers. We will address this point later.

However, the semistability of quivers under subgroups of GL(Q,α) turns out to be more intricate. Let σQ0 be an integer vector on Q0, which we call a weight. Let χσ be the corresponding multiplicative character of GL(Q,α), i.e.,

χσ(g)=iQ0det(gi)σ(i).

Note that χσ is a one-dimensional representation of GL(Q,α); GL(Q,α) acts on by gxχσ(g)x. A representation V is said to be σ-semistable if the orbit closure of (V,1)Rep(Q,α) under the GL(Q,α) action does not contain the origin, i.e.,

infgGL(Q,α)(aQ1ghaV(a)gta1F2+|χσ(g)|2)>0.

It turns out that checking the σ-semistability of a quiver representation includes operator scaling (noncommutative rank computation) and the membership problem of the BL polytopes. We will see these examples in the following sections.

Our first result is a deterministic algorithm that, given a quiver representation V and weight σ, decides whether the representation is σ-semistable in time polynomial in the bit complexity of V and absolute values of the entries of σ. Let α(Q0)iQ0α(i).

Theorem 1.1 (informal version of Theorem 3.4).

Let Q be an acyclic quiver, V a representation of Q, and σ a weight. There is a deterministic algorithm that decides the σ-semistability of V in time polynomial in the size of Q, α(Q0), bit complexity of V, and absolute values of the entries of σ.

This improves the previous result [30] which runs in time polynomial in the number of paths in Q, which can be exponential in the size of Q. Furthermore, if the absolute value of the entries of σ is constant, our algorithm runs in polynomial time. This includes the known result for operator scaling [24].

1.2 King’s criterion

King [32] showed the following characterization of σ-semistability, which is known as King’s criterion. Let σ(α)iQ0σ(i)α(i) for a dimension vector α. Then, a representation V is σ-semistable if and only if σ(dim¯V)=0 and σ(dim¯W)0 for any subrepresentation W of V.

King’s criterion is a common generalization of the noncommutative rank (nc-rank) computation and membership problem of BL polytopes.

Example 1.2 (nc-rank).

Let Q be the generalized Kronecker quiver with m parallel arcs, α=(n,n), and σ=(1,1); see Figure 1. Any representation V of Q with the dimension vector α can be regarded as an n×n linear matrix A=a=1mxaV(a), where xa is an indeterminate. A subrepresentation W of V is determined by a pair of subspaces (W(1),W(2)) such that a=1mV(a)W(1)W(2). Then, King’s criterion reads that V is σ-semistable if and only if dimUdim(a=1mV(a)U)0 for any subspace Un, which is equivalent to that A is nc-nonsingular. More generally, the nc-rank of A is equal to the minimum of n+dimUdim(a=1mV(a)U) over all subspaces Un [15].

Example 1.3 (BL polytope).

Let Q be a star quiver with m leaves. We assume that Q0={0,1,,m} and 0 is the root; see Figure 1. Let α=(n,n1,,nm) and σ=(d,c1,,cm) for positive integers d,c1,,cm. A real representation V of Q with the dimension vector α can be regarded as a tuple of the matrices (B1,,Bm), where Bi is an ni×n matrix. Again, a subrepresentation W is an (m+1)-tuple of the subspaces (W(0),W(1),,W(m)) such that BiW(0)W(i) for i[m]. King’s criterion reads that V is σ-semistable if and only if dn=i=1mcini and dndimW(0)i=1mcinidim(BiW(0))0 for any subspace W(0). This is equivalent to that p=(c1/d,,cm/d) is in the BL polytope of linear operators B1,,Bm [3].

Figure 1: Generalized Kronecker quiver (left) and star quiver (right).

We study the following optimization problem: given a quiver representation V and weight σ, find a subrepresentation W of V that maximizes σ(dim¯W). In the case of the nc-rank, such a subrepresentation corresponds to a subspace U that maximizes dimUdim(a=1mAiU). Such a subspace is called a shrunk subspace and can be regarded as a certificate of the nc-rank [31, 19]. In the case of the BL polytopes, the problem corresponds to separation for the BL polytope [23].

King’s criterion can be regarded as maximizing a modular function over the modular lattice of subrepresentations. For any subrepresentations W1,W2 of V, define the subrepresentations W1+W2 and W1W2 as follows. For each iQ0,

(W1+W2)(i)W1(i)+W2(i),(W1W2)(i)W1(i)W2(i),

where the addition and intersection on the right-hand side are those of the vector spaces. Furthermore, the linear map of aQ1 in W1+W2 (resp. W1W2) is defined as the restriction of V(a) to (W1+W2)(ta) (resp. (W1W2)(ta)). Then, W1+W2 and W1W2 are indeed subrepresentations of V. Thus, the subrepresentations of V form a modular lattice. Furthermore, the function f(W)σ(dim¯W) is a modular function, i.e., for any subrepresentations W1,W2 of V,

f(W1)+f(W2)=f(W1+W2)+f(W1W2).

Thus, subrepresentations maximizing f form a sublattice, and there is a unique inclusion-wise minimum maximizer. Our second result is a deterministic algorithm to find such a maximizer of King’s criterion.

Theorem 1.4 (informal version of Theorem 3.5).

Let Q be an acyclic quiver, V a representation of Q, and σ a weight. There is a deterministic algorithm that finds the inclusion-wise minimum maximizer W of King’s criterion in time polynomial in the size of Q, α(Q0), bit complexity of V, and absolute values of the entries of σ.

King’s criterion was originally proved using the Hilbert-Mumford criterion (see, e.g., [14, Sections 9.6 and 9.8]), a fundamental result in the GIT.

1.3 Harder-Narasimhan filtration

We use the algorithm for finding the maximizers of King’s criterion to devise an algorithm for finding the Harder-Narasimhan (HN) filtration [27, 28] of a quiver representation. Roughly speaking, the HN-filtration decomposes a quiver representation into the direct sum of smaller representations.

More precisely, let σQ0 be a weight and τ+Q0 a strictly monotone weight, i.e., a nonnegative weight such that τ(dim¯W)>0 if W{0}. Here, {0} denotes the zero representation, which is the representation whose dimension vector is the zero vector. We define the slope of a quiver nonzero representation V as μ(V)=σ(dim¯V)/τ(dim¯V). We say that V is μ-semistable222It is also called (σ:τ)-semistability in the literature. if μ(W)μ(V) for any nonzero subrepresentation W of V. The HN-filtration theorem states that for any quiver representation V, there exists a unique filtration {0}=W0<W1<<Wk=V such that (i) μ(Wi/Wi1)>μ(Wi+1/Wi) for i[k1] and (ii) Wi/Wi1 is μ-semistable. Here, Wi<Wi+1 means that Wi is a subrepresentation of Wi+1 with WiWi+1, and Wi/Wi1 is a representation of Q such that (Wi/Wi1)(j) is the quotient space Wi(j)/Wi1(j) for jQ0 and (Wi/Wi1)(a) is the corresponding quotient linear map of Wi(a) for aQ1. We note that semistability with respect to a slope can be reduced to that for a weight.

Our third result is a deterministic algorithm for finding the HN-filtration.

Theorem 1.5 (informal).

Let Q be an acyclic quiver, V a representation of Q, and μ=σ/τ a slope. There is a deterministic algorithm that finds the HN-filtration of V with respect to μ in time polynomial in the size of Q, α(Q0), bit complexity of V, and absolute values of the entries of σ and τ.

This result improves a recent result [7] which runs in time polynomial in the number of paths in Q.

Recently, Hirai and Sakabe [29] introduced the coarse Dulmage-Mendelsohn (DM) decomposition of a linear matrix, generalizing the classic DM-decomposition of a bipartite graph. They showed that a natural gradient flow of operator scaling converges to the coarse DM-decomposition. However, their result did not provide an efficient algorithm to compute the coarse DM-decomposition because the gradient flow may take exponential time to converge. We show that the coarse DM-decomposition is indeed a special case of the HN-filtration for the generalized Kronecker quiver. Since the absolute values of the weights involved for this special case are polynomially bounded, our algorithm finds the coarse DM-decomposition in polynomial time.

1.4 King’s polyhedral cone, rank-one representations, and submodular flow

Motivated by King’s criterion, we investigate a polyhedral cone that is the set of σQ0 satisfying σ(dim¯V)=0 and σ(dim¯W)0 for any subrepresentation W of V. Since the number of distinct dim¯W is finite, the above linear system is also finite and hence defines a polyhedral cone. We call the polyhedral cone the King cone of a quiver representation V. Interestingly, the King cone is related to the cone of feasible flows in network-flow problems.

Let us first consider the easiest case. If V is a quiver representation with dimV(i)=1 for all iQ0, then King’s criterion characterizes the existence of a nonnegative flow φ on the support quiver with the boundary condition φ=σ. To state it more precisely, we introduce notation on flows. Let Q=(Q0,Q1) be a quiver (or a directed graph). For a vertex subset XQ0, let Out(X) denote the set of outgoing arcs from X, i.e., Out(X){a=(i,j):iX,jQ0X}. Similarly, let In(X) denote the set of incoming arcs to X. If X={i}, we abbreviate Out({i}) and In({i}) as Out(i) and In(i), respectively. If Out(X)=, then X is called a lower set of Q. For a flow φQ1 on Q, its boundary φQ0 is defined by φ(i)aOut(i)φ(a)aIn(i)φ(a) for iQ0.

Let us return to the σ-semistability of a representation V with dimV(i)=1 for all iQ0. In this case, V(a) for each arc aQ1, and a subrepresentation W of V can be identified with a vertex subset XQ0. By the definition of a subrepresentation, if iX and W(a)0 for a=(i,j)Q1, then jX. This implies that X is a lower set in the support quiver of V, namely, the subquiver of Q whose arcs are aQ1 with V(a)0. Therefore, King’s criterion is equivalent to the purely combinatorial condition that σ(Q0)=0 and σ(X)0 for each lower set X of Q, which characterizes the existence of a nonnegative flow φ on the support quiver with the boundary condition φ=σ by Gale’s theorem [22] (see, e.g., [33, Theorem 9.2]).

By generalizing the above observation, we show that if V is a rank-one representation of Q, i.e., V(a) is a rank-one matrix for each aQ1, then

  • King’s criterion can be rephrased as a purely combinatorial condition with respect to the linear matroids arising from the rank-one matrices V(a) of Q, and

  • the rephrased condition above can be further viewed as the feasibility condition of a network flow-type problem called submodular flow.

That is, the King cone is representable as the feasibility of a certain instance of the submodular flow problem. This enables us to decide the σ-semistability for rank-one representations in strongly polynomial time.

Theorem 1.6 (informal).

Let Q be an acyclic quiver, V a rank-one representation of Q, and σ a weight. Then, σ is in the King cone if and only if there is a feasible flow in the instance of submodular flow constructed from V, σ. Therefore, using standard submodular flow algorithms, we can decide the σ-semistability of rank-one representations in strongly polynomial time.

This theorem recovers the following well-known results when applied to the generalized Kronecker quiver and a star quiver.

  • A rank-one linear matrix k=1mxkvkfk is (nc-)nonsingular (where vk is a column vector and fk a row vector) if and only if the linear matroids of (fk:k[m]) and (vk:k[m]) have a common base [34].

  • If each linear operator Bi=fi is of rank-one for i[m] (where fi is a row vector), the BL polytope coincides with the base polytope of the linear matroid of (fi:i[m]) [2].

1.5 Semistability of general quivers

Thus far, we have considered the σ-semistability of acyclic quivers. As a complementary result, we show that the semistability of cyclic quivers under the GL(Q,α)-action can be efficiently reduced to noncommutative polynomial identity testing. In particular, we show that the polynomial can be represented by an algebraic branching program (ABP). This yields a deterministic algorithm for deciding the semistability of general quivers because noncommutative polynomial identity testing for ABP can be conducted in deterministic polynomial time [36]. Note that [6] devised another deterministic algorithm for the problem with their framework of noncommutative optimization, which is built upon deep results in various areas of mathematics. See also the discussion in related work.333After submitting the first version of this paper, an anonymous reviewer pointed out that this result for general quivers was sketched in [35]. See Theorem 10.8 and the last paragraph of Section 10.2 in [35]. At a very high level, our algorithm and Mulmuley’s results follow a similar strategy, although Mulmuley’s result requires several deep algebro-geometric backgrounds. We believe that our proof is more explicit and elementary, and hence, is worthy to be presented here for completeness.

1.6 Our techniques

In this subsection, we outline our techniques.

𝝈-semistability.

Our starting point is a reduction of general acyclic quivers to the generalized Kronecker quiver [13, 30]. We decompose the weight σ=σ+σ, where σ+(i)max{σ(i),0} and σ(i)max{σ(i),0}. Let Q0+ and Q0 be the sets of vertices i such that σ(i)>0 and σ(i)<0, respectively. Derksen and Makam [13] showed that the σ-semistability of a representation V with the dimension vector α is equivalent to the nc-nonsingularity of a partitioned linear matrix

A:iQ0+(α(i))σ+(i)iQ0(α(i))σ(i),

where the (s,p;t,q)-block (sQ0+, p[σ+(s)], tQ0, q[σ(t)]) of A is given by a linear matrix

P:st pathxP,p,qV(P).

Here, xP,p,q is an indeterminate and V(P) is the linear map corresponding to the path P, i.e., V(P)V(ak)V(a1) for P=(a1,,ak) as a sequence of arcs. However, because the number of indeterminates is exponential, applying the known nc-rank computation algorithms in a black box manner does not yield the desired time complexity.

Inspired by the above reduction, we introduce the following scaling problem for quiver representations. We define a scaling Vg,h of the quiver representation V by the block matrices g=(gt:tQ0) and h=(hs:sQ0+) as

Vg,h(a){ghaV(a)htaif aOut(Q0+)In(Q0),V(a)htaif aIn(Q0)Out(Q0+),ghaV(a)if aOut(Q0+)In(Q0),V(a)otherwise.

Furthermore, let (b+,b)iQ0+α(s)×tQ0α(t) be vectors such that b+(s)=σ+(s)N𝟏α(s) and b(t)=σ(t)N𝟏α(t), where Nσ+(α)=σ(α) and 𝟏 denotes the all-one vector. We say that V is approximately scalable (to the marginals (b+,b)) if for any ε>0, there exist block matrices g and h such that Vg,h satisfies

sQ0+P:st pathVg,h(P)Vg,h(P)Diag(b(t))tr <ε(tQ0),
tQ0P:st pathVg,h(P)Vg,h(P)Diag(b+(s))tr <ε(sQ0+),

where the norm is the trace norm.444The choice of the trace norm is not important here; we can use any unitary invariant norm. This is an instance of operator scaling with specified marginals [17, 5]. The crucial observation is that even though there may exist exponentially many st paths, the above matrix sum can be computed efficiently by exploiting the underlying quiver structure. Therefore, we can use a simple iterative algorithm in [5] to check V is approximately scalable for a fixed ε>0. We can show that it runs in O(ε2poly(|Q|,α(Q0),b)) time, where b is the bit complexity of V. Furthermore, we show that it is sufficient to consider ε=O(1/N) to decide the σ-semistability of V. This yields our algorithm for checking the σ-semistability of quiver representations.

Maximizers in King’s criterion.

We follow a similar approach to find a maximizer in King’s criterion. We use the above linear matrix of the reduction [13] directly and show that the shrunk subspaces of the above linear matrix correspond to the maximizers of King’s criterion. Then, we show that the necessary operations in the recent shrunk subspace algorithm [19] can be performed efficiently, enabling us to find the inclusion-wise minimum maximizer of King’s criterion efficiently. Note that the correspondence between the shrunk subspaces and maximizers of King’s criterion is shown in [30] using abstract algebra; we provide a more direct and elementary proof using submodularity.

HN-filtration.

Our HN-filtration algorithm is based on principal partitions of submodular systems [21]. For a slope μ=σ/τ, we consider a parametric modular function

fλ(W)λτ(dim¯W)σ(dim¯W)

on the subrepresentations W of V, where λ is a parameter. Let (λ) denote the modular lattice of the minimizers of fλ and let W(λ) and W+(λ) be the minimum and maximum minimizers of fλ, respectively. By the standard argument in principal partition, we show that W+(λ)W(λ) for λ>λ. Furthermore, there must be a finite set of λ such that (λ) consists of more than one element. We call such a value of λ a critical value. Let λ1>>λk be the critical values. Then, they induce the filtration

{0}=W(λ1)<W+(λ1)=W(λ2)<W+(λ2)==W(λk)<W+(λk)=V.

We show that this coincides with the HN-filtration. Each W(λ) and W+(λ) can be found by our algorithm for maximizers of King’s criterion for fixed λ. The possible candidates for critical values can be easily enumerated, enabling us to find the HN-filtration efficiently.

Strongly polynomial-time algorithm for rank-one representations.

When V is a rank-one representation, each rank-one matrix V(a) is representable as vafa for some nonzero vector vaV(ha) and nonzero dual vector faV(ta). Based on this representation, we assign each vertex iQ0 to two linear matroids 𝐌i+ and 𝐌i, where the first is generated by {fa:faOut(i)} and the second by {va:vaIn(i)}. Then, we can simulate a subrepresentation W of V as a lower set X of the directed graph D[V] constructed from V; its vertex set is the (disjoint) union of the ground sets {fa:faOut(i)} and {va:vaIn(i)} of the matroids 𝐌i+ and 𝐌i for iQ0; its arc set represents Q1 and the dependencies as “if W(ha)=W(tb) contains va then W(tb)kerfb; hence, W(hb) must contain vb for W to be a subrepresentation of V.” This enables us to rephrase King’s criterion as a combinatorial condition on the lower sets X of D[V] as

iQ0(σ+(i)(dimV(i)ri+({fa:faOut(i)}X))σ(i)ri({va:vaIn(i)}X))0,

where ri+ and ri denote the rank functions of 𝐌i+ and 𝐌i, respectively. We further rephrase the above combinatorial condition as the feasibility characterization of a certain instance of the submodular flow problem by Frank [16]. Thus, we can check the σ-semistability of a rank-one representation V by checking the feasibility of the instance generated by V of the submodular flow problem.

Semistability of general quivers.

For the semistability of general quivers, we use an invariant polynomial characterization of the null-cone. By the general theory of GIT, a representation V is semistable if and only if there exists a GL(Q,α)-invariant homogeneous polynomial p on the representation space Rep(Q,α) such that p(V)0. The Le Bruyn-Procesi theorem [4] stated that the ring of invariant polynomials is generated by polynomials in the form of

tr[V(ak)V(a2)V(a1)],

for a closed path555Here, a closed path means a sequence (a1,,ak) of arcs such that hal=tal+1 (l[k]), where ak+1a1. In graph theory, it is usually called a closed walk. In this paper, we follow the standard terminologies in quiver representation. (a1,a2,,ak) in Q with length k1. Furthermore, closed paths with length 1kα(Q0)2 generate the invariant ring, where α=dim¯V.

Therefore, we can decide the semistability by checking whether the above polynomial is nonzero at some vertex i and closed path C. The obstacle is that the number of closed paths can be exponential. To this end, we consider another polynomial in noncommutative indeterminate xa (aQ1) defined as

C: closed path starting at ixCtrV(C)

for each vertex iQ0, where xC=xakxa1 for C=(a1,,ak). Then, V is semistable if and only if this noncommutative polynomial is nonzero at some vertex i. Note that the noncommutativity is essential to distinguish closed paths with the same arc sets. For example, consider a quiver with a single vertex and two self-loops (say, a and b). Then, closed walks C=ababab and C=akbk of length 2k have the same number of a and b, but their trace trV(C) and trV(C) are different in general for k2. So if we use commutative indeterminates, multiple closed walks correspond to a single monomial, and we cannot decide whether trV(C)0 for some C or not by checking whether the polynomial is zero or not.

Yet, we need to show how to perform noncommutative polynomial identity testing for this polynomial in deterministic polynomial time. Using the underlying quiver structure, we can show that this polynomial can written as an ABP of polynomial size. Applying the algorithm of [36], we obtain our algorithm for the semistability of general quivers.

We remark that this is the only place where a nontrivial algebraic result from the GIT machinery is needed. The other algorithms and analysis can be understood with elementary linear algebra (assuming the known analysis of operator scaling algorithms, which involves some abstract algebra).

1.7 Related work

Existing studies on algorithms for quiver semistability have focused on bipartite quivers [11, 9, 10, 18]. We remark that semistability in bipartite quivers is essentially operator scaling with a block structure. In bipartite quivers, the number of paths is equal to the number of arcs; hence, a weak running time was sufficient in the previous studies. [30, 7] studied general acyclic quivers. They first used the reduction of [13] to the generalized Kronecker quiver and applied the nc-rank algorithm [31] in a black box manner. Hence, their algorithm runs in time polynomial in the number of paths, which can be exponential.

Several polyhedral cones associated with quiver representations have been studied in the literature [8, 37]. The moment cone of a quiver Q and a dimension vector α is the polyhedral cone generated by the highest weights of the representations of Q with the dimension vector α. The membership of the moment cone can be decided in strongly polynomial time for bipartite quivers [8] and even general acyclic quivers [37]. Another polyhedral cone is the conic hull of weight σQ0 such that there exists a nonzero semi-invariant polynomial in Rep(Q,α) with weight σ. The membership problem of this cone is called the generic semistability problem [8]. By definition, σ is in this cone if and only if there exists a generic σ-semistable representation of Q, hence the name. To the best of our knowledge, the generic semistability problem remains open for general acyclic quivers.

The semistability of quiver representations is a special case of semistability in the GIT. In the most abstract setting, GIT studies group actions on algebraic varieties. We say that a point in the variety is semistable if its orbit closure does not contain the origin. Bürgisser et al. [6] proposed a framework of noncommutative optimization to devise algorithms for GIT problems in the general setting. Although noncommutative optimization is a broad and general framework, it does not always provide efficient algorithms for all GIT problems. Currently, most of the known tractable problems originate from a family of operator scaling problems, which are also contained in the semistability of quiver representations. Furthermore, it is built upon deep results in various areas of mathematics, such as algebraic geometry, Lie algebra, and representation theory, rendering it difficult for non-experts to understand. Another conceptual contribution of this paper is identifying the semistability of quiver representations as a useful subclass of GIT. The semistability of quiver representations is rich enough to capture various interesting problems in the literature while also supporting the design efficient algorithms using elementary techniques.

1.8 Organization of this paper

The rest of this paper is organized as follows. Section 2 introduces the necessary background and notation of quiver representations, operator scaling, and noncommutative rank. Section 3 presents our algorithms for deciding the σ-semistability and finding maximizers of King’s criterion. Due to the space limitation, the other results are placed in the full version.

2 Preliminaries

We denote the set of nonnegative integers, rational, and real numbers by +, +, and +, respectively. We let [m,n]{m,m+1,,n1,n} for m,n with mn and [n][1,n]={1,,n} for a positive integer n. We denote the set of m×n complex matrices by Mat(m,n). We simply denote Mat(n,n) by Mat(n). The conjugate transpose of a matrix A is denoted by A. The subgroup of the upper triangular matrices in GL(n) (i.e., the Borel subgroup) is denoted by B(n). For two vector spaces U and V, we mean by UV that U is a subspace of V. Let S denote the vector space spanned by a multiset S of the vectors. For a vector bn, we denote by Diag(b) the n×n diagonal matrix such that the entries of b are on the diagonal.

2.1 Operator scaling, matrix space, and noncommutative rank

A linear map Φ:Mat(n)Mat(m) is said to be completely positive, or CP for short, if Φ(X)==1kAXA for some AMat(m,n). These A are called the Kraus operators of Φ. The dual map Φ:Mat(m)Mat(n) of Φ is defined by Φ(X)==1kAXA. For (g,h)GL(m)×GL(n), we define the scaling Φg,h of Φ by

Φg,h(X)gΦ(hXh)g==1k(gAh)X(gAh).

If m=n, the CP map is said to be square.

Let Φ:Mat(n)Mat(n) be a square CP map. Let 𝖽𝗌(Φ)Φ(I)IF2+Φ(I)IF2, where F denotes the Frobenius norm. Then, Φ is said to be approximately scalable if for any ε>0, there exists (g,h)GL(n)×GL(n) such that 𝖽𝗌(Φg,h)ε. The goal of the operator scaling problem is to decide whether a given CP map is approximately scalable or not.

Operator scaling is closely related to the noncommutative rank (nc-rank) of linear matrices. An m×n symbolic matrix A is called a linear matrix if A==1kxA for indeterminates x and matrices AMat(m,n). Sometimes, it is more convenient to see a linear matrix as a matrix space spanned by A1,,Ak. We denote the corresponding matrix space of a linear matrix A by 𝒜. For a subspace Un, let 𝒜U{Au:A𝒜,uU}, which is a subspace of m. The nc-rank of a linear matrix A (denoted by ncrankA) is defined by

ncrankAmin{n+dim𝒜UdimU:Un}.

A square linear matrix A is said to be nc-nonsingular if ncrankA=n. Informally speaking, ncrankA is the rank of A, where the indeterminates xi are pairwise noncommutative, i.e., xixjxjxi for ij. See [12, 15] for more details. A pair (L,R) of subspaces Lm and Rn is called an independent subspace if L𝒜R={0}. Over the complex field, (L,R) is independent if and only if tr(ΠLΦ(ΠR))=0, where ΠL denotes the orthogonal projection matrix onto L. Then,

ncrankA=m+nmax{dimL+dimR:(L,R) an independent subspace}.

An independent subspace (L,R) is said to be maximum if dimL+dimR is maximum. In particular, a square linear matrix A is nc-nonsingular if and only if dimL+dimRn for any independent subspace (L,R). Gurvits’ theorem [25] states that a square CP map Φ with the Kraus operator A ([k]) is approximately scalable if and only if the linear matrix =1kxA is nc-nonsingular.

2.2 Operator scaling with specified marginals

Operator scaling with specified marginals is a generalization of operator scaling. Let (b+,b)+n×+m be a pair of nonincreasing nonnegative vectors, which we call the target marginals. We say that a CP map Φ:Mat(n)Mat(m) is approximately scalable to the target marginals (b+,b) if there exist nonsingular upper triangular matrices (g,h)B(m)×B(n) such that

Φg,h(I)Diag(b)trε,Φg,h(I)Diag(b+)trε.

Such target marginals are said to be feasible. We define Δb++n as Δbj+=bj+bj+1+ for j[n], where we conventionally define bn+1+0. Similarly, we define Δb+m. Let Fj+=𝐞1,,𝐞j be the standard flag of n for j[n]. Similarly, we define Fi for i[m]. The following theorem characterizes the set of feasible marginals by a certain linear system.

Theorem 2.1 ([17, Theorem 18]).

Let Φ:Mat(n)Mat(m) be a CP map and (b+,b)+n×+m a pair of nonincreasing nonnegative vectors. Then, Φ is approximately scalable to the marginals (b+,b) if and only if j=1nbj+=i=1mbiB and

i=1mΔbidim(LFi)+j=1nΔbj+dim(RFj+)B

for any independent subspace (L,R) of Φ.

Let (b+,b) be a feasible marginal with rational entries. There is an efficient algorithm that finds a scaling of a given CP map Φ whose marginal is ε-close to (b+,b) [17, 5].

Theorem 2.2 (Theorem 1.13 in [5], specialized for operator scaling).

Let ε>0 be an accuracy parameter and Φ:Mat(n)Mat(m) a CP map with Gaussian integer Kraus operators. Let (b+,b)n×m be target marginals such that b1+bn+0, b1bm0, and j=1nbj+=i=1mbi=1. Then, Algorithm 1 finds upper triangular g,h such that Φg,h(I)Diag(b)trε and Φg,h(I)Diag(b+)trε in T=O(ε2(b+Nlog(N))) iterations, where b is the maximum bit length of the target marginals (b+,b), Nmax{m,n}, and is the smallest positive integer such that (b+,b) is an integer. Furthermore, each iteration can be executed in time O(N3).

Algorithm 1 Operator Sinkhorn iteration for specified marginals [17, 5].

3 Reduction to nc-rank and algorithms for semistability

In this section, we present our algorithms for deciding σ-semistability and finding maximizers of King’s criterion.

3.1 Reduction from semistability to nc-rank

Here, we recall the reduction of the σ-semistability of an acyclic quiver to nc-nonsingularity testing [13].

Let Q be an acyclic quiver and V a representation of Q with the dimension vector α. Let σQ0 be a weight.666We do not assume σ(α)=0 here because the reduction does not need it. This is useful for finding a maximizer in King’s criterion. Let Q0+ and Q0 be the sets of vertices i such that σ(i)>0 and σ(i)<0, respectively. Let σ+(i)max{σ(i),0} and σ(i)max{σ(i),0} for each iQ0. Note that σ=σ+σ. Let Nσ+(α)=σ(α). We define an N×N partitioned linear matrix A as follows. As a linear map,

A:sQ0+(α(s))σ+(s)tQ0(α(t))σ(t). (1)

The (s,p;t,q)-block (sQ0+, p[σ+(s)], tQ0, q[σ(t)]) of A is given by a linear matrix

P:st pathxP,p,qV(P),

where xP,p,q is an indeterminate and V(P) is the linear map corresponding to the path P, i.e., V(P)V(ak)V(a1) for P=(a1,,ak) as a sequence of arcs. The number of indeterminates in A is equal to

sQ0+tQ0σ+(s)σ(t)m(s,t),

where m(s,t) denotes the number of st paths in Q. Thus, the number of indeterminates may be exponential in general.

The following lemma connects the nc-rank of A with King’s criterion, which is shown in [30, Theorem 3.3] using abstract algebra.

Lemma 3.1.

The minimal maximizer of σ(dim¯W) for the subrepresentations W of V corresponds to the minimal shrunk subspace of A.

By the lemma, one can check the σ-semistability of a quiver representation by checking whether the corresponding linear matrix is nc-nonsingular or not. However, the naive reduction does not give a polynomial time algorithm, as the number of indeterminates in the linear matrix may be exponential.

3.2 Scaling algorithm for 𝝈-semistability

We present a scaling algorithm for deciding the σ-semistability. The idea is to reduce the problem to operator scaling with specified marginals.

Let us define a CP map ΦV corresponding to a quiver representation V. As a linear map,

ΦV:sQ0+Mat(α(s))tQ0Mat(α(t)).

Let X=(Xs:sQ0+) be an input block matrix. Then,

(ΦV(X))tsQ0+P:st pathV(P)XsV(P)

for tQ0. The dual map

ΦV:tQ0Mat(α(t))sQ0+Mat(α(s))

is given by

(ΦV(Y))stQ0P:st pathV(P)YtV(P)

for sQ0+. Analogous to the scaling of CP maps, we define a scaling Vg,h of the quiver representation V by block matrices g=(gt:tQ0) and h=(hs:sQ0+) as

Vg,h(a){ghaV(a)htaif aOut(Q0+)In(Q0),V(a)htaif aIn(Q0)Out(Q0+),ghaV(a)if aOut(Q0+)In(Q0),V(a)otherwise.

Here is the key lemma that relates the σ-semistability of a quiver representation to the feasibility of a specific marginal of ΦV.

Lemma 3.2.

Let V be a representation of an acyclic quiver Q with the dimension vector α and σ a weight with σ(α)=0. Let (b+,b) be the target marginals such that b+(s)=σ+(s)N𝟏α(s) and b(t)=σ(t)N𝟏α(t), where Nσ+(α)=σ(α). Then, ΦV is approximately scalable to the marginals (b+,b) if and only if V is σ-semistable.

To check the feasibility of the marginal, one can use the scaling algorithm for operator scaling with the specified marginals (Algorithm 1). This yields Algorithm 2.

Algorithm 2 Scaling algorithm for σ-semistability.

To run the algorithm, we first need to show how to compute the value of ΦV in polynomial time. Note that the naive computation of ΦV requires exponential time, as the number of terms in the sum is exponential. However, we can compute ΦV in polynomial time using the underlying quiver structure. We show an algorithm in Algorithm 3. The algorithm for the dual map is similar and therefore omitted.

Algorithm 3 Algorithm for computing ΦV(X).

Next, we show an upper bound of the accuracy parameter ε that is sufficient to decide the σ-semistability.

Lemma 3.3.

Let (b+,b) be as above. Let 0<ε16N. If there exist g,h such that (ΦV)g,h(I)Diag(b)trε and (ΦV)g,h(I)Diag(b+)trε, then V is σ-semistable.

Theorem 3.4.

Let V be a representation of an acyclic quiver Q with Gaussian integer entries and σ be a weight with σ(α)=0. Algorithm 2 correctly decides the σ-semistability of V in O(N2(b+dlog(Nd))) iterations, where N=σ+(α)=σ(α), d=max{α(Q0+),α(Q0)}, and b is the maximum bit length of σ. Each iteration can be executed in O(|Q1|αmax3+sQ0+α(s)3+tQ0α(t)3) time, where αmax=maxiQ0α(i).

Proof.

If V is σ-semistable, then ΦV is approximately scalable to the marginals (b+,b). By Theorem 2.1, the algorithm must find such a scaling within the stated iterations. Consequently, the algorithm outputs Yes. If V is not σ-semistable, then there is no scaling g,h such that (ΦV)g,h(I)Diag(b)trε and (ΦV)g,h(I)Diag(b+)trε for ε=1/6N by Lemma 3.3. Thus, the algorithm outputs No. This proves the correctness of the algorithm.

The number of iterations is immediate from the algorithm. The time complexity of each iteration is dominated by the computations of ΦV(I) and ΦV(I) and that of the block Cholesky decomposition. The former takes O(|Q1|αmax3) time and the latter takes O(tQ0α(t)3+sQ0+α(s)3) time.

3.3 Finding the extreme maximizer in King’s criterion

In this subsection, we extend the result from the previous section to find the extreme maximizer in King’s criterion. The idea is to use the shrunk subspace algorithm for the linear matrix (1).

Let

ΦVσ:iQ0+Mat(α(i))σ+(i)iQ0Mat(α(i))σ(i)

be a CP map that maps X=(Xs,p:sQ0+,p[σ+(s)]) to

(ΦVσ(X))t,q=sQ0+p[σ+(s)]P:st pathV(P)Xs,pV(P)

for tQ0 and q[σ(t)]. Let Jσ+=sQ0+σ+(s)Iα(s) and Jσ=tQ0σ(t)Iα(t). Note that

(ΦVσ(I))t,q=sQ0+σ+(s)P:st pathV(P)V(P)=ΦV(Jσ+)t,
((ΦVσ)(I))s,p=tQ0σ(t)P:st pathV(P)V(P)=ΦV(Jσ)s.

Thus, one can compute ΦVσ(I) and (ΦVσ)(I) in poly(|Q|,|α|,|σ|) time using Algorithm 3. By Lemma 3.1, the minimum shrunk subspace of ΦVσ corresponds to the minimum maximizer in King’s criterion. To find the minimum shrunk subspace of ΦVσ, one can simply use the algorithm of [19], which runs in poly(|Q|,|α|,|σ|,b) time, because one can compute ΦVσ(I) and (ΦVσ)(I) in poly(|Q|,|α|,|σ|,b) time.

The maximum maximizer can also be found by considering Φ instead of Φ. To see this, observe that the maximum shrunk subspace corresponds to the maximum independent subspace (L,R) such that dimL is the smallest. Since tr(ΠLΦ(ΠR))=tr(Φ(ΠL)ΠR), L is the minimum shrunk subspace of Φ.

Therefore, we obtain the following theorem.

Theorem 3.5.

For a quiver representation V of an acyclic quiver Q with the dimension vector α with Gaussian integer entries and a weight σ, the minimum and maximum maximizers of King’s criterion can be found in poly(|Q|,|α|,|σ|,b) time, where b denotes the bit complexity of V.

References

  • [1] Carlos Améndola, Kathlén Kohn, Philipp Reichenbach, and Anna Seigal. Invariant theory and scaling algorithms for maximum likelihood estimation. SIAM Journal on Applied Algebra and Geometry, 5(2):304–337, 2021. doi:10.1137/20M1328932.
  • [2] Franck Barthe. On a reverse form of the Brascamp-Lieb inequality. Inventiones Mathematicae, 134(2):335–361, 1998. doi:10.1007/s002220050267.
  • [3] Jonathan Bennett, Anthony Carbery, Michael Christ, and Terence Tao. The Brascamp-Lieb inequalities: Finiteness, structure and extremals. Geometric and Functional Analysis, 17(5):1343–1415, 2008. doi:10.1007/s00039-007-0619-6.
  • [4] Lieven Le Bruyn and Claudio Procesi. Semisimple representations of quivers. Transactions of the American Mathematical Society, 317(2):585, 1990. doi:10.2307/2001477.
  • [5] Peter Burgisser, Cole Franks, Ankit Garg, Rafael Oliveira, Michael Walter, and Avi Wigderson. Efficient algorithms for tensor scaling, quantum marginals, and moment polytopes. In Proceedings of the IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS), pages 883–897, 2018. doi:10.1109/FOCS.2018.00088.
  • [6] Peter Bürgisser, Cole Franks, Ankit Garg, Rafael Oliveira, Michael Walter, and Avi Wigderson. Towards a theory of non-commutative optimization: Geodesic 1st and 2nd order methods for moment maps and polytopes. In Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pages 845–861, 2019. doi:10.1109/FOCS.2019.00055.
  • [7] Chi-Yu Cheng. A deterministic algorithm for Harder–Narasimhan filtrations for representations of acyclic quivers. Algebra & Number Theory, 18(2):319–347, 2024. doi:10.2140/ant.2024.18.319.
  • [8] Calin Chindris, Brett Collins, and Daniel Kline. Membership in moment cones and quiver semi-invariants for bipartite quivers. arXiv, 2022.
  • [9] Calin Chindris and Harm Derksen. The capacity of quiver representations and Brascamp-Lieb constants. International Mathematics Research Notices, 2022(24):19399–19430, 2021. doi:10.1093/imrn/rnab064.
  • [10] Calin Chindris and Jasim Ismaeel. A quiver invariant theoretic approach to radial isotropy and the paulsen problem for matrix frames. SIAM Journal on Applied Algebra and Geometry, 6(4):536–562, 2022. doi:10.1137/21M141470X.
  • [11] Calin Chindris and Daniel Kline. Simultaneous robust subspace recovery and semi-stability of quiver representations. Journal of Algebra, 577:210–236, 2021. doi:10.1016/j.jalgebra.2021.03.005.
  • [12] P. M. Cohn. Skew Fields: Theory of General Division Rings. Encyclopedia of Mathematics and its Applications. Cambridge University Press, 1995.
  • [13] Harm Derksen and Visu Makam. Polynomial degree bounds for matrix semi-invariants. Advances in Mathematics, 310:44–63, 2017. doi:10.1016/j.aim.2017.01.018.
  • [14] Harm Derksen and Jerzy Weyman. An Introduction to Quiver Representations, volume 184. American Mathematical Soc., 2017.
  • [15] Marc Fortin and Christophe Reutenauer. Commutative/noncommutative rank of linear matrices and subspaces of matrices of low rank. Séminaire Lotharingien de Combinatoire [electronic only], 52:B52f–12, 2004.
  • [16] András Frank. Finding feasible vectors of Edmonds-Giles polyhedra. Journal of Combinatorial Theory, Series B, 36(3):221–239, 1984. doi:10.1016/0095-8956(84)90029-7.
  • [17] Cole Franks. Operator scaling with specified marginals. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 190–203, 2018. doi:10.1145/3188745.3188932.
  • [18] Cole Franks and Visu Makam. iPCA and stability of star quivers. arXiv, 2023. doi:10.48550/arXiv.2302.09658.
  • [19] Cole Franks, Tasuku Soma, and Michel X Goemans. Shrunk subspaces via operator sinkhorn iteration. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1655–1668, 2023. doi:10.1137/1.9781611977554.CH62.
  • [20] William Cole Franks and Ankur Moitra. Rigorous guarantees for Tyler’s M-estimator via quantum expansion. In Proceedings of the 33rd Conference on Learning Theory (COLT), volume 125, pages 1601–1632. PMLR, 2020. URL: http://proceedings.mlr.press/v125/franks20a.html.
  • [21] Satoru Fujishige. Theory of Principal Partitions Revisited, pages 127–162. Springer Berlin Heidelberg, 2009. doi:10.1007/978-3-540-76796-1_7.
  • [22] David Gale. A theorem on flows in networks. Pacific Journal of Mathematics, 7(2):1073–1082, 1957.
  • [23] Ankit Garg, Leonid Gurvits, Rafael Oliveira, and Avi Wigderson. Algorithmic and optimization aspects of Brascamp-Lieb inequalities, via operator scaling. Geometric and Functional Analysis, 28(1):100–145, 2018. doi:10.1007/s00039-018-0434-2.
  • [24] Ankit Garg, Leonid Gurvits, Rafael Oliveira, and Avi Wigderson. Operator scaling: Theory and applications. Foundations of Computational Mathematics, 2019. doi:10.1007/s10208-019-09417-z.
  • [25] Leonid Gurvits. Classical complexity and quantum entanglement. Journal of Computer and System Sciences, 69(3):448–484, 2004. doi:10.1016/j.jcss.2004.06.003.
  • [26] Masaki Hamada and Hiroshi Hirai. Computing the nc-rank via discrete convex optimization on CAT(0) spaces. SIAM Journal on Applied Algebra and Geometry, 5(3):455–478, 2021. doi:10.1137/20M138836X.
  • [27] G Harder and M S Narasimhan. On the cohomology groups of moduli spaces of vector bundles on curves. Mathematische Annalen, 212(3):215–248, 1975.
  • [28] Lutz Hille and José Antonio de la Peña. Stable representations of quivers. Journal of Pure and Applied Algebra, 172(2):205–224, 2002. doi:10.1016/S0022-4049(01)00167-0.
  • [29] Hiroshi Hirai and Keiya Sakabe. Gradient descent for unbounded convex functions on hadamard manifolds and its applications to scaling problems. In Proceedings of the IEEE 65th Annual Symposium on Foundations of Computer Science (FOCS), pages 2387–2402, 2024. doi:10.1109/FOCS61266.2024.00139.
  • [30] Alana Huszar. Non-commutative rank and semi-stability of quiver representations. arXiv, 2021. doi:10.48550/arXiv.2111.00039.
  • [31] Gábor Ivanyos, Youming Qiao, and K V Subrahmanyam. Constructive non-commutative rank computation is in deterministic polynomial time. Computational Complexity, 27(4):561–593, 2018. doi:10.1007/s00037-018-0165-7.
  • [32] Alastair D King. Moduli of representations of finite dimensional algebras. The Quarterly Journal of Mathematics, 45(4):515–530, 1994.
  • [33] Bernhard Korte and Jens Vygen. Combinatorial Optimization: Theory and Algorithms. Springer-Verlag, Berlin, 6 edition, 2018. doi:10.1007/978-3-662-56039-6.
  • [34] László Lovász. Singular spaces of matrices and their application in combinatorics. Bulletin of the Brazilian Mathematical Society, 20:87–99, 1989.
  • [35] Ketan Mulmuley. Geometric complexity theory V: Efficient algorithms for Noether normalization. Journal of the American Mathematical Society, 30(1):225–309, 2017.
  • [36] Ran Raz and Amir Shpilka. Deterministic polynomial identity testing in non-commutative models. Computational Complexity, 14(1):1–19, 2005. doi:10.1007/s00037-005-0188-8.
  • [37] Michèle Vergne and Michael Walter. Moment cone membership for quivers in strongly polynomial time. arXiv, 2023.