Skip to main content
Nathanaël Fijalkow

Value iteration for parity games

This post presents a generic value iteration algorithm for parity games parametrised by universal trees. As special cases this extends the small progress measure of Jurdziński and the succinct progress measure of Jurdziński and Lazić. The complexity of the algorithm is bounded by the size of the universal tree.

We use the notion of universal trees introduced in this post and the fundamental theorem proved in this post.

We refer to this paper for full details.

Progress measures and universal trees

We fix two parameters nnn and ddd, and TTT a universal tree.

Definition:
Consider a parity game G\mathcal{G}G with nnn vertices and priorities in [0,d−1][0,d-1][0,d−1]. A progress measure G\mathcal{G}G is a function μ:V→T∪{⊥}\mu : V \to T \cup \{\bot\}μ:V→T∪{⊥} such that
* if v∈VEv \in V_Ev∈VE​, then there exists (v,p,v’)∈E(v,p,v’) \in E(v,p,v’)∈E such that μ(v)⊲pμ(v’)\mu(v) \vartriangleleft_p \mu(v’)μ(v)⊲p​μ(v’)
* if v∈VAv \in V_Av∈VA​, then for all (v,p,v’)∈E(v,p,v’) \in E(v,p,v’)∈E we have μ(v)⊲pμ(v’)\mu(v) \vartriangleleft_p \mu(v’)μ(v)⊲p​μ(v’)

We can now state the main theorem underlying the value iteration algorithm.

Theorem:
For all parity games with nnn vertices and priorities in [0,d−1][0,d-1][0,d−1], there exists a progress measure such that μ(v)≠⊥\mu(v) \neq \botμ(v)​=⊥ if and only if Eve has a winning strategy from vvv.

Proof:
This is an easy corollary of the fundamental theorem combined with positional determinacy.

First, observe that for any progress measure μ\muμ, we can define a positional strategy σ\sigmaσ by defining σ(v)=(v,p,v’)\sigma(v) = (v,p,v’)σ(v)=(v,p,v’) such that μ(v)⊲pμ(v’)\mu(v) \vartriangleleft_p \mu(v’)μ(v)⊲p​μ(v’). Now reducing to the vertices such that μ(v)≠⊥\mu(v) \neq \botμ(v)​=⊥ and the moves prescribed by σ\sigmaσ we obtain a graph together with a progress measure for this graph. It follows that σ\sigmaσ is a winning strategy from all vertices such that μ(v)≠⊥\mu(v) \neq \botμ(v)​=⊥. Note that this observation holds for any progress measure μ\muμ.

Conversely, assume that Eve has a winning strategy from vvv, and let σ\sigmaσ be a positional winning strategy. Consider the graph G[σ]\mathcal{G}[\sigma]G[σ], it satisfies parity, hence by the fundamental theorem there exists a progress measure. Combined with the strategy σ\sigmaσ this yields a progress measure for the game G\mathcal{G}G.

The algorithm

The value iteration algorithm constructs the largest progress measure. It manipulates functions μ:V→T∪{⊥}\mu : V \to T \cup \{\bot\}μ:V→T∪{⊥}. The initial function assigns all vertices to the maximal (i.e., rightmost) leaf in TTT. At each step, the value iteration algorithm picks a vertex vvv, and checks whether the local conditions are satisfied by μ\muμ for vvv. If they are not, the value of μ(v)\mu(v)μ(v) is decreased by the minimal possible amount to satisfy them, giving rise to liftv(μ)\textrm{lift}_v(\mu)liftv​(μ). If this is not possible, μ(v)\mu(v)μ(v) is assigned ⊥\bot⊥ (losing).

Example of lifting v5v_5v5​: it is pushed to the left in order to satisfy v5⊲3v7v_5 \vartriangleleft_3 v_7v5​⊲3​v7​ and v5⊲2v1v_5 \vartriangleleft_2 v_1v5​⊲2​v1​.

The correctness follows from the two observations: let μ⋆\mu_{\star}μ⋆​ be a labeling as in the theorem above, for the order ⊲0\vartriangleleft_0⊲0​:
* the initial labeling is pointwise larger than μ⋆\mu_{\star}μ⋆​,
* if μ\muμ is pointwise larger than μ⋆\mu_{\star}μ⋆​, then so is liftv(μ)\textrm{lift}_v(\mu)liftv​(μ).

Complexity

The algorithm given above lifts a vertex vvv at most ∣T∣|T|∣T∣ many times, which is the number of leaves of TTT. Hence the total number of lifts is O(n∣T∣)O(n |T|)O(n∣T∣), hence so is (roughly) the complexity of the algorithm, provided lifts can be performed in reasonable time (which is the case when the universal tree is reasonably described).

Theorem: (Jurdziński and Lazić, 2017)
There exists a (n,h)(n,h)(n,h)-universal tree with nlog⁡(h)n^{\log(h)}nlog(h) leaves

This yields the succinct progress measure algorithm of Jurdziński and Lazić.

Corollary:
There exists a quasipolynomial time algorithm solving parity games

Parity games

Upper and lower bounds for universal trees

Read post
Parity games

The interplay between parity games and universal trees

Read post
Learning theory

Weighted automata and matrix factorisations

Read post
See all posts
See all posts in category

Nathanaël Fijalkow

351, cours de la Libération F-33405 Talence cedex, France

nathanael.fijalkow@gmail.com
How to find my office
made by superskrypt
  • Home
  • Curriculum Vitae
  • Publications
  • Research blog

Nathanaël Fijalkow

nathanael.fijalkow@gmail.com
made by superskrypt