Asynchronous Iterative Solution for State-Based Performance Metrics
Douglas V. de Jager Jeremy T. Bradley
Department of Computing Imperial College London 180 Queen’s Gate, London SW7 2AZ United Kingdom
{dvd03,jb}@doc.ic.ac.uk
ABSTRACT
Solution of large sparse fixed-point problems, Mx = x and Mx + b = x, may be seen as underpinning many important performance analysis calculations. These calculations include steady-state, passage-time and transient-time calculations in discrete-time Markov chains, continuous-time Markov chains and semi-Markov chains. In recent years, much work has been done to extend the application of asynchronous iterative fixed-point solution methods to many different contexts. This work has been motivated by the potential for faster solution, more efficient use of the communication channel and/or access to memory, and simplification of task management and programming. In this paper, we present theoretical developments which allow us to extend the application of asynchronous iterative solution methods to solve for the key performance metrics mentioned above— such that we may employ the full breadth of Chazan and Miranker’s classes of asynchronous iterations.
This type of solution allows each computational node to perform calculation iterations without having to synchronise frequently with other nodes and, thereby, to create communication bottlenecks.
2.
ASYNCHRONOUS ITERATIONS FOR LINEAR FIXED-POINT PROBLEMS
The theory of asynchronous iterations to solve linear fixedpoint problems was originally presented by Chazan and Miranker. [4] It may be outlined as follows. Consider the linear fixed-point problem: x = Ax + b, where A ∈ Cn×n , x, b ∈ Cn . (1)
Categories and Subject Descriptors
G.3 [Probability and Statistics]: Markov processes; G.1.3 [Numerical Linear Algebra]: [Eigenvalues and eigenvectors, sparse, structured and very large systems]
To solve for the unique fixed-point, x, we generate a convergent sequence of iterates from the corresponding class of asynchronous iterations. The class of asynchronous iterations is given by the class of sequences, , of column vectors in Cn recursively defined: (P (t−d(t,i,j)) n + bi if i ∈ U (t) (t+1) j=1 Aij xj xi := (2) (t) xi if i ∈ U (t) / where U , d are functions satisfying a set of conditions. Function U : N → P{1, . . . , n} gives the set of vector entries to be updated at each step (where P{1, . . . , n} denotes the power set of {1,. . . ,n}); function d : N×{1, . . . , n}×{1, . . . , n} → N gives the relative “age” of the entries used in the update of each vector entry at each step. These two functions satisfy the following two conditions: (CM1) Each vector entry, i (1 ≤ i ≤ n), features in an infinite number of update sets. (CM2) For each pair of vector entries, i, j (1 ≤ i, j ≤ n), we have that both (t − d(t, i, j)) → ∞ as t → ∞ and ∀t(d(t, i, j) ≤ t). Generalising minimally the fundamental result from [4], we get the following convergence guarantee for generated sequences. It allows us to know pre-generation whether sequences are convergent. Theorem 1. Consider the class of asynchronous iterations corresponding to equation (1). If ρ(|A|) < 1 , then every sequence of iterates within the class converges to the unique fixed-point, x, as t → ∞; if ρ(|A|) ≥ 1, then some non-convergent sequence of iterates exists within the class.
General Terms
Performance, algorithms, theory
Keywords
Asynchronous iterations, performance analysis, dominant eigenvectors, Perron–Frobenius, matrix-vector splitting
1.
INTRODUCTION
Extracting performance measures from performance models is a computationally intensive process. Typically a performance model will consist of a large Markov chain in excess of 109 states, and producing even a simple steady-state vector for a utilisation metric will require considerable amounts of computing resources. In this paper, we present theoretical developments which show that steady-state, passage-time and transient-time performance measures—when reformulated as linear fixed-point problems—are amenable to asynchronous iterative solution.
Copyright is held by the author/owner(s). SIGMETRICS’07, June 12–16, 2007, San Diego, California, USA. ACM 978-1-59593-639-4/07/0006.
In this theorem, |A| denotes the modulus matrix corresponding to A; ρ(|A|) denotes the spectral radius of |A|.
4.
ASYNCHRONOUS ITERATIONS FOR TRANSIENT AND PASSAGE TIMES
3.
ASYNCHRONOUS ITERATIONS FOR STEADY-STATE VECTORS
It can be shown that the calculation of passage-time and transient distributions in CTMCs and SMPs ([3] and [2]) is a fixed-point problem of the sort: x=U x+b (6)
It is a standard result that steady-state analysis of both continuous-time Markov chains (CTMCs) and semi-Markov chains (SMPs) can be expressed in terms of the steady-state analysis of their respective embedded discrete-time Markov chains (DTMCs). This has the familiar eigenvector form, x = Mx, where M ∈ Rn×n is a column-stochastic square matrix. This DTMC problem is of course a special case of the dominant eigenvector problem: ρ(M)x = Mx
n×n
(3)
where M ∈ C . Without loss of generality we may assume that ρ(M) = 1. If we are to employ the corresponding asynchronous iterations to solve for this vector, x, then we require a convergence guarantee for the iterations. Unfortunately, Theorem 1 is of little immediate help in the case of the dominant eigenvector problem, since from [5] we have: ρ(|M|) ≥ ρ(M) = 1 (4)
where the entries of U are derived from the Laplace transform of the sojourn time distribution transition matrix of the stochastic process. This is soluble by the method of asynchronous iterations as given by Theorem 1 if ρ(|U |) < 1. We need to prove that this inequality holds and we do so as follows. From [1], we know that ρ(|U |) < 1 if we can construct a matrix, M, from U by replacing some of the zeros in U with nonzeros, where the newly constructed M satisfies all three of the following: (PT 1) M ∞ ≤ 1; P (PT 2) ∃k j |Mkj | < 1; (PT 3) M is irreducible. If this matrix can be constructed then we can deduce—using the Perron–Frobenius theorem and spectral-radius continuity with respect to matrix entries—that: ρ(|U |) ≤ ρ(|M|) < 1 (7)
However, Theorem 1 may be employed indirectly. This is made possible by proof of the following theorem: Theorem 2. Let irreducible M ∈ Cn×n have ρ(|M|) = 1. Let V(K) ∈ Cn×n (K ⊆ {1, 2, . . . , n}) be defined such that (K) Vij := vi if j ∈ K and 0 if j ∈ K, where v ∈ Cn . Let / v and K be such that either 0 ≤ Re(Vij Re(Mij ) ≤
(K) Re(Vij ) (K)
) ≤ Re(Mij ) or
(K)
≤ 0, for every i, j pair; and anal= 0 for
ogously for the imaginary components. Let Vij some i, j pair. Then, the following holds true: (C1) ρ(|M − V(K) |) < 1;
and we are done. We can show that the only cases where we cannot construct such a matrix M are: if all the transitions in the SMP are immediate—in which case all passage times are 0 without calculation and the transient calculation is degenerate; or there is an absorbing strongly-connected subcomponent of the state-space, which comprises states all of which have immediate out-transitions. In the latter case, the SMP can be aggregated to remove the set of states in a way that has no effect on the passage-time or transient calculation. In summary, in all non-trivial cases, U has ρ(|U |) < 1, or a suitable U is easily found by state aggregation.
(C2) If 1 is an eigenvalue of M, then it is a simple eigenvalue of M and there is a corresponding right eigenvector, x, of M, which comprises no zero entries, and is the unique fixed-point of x = (M − V(K) )x + v. (5)
5.
REFERENCES
Beyond opening the door to unrestricted asynchronous iterative solution, Theorem 2 is significant for at least two reasons. It provides a complex extension of sorts to the Perron–Frobenius theorem: it informs us that if M comprises negative entries or entries with imaginary parts, and if there is some suitable corresponding dominant eigenvector, x, then this eigenvector, x, is necessarily unique (up to scalar multiples). Secondly, it offers the potential for improved sparsity patterns and/or reduced spectral radius for the principal matrix—and, thereby, the potential for accelerated convergence using traditional iterative techniques. To this end, an example application involves removing dangling node and teleportation entries from the principal PageRank matrix [6].
[1] Axelsson, O. Iterative solution methods. Cambridge University Press, New York, NY, USA, 1994. [2] Bradley, J. t., Dingle, N. J., Harrison, P. G., and Knottenbelt, W. J. Distributed computation of transient state distributions and passage time quantiles in large semi-Markov models. Future Generation Computer Systems 22, 7 (August 2006), 828–837. [3] Bradley, J. T., Dingle, N. J., Knottenbelt, W. J., and Wilson, H. J. Hypergraph-based parallel computation of passage time densities in large semi-Markov models. Journal of Linear Algebra and Applications 386 (July 2004), 311–334. [4] Chazan, D., and Miranker, W. L. Chaotic relaxation. Linear Algebra and Its Applications 2 (1969), 199–222. [5] Horn, R. A., and Johnson, C. R. Matrix analysis. Cambridge University Press, New York, 1986. [6] Sepandar D. Kamvar, Taher H. Haveliwala, C. D. M., and Golub, G. H. Exprapolation methods for accelerating PageRank computations. In Proceedings of the 12th Int. World Wide Web Conference (2003).