Applications to Sparse Random Graphs

Algorithm B.3 could easily be adapted to use adjacency-lists-based data structures [154], resulting in a faster execution and lower storage requirements for sparse graphs. We have implemented [274] a sparse-optimised version of Algorithm B.3 because the graph representations of the Markov chains of interest in the present work are sparse [201].

The algorithm for detaching a single intermediate node from an arbitrary graph stored in a sparse-optimised format is given in Algorithm B.4. Having chosen the node to be removed, $\gamma$ , all the neighbours $\beta \in Adj[\gamma]$ are analysed in turn, as follows. Lines 3-9 of Algorithm B.4 find node $\gamma$ in the adjacency list of node $\beta$ . If $\beta$ is not a sink, lines 11-34 are executed to modify the adjacency list of node $\beta$ : lines 13-14 delete node $\gamma$ from the adjacency list of $\beta$ , while lines 15-30 make all the neighbours $\alpha \in Adj[\gamma]\ominus{\beta}$ of node $\gamma$ the neighbours of $\beta$ . The symbol $\ominus$ denotes the union minus the intersection of two sets, otherwise known as the symmetric difference.If the edge $\beta \rightarrow \alpha$ already existed only the branching probability is changed (line 21). Otherwise, a new edge is created and the adjacency and branching probability lists are modified accordingly (line 26 and line 27, respectively). Finally, the branching probabilities of node $\beta$ are renormalised (lines 31-33) and the waiting time for node $\beta$ is increased (line 34).

Algorithm B.4 is invoked iteratively for every node that is neither a source nor a sink to yield a graph that is composed of source nodes and sink nodes only. Then the procedure described in Section 4.4 for disconnection of source nodes (lines 17-34 of Algorithm B.3) is applied to obtain the mean escape times for every source node. The sparse-optimised version of the second part of Algorithm B.3 is straightforward and is therefore omitted here for brevity.

The running time of Algorithm B.4 is $\mathcal{O}(d_c \sum_{i \in Adj[c]} d_i)$ , where is the degree of node . For the case when all the nodes in a graph have approximately the same degree, , the complexity is $\mathcal{O}(d^3)$ . Therefore, if there are intermediate nodes to be detached and is of the same order of magnitude as , the cost of detaching nodes is $\mathcal{O}(N^4)$ . The asymptotic bound is worse than that of Algorithm B.3 because of the searches through adjacency lists (lines 3-9 and lines 19-24). If is sufficiently small the algorithm based on adjacency lists is faster.

After each invocation of Algorithm B.4 the number of nodes is always decreased by one. The number of edges, however, can increase or decrease depending on the in- and out-degree of the node to be removed and the connectivity of its neighbours. If node $\gamma$ is not directly connected to any of the sinks, and the neighbours of node $\gamma$ are not connected to each other directly, the total number of edges is increased by $d_\gamma(3-d_\gamma)$ . Therefore, the number of edges decreases (by ) only when $d_\gamma \in \{1,2\}$ , and the number of edges does not change if the degree is . For $d_\gamma>3$ the number of edges increases by an amount that grows quadratically with $d_\gamma$ . The actual increase depends on how many connections already existed between the neighbours of $\gamma$ .

The order in which the intermediate nodes are detached does not change the final result and is unimportant if the graph is complete. For sparse graphs, however, the order can affect the running time significantly. If the degree distribution for successive graphs is sharp with the same average, , then the order in which the nodes are removed does not affect the complexity, which is $\mathcal{O}(d^3 N)$ . If the distributions are broad it is helpful to remove the nodes with smaller degrees first. A Fibonacci heap min-priority queue [276] was successfully used to achieve this result. The overhead for maintaining a heap is $d_\gamma$ increase-key operations (of $\mathcal{O}(\log (N))$ each) per execution of Algorithm B.4. Fortran and Python implementations of Algorithm B.4 algorithm are available online [274].

Random graphs provide an ideal testbed for the GT algorithm by providing control over the graph density. A random graph, , is obtained by starting with a set of nodes and adding edges between them at random [33]. In this work we used a random graph model where each edge is chosen independently with probability $\left<d\right>/(N-1)$ , where $\left <d\right >$ is the target value for the average degree.

**Figure:** Evolution of the distribution of degrees for random graphs of different expected degree, **$\left <d\right >=5,10,15$** , as labelled. This is a colour-coded projection of the probability mass function [277,278], , of the distribution of degrees as a function of the number of the detached intermediate nodes, . The straight line shows for complete graph **$K_{1000}$** . All four graphs contain a single source, 999 intermediate nodes and a single sink. The transformation was done using sparse-optimised version of Algorithm B.3 with Fibonacci-heap-based min-priority queue. It can be seen that as the intermediate nodes are detached the density of the graph that is being transformed grows. The expected degree of the initial graph determines how soon the maximum density will be reached.
$\begin{psfrags} \psfrag{d} [bc][bc]{$d$} \psfrag{n} [bc][bc]{$n$} \psfrag{... ...ne{\includegraphics[width=0.8\textheight]{markov/devolution.eps}} \end{psfrags}$

To investigate the dependence of the cost of the GT method on the number of nodes, , we have tested it on a series of random graphs for different values of and fixed average degree, $\left <d\right >$ . The results for three different values of $\left <d\right >$ are shown in Figure 4.11. The motivation for choosing $\left <d\right >$ from the interval $\left[3,5\right]$ was the fact that most of our stationary point databases have average connectivities for the local minima that fall into this range.

**Figure:** CPU time needed to transform a sparse random graph **$R_{2N}$** using the GT approach described in Section 4.4 as a function of the number of intermediate nodes, . **$R_{2N}$** is composed of a single source node, sink nodes and intermediate nodes. For each value of the data for three different values of the expected degree, **$\left <d\right >=3,4,5$** , is shown, as labelled. Solid lines are analytic fits of the form , where **$c=2.3\times 10^{-11}, 7.4\times 10^{-11}, 1.5\times 10^{-10}$** for **$\left <d\right >=3,4,5$** , respectively. CPU time is in seconds.
$\begin{psfrags} \psfrag{CPU time / s} [bc][bc]{CPU time / s} \psfrag{N} [bc]... ... [bc][bc]{$800$} \centerline{\includegraphics{markov/CPUofN.ps}} \end{psfrags}$

It can be seen from Figure 4.11 that for sparse random graphs the cost scales as $\mathcal{O}(N^4)$ with a small $\left <d\right >$ -dependent prefactor. The dependence of the computational complexity on $\left <d\right >$ is illustrated in Figure 4.12.

**Figure:** CPU time needed to transform a sparse random graph **$R_{2N}$** using the GT approach as a function of the expected degree, **$\left <d\right >$** . The data is shown for three graphs with , and , as labelled. **$R_{2N}$** is composed of a single source node, sink nodes and intermediate nodes.
$\begin{psfrags} \psfrag{CPU time / s} [bc][bc]{CPU time / s} \psfrag{<d>} [b... ... [br][br]{$600$} \centerline{\includegraphics{markov/CPUofd.ps}} \end{psfrags}$

From Figure 4.10 it is apparent that at some point during the execution of the GT algorithm the graph reaches its maximum possible density. Once the graph is close to complete it is no longer efficient to employ a sparse-optimised algorithm. The most efficient approach we have found for sparse graphs is to use the sparse-optimised GT algorithm until the graph is dense enough, and then switch to Algorithm B.3. We will refer to this approach as SDGT. The change of data structures constitutes a negligible fraction of the total execution time. Figure 4.13 depicts the dependence of the CPU time as a function of the switching parameter .

**Figure:** CPU time as a function of switching ratio shown for random graphs of different expected degree, **$\left <d\right >=5,10,15$** , as labelled. All three graphs contain a single source, 999 intermediate nodes and a single sink. The transformation was performed using the sparse-optimised version of Algorithm B.3 until the the ratio **$d_{c(i)}/n(i)$** became greater than . Then a partially transformed graph was converted into adjacency matrix format and the transformation was continued with Algorithm B.3. The optimal value of lies in the interval . Note the log $_{10}$ scale on both axes.
$\begin{psfrags} \psfrag{CPU time / s} [bc][bc]{$\log_{10}$(CPU time / s)} \p... ...rline{\includegraphics[width=.47\textheight]{markov/CPUofRs.eps}} \end{psfrags}$