# Reduction of calcium release site models via optimized state aggregation

- Yan Hao
^{1}Email author

**Received: **25 January 2016

**Accepted: **17 May 2016

**Published: **4 June 2016

## Abstract

### Background

Markov chain models of calcium release sites in living cells exhibit stochastic dynamics reminiscent of the experimentally observed phenomenon of calcium puffs and sparks. Such models often take the form of stochastic automata networks in which the transition probabilities for each of a large number of intercellular channel models depend on the local calcium concentration and thus the state of nearby channels. The state-space size in such compositionally defined calcium release site models increases exponentially as the number of channels increases, which is referred to as “state-space explosion”.

### Methods

In order to overcome the state-space explosion problem, we utilized the idea of “coarse graining” and implemented an automated procedure that reduces the state space by aggregating and lumping states of the full release site model. For a given state aggregation scheme, the transition rates between reduced states are chosen consistent with the conditional probability distribution among states within each group. A genetic algorithm-based approach is then applied to select the state aggregation schemes that lead to reduced models that approximate the observable behaviors of the full model.

### Results

The genetic algorithm-based approach is implemented in Matlab®; and applied to two different release site models. The approach found reduced models that approximate the full model in the number of open channels, spark statistics, and the jump probability matrix as a function of time.

### Conclusions

A novel automated genetic algorithm-based searching technique is implemented to find reduced calcium release site models that approximate observable behaviors of the full Markov chain models that possess intractable state-spaces. As compared to the full model, the reduced models produce quantitatively similar results using significantly less computational resources.

### Keywords

State space explosion Genetic algorithms Calcium signaling Stochastic automata network Set partition Coarse graining strategies## Background

### Coarse-graining methods as a model reduction strategy

Increasingly over the years, mathematical models and computer simulation have been used in the natural sciences and the social sciences. As more and more scientific discoveries from experiments are included, models and simulation approaches have been developed to be more and more accurate. Unfortunately, the corresponding computational cost also gets higher. When a problem to be modeled is large in scale or possesses multiple scales, the complexity and dimensionality of the model may increase to unmanageable levels of storage and computational capacity. How to reduce the computational cost without losing important properties of mathematical/computational models is therefore an important scientific question. Molecular dynamics (MD) simulation for example, is a type of *N*-body simulation that allows atoms and molecules to interact for a fixed period of time [1]. While the trajectories of atoms and molecules can be quite accurately determined by numerically solving Newton’s equations of motion for a system of interacting particles, the computations are highly expensive [2]. To reduce the computational cost for MD simulations, the field of coarse-grained modeling and simulation has been rapidly expanding. Instead of explicitly representing every atom of a system, one partitions the system into a number of groups of atoms, and then uses “pseudo-atoms” to represent each group. The coarse-grained modeling significantly reduces the time and computational storage requirement of MD simulations. Simulations of soft matter systems, polymer dynamics, protein folding and many other physical or biological systems that feature the spatiotemporal coupling between scales thus become possible [3, 4].

As another example, continuous time Markov chains have become an important modeling approach over the past a few decades. Because it well describes the systems that undergoes transitions from one state to another, it was used extensively in modeling ion channels in cell membranes [5, 6]. However, when the collaborative behavior of a cluster of these ion channels are of interest, the state space of the cluster model increases exponentially as the number of channels increases. This combinatorial state-space explosion causes some modeling approaches to become intractable. In this paper, we present a novel approach to reduce large Markov chain models that shares the spirit of coarse-graining methods: the state-space of the Markov chain model is partitioned into groups, then each group is represented by a “mega-state” so that the full model is compressed into a tractable size. The partition scheme is selected by a genetic algorithm-based approach to preserve selected features of the full model.

### Markov chain models of Ca^{2+} signaling

As a second messenger, calcium ions (Ca^{2+}) play an important role in many physiological activities. Signaling occurs when the cell is stimulated to release calcium ions (Ca^{2+}) from the endoplasmic/sarcoplasmic reticulum (ER/SR), the intracellular Ca^{2+} reservoir, or when Ca^{2+} enters the cell through plasma membrane ion channels [7]. The intracellular Ca^{2+} release which causes localized Ca^{2+} elevations known as puffs and sparks arises from concerted gating of clusters of inositol 1,4,5-trisphosphate receptors (IP3Rs) or ryanodine receptors (RyRs) on the surface of ER/SR [8–10]. For example, in cardiac myocyte excitation-contraction coupling (ECC), the cell membrane depolarizes causing L-type Ca^{2+} channels to open and the Ca^{2+} influx further activates RyRs located on the SR, known as Ca^{2+}-induced Ca^{2+} release (CICR) [11].

The spatial organization of IP3Rs and RyRs has been shown to be the basis of intracellular Ca^{2+} signaling activities that are observed via confocal microfluorimetry in cardiomyocytes, oocytes, and other cell types [10, 12, 13]. The spacing of receptor clusters was identified as a decisive parameter for the occurrence of collective behaviors [14, 15].

From the literature, the behavior of single IP3R/RyR channel gating is often modeled by continuous-time discrete-state Markov chains (CTMCs) [16, 17]. When Markov chain models of these channels are coupled via a Ca^{2+} microdomain in which the transition rates between the states of each channel become dependent on the states of other channels, the simulated Ca^{2+} channel clusters (release sites) may exhibit stochastic excitability that is reminiscent of Ca^{2+} puff/sparks [6, 14, 18]. However, the number of states possessed by these compositionally defined Ca^{2+} release site models increases exponentially as the number of channels increases. This combinatorial state-space explosion causes some modeling approaches to become intractable.

While the dynamics of any individual Ca^{2+} release site can theoretically be obtained by Monte Carlo simulation regardless of model complexity, in practice these simulations are prohibitively computationally intensive due to the large state spaces. Moreover, because cells usually possess a large number of release sites, compositionally defined Ca^{2+} release site models have often been excluded from multiscale whole cell simulations. On the other hand, many recently developed approaches that accelerate whole cell simulations, such as probability density and moment closure approaches [19, 20], require release site models to be as compact as possible while retaining the physiological realism of collective channel gating.

For these reasons, we developed several automated approaches based on fast/slow analysis [21]. To reduce Markov chain Ca^{2+} release site models where the rate constants in release site models are categorized as either fast or slow, groups of states that are connected by fast transitions are lumped so that the full model is compressed into a tractable size while the physiological gating and interaction properties of the channels are preserved. However, when the time scale separation between transition rates that is necessary for fast/slow analysis is absent, the manner in which the full model states should be partitioned and aggregated for optimal reduction is difficult to determine a priori. Naively enumerating all partitions for a Markov chain Ca^{2+} release site model and choosing the one with the smallest error is not possible because the number of valid partitions is too large. Actually, finding out how many ways one can divide a graph that possesses *n* vertices into *k* smaller components is know to be an NP-hard problem [22]. For example, a release site model composed of merely five three-state Ca^{2+} channels (15 states) can be partitioned in approximately 10^{10} distinct ways. In this paper we discuss the implementation of a genetic algorithm that is able to automatically and rapidly select partition schemes that reduce the corresponding Markov chain model to a tractable size while keeping the reduction error in control.

### Genetic algorithms

Developed in the 1970s by John Holland [23], genetic algorithms are widely used as computational schemes to find exact or approximate solutions for optimization and search problems. Genetic algorithms have been applied to various aspects of biological research, such as the profiling of gene expression in bacteria [24, 25] and phylogenetic analysis of proteins [26]. More recently, it has been used to identify parameters for cell-specific electrophysiology models [27]. Nevertheless, the application of genetic algorithms in the context of the automated reduction of Ca^{2+} release site models is novel. In our implementation, each individual in a population corresponds to a potential scheme for state aggregation. The program “evolves” the population by selecting the partitions that lead to reduced models that approximate the full model behavior.

Unlike the fast/slow analysis that assumes fixed ER/SR [ Ca^{2+}] and instantaneous coupling between the channels [21], we motivate a whole cell homeostasis formulation that takes both local and global Ca^{2+} signaling into consideration. Consequently the reduced models selected by the genetic algorithm must generate small errors for a wide range of ER/SR [ Ca^{2+}]. In the implementation of the genetic algorithm, a population of set partitions is randomly generated, where each partition corresponds to a potential scheme for state aggregation. The states of the full model is aggregated and reduced following each individual set partition scheme. Then the difference between the full model and the corresponding reduced model on the behaviors of interest, spark statistics for example, are measured and called the “error” of the reduced model. The error is used as the key factor for deciding the fitness of each individual partition scheme. Since partition schemes that produce low errors are preferred, the fitness is chosen to be a decreasing function of error. The program selects survivors from the current population in a manner that favors better fitness (i.e. low error), then each survivor will generate one child (a new set partition scheme) by mutation. The set partition population thus evolves toward the partitions that have better fitness and lead to reduced models that better approximate the full model.

The remainder of this paper is organized as follows. In the ‘Model’ section we motivate the network reduction process by partitioning a minimal whole cell model of Ca^{2+} homeostasis where both localized (subcellular) and global (cellular) aspects of Ca^{2+} signaling are modeled. In the ‘Methods’ section we introduce genetic algorithms and detail their implementation in the context of reducing a minimal whole cell model of Ca^{2+} homeostasis that features bidirectional local and global Ca^{2+} signaling. In the ‘Results’ section, we demonstrate that the reduced model approximates the full model with regard to several important steady-state responses observed in the minimal whole cell environment. To show that the reduction technique is applicable to more realistic Ca^{2+} release site models, we also present Ca^{2+} release site reduction results using a single channel model that includes both cytosolic and luminal Ca^{2+} regulation.

## Model

### A minimal whole cell model

^{2+}release site model reduction approach using a whole cell model of a quiescent cytosolic environment that takes Ca

^{2+}homeostasis into account (Fig. 1). Similar to previous work by Hartman and colleagues [28], this minimal whole cell model considers both local and global Ca

^{2+}responses to the stochastic gating of Ca

^{2+}channels. Further, release and reuptake fluxes are balanced in this model. Figure 1 shows the components and fluxes of the model. A large number of Ca

^{2+}release sites are coupled to the bulk cytosolic and ER/SR [ Ca

^{2+}]. Each Ca

^{2+}release site is composed of 10–30 Ca

^{2+}channels. In this formulation, release sites may experience different “domain” [ Ca

^{2+}], but all channels in a given release site experience the same local cytosolic and luminal [ Ca

^{2+}]. Consistent with prior work by Hinch and colleagues [29–31], when the number of open channels in a Ca

^{2+}release site changes, the local [ Ca

^{2+}] is assumed to rapidly reach a new equilibrium in the spatially restricted domain. The change in the balance of the leak and reuptake by the endo(sarco)plasmic reticulum Ca

^{2+}-ATPase (SERCA) pumps caused by this change in the domain [ Ca

^{2+}] will influence the bulk [ Ca

^{2+}] and further affect the puff/spark dynamics.

#### Steady-state of domain concentration

^{2+}] for the release sites, \(c_{cyt}^{d}\) and \(c_{er/sr}^{d}\), are coupled to each other via the release flux

*J*

_{ rel }when one or more channels open. As mentioned above, the domain [ Ca

^{2+}] associated with each release site is distinct, and all domains are coupled to the bulk cytosolic and luminal compartments via the fluxes

*J*

_{ cyt }and

*J*

_{ e r/s r }. Under these assumptions, the domain fluxes are given by:

where *ν*
_{
rel
} is the maximum release rate through a release site, *c*
_{
cyt
} and *c*
_{
e
r/s
r
} are the bulk cytosolic and ER/SR concentrations, and *γ*
_{
n
}=*n*/*N* is the fraction of open channels at an *N*-channel per release site. The rate constants *ν*
_{
cyt
} and *ν*
_{
e
r/s
r
} determine the time required for the decay and refilling of the cytosolic and luminal microdomains, respectively [32, 33].

^{2+}is fast compared to the stochastic gating of Ca

^{2+}channels, the domain fluxes associated with each release site must balance for any specific release site:

^{2+}] of any release site with

*n*channels open can be obtained by directly solving Eq. 4 as a function of the bulk cytosolic and luminal [ Ca

^{2+}] (

*c*

_{ cyt }and

*c*

_{ e r/s r }), that is,

Notice that for a release site with *N* channels, the number of open channels takes integer values from 0 to *N*. Consequently, there are *N*+1 pairs of cytosolic and luminal domain [ Ca^{2+}] values for any given value of the bulk concentration (*c*
_{
cyt
} and *c*
_{
e
r/s
r
}).

#### Concentration balance equations for the bulk cytosol and ER

^{2+}] are both influenced by the Ca

^{2+}fluxes to and from their associated microdomains, \(J^{n}_{cyt}\) and \(J^{n}_{er}\). The bulk concentrations also interact via a SERCA pump flux that takes the form:

*J*

_{ pm }is

where *k*
_{
pm
} is chosen large enough so that the bulk cytosolic [ Ca^{2+}] is “clamped” to the extracellular bath (*c*
_{
ext
}=0.1*μ* M).

^{2+}fluxes are defined, the concentration balance equations for the bulk cytosolic and ER compartments are given by:

*λ*

_{ e r/s r }=

*V*

_{ e r/s r }/

*V*

_{ cyt },

*V*

_{ cyt }and

*V*

_{ e r/s r }are the effective cytosolic and ER/SR volumes, i.e. taking Ca

^{2+}buffering into account. \(J_{cyt}^{T}\) and \(J_{er/sr}^{T}\) are the sums of fluxes over all release sites. Notice that under the fast domain Ca

^{2+}assumption, there are only

*N*+1 pairs of possible domain [ Ca

^{2+}] values and consequently \(J_{cyt}^{T}\) and \(J_{er/sr}^{T}\) can be expressed as

where *f*
_{
n
} is the fraction of release sites with *n* open channels.

### The Markov chain model of single channel gating

*C*(closed),

*O*(open) and

*R*(refractory), featuring both Ca

^{2+}activation and Ca

^{2+}inactivation. The transition diagram of this model is given by

*i*∈{

*a, b*}, are transition rates with units of reciprocal time. \(k_{i}^{+}\) is an association rate constant with units of conc

^{−η }time

^{−1}where

*η*is the cooperativity of Ca

^{2+}binding, and \(c_{cyt}^{d}\) is the domain [ Ca

^{2+}] experienced by the release site on the cytosol side. Under the assumption that the formation and collapse of local Ca

^{2+}is fast compared to channel gating, when the local Ca

^{2+}concentrations are specified, the transition-state diagram Eq. 15 defines a continuous time Markov chain with the corresponding infinitesimal generator matrix Q=(

*q*

_{ ij }) given by:

*i*to state

*j*, defined by

where *i*≠*j* and the diamonds (◇) on the diagonal entries are negative values leading to row sums of zero.

^{2+}channel can be calculated from its Q-matrix (Eq. 16). Importantly, the time evolution of the probability distribution over all three states of this model can be calculated by solving the ordinary differential equation (ODE) system:

*t*)=(

*π*

_{ C },

*π*

_{ O },

*π*

_{ R }) is a row vector indicating the probability of finding the channel in each state at time

*t*, given the initial condition π(0). Notice that the limiting probability distribution π

_{ s }of Markov chains (the steady state of Eq. 18) does not depend on the initial condition π(0), and can be obtained by solving

where **e** is a commensurate column vector of ones.

### Compositionally defined Ca ^{2+} release site models

^{2+}release site models that are used to demonstrate the implementation of the reduction approach involve

*N*identical Ca

^{2+}channels interacting via changes in local [ Ca

^{2+}] under the assumption of “instantaneous mean-field coupling”. The local [ Ca

^{2+}] experienced by the Ca

^{2+}regulatory site of each channel is assumed to depend only on the number of open channels at the Ca

^{2+}release site

*N*

_{ O }. Because identical channels coupled in this manner are indistinguishable, a release site composed of

*N*

*M*-state channels includes

*N*-channel release site has

*β*(

*N*,3)=(

*N*+2)(

*N*+1)/2 states. Each state can be written in the form of an ordered three-tuple (

*N*

_{ C },

*N*

_{ O },

*N*

_{ R }), where

*N*

_{ i }=

*k*,(

*i*∈{

*C, O, R*}) indicates

*k*channels are in state

*i*, and \(\sum _{i}N_{i} = N\). With this notation, the states of any release site form a well ordered set and can be conveniently ranked anti-lexicographically. Figure 2 enumerates all the states and illustrates the topology of the three-state single channel model (Fig.) and a minimal release site composed of two three-state channels under the mean-field coupling assumption (Fig. 2 b).

## Methods

### Reduction technique

Our basic strategy of reducing a Ca^{2+} release site model to a smaller model with pre-determined size \(\hat {b}\) includes three major steps: *Step 1* Partition the full model into \(\hat {b}\) groups. *Step 2* Lump the states within each group. *Step 3* Calculate proper transition rates between groups.

In previous work [21], *S*
*t*
*e*
*p* 1 was achieved automatically based on the separation of time scales, where states that are connected by fast transitions were aggregated. In this paper, we employ a genetic algorithm to search for partition schemes that generate small reduction errors for general Ca^{2+} release site models, especially those without time scale difference. In the future, a partition that divides the states of a release site model into \(\hat {b}\) groups will be referred to as a “\(\hat {b}\) -partition”. *S*
*t*
*e*
*p* 2 and *S*
*t*
*e*
*p* 3 in this reduction technique are carried on from [21].

Theoretically, this genetic algorithm-based technique can be used to reduce any Ca^{2+} release site models of size *β*(*N,M*) to any pre-determined size \(\hat {b}\) (\(\hat {b} < \beta ({N,M})\)). Furthermore, the partition scheme can be restricted by certain rules, such as “each group must be connected on the transition diagram” or “all states in each group must present the same number of open channels”, etc. Hereafter, we will demonstrate the implementation of the genetic algorithm using a minimal example where the 6-state release site model illustrated in Fig. 2
b is partitioned into 3 groups with the constraint “states in each group must be connected”. Results from reducing a release site that is composed of 10 three-state channels will be presented in the ‘Results’ section. Other partition restrictions can be implemented by restricting the reproduction procedure or modifying the objective function.

### Genetic algorithms

The program starts with initialization, where a number of “*individuals*” (solution candidates) are randomly generated to form an initial “*population*”. The size *N*
_{
p
} of the *population* is usually kept constant throughout the entire searching procedure. After initialization, this *population* goes through the evaluation procedure, where each *individual* is evaluated by the objective function and its *fitness* is assigned according to the objective function value. The program then checks whether the termination criteria are satisfied; usually either the desired objective function value is attained or a predetermined number of generations is reached. If none of the termination criteria is satisfied, the program will move on to the selection process, which is usually stochastic and designed so that the *individuals* with better *fitness* have a higher probability to be selected as compared to those who are less fit. Only a fraction of the current population (*N*
_{
s
}
*individual*s, where *N*
_{
s
}<*N*
_{
p
}) can survive and enter the reproduction process as the “*parent*” solutions. To generate each “*child*” (a new solution candidate), one or more *parents* are selected, recombined (crossover) and/or varied (mutate). The reproduction process continues till *N*
_{
p
}
*individual*s are generated and thus a new generation of *population* is formed. The new generation will then go through an evaluation process to have its *fitness* evaluated and the entire program continues until one or more of the termination criteria are satisfied.

#### Initialization

^{2+}release site models so that the resulting reduced models better approximate the full models. In this context, each

*individual*(\(\mathcal {I}_{i}\)) is a set partition scheme which divides the

*β*(

*N,M*) states of the full model into \(\hat {b}\) groups. To make physical sense, the requirement of the partition process is that each group must be internally connected, that is, there is a path from any state to any other state. The dashed line boxes and grey boxes in Fig. 2 give two samples valid three-partitions that divide the 6-state release site model into 3 groups:

In the initialization process, *N*
_{
p
} distinct three-partitions are randomly generated.

#### Evaluation

In the evaluation process each of the *N*
_{
p
}
*individuals* (partition schemes) must be applied to the full model and have its corresponding reduced model compared to the full model. The *fitness* of each *individual* is then assigned in a manner that favors those that produce less error.

^{2+}release site model is a \(\hat {b}\times \hat {b}\) matrix

*i*≠

*j*, and \(\hat {q}_{ii} = \sum _{j\neq i}-\hat {q}_{ij}\). \(\bar {\boldsymbol {\pi }}_{i}\) is the conditional probability distribution of the states within group

*i*:

and e
_{
i
} are the commensurate column vectors of ones.

**P**=

*e*

^{ t Q }). Assuming the full model has

*b*states, we write

_{ i }are column vectors of ones with lengths commensurate with Q

_{ ii }, and U is a \(\hat {b} \times b\) distributor matrix given by

*i*, is calculated from the stationary distribution of the full model. The transition probabilities of the reduced model and the full model agree with each other exactly in the limit. As shown in Fig. 5 a, the maximum difference of the transition probabilities falls below 10

^{−9}within 1 second.

\(\hat {\boldsymbol {E}}(t)\) is a *b*×*b* matrix and is cumbersome to use for evaluation. Consequently, we define \(E_{\max }(t) = \underset {ij}\max \left | \hat {\boldsymbol {E}}_{ij} (t)\right |\), the element of \(\hat {\boldsymbol {E}}(t)\) with largest absolute value at time *t*. Note that *E*
_{max} is a function of both time and *c*
_{
e
r/s
r
} because the transition rates of the full Ca^{2+} release site model are functions of the luminal [ Ca^{2+}]. Figure 5
a shows *E*
_{max}(*t*;*c*
_{
e
r/s
r
}) for the 6-state Ca^{2+} release site model (Fig. 2
b) reduced to a three-state model following the partition scheme given by \(\mathcal {I}_{1}\). As validated in [21], the reduced model better approximates the full model as *E*
_{max} gets smaller. The maximum transition error decreased significantly as the luminal [ Ca^{2+}] is dropped from 1100 *μ*M (dashed line) to 100 *μ*M (dot-dash-dot line), indicating reductions following partition scheme \(\mathcal {I}_{1}\) make better approximations of the full model at relatively low levels of luminal [ Ca^{2+}].

^{2+}] values at all times. Consequently, we define

the area under each curve in Fig. 5
a. Then, for any partition scheme \(\mathcal {I}\), the integrated error \(\mathcal {E}\) can be calculated as a function of *c*
_{
e
r/s
r
}, and the maximum \(\mathcal {E}(c_{er/sr})\) is selected as the global reduction error of scheme \(\mathcal {I}\). In Fig. 5
b the dashed and solid lines show the integrated error \(\mathcal {E}\) associated with \(\mathcal {I}_{1}\) and \(\mathcal {I}_{2}\), respectively, as a function of *c*
_{
e
r/s
r
} (100−2000 *μ*M).

*fitness*of a given partition scheme \(\mathcal {F}\) is defined by

As shown in Fig. 5
b, when the full model is partitioned and lumped following \(\mathcal {I}_{1}\) (dashed line), the maximum possible error (star) generated by the reduced model \(\mathcal {E}_{1}\) is approximately 210 times larger than the maximum error (dot) generated by using \(\mathcal {I}_{2}\). The *fitness* of \(\mathcal {I}_{1}\) is consequently 210 times less than the fitness of \(\mathcal {I}_{2}\).

#### Selection and reproduction

This section introduces how the genetic algorithm implementation forms the “*next generation*” from the current *population*. The conventional reproduction process in genetic algorithms usually consists of selection, crossover and mutation.

*parents*of the next generation of set partitions. The probability mass function (PMF), which indicates the probability \(\mathcal {P}_{i}\) for each “

*individual*” \(\mathcal {I}_{i}\) to be selected, is

that is, the probabilities of selection are proportional to the *fitness*, \(\mathcal {F}_{i}\). To generate each *child*, we start by randomly selecting a pair of *parents*, one at a time, from the current *population* following the corresponding PMF. For example, the probability of \(\mathcal {I}_{1}\) being selected is 210 times smaller than the probability that \(\mathcal {I}_{2}\) is selected.

*parents*are selected, a

*child*is generated through the crossover process, which takes the permutation of states from one

*parent*and the group sizes from the other

*parent*. For example, if the permutation of states is taken from \(\mathcal {I}_{2}\ (\{1\}, \{2\}, \{3, 4, 5, 6\})\) and the group sizes are taken from \(\mathcal {I}_{1}\ (\{1, 2\}, \{3, 5, 6\}, \{4\})\) then the

*child*would be

*child*is produced, with probability

*p*, a mutation process begins by randomly selecting and joining two groups of states in its partition scheme, and then randomly splitting this aggregated group of states into two new internally connected groups. For example, a possible mutation process of the

*child*could be joining the second and third group:

*child*may undergo multiple mutations of this kind. A geometric distribution is assigned to the number of mutations

*n*

_{ m }for each

*child*:

where *k*=1,2,3,… and *p*=0.8.

Note that the reproduction process generates one *child* at a time and continues until the number of *children* reaches *N*
_{
p
}. These *children* then forms a new generation of *individuals* and are sent to the evaluation process, starting a new iteration in the genetic algorithm. The genetic algorithm is executed until either a set partition has a *fitness*
\(\mathcal {F} \geq 1000\) (reduction error \(\mathcal {E}\) less than 0.1 *%* for all luminal [ Ca^{2+}] values) is found, or the maximum number of allowed iterations (2000) is reached.

## Results

In this section we first validate the genetic algorithm implemented in the previous section by showing that the algorithm converges and produces set partition schemes that generate small reduction errors. To demonstrate that the genetic algorithm can be applied to general Ca^{2+} channels, we use this approach to reduce a release site that is composed of several four-state channels (Fig. 7) under the constraint that each group of states must be connected in the state transition diagram. This four-state model features activation by cytosolic Ca^{2+} and luminal [ Ca^{2+}] regulation of the activation affinity. The reduced Ca^{2+} release site model is integrated into the whole cell model, and simulation results from the reduced and the full model are compared.

### Reducing Ca^{2+} release site models that are composed of three-state channels

^{2+}release site model that is composed of 10 three-state channels (66 states) to an 11-state model. The population size

*N*

_{ p }=10 and the reduction error \(\mathcal {E}\) was measured for 50 log-spaced

*c*

_{ e r/s r }values ranging from 100

*μ*M to 2000

*μ*M. Each column of stars represents the 10

*individuals*of a

*generation*. The black stars indicate the

*individual*(partition) that produces the smallest error in its

*generation*. The criterion that ends the program is defined as \(\mathcal {E} < 10^{-3}\) or 2000 generations are generated, whichever is satisfied first. In this specific reduction experiment, the program did generate 2000 generations and the minimum \(\mathcal {E}\) was 0.0043.

### Reducing Ca^{2+} release site models that are composed of four-state channels with luminal regulation

^{2+}release models can replace the full model in the whole cell Ca

^{2+}homeostasis model with good accuracy. To validate that the reduction procedure fits a wide variety of models, we introduce a four-state model (Fig. 7) which is activated by cytosolic Ca

^{2+}and the activation affinity is regulated by the luminal [ Ca

^{2+}].

The four-state Ca^{2+} channel model is assumed to have a regular or “unsensitized” mode (states *C*
_{
u
},*O*
_{
u
}) in which the activation dissociation constant (\(K_{a} = \sqrt {k_{a}^{-}/k_{a}^{+}}\)) is higher than the activation dissociation constant (\(K_{d} = \sqrt {k_{d}^{-}/k_{d}^{+}}\)) of the “sensitized” mode (*C*
_{
s
},*O*
_{
s
}). We also assume that the channel is more likely to be in the “sensitized” mode when the ER/SR [ Ca^{2+}] is high. \(k_{i}^{+}\left (c_{cyt}^{d}\right)^{2}\), \(k_{j}^{+}c_{er/sr}^{d}\) and \(k_{i}^{-}\), for *i,j*∈{*a*,...,*e*}, are transition rates with units of reciprocal time. \(k_{i}^{+}\) is an association rate constant with units of conc ^{−η
} time ^{−1}, where *η* is the cooperativity of Ca^{2+} binding while \(c_{cyt}^{d}\) and \(c_{er/sr}^{d}\) are the domain [ Ca^{2+}] experienced by the release site on the cytosol and ER/SR side respectively. Notice that we assume that the Ca^{2+} binding cooperativity (*η*=1) of the channel sensitization (luminal regulation) process is different from the binding cooperativity of the activation process (*η*=2).

An important motivation in using this four-state model is that luminal regulation of RyRs is observed in many experiments [37, 38] but the detailed mechanism is yet not clear. In this paper, we are interested in how the “sensitization” of the activation of each individual Ca^{2+} channel affects the cooperative gating of the Ca^{2+} release site. Consequently we experiment on different sensitized activation rates as well as the dissociation constant \(K_{c} = k_{c}^{-}/k_{c}^{+}\) (Fig. 7) of the sensitization process. On the other hand, the parameters of the regular or “unsensitized” Ca^{2+} activation were chosen to be consistent with the parameters in [28], where many puff/spark statistics of a group of 10 two-state Ca^{2+}-activated channels were studied.

^{2+}release sites is assumed to be large so that the distribution of release site states can be well approximated by π(

*t*) (solved from Eq. 18) instead of using Monte Carlo simulation. However, simply substituting

*π*

_{ n }for

*f*

_{ n }in Eqs. 13 and 14 will fail because the “fast domain” assumption is a singular limit of the ODE system. Consequently, instead of using Eqs. 11 and 12 we consider the total cytosolic (\(\hat {c}_{cyt}\)) and ER/SR [ Ca

^{2+}] (\(\hat {c}_{er/sr}\)), which are sums of the bulk and domain concentrations weighted by effective volume ratios,

^{2+}concentrations. The effective volume ratios in Eqs. 31 and 32 are given by

*V*

*cyt*

*d,T*and

*V*

*sr*

*d,T*are the effective volumes of the aggregated cytosolic and SR domains, respectively. The equations that balance \(\hat {c}_{cyt}\) and [ Ca

^{2+}] \(\hat {c}_{er/sr}\) are given by:

where *γ*
_{
n
}=*n*/*N*, \(c_{cyt}^{d,\,n}\) and \(c_{er/sr}^{d,\,n}\) are given by Eqs. 5, 6 and 7, and *π*
_{
n
} is the probability that a randomly sampled release site has *n* open channels, which can be found from π=(*π*
_{0},*π*
_{1},⋯,*π*
_{
N
}) by integrating Eq. 18.

^{2+}release site composed of 10 four-state RyRs (286 states, lines) and the corresponding reduced 34 state model (circles and crosses) using different values of the disassociation rate of sensitization

*K*

_{ C }. The filled circles and triangles show the results of a release site composed of 10 two-state RyRs. When the disassociation rate of sensitization

*K*

_{ C }is high enough, the sensitized states are rarely visited and consequently the four-state model results should approach the two-state model results. As shown in Fig. 8 the four-state model well approximates the two-state model when

*K*

_{ C }is approximately 1000

*μ*M.

*K*

_{ C }will decrease the bulk SR [ Ca

^{2+}] and the results calculated from the reduced Ca

^{2+}release site model is a close approximation to the full model. Figure 8 b shows the open probability of a single four-state channel (dashed line) as a function of

*K*

_{ C }. The solid line in Fig. 8 b shows the fraction of open channels,

*f*

_{ O }of the 10-channel release site, where

*K*

_{ C }decreases, both parameters increase, which indicates adding the sensitized states increases the open probability of the channels. The increased open probability further causes a lower steady state SR [ Ca

^{2+}], which is consistent with Fig. 8 a. In prior work by Nguyen and colleagues [6], a puff/spark

*Score*was defined as

from which the presence or absence of puff/spark can be assessed. This measure ranges between 0 and 1, and values that are larger than 0.2 indicate the presence of robust Ca^{2+} puffs/sparks. Figure 8
c, shows the *Score*s of the full model and the reduced model as a function of *K*
_{
C
}. The reduced model *Score*s give a close approximation to the full model results.

Notice that the *Score* values are above 0.35 for all *K*
_{
C
} values, indicating robust Ca^{2+} puffs/sparks present in both the full and reduced model. We further studied the mean duration of spontaneous Ca^{2+} puffs/sparks occurring as a function of *K*
_{
C
} in the whole cell formulation, shown in Fig. 8
d. We assume that a transition from *N*
_{
O
}=4 to *N*
_{
O
}=5 is considered to initialize a puff/spark and a transition from *N*
_{
O
}=1 to *N*
_{
O
}=0 (all channels closed) terminates the puffs/spark. The mean puffs/spark duration was calculated using the matrix analytic method described in [39]. As *K*
_{
C
} decreases, the channels are more likely to be sensitized, the puff/spark duration increases, indicating that the luminal regulation of the channel might lead to longer puffs/sparks, which is consistant with experimental observations [38, 40]. Compared to the Ca^{2+} release site model composed of 10 two-state channels (filled circle), the average puff/spark duration of a release site composed of the same number of four-state channels can be up to four times (when *K*
_{
C
}=1) longer. While in prior work [21], similar comparisons to a Ca^{2+} release site composed of Keizer-Levine model [41] and its corresponding reduced model gave good agreement, in this new study the reduced model tends to slightly underestimate the puff/spark durations.

## Conclusions and discussion

### A brief summary

We have implemented and validated a novel genetic algorithm-based searching technique to find reduced models that produce moderate errors for Ca^{2+} release site models that are compositionally defined from single channel Markov models. Given a full model and the designated size of the reduced model, this algorithm samples and evolves a population of set partitions, each corresponding to a potential scheme for state aggregation, that leads to the partitions that lead to reduced models which approximate the full model on the behaviors of interest. A Ca^{2+} release site composed of 10 four-state channels that are activated by the cytosolic Ca^{2+} and regulated by luminal Ca^{2+} is reduced by this technique and the steady state responses of the reduced model well approximate the full model in the minimal whole cell homeostasis environment (Fig. 8).

When a Ca^{2+} release site model is reduced, the resulting models are designated to have significantly fewer states, which is inevitably accompanied by losing some transition information. Different state aggregation schemes may preserve different information. A main benefit from using genetic algorithms is that the **evaluation** function is flexible enough to pick state aggregation schemes that maximize any information that is of specific interest to the user. In this report, for example, we are interested in how luminal regulation affects the spark behavior of the Ca^{2+} release site, and the **evaluation** function is consequently designed to assign higher *fitness* to the partitions which generate small errors in a wide range of ER/SR [ Ca^{2+}]. As another example, if the spark frequency is crucial in some study, we can conveniently edit the **e**
**v**
**a**
**l**
**u**
**a**
**t**
**i**
**o**
**n** function to calculate the spark frequency of each reduced model generated from partition \(\mathcal {I}\) and assign higher *fitness* to the ones that better approximate the full model spark frequency. When focusing on a single release site, behaviors of interest that could be implemented include, but not limited to: the number of open channels, transition probabilities among specific states, puff/spark amplitude, puff/spark durations, and inter-puff/spark-intervals. When considering Ca^{2+} diffusion and homeostasis, hybrid stochastic and deterministic simulations as used by Rückl and colleagues [42, 43] can be implemented in the **e**
**v**
**a**
**l**
**u**
**a**
**t**
**i**
**o**
**n** function such that reduced models approximate the [ Ca^{2+}] wave and oscillation statistics of the full model.

### Comparison to previous work

As compared to the fast/slow reduction technique [21], the genetic algorithm-based approach does not require time scale differences and allows users to choose the size of the reduced model freely. More importantly, the genetic algorithm-based approach often finds partition schemes that produce less error than the fast/slow technique. It is also important to note that this procedure performs better on models whose parameters are of the same scale. When time scale differences are present, like in the Keizer-Levine model and the De Young-Keizer model, because for every \(Individual\ \mathcal {I}\), we must reduce the full model following the aggregation scheme, calculating the reduction error by computing matrix exponentials. Thus, it is recommended to fine tune the genetic algorithm parameters to achieve faster convergence. The choice of crossover and/or mutation probabilities, selection techniques, and even population size per generation could affect the performance of genetic algorithms [44]. For example, when reducing a full model consisting of 10 four-state channels, if using 10 as the population size and 2000 generations are generated until the program terminates, the total time consumed is approximately 2000 times that of the fast/slow procedure. If the population size was increased to 200 individuals per generation, the genetic algorithm can find equally good reductions approximately 10 times faster. Fortunately, for any specific objective assigned, the reduction procedures need to execute very few times and the reduced release site models are potentially able to save significantly more time in the whole cell simulations. Furthermore, this genetic algorithm-based reduction technique can be hybridized with other stochastic optimization methods. For example, we implemented one version of the genetic algorithm with a simulated annealing twist, where the **crossover** procedure was skipped and every individual that survived selection would generate a child through mutation. This version converges faster as compared to the traditional genetic algorithm when applied to some release site models.

### Future work

As discussed in the ‘Background’ section, how to control computational cost of large mathematical/computational models has become an increasingly important research topic. The recent work by Cao and colleagues [45] that proposed a deterministic model of IP3Rs that qualitatively predicts some stochastic Ca^{2+} oscillation properties is very encouraging. This two-state model was constructed by reducing a 6-state stochastic IP3R model [46], where the six states were partitioned into two groups assuming time scale differences and the experimentally observed “two mode” property. Then each group was lumped to a single state according to the steady state probability distribution of the full model. Not only can this deterministic model quantitatively reproduce Ca^{2+} puffs and stochastic oscillations, but their model is also approximately 10 times faster than the comparable stochastic simulations. It would be very beneficial if we can find simple deterministic models that can replace other stochastic calcium channel models under certain circumstances. Since not all models possess time scale differences, we can potentially use the genetic algorithm approach to search for partition schemes of stochastic IP3R or RyR models that allow the reduced deterministic models to well approximate the stochastic behaviors of these channels.

Another important project for the near future is to search for common features in the partition schemes that produce small errors. Should the aggregated states in the reduction follow certain topological pattern or possess similar functional feature (having similar number of open/refractory channels for example) the model reduction approach can be significantly accelerated by using a biased initial population. So far, an interesting phenomenon observed while reducing Ca^{2+} release site models using the genetic algorithm based approach is that the state aggregation schemes which result in small reduction errors tend to be “heavy headed”. That is, these low-error partitions usually feature one large group that contains more than 50 % of the states while other groups contain significantly fewer (sometimes only one or two) states. Moreover, the states aggregated in the small groups are highly likely to be the states that are less often visited in the full model, and this phenomenon exists in all Ca^{2+} release site reduction procedures. This observation is a good explanation for the fact that the generator matrices associated with the reduced model \(\hat {\boldsymbol {Q}}\) tend to be ill-conditioned. This observation suggests that it may be possible to generate a biased initial population to accelerate the evolution procedure.

## Declarations

### Acknowledgements

This material is based upon the work supported by the National Science Foundation under Grant No. 0443843 and the provost office of Hobart and William Smith Colleges. The author would like to thank Prof. Gregory D. Smith for setting this project up. His advice and valuable suggestions have greatly shaped the frame of this work. The author also thank Associate Prof. Matthew S. Haner for proofreading and editing part of this paper. The author acknowledges stimulating discussions with H. Drew LaMar and Ryan Carpenter and the support from SciClone Computing Complex.

licensee Springer on behalf of EPJ. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## Authors’ Affiliations

## References

- Alder BJ, Wainwright TE. Studies in molecular dynamics. i. general method. J Chem Phys. 1959; 31(2):459.ADSMathSciNetView ArticleGoogle Scholar
- Plimpton SJ. Computational limits of classical molecular-dynamics simulations. Comput Mater Sci. 1995; 4:361–4.View ArticleGoogle Scholar
- Brini E, Algaer EA, Ganguly P, Li C, Rodríguez-Ropero F, van der Vegt NFA. Systematic coarse-graining methods for soft matter simulations - a review. Soft Matter. 2013; 7:2108–119.ADSView ArticleGoogle Scholar
- Saunders MG, Voth GA. Coarse-graining methods for computational biology. Annu Rev Biophys. 2013; 42(2):73–93.View ArticleGoogle Scholar
- Zheng J, Vankataramanan L, Sigwortha FJ. Hidden markov model analysis of intermediate gating steps associated with the pore gate of shaker potassium channels. J Gen Physiol. 2001; 118(5):547–64.View ArticleGoogle Scholar
- Nguyen V, Mathias R, Smith GD. A stochastic automata network descriptor for markov chain models of instantaneously-coupled intracellular Ca
^{2+}channels. Bull Math Biol. 2005; 67(3):393–432.MathSciNetView ArticleMATHGoogle Scholar - Clapham DE. Calcium signaling. Cell. 1995; 80(2):259–68.View ArticleGoogle Scholar
- Berridge MJ. Elementary and global aspects of calcium signalling. J Physiol (Lond). 1997; 499(Pt 2):291–306.View ArticleGoogle Scholar
- Cheng H, Lederer MR, Lederer WJ, Cannell MB. Ca
^{+}sparks and [Ca^{2+}]_{ i }waves in cardiac myocytes. Am J Physiol. 1996; 270(1 Pt 1):148–59.Google Scholar - Yao Y, Choi J, Parker I. Quantal puffs of intracellular Ca
^{2+}evoked by inositol trisphosphate in Xenopus oocytes. J Physiol. 1995; 482(Pt 3):533–3.View ArticleGoogle Scholar - Endo M. Calcium release from the sarcoplasmic reticulum. Phys Rev. 1977; 57(1):71–108.Google Scholar
- Cheng H, Lederer WJ, Cannell MB. Calcium sparks: elementary events underlying excitation-contraction coupling in heart muscle. Science. 1993; 262(5134):740–4.ADSView ArticleGoogle Scholar
- Parker I, Choi J, Yao Y. Elementary events of IP3-induced Ca
^{2+}liberation in Xenopus oocytes: hot spots, puffs and blips. Cell Calcium. 1996; 20(2):105–21.View ArticleGoogle Scholar - Shuai JW, Jung P. Optimal ion channel clustering for intracellular calcium signaling. Proc Natl Acad Sci USA. 2003; 100(2):506–10.ADSView ArticleGoogle Scholar
- Cannell MB, Cheng H, Lederer WJ. Spatial non-uniformities in [Ca
^{2+}]_{ i }during excitation-contraction coupling in cardiac myocytes. Biophys J. 1994; 67(5):1942–56.View ArticleGoogle Scholar - Colquhoun D, Hawkes A. A Q-matrix cookbook: how to write only one program to calculate the sigle-channel and macroscopic predictions for any kinetic mechanism In: Sakmann B, Neher E, editors. Single-Channel Recording. New York: Plenum Press: 1995. p. 589–633.Google Scholar
- Smith GD. Modeling the stochastic gating of ion channels In: Fall C, Marland E, Wagner J, Tyson J, editors. Computational Cell Biology. New York: Springer: 2002. p. 291–325.Google Scholar
- DeRemigio H, Smith GD. Calcium release site ultrastructure and the dynamics of puffs and sparks. Math Med Biol. 2008; 25(1):65–85.View ArticleMATHGoogle Scholar
- Williams GSB, Huertas MA, Sobie EA, Jafri MS, Smith GD. A probability density approach to modeling local control of Ca
^{2+}-induced Ca^{2+}release in cardiac myocytes. Biophys J. 2007; 92(7):2311–28.View ArticleGoogle Scholar - Williams GSB, Huertas MA, Sobie EA, Jafri MS, Smith GD. Moment closure for local control models of Ca
^{2+}-induced Ca^{2+}release in cardiac myocytes. Biophys J. 2008; 95(4):1689–703.View ArticleGoogle Scholar - Hao Y, Kemper P, Smith GD. Reduction of calcium release site models via fast/slow analysis and iterative aggregation/disaggregation. Chaos. 2009; 5(19):037107.ADSView ArticleMATHGoogle Scholar
- Feldmann AE. Fast balanced partitioning is hard even on grids and trees In: Rovan B, Sassone V, Widmayer P, editors. Mathematical Foundations of Computer Science 2012. Berlin Heidelberg: Springer: 2012. p. 372–82.Google Scholar
- Holland JH. Adaptation in Natural and Artificial Systems. Ann Arbor: The U. of Michigan Press; 1975.Google Scholar
- Gesú VD, Giancarlo R, Bosco GL, Raimondi A, Scaturro D. Genclust: a genetic algorithm for clustering gene expression data. BMC Bioinformatics. 2005; 6:289.View ArticleGoogle Scholar
- To C, Vohradsky J. A parallel genetic algorithm for single class pattern classification and its application for gene expression profiling in streptomyces coelicolor. BMC Genomics. 2007; 8:49.View ArticleGoogle Scholar
- Hill T, Lundgren A, Fredriksson R, Schioth H. Genetic algorithm for large-scale maximum parsimony phylogenetic analysis of proteins. Biochim Biophys Acta. 2005; 1725(1):19–29.View ArticleGoogle Scholar
- Groenendaal W, Ortega FA, Kherlopian AR, Zygmunt AC, Krogh-Madsen T, Christini DJ. Cell-specific cardiac electrophysiology models. PLoS Comput Biol. 2015; 11(4):1004242.View ArticleGoogle Scholar
- Hartman JA, Sobie EA, Smith GD. Calcium sparks and homeostasis in a minimal model of local and global calcium responses in quiescent ventricular myocytes. AJP: Heart Circ Physiol. 2010. doi:10.1152/ajpheart.00293.2010.
- Hinch R, Greenstein JL, Tanskanen AJ, Xu L, Winslow RL. A simplified local control model of calcium-induced calcium release in cardiac ventricular myocytes. Biophys J. 2004; 87(6):3723–6.View ArticleGoogle Scholar
- Hinch R, Greenstein JL, Winslow RL. Multi-scale models of local control of calcium induced calcium release. Prog Biophys Mol Biol. 2006; 90(1-3):136–50.View ArticleGoogle Scholar
- Greenstein JL, Hinch R, Winslow RL. Mechanisms of excitation-contraction coupling in an integrative model of the cardiac ventricular myocyte. Biophys J. 2006; 90(1):77–91.View ArticleGoogle Scholar
- Mazzag B, Tignanelli C, Smith GD. The effect of residual Ca
^{2+}on the stochastic gating of Ca^{2+}-regulated Ca^{2+}channels. J Theor Biol. 2005; 235(1):121–50.View ArticleGoogle Scholar - Huertas MA, Smith GD. The dynamics of luminal depletion and the stochastic gating of Ca
^{2+}-activated Ca^{2+}channels and release sites. J Theor Biol. 2007; 246(2):332–54.MathSciNetView ArticleGoogle Scholar - Davis L. Handbook Of Genetic Algorithms. New York: Van Nostrand Reingold; 1991.Google Scholar
- Michalewicz Z. Genetic Algorithms + Data Structures = Evolution Programs. New York: Springer; 1994.View ArticleMATHGoogle Scholar
- Nicola V. Lumping in markov reward processes. Technical report, RC14719, IBM Thomas Watson Research Centre, PO Box 704, Yorktown Heights, NY 10598;1998.Google Scholar
- Shannon TR, Wang F, Puglisi J, Weber C, Bers DM. A mathematical treatment of integrated Ca
^{2+}dynamics within the ventricular myocyte. Biophys J. 2004; 87(5):3351–71.View ArticleGoogle Scholar - Stevens SC, Terentyev D, Kalyanasundaram A, Periasamy M, Györke S. Intra-sarcoplasmic reticulum Ca
^{2+}oscillations are driven by dynamic regulation of ryanodine receptor function by luminal Ca^{2+}in cardiomyocytes. J Physiol (Lond). 2009; 587(20):4863–72. published in October 2009.View ArticleGoogle Scholar - Groff JR, Smith GD. Calcium-dependent inactivation and the dynamics of calcium puffs and sparks. J Theor Biol. 2008; 253(3):483–99.View ArticleGoogle Scholar
- Györke I, Györke S. Regulation of the cardiac ryanodine receptor channel by luminal Ca
^{2+}involves luminal Ca^{2+}sensing sites. Biophys J. 1998; 75(6):2801–10.View ArticleGoogle Scholar - Keizer J, Levine L. Ryanodine receptor adaptation and Ca
^{2+}(-)induced Ca^{2+}release-dependent Ca^{2+}oscillations. Biophys J. 1996; 71(6):3477–487.View ArticleGoogle Scholar - Rückl M, Parker I, Marchant JS, Nagaiah C, Johenning FW, Rüdiger S. Modulation of elementary calcium release mediates a transition from puffs to waves in an IP3R cluster model. PLoS Comput Biol. 2015; 11(1):1003965.View ArticleGoogle Scholar
- Rüdiger S, Shuai JW, Huisinga W, Nagaiah C, Warnecke G, Parker I, Falcke M. Hybrid stochastic and deterministic simulations of calcium blips. Biophys J. 2007; 93:1847–57.View ArticleGoogle Scholar
- Karafotias G, Hoogendoorn M, Eiben AE. Parameter control in evolutionary algorithms: Trends and challenges. IEEE Trans Evol. 2015; 19(2):167–87.View ArticleGoogle Scholar
- Cao P, Tan X, Donovan G, Sanderson MJ, Sneyd J. A deterministic model predicts the properties of stochastic calcium oscillations in airway smooth muscle cells. PLoS Comput Biol. 2014; 10(8):1003783.ADSView ArticleGoogle Scholar
- Cao P, Donovan G, Falcke M, Sneyd J. A stochastic model of calcium puffs based on single-channel data. Biophys J. 2013; 105:1133–42.View ArticleGoogle Scholar