The neuronal response at extended timescales: a linearized spiking input–output relation

Soudry, Daniel; Meir, Ron

doi:10.3389/fncom.2014.00029

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 02 April 2014
Volume 8 - 2014 | https://doi.org/10.3389/fncom.2014.00029

The neuronal response at extended timescales: a linearized spiking input–output relation

Daniel Soudry^*

Ron Meir

Laboratory for Network Biology Research, Department of Electrical Engineering, Technion, Haifa, Israel

Many biological systems are modulated by unknown slow processes. This can severely hinder analysis – especially in excitable neurons, which are highly non-linear and stochastic systems. We show the analysis simplifies considerably if the input matches the sparse “spiky” nature of the output. In this case, a linearized spiking Input–Output (I/O) relation can be derived semi-analytically, relating input spike trains to output spikes based on known biophysical properties. Using this I/O relation we obtain closed-form expressions for all second order statistics (input – internal state – output correlations and spectra), construct optimal linear estimators for the neuronal response and internal state and perform parameter identification. These results are guaranteed to hold, for a general stochastic biophysical neuron model, with only a few assumptions (mainly, timescale separation). We numerically test the resulting expressions for various models, and show that they hold well, even in cases where our assumptions fail to hold. In a companion paper we demonstrate how this approach enables us to fit a biophysical neuron model so it reproduces experimentally observed temporal firing statistics on days-long experiments.

1. Introduction

Neurons are modeled biophysically using Conductance-Based Models (CBMs). In CBMs, the membrane time constant and the timescales of fast channel kinetics determine the timescale of Action Potential (AP) generation in the neuron. These are typically around 1–20 ms. However, there are various modulating processes that affect the response on slower timescales. Many types of ion channels exist, and some change with a timescale as slow as 10 s (Channelpedia), and possibly even minutes (Toib et al., 1998). Additional new sub-cellular kinetic processes are being discovered at an explosive rate (Bean, 2007; Sjöström et al., 2008; Debanne et al., 2011). This variety is particularly large for very slow processes (Marom, 2010).

For example, ion channels are known to be regulated over the course of long timescales (Levitan, 1994; Staub et al., 1997; Jugloff, 2000; Monjaraz et al., 2000), which could cause changes in ion channel numbers, conductances and kinetics. Also, the ionic concentrations in the cell depend on the activity of the ionic pumps, which can be affected by the metabolism of the network (Silver et al., 1997; Kasischke et al., 2004). Finally, the cellular neurites (De Paola et al., 2006; Nishiyama et al., 2007) and even the spike initiation region (Grubb and Burrone, 2010) can shift their location with time. All these changes can have a large effect on excitability.

Therefore, current CBMs can be considered as strictly accurate only below a certain timescale, since they do not incorporate most of these slow processes. A main reason for this “neglect” is that such slow processes are not well characterized. This is especially problematic since neurons are excitable, so their dynamics is far from equilibrium, highly non-linear and contain feedback. Due to the large number of processes which are unknown or lacking known parameters, it would be hard to simulate or analyze such models. Therefore, it may be hard to quantitatively predict how adding and tuning slow processes in the model would affect the dynamics at longer timescales.

In order to allow CBMs with many slow process to be fitted and analyzed, it is desirable to have general expressions that describe their Input–Output (I/O) relation explicitly, based on biophysical parameters. In a previous paper (Soudry and Meir, 2012b), we found that this becomes possible if we use (experimentally relevant Elul and Adey, 1966; Kaplan et al., 1996; De Col et al., 2008; Gal et al., 2010; Goldwyn et al., 2012) sparse spike inputs, similar to the typical output of the neuron (Figures 1A,B). In this case, we derived semi-analytically¹ a discrete piecewise linear map describing the neuronal dynamics between stimulation spikes, for a general deterministic neuron model with a few assumptions (mainly, a timescale separation assumption). Based on this reduced map, we were able to derive expressions for the “mean” behavior of the neuron (e.g., firing modes, firing rate and mean latency).

FIGURE 1

Figure 1. Schematic summary. (A) Aim: find the I/O relation between inter-stimulus intervals (T_m) and Action Potential (AP) occurrences (Y_m) – for a general biophysical neuron model (Equations 1–3). (B) An AP “occurred” if the voltage V crossed a threshold V_th following the (sparse) stimulus, with T_m ≫ τ_AP. (C) Result: Biophysical neuron model reduced to a simple linear system with feedback (Equations 11, 12), and biophysically meaningful parameters (F,d,a and w).

In this paper, we find that stronger and more general analytical results can be obtained if we take into account the stochasticity of the neuron – arising from ion channel noise² (Neher and Sakmann, 1976; Hille, 2001). Due to the presence of this noise, the discrete map describing the neuronal dynamics is “smoothed out,” and can be linearized. This linearized map constitutes a concise description for the neuronal I/O (Equations 11, 12) based on biophysically meaningful parameters. This I/O is well described by an “engineering-style” block diagram with feedback (Figure 1C), where the input is the process of stimulation intervals and the output is the AP response (Figure 1A). Note that the response is affected both by internal noise and by the input. Beyond conceptual lucidity, such a linear I/O allows the utilization of well known statistical tools to derive all second order statistics, to construct linear optimal estimators and to perform parameter identification. These results hold numerically (Figure 3), even sometimes when our assumptions break down (Figure 4).

In our previous paper, Soudry and Meir (2012b), we used our results to model recent experiments (Gal et al., 2010) where synaptically isolated individual neurons, from rat cortical culture, were stimulated with extra-cellular sparse current pulses for an unprecedented duration of days. Our results enabled us to explain the “mean” response of these neurons. However, the second order-statistics in the experiment seem particularly puzzling. The neurons exhibited 1/f^α statistics (Keshner, 1982), responding in a complex and irregular manner from seconds to days. In a companion paper (Soudry and Meir, 2014), we demonstrate the utility of our new results. These results allow us to reproduce and analyze the origins of this 1/f^α on very long timescales.

2. Results

This section described our main results in outline. The details of each sub-section here appear in the corresponding sub-section in section 4. For readers who do not wish to go through the detailed derivations, the present section is self-contained. Readers who do wish to follow the mathematical derivations, should first read section 4, where, for convenience, each subsection (except for the last one) can be read independently. In our notation 〈·〉 is an ensemble average, $i ≜ \sqrt{- 1}$ , a non-capital boldfaced letter x ≜ (x₁, …, x_n)^⊤ is a column vector (where (·)^⊤ denotes transpose), and a boldfaced capital letter X is a matrix (with components X_mn).

2.1. Full Model

The voltage dynamics of an isopotential neuron are determined by ion channels, protein pores which change their conformations stochastically with voltage-dependent rates (Hille, 2001). At the population level, such dynamics are generically described by Fox and Lu (1994), Goldwyn et al. (2011), and Soudry and Meir (2012b) a CBM

\begin{matrix} \dot{V} = f (V, r, s, I (t)) & (1) \end{matrix}

\begin{matrix} \dot{r} = A_{r} (V) r + B_{r} (V, r) ξ_{r} & (2) \end{matrix}

\begin{matrix} \dot{s} = A_{s} (V) s + B_{s} (V, s) ξ_{s}, & (3) \end{matrix}

with voltage V, stimulation current I(t), rapid variables r (e.g., m, n, h in the Hodgkin–Huxley (HH) model Hodgkin and Huxley, 1952), slow “excitability” variables s (e.g., slow sodium inactivation Chandler and Meves, 1970), rate matrices A_r/s, white noise processes ξ_r/s (with zero mean and unit variance), and matrices B_r/s which can be written explicitly using the rates and ion channel numbers (Orio and Soudry, 2012) (D = BB^⊤ is the diffusion matrix Orio and Soudry, 2012). For simplicity, we assumed that r and s are not coupled directly, but this is non-essential (Contou-Carrere, 2011; Wainrib et al., 2011). The parameter space can be constrained (Soudry and Meir, 2012b), since we consider here only excitable, non-oscillatory neurons which do not fire spontaneously³ and which have a single resting state – as is common for cortical cells, e.g., Gal et al. (2010).

Since the components of r and s usually represent fractions, in some cases it is more convenient to use the normalization constraint (i.e., that fractions sum to one), and reduce the dimensions of r, s, and ξ_r/s. After this reduction, the form of Equations (1–3) changes to

\begin{matrix} \dot{V} = f (V, r, s, I (t)) & (4) \end{matrix}

\begin{matrix} \dot{r} = A_{r} (V) r - b_{r} (V) + B_{r} (V, r) ξ_{r}, & (5) \end{matrix}

\begin{matrix} \dot{s} = A_{s} (V) s - b_{s} (V) + B_{s} (V, s) ξ_{s}, & (6) \end{matrix}

where all the variables and parameters have been redefined (with their size decreased). Note that we have slightly abused notation by using the same symbols in Equations (4–6) and in Equations (1–3). The specific set of equations used will always be stated. We call Equations (4–6) the “compressed form” of the CBM.

Such biophysical neuronal models (either Equations 1–3 or 4–6) are generally complex and non-linear, containing many variables and unknown parameters (sometimes ranging in the hundreds Koch and Segev, 1989; Roth and Häusser, 2001), not all of which can be identified (Huys et al., 2006). Therefore, such models are notoriously difficult to tune, highly susceptible to over-fitting and computationally expensive (Migliore et al., 2006; Gerstner and Naud, 2009; Druckmann et al., 2011). Also, the high degree of non-linearity usually prevents exact mathematical analysis of such models at their full level of complexity (Ermentrout and Terman, 2010). However, much of the complexity in such models can be overcome under well defined and experimentally relevant settings (Elul and Adey, 1966; Kaplan et al., 1996; De Col et al., 2008; Gal et al., 2010; Goldwyn et al., 2012), if we use sparse inputs, similar in nature to the spikes commonly produced by the neuron.

2.2. Model Reduction

We consider a stimulation setting motivated by the experiments described in Gal et al. (2010) and further elaborated on in section 3. Specifically, suppose I(t) consists of a train of pulses arriving at times {t_m} (Figure 1A, top), so T_m = t_{m + 1} − t_m ≫ τ_AP with τ_AP being the timescale of an AP (Figure 1B). Our aim is to describe the AP occurrences Y_m, where Y_m = 1 if an AP occurred immediately after the m-th stimulation, and 0 otherwise (Figure 1A, bottom). Recall again that we assume the neuron does not generate APs unless stimulated (as observed in Gal et al., 2010).

In this section we “average out” Equations (1–3) using a semi-analytical method similar to that in Soudry and Meir (2012b). To do so, we need to integrate Equations (1–3) between t_m and t_{m + 1}. Since T_m ≫ τ_AP, the rapid AP generation dynamics of (V, r) relax to a steady state before t_{m + 1}. Therefore, the neuron AP “remembers” any history before t_m only through s_m = s(t_m). Given s_m, the response of the fast variables (V, r) to the m-th stimulation spike will determine the probability to generate an AP. This probability,

p_{AP} (s_{m}) ≜ P (Y_{m} | s_{m}) = 〈 Y_{m} | s_{m} 〉,

collapses all the relevant information from Equations (1, 2), and can be found numerically from the pulse response of Equations (1, 2) with s held fixed (section 4.2.4).

In order to integrate the remaining Equation (3), we define X₊, X₋ and X₀ to be the averages of a quantity X_s during an AP response, a failed AP response and rest, respectively ⁴. Also, we denote

\begin{matrix} \begin{array}{l} X (Y_{m}, T_{m}) \overset{Δ}{=} τ_{AP} T_{m}^{- 1} (Y_{m} X_{+} + (1 - Y_{m}) X_{-}) + \\ (1 - τ_{AP} T_{m}^{- 1}) X_{0}, \end{array} & (7) \end{matrix}

as the steady state mean value of X_s. For analytical simplicity we assume⁵ T_m ≪ τ_s. We obtain, to first order

\begin{matrix} s_{m + 1} = s_{m} + T_{m} A (Y_{m}, T_{m}) s_{m} + n_{m} . & (8) \end{matrix}

where n_m is a white noise process with zero mean and variance T_mD (Y_m, T_m). For the compressed form (Equations 4–6) we have instead

\begin{matrix} s_{m + 1} = s_{m} + T_{m} [A (Y_{m}, T_{m}) s_{m} - b (Y_{m}, T_{m})] + n_{m} . & (9) \end{matrix}

Note that such a simplified discrete time map, which describes the excitability dynamics of the neuron, has far fewer parameters than the full model, since it is written explicitly only using the averaged microscopic rates of s (through A and D), population sizes (through D), the probability to generate an AP given s, p_AP (s), and the relevant timescales. This effective model exposes the large degeneracy in the parameters of the full model and leads to significantly reduced simulation times and mathematical tractability. Notably, the dynamics of the state s_m (Equation 8) depends on the input T_m and the output Y_m – and this feedback affects all of our following results.

2.3. Linearization

In this section we exploit the intrinsic ion channel noise to linearize the neuronal dynamics, rendering it more tractable than the (less realistic) noiseless case (Soudry and Meir, 2012b). Suppose that the inter-stimulus intervals {T_m} have stationary statistics with mean T_* so that τ_AP ≪ T_m ≪ τ_s with high probability. Since s is slow and AP generation is rather noisy in this regime (Soudry and Meir, 2012b) (so p_AP (s_m) is slowly varying), we assume that a stable excitability fixed point s_* exists (Figure 2). Therefore, the perturbations ${\hat{s}}_{m} = s_{m} - s_{*}$ are small and we can linearize

p_{AP} (s_{m}) \approx p_{*} + w^{⊤} {\hat{s}}_{m} .

FIGURE 2

Figure 2. Schematic explanation of linearization. In a deterministic neuron, an AP will be generated in response to stimulation if and only if the neuronal excitability (here, s) is above a certain threshold (A). This generates discontinuous dynamics in the neuronal excitability (B), see Equations 7, 8). In a stochastic neuron, the response probability is a smooth function of s (C). In turn, this “smooths” the dynamics (D). Note that if the noise is sufficiently high (as is true in many cases, for biophysically realistic levels of noise), then this generates a stable fixed point s* – which gives the mean response probability p_*, and around which the dynamics can be linearized (yellow region).

Denoting X_* = X(p_*, T_*), the mean AP firing rate can be found self consistently from the location of the fixed point s_*,

\begin{matrix} 〈 Y_{m} 〉 = p_{*} = p_{AP} (s_{*}), & (10) \end{matrix}

where s_* depends on p_* through A_*s_* = 0, or s_* = A⁻¹_*b_* in the compressed form.

The perturbations ${\hat{s}}_{m} = s_{m} - s_{*}$ around the fixed point s_* are described by the linear system

\begin{matrix} {\hat{s}}_{m + 1} = F {\hat{s}}_{m} + d {\hat{T}}_{m} + a {\hat{Y}}_{m} + n_{m}, & (11) \end{matrix}

\begin{matrix} {\hat{Y}}_{m} = w^{⊤} {\hat{s}}_{m} + e_{m}, & (12) \end{matrix}

where ${\hat{T}}_{m} = T_{m} - T_{*}$ , ${\hat{Y}}_{m} = Y_{m} - 〈 Y_{m} 〉$ , F ≜ I + T_*A_*, 〈n_mn^⊤_m〉 = T_*D_*, e_m is a (non-Gaussian) white noise process, 〈e_m〉 = 〈e_mn_m〉 = 0, σ²_e ≜ 〈e²_m〉 = p_* (1 − p_*), d ≜ A₀s_* and a ≜ τ_AP (A₊ − A₋) s_*. If we use the compressed form instead, then these results remain valid, except we need to re-define d ≜ A₀s_* − b₀ and a ≜ τ_AP[(A₊ − A₋) s_* − (b₊ − b₋)].

The linear I/O for the fluctuations in Equations (11, 12), which contains feedback from the “output” ${\hat{Y}}_{m}$ to the state variable ${\hat{s}}_{m}$ (Figure 1C), can be very helpful mathematically and its parameters are directly related to biophysical quantities.

2.4. Linear Systems Analysis

Using standard tools, this formulation makes it now possible to construct optimal linear estimators for Y_m and s_m (Anderson and Moore, 1979), perform parameter identification (Lejung, 1999), and find all second order statistics in the system (Papoulis and Pillai, 1965; Gardiner, 2004), such as correlations or Power Spectral Densities (PSD). For example, for f ≪ T⁻¹_*, the PSD of the output is

\begin{matrix} \begin{array}{l} S_{Y} (f) = w^{⊤} H_{c} (- f) (D_{*} + T_{*}^{- 2} d d^{⊤} S_{T} (f)) H_{c}^{⊤} (f) w \\ + T_{*} σ_{e}^{2} {| 1 + T_{*}^{- 1} w^{⊤} H_{c} (f) a |}^{2} \end{array} & (13) \end{matrix}

where

\begin{matrix} H_{c} (f) \overset{Δ}{=} {(2 π f i - A_{*} - T_{*}^{- 1} a w^{⊤})}^{- 1} . & (14) \end{matrix}

Similarly, the PSD of the state variables is

\begin{matrix} S_{s} (f) = H_{c} (- f) (D_{*} + T_{*}^{- 1} a a^{⊤} σ_{e}^{2} + T_{*}^{- 2} d d^{⊤} S_{T} (f)) H_{c}^{⊤} (f), & (15) \end{matrix}

and the input–output cross-PSD is

\begin{matrix} S_{Y T} (f) = T_{*}^{- 1} w^{⊤} H_{c} (- f) d S_{T} (f) . & (16) \end{matrix}

Again, note the large degeneracy here – many different sets of parameters will generate the same PSD. Using similar methods, the PSDs of various response features, such as the AP latency or amplitude, can also be derived (Equation 124).

Finally, we note Equations (11) and (12) can be re-arranged as a direct I/O relation. First, we define the filters (transfer functions)

\begin{matrix} H^{ext} (f) ≜ T_{*}^{- 1} w^{⊤} H_{c} (f) d & (17) \end{matrix}

\begin{matrix} H^{int} (f) ≜ (T_{*}^{- 1} w^{⊤} H_{c} (f) K + 1) σ_{v} & (18) \end{matrix}

where K = a + FPwσ⁻²_v and σ²_v = w^⊤ Pw + σ²_e, with P being the solution of

\begin{matrix} P = F P F^{⊤} - {(w^{⊤} P w + σ_{e}^{2})}^{- 1} F P w w^{⊤} P F^{⊤} + T_{*} D_{*} . & (19) \end{matrix}

Using these filters, we obtain, in the frequency domain,

\begin{matrix} \hat{Y} (f) = H^{ext} (f) \hat{T} (f) + H^{int} (f) z (f), & (20) \end{matrix}

where $\hat{Y} (f), \hat{T} (f)$ and z(f) are the Fourier transforms of Y_m, ${\hat{T}}_{m}$ and z_m, respectively, with z_m being a white noise process with zero mean and unit variance. Notably, these transfer functions can be identified from the spiking input–output of the neuron ${{\hat{T}}_{m}, {\hat{Y}}_{m}}$ , without access to the underlying dynamics or biophysical parameters. Specifically, Equation (20) has the form of an ARMAx(M, M, M) model⁶ (Lejung, 1999) (recall M is the dimension of s), which can be estimated using standard tools (e.g., the system identification toolbox in Matlab).

2.5. Numerical Tests

As we argued so far, a main asset of the present approach is its applicability to a broad range of models of various degrees of complexity and realism. Recall that our three assumptions are

τ_AP ≪ T_m (temporally sparse input).
T_m ≪ τ_s (timescale separation).
A stable excitability fixed point s_* exists, (“noisy” neuron).

In this section we will demonstrate that our analytical approximations agree very well with the numerical solution of Equations (1–3), even in some cases where the assumptions 2 and 3 do not hold. Therefore, these assumptions are sufficient, but not necessary.

2.5.1. The HHS model

First, in Figure 3 we tested our results on the HH model with Slow sodium inactivation. This “HHS” model (Soudry and Meir, 2012b, and see section 4.5.1 for parameter values) augments the classic HH model (Hodgkin and Huxley, 1952) with an additional slow inactivation process of the sodium conductance (Chandler and Meves, 1970; Fleidervish et al., 1996). The HHS model includes the uncoupled stochastic Hodgkin–Huxley (HH) model equations (Fox and Lu, 1994), and is written in the compressed formulation (Equations 4–6)

\begin{matrix} \begin{array}{l} C \dot{V} = {\bar{g}}_{N a} s m^{3} h (E_{N a} - V) + {\bar{g}}_{K} n^{4} (E_{K} - V) \\ + {\bar{g}}_{L} (E_{L} - V) + I (t) \end{array} & (21) \end{matrix}

\begin{matrix} \begin{array}{l} \dot{r} = [α_{r} (V) (1 - r) - β_{r} (V) r] ϕ + \\ \sqrt{N^{- 1} ϕ (α_{r} (V) (1 - r) + β_{r} (V) r)} ξ_{r}, \end{array} & (22) \end{matrix}

for r = m, n and h, with the additional kinetic equation for slow sodium inactivation

\begin{matrix} \dot{s} = δ (V) (1 - s) - γ (V) s + \sqrt{N^{- 1} (δ (V) (1 - s) + γ (V) s)} ξ_{s}, & (23) \end{matrix}

where V is the membrane voltage, I(t) is the input current, m, n and h are ion channel “gating variables,” α_r(V), β_r(V), δ(V), and γ(V) are the voltage dependent kinetic rates of these gating variables, ϕ is an auxiliary dimensionless number, C is the membrane's capacitance, E_K, E_Na and E_L are ionic reversal potentials, g_K, g_Na and g_L are ionic conductances and N is the number of ion channels. Note that in this model τ_s is between 20 s (at rest) and 40 s (during an AP).

FIGURE 3

Figure 3. Comparing the mathematical results with the numerical simulation of the full model (Equations 1–3) for the stochastic HHS model (section 4.5.1). (A) Firing probability p_*(T⁻¹_*) (Equation 10) for different currents (I_stim = 7.5, 7.7, 7.9, 8.1, 8.3 μA from bottom to top). (B) The PSDs S_Y(f) and S_s(f). “Sim” is a simulation of the full model, “Map” is a (10⁴ faster) simulation of Equation (8) together with p_AP (s_m), while “Approx” refers to the analytical expressions (Equations 13–15). “Ident” is the PSD S_Y(f) of the linear system identified from the spiking data. Note the high/low-pass filter shapes of S_Y(f) and S_s(f), respectively. (C) Optimal linear estimation of $\hat{s}$ . (D) Amplitude and phase of the cross-spectrum S_YT(f) for Poisson stimulation (Equations 16). Note that the frequency range was cut due to spectral estimation noise (see Figure 8). Parameters: I₀ = 7.9 μA and T_* = 50 ms in (B–D), and also stimulation is periodical in (A–C). Note the low-pass filter shapes of S_YT(f).

In Figure 3A we show that through Equation (10) we can accurately calculate p_*, the mean probability to generate an AP (so p_*T⁻¹_* is the firing rate of the neuron). In Figure 3B we demonstrate both the analytical expression (Equations 13, 15), or a simulation of the reduced model (Equation 8), will give the PSDs S_Y (f) or S_s (f) of the full model (Equations 1–3). In Figure 3D we do the same for the analytical expression (Equation 16) of the Cross-PSD S_YT (f). In Figure 3C we show that we can construct a linear optimal filter for the internal state ${\hat{s}}_{m}$ , given ${{T_{k}}_{k = 0}^{m - 1}, {Y_{k}}_{k = 0}^{m - 1}}$ quite well, with low mean square error (section 4.4.4). Finally, back in Figure 3B, top, we infer the linear model parameters from the spike output using system identification tools [here, with ARMAx(1, 1, 1)], and present the PSD of the identified model (“Ident”). Since S_Y (f) = |H_int (f)|² (see Equation 111) for periodical input (in which ${\hat{T}}_{m} = 0$ ) this allows us to confirm that the linear model was identified. As can be seen, the identified filter matches well with that of the linear system.

2.5.2. Testing the limit of our assumptions

Next, we demonstrate that our analytical expressions hold also for various other models. Specifically, in the following scenarios: (1) when the kinetics of the neuron are extended to arbitrarily slow timescales, (2) when the assumptions 2 and 3 break down, (3) when the rapid and slow kinetics are coupled, (4) when “physiological” synaptic inputs are used. These results are presented in Figures 4, 5, with specific model parameters given in section 4.5.

FIGURE 4

Figure 4. Comparing mathematical results with full model simulation when the assumptions fail to hold. In the HHSIP model (HHS with potassium inactivation) we plot (A) p_*(T⁻¹_*) for different currents (I₀ = 7.5, 7.7, 7.9, 8.1, 8.3μA from bottom to top). (B) S_Y(f) for two values of T_*. As before, “Sim” is a simulation of the full model, “Approx” is the analytical approximation, and “Ident” is the PSD S_Y(f) of the linear system identified from the spiking data. Upper figure shows the case when T_* ≈ 0.5τ_s so the timescale separation assumption breaks down. In the lower figure the parameters are close to a Hopf bifurcation where a limit cycle is formed so the fixed point assumption breaks down, so the estimation of the limit cycle frequency component is less accurate. (C) The estimation of ${\hat{s}}_{1}$ for T⁻¹_* ≈ 30 Hz is even better than in the HHS case. Similarly to (A–C) we plot the results of the HHMSIP model (HHSIP with many additional slow sodium inactivation kinetics) in (D–F), which has considerably more noise in the slow kinetics, and so even larger fluctuations (which further invalidates the fixed point assumption). See section 4.5 for various model details.

FIGURE 5

Figure 5. Comparing mathematical results (green) with full model simulation (blue) for various models. (A) Coupled HHS (HHS coupled slow and rapid kinetics) (B) HHMS (HHS with many additional slow sodium inactivation kinetics) (C) HHSTM (HHS with a synapse) (D) Multiplicative HHMS (variant of HHMS). As before, “Sim” is a simulation of the full model, “Approx” is the analytical approximation, and “Ident” is the PSD S_Y(f) of the linear system identified from the spiking data. See section 4.5 for various model details.

First, we tested whether or not the model can be extended to arbitrarily slow timescales. We added to the HHS model four types of slow sodium inactivation processes with increasingly slower kinetics and smaller channel numbers. In the first case, those processes were added additively (as different currents), so s was replaced with ∑_i s_i in the voltage equation (Equation 21). This model was denoted “HHMS” (HH with Many Sodium slow inactivation processes, section 4.5.4). In the second case, those processes were added in a multiplicative manner (as different processes affecting the same channel, in the uncoupled approximation), so s was replaced with ∏_i s_i in the voltage equation (Equation 21). We denote this model as “Multiplicative HHMS” (section 4.5.5). In both cases, our analytical approximations seemed to hold quite well. For example, the approximated S_Y (f) (Equation 13) corresponded rather well with the numerical simulation of the full model (Figures 5B,D, respectively).

Next, to test the limits of our assumptions we extended the HHS model to the HHSIP model (from Soudry and Meir, 2012b, see section 4.5.6) and added a potassium inactivation current which had faster kinetics (so τ_s ≈ 5 Hz). So if T⁻¹_* = 10 Hz, we get T_* ≈ 0.5τ_s, so the timescale separation assumption 2 is not strictly valid here. Also, for certain parameter values we get a limit cycle in the dynamics of ${\hat{s}}_{m}$ , so the fixed point assumption 3 fails. However, it seems that our approximations still follow the numerical simulation of the full model: for p_* at various stimulation frequencies T⁻¹_* and currents I₀ (Figure 4A), for S_Y (f) at T⁻¹_* = 10 Hz when assumption 2 breaks down (Figure 4B, top), for S_Y (f) at T⁻¹_* = 30 Hz when assumption 3 breaks down (near a Hopf bifurcation) and a limit cycle begins to form (see Figure 4B, bottom), and for state estimation of ${\hat{s}}_{1}$ using a linear optimal filter, again at T⁻¹_* = 10 Hz (Figure 4C).

The only discrepancy seemed to appear in the limit cycle case, where the frequency of the limit cycle “sharpens” the peak in S_Y (f) (Figure 4B, bottom). This may suggest that, in this case, the perturbations of the system near the limit cycle could be linearized, and that the eigenvalues of that linearized system might be related to the eigenvalues of the linearized system around the (now unstable) fixed point s_*. More generally, the results so far indicate that even if our assumptions are inaccurate, it is possible that the resulting error will not accumulate and remain small – in comparison with the intrinsic noise in the model.

Next, to challenge the approximation even more, we added to the HHSIP model four types of sodium currents with increasingly slower kinetics and fewer channels, similarly to the HHMS model (so this is the “HHMSIP” model, section 4.5.7). This significantly increased the variance of the dynamic noise n_m, rendering the dynamics more “noisy.” These random fluctuations in s_m (Figure 4E) are of similar magnitude to the width of the threshold (non-saturated) region in p_AP (s_m) (see Figure 6). This renders the fixed point assumption 3 inaccurate, since now the linear approximation $p_{AP} (s_{m}) \approx p_{*} + w^{⊤} {\hat{s}}_{m}$ breaks down most of the time. However, even in this case, the approximations seem to hold quite well with simulations of the full neuronal model (Figures 4D–F).

FIGURE 6

Figure 6. Fitting of p_AP(s) = Φ((s − a)/b) in the HHS model. (A) Fitting of p_AP (s) for various values of I₀. (B) Fitting shows that a is linearly decreasing in I₀. (C) Fitting of p_AP (s) for various values of N. (D) Fitting shows that $b \propto 1 / \sqrt{N}$ .

In Figure 5A we used a coupled version of the HHS model (“coupled HHS” model, section 4.5.2), in which the equations for r and s in the full model are tangled together, and not separated as we assumed in Equations (2, 3). Even in this case, our approximations seemed to hold well.

Finally, in Figure 5C, we extend the HHS model so that the stimulations are not given directly, but through a synapse. We used the biophysical Tsodyks–Markram model (Tsodyks and Markram, 1997) of a synapse with short-term depression, with added stochasticity (“HHSTM” model, section 4.5.3). This also seemed to work well.

In all simulation we also added the PSD of the linear model identified from the spike output (“Ident.”), to show that it can be estimated reasonably well. Note that the performance at the lowest frequencies seems to be significantly worse when they contain relatively high power. This is not surprising since it is typically harder to estimate model parameters, when the data has such (1/f^α) PSD shape – which indicates long-term correlations (Beran, 1992).

3. Discussion

In this work we found that under a temporally sparse (“spike-like”) stimulation regime (Figures 1A,B) we can perform accurate semi-analytical linearization of the spiking input–output relation of a CBM (Figure 1C), while retaining biophysical interpretability of the parameters (e.g., Figure 7). This linearization considerably reduces model complexity and parameter degeneracy, and enables the use of standard analysis and estimation tools. Importantly, this method is rather general, since it can be applied to any stochastic CBM, with only a few assumptions.

FIGURE 7

Figure 7. The averaged kinetic rates. Left: The averaged rates demonstrated for three common kinetic rates γ(V) with sigmoidal shapes. Right: The voltage threshold of the sigmoid determines whether the process is sensitive to APs (the output), stimulation pulse (the input), or neither. Note that a similar classification of biophysical processes affecting excitability was previously suggested in Wallach (2012, Figure 3.1).

3.1. Connection to Previous Work

To the best of our knowledge, such results are novel, as no previous work examined analytically the response of general stochastic CBMs to temporally sparse input for extended durations. However, the connection between sparse inputs and slow timescales has been previously made. It was previously suggested (Linaro et al., 2011) that sparse inputs could be used to identify neuronal parameters in a network of integrate and fire neurons with spike frequency adaptation. Interestingly, using different methods we reach a qualitatively similar conclusion here, though not in a network setting, and for a different class of neuron models.

Additionally, in Soudry and Meir (2012b) we modeled neurons under periodical stimulation using deterministic CBMs with slow kinetics, which are completely uncoupled from each other, and slower than the stimulation rate. Using a reduction scheme similar in nature to that described here, we were able to describe the deterministic CBM's excitability and response using a discrete-time map – which “samples” the neuronal state at each stimulation. Analyzing this map, we obtained analytical results describing the neuronal activation modes, spike latency dynamics, mean firing rate and short-time firing patterns. Stochastic CBMs were then examined numerically, and were shown to lead to qualitatively different responses, which are more similar to the experimentally observed responses.

The current work, therefore, generalizes this previous work. Here, we analyze the general case of stochastic CBMs, under general sparse stimulation patterns and with coupled slow kinetic dynamics. Therefore, the framework in the previous work (Soudry and Meir, 2012b) could be considered as a special case of this work, in which there is an infinite number of ion channels (N → ∞, so B_r/s = D_r/s = 0), T_m = T_* (so ${\hat{T}}_{m} = 0$ ) and A_s (V) (the rate matrix) is a diagonal matrix. In the current work we similarly show that, in the generalized framework, the CBM's excitability and responses can be succinctly described using a discrete-time map. It is then straightforward to derive results paralleling those in Soudry and Meir (2012b) in this more general setting, such as the mean firing rate (Equation 10).

3.2. Theoretical Novelty

However, the main novelty lies in our additional results, that could not be derived in Soudry and Meir (2012b). Specifically, due to the presence of noise, we were able to linearize the map's dynamics, and derive an explicit input–output relation. Such a linearization became possible because we made the (unusual) choice that the “input” to the CBM consists of the time-intervals between stimulation pulses, while the “output” is a binary series indicating whether or not an AP happened immediately after a stimulation pulse. The linearized input–output relation can be expressed either in biophysically interpretable “state space” (Equations 11, 12 and Figure 1C), or as a sum of the filtered input and filtered noise (Equation 20). Note that the overall I/O includes the mean output (Equation 10) which is non-linear. However, the linear part of the response, allows the derivation of the power spectral densities (Equation 13), the construction of linear optimal estimators (e.g., Figure 3C) and blind identification of the (linearized) system parameters (“Ident.” in Figures 3–5).

Our results rely on three main assumptions. The temporal sparseness of the input τ_AP ≪ T_m insures that the slow variables s_m effectively represent the “neuronal state” alone (as V and r always relax to a steady state before the next stimulation is given). The additional assumption T_m ≪ τ_s allowed us to integrate the model dynamics and derive the reduced map (Equation 8) for the dynamics of s_m, which is linear in T_m. The last assumption is that the dynamics of s_m can be linearized around a stable fixed point s_*. This fixed point is generated due to the noisiness of the rapid variables (Figure 2), and the assumption T_m ≪ τ_s ensures that the stochastic fluctuations around s_* are small. We performed extensive numerical simulations (section 2.5) that indicate that our analytical results are accurate – sometimes even if our assumptions break down.

However, clearly there are cases, beyond our assumptions, in which are results cannot hold. For example, if ${\hat{T}}_{m}$ has very large fluctuations, then the response of the neuron cannot be completely linear, since $0 < {\hat{Y}}_{m} < 1$ . Such cases may require an extension of the formalism described here. There are many possible extensions which we did not pursue here. For example, one can extend the modeling framework (e.g., multi-compartment neurons), stimulation regime (e.g., heterogeneous pulse amplitudes), or the type of neurons modeled (e.g., bursting and spontaneously firing neurons). However, it seems that an important assumption, that cannot be easily removed, is that the input is temporally sparse (τ_AP ≪ T_m).

3.3. Practical Significance

Is such a sparse temporally stimulation regime “physiologically relevant” for the soma of a neuron? Currently, such question cannot be answered directly, since it is impossible to accurately measure all the current arriving to the soma from the dendrites under completely physiological conditions. However, there is some indirect evidence. Recent studies have shown that the distribution of synaptic efficacies in the cortex is log-normal (Song et al., 2005) – so a few synapses are very strong, while most are very weak. This indicates that the neuronal firing patterns might in fact be dominated by a small number of very strong synapses while the sum of the weak synapses sets the voltage baseline (Ikegaya et al., 2012). Such a possibility is supported by the fact that individual APs can trigger the complex network events in humans (Molnár et al., 2008; Komlósi et al., 2012). Also, in rats, individual cortical cells can elicit whisker movements in Brecht et al. (2004) and even modify the global brain state (Li et al., 2009). Taken together, these observations suggests that the above-threshold stimulation reaching the soma may be temporally sparse in some cases.

There are other obvious cases were our results are immediately applicable. First, in an axonal compartment, the relevant input current is indeed sparse – an AP spike train arriving from a previous compartment. Second, a direct pulse-like stimulation is used in cochlear implants (Goldwyn et al., 2012, and references therein). Lastly, such stimulation is used as an experimental probe (De Col et al., 2008; Gal et al., 2010; Gal and Marom, 2013). Specifically, since we now have a precise expression for the power spectral density of the response, we are now ready to use these analytical results in Soudry and Meir (2014) to reproduce the experimentally observed 1/f^α behavior of the neuron and explore its implications on its input–output relation.

4. Methods

In this section we provide the details of the results presented in the paper. Sections 4.1–4.5 here respectively correspond to Sections 2.1–2.5. The first four (theoretical) sections can be read independently of each other (except when we discuss the repeating “HHS model” example). The last section give the details of the numerical simulations.

4.1. Full Model (Biophysical Neuron Models)

As we explained in section 2.1, a general model for a biophysical isopotential neuron is given by the following equations

\begin{matrix} \dot{V} = f (V, r, s, I (t)), & (24) \end{matrix}

\begin{matrix} \dot{r} = A_{r} (V) r + B_{r} (V, r) ξ_{r}, & (25) \end{matrix}

\begin{matrix} \dot{s} = A_{s} (V) s + B_{s} (V, s) ξ_{s}, & (26) \end{matrix}

with voltage V, stimulation current I(t), rapid variables r (e.g., m, n, h in the Hodgkin–Huxley (HH) model Hodgkin and Huxley, 1952), slow variables s (e.g., slow sodium inactivation Chandler and Meves, 1970), rate matrices A_r/s, white noise processes ξ_r/s (with zero mean and unit variance), and matrices B_r/s which can be written explicitly using the rates and ion channel numbers (Orio and Soudry, 2012) (D = BB^⊤ is the diffusion matrix Gardiner, 2004; Orio and Soudry, 2012). In this section we give the specific forms of A_r/s and B_r/s, and their origin based on neuronal biophysics.

Microscopic origins

Such a model is commonly called a stochastic Conductance Based Model (CBM). In a non-stochastic CBM, the dynamics of the membrane voltage V (Equation 36) are deterministically determined by some general function of V, the stimulation current I(t), and some internal state variables r and s. In contrast, the dynamical equations for r and s here adhere to a specific Stochastic Differential Equation (SDE) form, since these variables describe the “population state” of all the ion channels in the neuron. We now explain the biophysical interpretation of those equations.

At the microscopic level, each ion channel has several states, and it switches between those states with voltage dependent rates (Hille, 2001). This is usually modeled using a Markov model framework (Colquhoun and Hawkes, 1981). Formally, suppose we index by c the different types of channels, c = 1, …, C. For each channel type c there exist N^(c) channels, where each channel of type c possesses K^(c) internal states. In the Markov framework, for each ion channel that resides in state i, the probability that the channel will be in state j after an infinitesimal time dt is given by

\begin{matrix} {\begin{array}{l} A_{i j}^{(c)} (V) d t, & if j \neq i \\ 1 - \sum_{j \neq i} A_{j i}^{(c)} (V) d t, & if j = i \end{array}, & (27) \end{matrix}

where A^(c) (V) is called the “rate matrix” for that channel type.

To facilitate mathematical analysis and efficient numerical simulation, we preferred to model stochastic CBMs using a compressed, SDE form. This method was initially suggested by Fox and Lu (1994), but their method suffered from several problems (Goldwyn et al., 2011). In a recent paper (Orio and Soudry, 2012) a more general method was derived, which had none of the previous problems, and was shown numerically to produce a very accurate approximation of the original Markov process description.

Derivation

According to Orio and Soudry (2012), if we define x^(c)_k to be the fraction of c-type channels in state k, and x^(c) to be a column vector composed of x^(c)_k, then

\begin{matrix} {\dot{x}}^{(c)} = A^{(c)} (V) x^{(c)} + B^{(c)} (V, x^{(c)}) ξ^{(c)}, & (28) \end{matrix}

where ξ^(c) is a white noise vector process – meaning it has zero mean and auto-covariance

〈 ξ^{(c)} (t) {(ξ^{(c)} (t^{'}))}^{⊤} 〉 = I δ_{c, c^{'}} δ (t - t^{'})

where I is the identity matrix, δ(t) is the Dirac delta function, and δ_{c, c′} = 1 if c = c′ and 0 otherwise. Furthermore, B^(c) is defined so that in Equation (28) each component of ξ^(c), which is associated with a transition pair i ⇋ j, is multiplied by $\sqrt{(A_{i j}^{(c)} x_{j}^{(c)} + A_{j i}^{(c)} x_{i}^{(c)}) / N^{(c)}}$ , and appears in the equation for ${\dot{x}}_{i}^{(c)}$ and ${\dot{x}}_{j}^{(c)}$ with opposite signs. Note that B^(c) is not necessarily square since it has K^(c) rows but the number of columns is equal to the number of transition pairs.

We now need to combine Equation (28) for all c to obtain Equations (1–3). For simplicity, assume now that all ion channels types can be classified as either “rapid” or “slow” (this assumption can be relaxed). In this case we can concatenate all vectors related to rapid channels $r ≜ {(x_{(1)}^{⊤}, \dots, x_{(R)}^{⊤})}^{⊤}$ and to slow channels $s ≜ {(x_{(R + 1)}^{⊤}, \dots, x_{(R + S)}^{⊤})}^{⊤}$ , where R + S = C. We similarly define ξ_r and ξ_s together with the block matrices

A_{r} ≜ (\begin{matrix} \begin{matrix} A^{(1)} \end{matrix} & 0 & \dots & 0 \\ 0 & A^{(2)} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & A^{(R)} \end{matrix}), A_{s} ≜ (\begin{matrix} A^{(R + 1)} & 0 & \dots & 0 \\ 0 & A^{(R + 2)} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & A^{(R + S)} \end{matrix})

and similarly for B_r and B_s. Note that A_r is square with size $\tilde{M} = \sum_{c = 1}^{R} K^{(c)}$ rows while A_s is square with size $\tilde{M} = \sum_{c = R + 1}^{R + S} K^{(c)}$ rows.

4.1.1. Compressed formulation

In some cases, it is more convenient to re-write Equations (1–3) in a compressed form (this is always possible)

\begin{matrix} \dot{V} = f (V, r, s, I (t)), & (29) \end{matrix}

\begin{matrix} \dot{r} = {\tilde{A}}_{r} (V) r - b_{r} (V) + {\tilde{B}}_{r} (V, r) ξ_{r}, & (30) \end{matrix}

\begin{matrix} \dot{s} = {\tilde{A}}_{s} (V) s - b_{s} (V) + {\tilde{B}}_{s} (V, s) ξ_{s}, & (31) \end{matrix}

where r, s, and ξ_r/s have been redefined (their dimension has decreased), as we will show next. First, we comment that the main disadvantage is of these equations is that they are less compact and the notation is somewhat more cumbersome. However, there are also several advantages to this approach: (1) The vectors and matrices are smaller, (2) The rate and diffusion matrices do not have “troublesome” zero eigenvalues and can be diagonal (which is analytically convenient), (3) Most CBMs are written using this form (e.g., the HH model), so it is easier to apply our results using this formalism.

Derivation

To derive these compressed equations, we use the fact x^(c)_k denote fractions, so ∑_kx^(c)_k = 1, for all c. We can use this constraint, together with the irreducibility of the underlying ion channel process, to reduce by one the dimensionality of Equation (28) (see Soudry and Meir, 2012a for further details). Defining I to be the identity function, J to be the I with it last row removed, e ≜ (0, 0, …, 1)^⊤, u ≜ (1, 1, …, 1)^⊤, G ≜ (I − eu^⊤) J^⊤, ${\tilde{A}}^{(c)} ≜ J A^{(c)} G$ , ${\tilde{B}}^{(c)} ≜ J B^{(c)}$ (with x^(c)_K^(c) replaced by 1 − x₁ − x₂ … −x_K^(c)−1) and b^(c) ≜ −JA^(c) e ( ${\tilde{A}}^{(c)}$ is invertible), we obtain the following equation for the reduced state vector y^(c) = Jx^(c) (which has only K^(c) − 1 states)

{\dot{y}}^{(c)} = {\tilde{A}}^{(c)} y^{(c)} - b + {\tilde{B}}^{(c)} ξ^{(c)} .

Again assuming that all channels can be classified as either “rapid” or “slow,” we concatenate all vectors related to rapid channels r ≜ (y^⊤₍₁₎, …, y^⊤_(R))^⊤ and to slow channels s ≜ (y^⊤_(R+1), …, y^⊤_(R+S))^⊤, where R + S = C. We obtain Equations (30, 31) by similarly defining b_r, b_s, ξ_r and ξ_s together with the block matrices

{\tilde{A}}_{r} ≜ (\begin{matrix} \begin{matrix} {\tilde{A}}^{(1)} & 0 & \dots & 0 \\ 0 & {\tilde{A}}^{(2)} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & {\tilde{A}}^{(R)} \end{matrix} \end{matrix}), {\tilde{A}}_{s} ≜ (\begin{matrix} {\tilde{A}}^{(R + 1)} & 0 & \dots & 0 \\ 0 & {\tilde{A}}^{(R + 2)} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & {\tilde{A}}^{(R + S)} \end{matrix}) ​,

and similarly for ${\tilde{B}}_{r}$ and ${\tilde{B}}_{s}$ . Note that ${\tilde{A}}_{r}$ is square with ${\tilde{M}}_{r} = \sum_{c = 1}^{R} K^{(c)} - R$ rows while ${\tilde{A}}_{s}$ is square with ${\tilde{M}}_{s} = \sum_{c = R + 1}^{R + S} K^{(c)} - S$ rows. Furthermore, it can be shown (Soudry and Meir, 2012a) that ${\tilde{A}}^{(c)}$ is a strictly stable matrix (all its eigenvalues are also eigenvalues of A^(c) except its zero eigenvalue, and so have a strictly negative real part), and ${\tilde{D}}^{(c)} ≜ {\tilde{B}}^{(c)} {\tilde{B}}^{(c) ⊤}$ is positive definite (so all its eigenvalues are real and strictly positive). Therefore, also ${\tilde{A}}_{r}$ and ${\tilde{A}}_{s}$ are both strictly stable and ${\tilde{D}}_{r}$ and ${\tilde{D}}_{s}$ are positive definite. Therefore, if V is held constant, 〈s〉 and 〈r〉 tend to $s_{\infty} = {\tilde{A}}_{s}^{- 1} b_{s}$ and $r_{\infty} = {\tilde{A}}_{r}^{- 1} b_{r}$ , respectively.

Example – the HHS model

The HHS model can be easily written using the compressed formulation. For example, comparing Equation (23) with Equation (31) we find that

\begin{matrix} A_{s} (V) = - γ (V) - δ (V) & (32) \end{matrix}

\begin{matrix} b_{s} (V) = - δ (V) & (33) \end{matrix}

\begin{matrix} B_{s} (V, s) = \sqrt{(δ (V) (1 - s) + γ (V) s) N_{s}^{- 1} ϕ} & (34) \end{matrix}

\begin{matrix} D_{s} (V, s) = (δ (V) (1 - s) + γ (V) s) N_{s}^{- 1} ϕ . & (35) \end{matrix}

Note that all the parameters are scalar in the HHS model, and so are not boldfaced, as in the general case.

4.2. Model Reduction

In this section we give additional technical details on section 2.2. Specifically, we show how, given sparse spike stimulation and a few assumptions, it is possible to derive a simple reduced dynamical system (Equation 8) from the full equations of a general biophysical model for an isopotential neuron (Equations 1–3),

\begin{matrix} \dot{V} = f (V, r, s, I (t)), & (36) \end{matrix}

\begin{matrix} \dot{r} = A_{r} (V) r + B_{r} (V, r) ξ_{r}, & (37) \end{matrix}

\begin{matrix} \dot{s} = A_{s} (V) s + B_{s} (V, s) ξ_{s} . & (38) \end{matrix}

For more details on how its parameters and variables map to microscopic biophysical quantities, see section 4.1.

4.2.1. The excitability constraint

As explained in section 2.1, we focus on models for excitable neurons describable by equations of the general form of Equations (36–38), rather than on arbitrary dynamical systems. This imposes some constraints on the parameters (Soudry and Meir, 2012b). Formally, recall that τ_AP and τ_s are the respective kinetic timescales of {V, r} and s, and that τ_AP < τ_s. Suppose we “freeze” the dynamics of s (so that effectively τ_s = ∞) and allow only V and r to evolve in time. We say the original model describes an excitable neuron, if the following conditions hold in this “semi-frozen” model:

If I(t) = 0, then for all initial conditions, V and r rapidly (within timescale τ_AP) relax to a constant and unique steady state (“rest”).
Assume that V and r are near rest, and a short stimulation pulse is given with duration t_stim ≤ τ_AP and amplitude I₀. For certain initial conditions and values of I₀, we get either a stereotypical “strong” response (“AP response”) or a stereotypical “weak” response in V (“no AP response”). Only for a very small set of initial conditions and values of I₀, do we get an “intermediate” response (“weak AP response”). By “stereotypical” we mean that the shape of response does not change much between trials or for different initial conditions in {V, r} (however, it can change with s).

Note that due to condition 1, such an excitable neuron is not oscillatory and does not spontaneously fire APs.

4.2.2. Problem formulation

Formally, suppose an excitable neuron receives a train of identical stimuli, so

I (t) = \sum_{m = - \infty}^{\infty} ⊓ (t - t_{m}),

where ⊓(x) is a pulse, of width t_w (so ⊓(x) = 0 for x outside [0, t_w]). We denote by {Y_m}^∞_m=−∞ the occurrence events of AP responses at times {t_m}^∞_m=−∞, i.e., immediately after each stimulation time t_m (Figure 1A),

\begin{matrix} Y_{m} ≜ {\begin{array}{l} \begin{array}{l} 1 & , & if an AP occurs \\ 0 & , & otherwise \end{array} \end{array} . & (39) \end{matrix}

Defining T_m ≜ t_{m + 1} − t_m, the inter-stimulus interval, and τ_AP as the upper timescale of an AP event (Figure 1B) we make the following assumption.

Assumption 1. (a) The stimulation pulse width is small, t_w < τ_AP. (b) The spike times {t_m}^∞_{m = 0} are temporally sparse, i.e., τ_AP ≪ T_m for every m (“no collisions”).

Our main objective here is to mathematically characterize the relation between {Y_m} and {T_m} under the most general conditions.

4.2.3. Derivations

We define the sampled quantities V_m ≜ V(t_m), r_m ≜ r(t_m), s_m ≜ s(t_m), x_m ≜ (V_m, r^⊤_m, s^⊤_m)^⊤ and the history set yes _m ≜ ${{x_{k}}_{k = - \infty}^{m}, {T_{k}}_{k = - \infty}^{m}, {Y_{k}}_{k = - \infty}^{m}}$ (note that yes _m ⊂ yes _{m + 1}). The Stochastic Differential Equation (SDE) description in Equations (36–38) implies that x_m is a “state vector” with the Markov property, namely it is a sufficient statistic on the history to determine the probability of generating an AP at each stimulation,

and, together with Y_m and T_m, its own dynamics

which implies the following causality relations

\begin{matrix} \begin{matrix} x_{0} & \overset{T_{0}}{\to} & x_{1} & \overset{T_{1}}{\to} & x_{2} & \dots & x_{m} \\ ↓ & ↗ & ↓ & ↗ & ↓ & ↓ \\ Y_{0} & Y_{1} & Y_{2} & \dots & Y_{m} \end{matrix} . & (42) \end{matrix}

This causality structure is reminiscent of the well known Hidden Markov Model (Rabiner, 1989), except that in the present case the output Y_m, affects the transition probability, and we have input T_m. Theoretically, if we knew the distributions in Equations (40) and (41), as well as the initial condition P (x₀), we could integrate and find an exact probabilistic I/O relation P({Y_k}^m_k=0|{T_k}^m_k=0. However, since it may be hard to find an expression for P(x_{m + 1}|x_m, T_m, Y_m) in general, we make a simplifying assumption.

Assumption 2. T_m ≪ τ_s for every m.

This assumption, together with Assumption 1 and the excitable nature of the CBM, renders the dynamics between stimulations relatively easy to understand. Specifically, between two consecutive stimulations, the fast variables (V(t), r(t)) follow stereotypically either the “AP response” (Y_m = 1) or the “no-AP response” (Y_m = 0), then equilibrate rapidly (within time τ_AP) to some quasi-stationary distribution q(V, r|s_m). Meanwhile, the slow variable s (t), starting from its initial condition at the time of the previous stimulation, changes slowly according to Equation (38), affected by the voltage trace of V(t) (through A_s (V)).

Summarizing this mathematically, we obtain the following approximations

\begin{matrix} P (Y_{m} | s_{m}) \approx \int P (Y_{m} | V, r, s_{m}) q (V, r | s_{m}) d V d r, & (43) \end{matrix}

\begin{matrix} \begin{array}{l} P (V_{m + 1}, r_{m + 1}, s_{m + 1} | s_{m}, T_{m}, Y_{m}) \approx q (V_{m + 1}, r_{m + 1} | s_{m + 1}) \\ P (s_{m + 1} | s_{m}, T_{m}, Y_{m}) . \end{array} & (44) \end{matrix}

Using these equations together with Equations (40) and (41), we obtain

Therefore, the “excitability” vector s_m can now replace the full state vector x_m = (V_m, r^⊤_m, s^⊤_m)^⊤ as the sufficient statistic that retains all relevant the information about the history of previous stimuli. Given the input {T_m}^∞_m=−∞, Equations (45) and (46) together completely specify a Markov process with the causality structure

\begin{matrix} \begin{matrix} \begin{matrix} s_{0} \end{matrix} & \overset{T_{0}}{\to} & s_{1} & \overset{T_{1}}{\to} & s_{2} & \dots & s_{m} \\ ↓ & ↗ & ↓ & ↗ & ↓ & ↓ \\ Y_{0} & Y_{1} & Y_{2} & \dots & Y_{m} \end{matrix} . & (47) \end{matrix}

Since the function p_AP (s) is not affected by the kinetics of s, it can be found by numerical simulation of a single AP using only Equations (36, 37), when s is held constant (see section 4.2.4). Now, instead of finding P(s_{m + 1}|s_m, Y_m, T_m) directly, we calculate the increments Δs_m ≜ s_{m + 1} − s_m by integration of the SDE in Equation (38) between t_m and t_{m + 1}. First, we integrate the “predictable” part of the increment

\begin{matrix} 〈 Δ s_{m} | s_{m}, T_{m}, Y_{m} 〉 = \int_{t_{m}}^{t_{m + 1}} A_{s} (V (t)) s (t) d t, & (48) \end{matrix}

\begin{matrix} \approx (\int_{t_{m}}^{t_{m + 1}} A_{s} (V (t)) d t) s_{m}, & (49) \end{matrix}

to first order, where 〈X|Y〉 denotes the conditional expectation of X given Y. Note that A_s ~ O(τ_s⁻¹), so second order corrections are of order O((T_mτ_s⁻¹)²). Due to assumption 2, we have T_mτ_s⁻¹ ≪ 1, so these corrections are negligible. Now,

\begin{array}{l} \int_{t_{m}}^{t_{m + 1}} A_{s} (V (t)) d t = τ_{AP} (\frac{1}{τ_{AP}} \int_{t_{m}}^{t_{m} + τ_{AP}} ​ A_{s} (V (t)) d t) \\ + ​ (T_{m} - τ_{AP}) ​ (\frac{1}{T_{m} - τ_{AP}} ​ \int_{t_{m} + τ_{AP}}^{t_{m + 1}} A_{s} (V (t)) d t) \\ = τ_{AP} (A_{+} (s_{m}) Y_{m} + A_{-} (s_{m}) (1 - Y_{m})) \\ + (T_{m} - τ_{AP}) A_{0} (s_{m}) \end{array}

where we defined

\begin{matrix} A_{0} (s_{m}) = \frac{1}{T_{m} - τ_{AP}} \int_{t_{m} + τ_{AP}}^{t_{m + 1}} A_{s} (V (t)) d t, & (50) \end{matrix}

\begin{matrix} A_{-} (s_{m}) = \frac{1}{τ_{AP}} \int_{t_{m}}^{t_{m} + τ_{AP}} A_{s} (V (t)) d t, if Y_{m} = 0, & (51) \end{matrix}

\begin{matrix} A_{+} (s_{m}) = \frac{1}{τ_{AP}} \int_{t_{m}}^{t_{m} + τ_{AP}} A_{s} (V (t)) d t, if Y_{m} = 1, & (52) \end{matrix}

which are the average rates during rest, during an AP response and during a no-AP response, receptively. Note a similar notation was also used in Soudry and Meir (2012b) (e.g., Equations 2.15, 2.16 there), where the +/ − /0 were replaced with H/M/L.

Next, we calculate the remaining part of the increment, which is the “innovation,”

n_{m} ≜ Δ s_{m} - 〈 Δ s_{m} | s_{m}, T_{m}, Y_{m} 〉 .

Obviously, 〈n_m|s_m, T_m, Y_m〉 = 0, and also

\begin{array}{l} 〈 n_{m} n_{m}^{⊤} | s_{m}, T_{m}, Y_{m} 〉 = 〈 (\int_{t_{m}}^{t_{m + 1}} B_{s} (V (t), s (t)) ξ_{s} (t) d t) \\ (\int_{t_{m}}^{t_{m + 1}} B_{s} (V (t), s (t^{'})) {ξ_{s} (t^{'}) d t^{'})}^{⊤} | s_{m}, T_{m}, Y_{m} 〉 \\ = \int_{t_{m}}^{t_{m + 1}} d t ​ \int_{t_{m}}^{t_{m + 1}} d t^{'} δ ​ (t - t^{'}) ​ B_{s} ​ (V (t), s ​ (t)) B_{s}^{⊤} (V (t^{'}), s (t^{'})) \\ = \int_{t_{m}}^{t_{m + 1}} B_{s} (V (t), s (t)) B_{s}^{⊤} (V (t^{'}), s (t)) d t \\ = \int_{t_{m}}^{t_{m + 1}} D_{s} (V (t), s (t)) d t \\ \approx \int_{t_{m}}^{t_{m + 1}} D_{s} (V (t), s_{m}) d t \end{array}

to first order. Note that D_s ~ O(τ_s⁻¹/N), where N = min_cN^(c)(N^(c) is the channel number of the c-type channel, as we defined in section 4.1), while Equation (53) has corrections of size O((T_mτ_s⁻¹/N)²). Since N ≥ 1 (usually N ≫ 1), and due to assumption 2, we have T_mτ_s⁻¹/N ≪ 1, so these corrections are also negligible. Now,

\begin{matrix} \begin{array}{l} \int_{t_{m}}^{t_{m + 1}} D_{s} (V (t), s_{m}) d t \\ = τ_{AP} (Y_{m} D_{+} (s_{m}) + (1 - Y_{m}) D_{-} (s_{m})) + (T_{m} - τ_{AP}) D_{0} (s_{m}) \end{array} & (53) \end{matrix}

where we defined

\begin{matrix} D_{0} (s_{m}) = \frac{1}{T_{m} - τ_{AP}} \int_{t_{m} + τ_{AP}}^{t_{m + 1}} D_{s} (V (t), s_{m}) d t & (54) \end{matrix}

\begin{matrix} D_{-} (s_{m}) = \frac{1}{τ_{AP}} \int_{t_{m}}^{t_{m} + τ_{AP}} D_{s} (V (t), s_{m}) d t, if Y_{m} = 0 & (55) \end{matrix}

\begin{matrix} D_{+} (s_{m}) = \frac{1}{τ_{AP}} \int_{t_{m}}^{t_{m} + τ_{AP}} D_{s} (V (t), s_{m}) d t, if Y_{m} = 1. & (56) \end{matrix}

Additionally, we note that A_±/0 (s_m) generally tend to be rather insensitive to changes in s_m. This is because the kinetic transition rates (which are used to construct A_s (V), as explained in section 4.1) tend to demonstrate this insensitivity when similarly averaged (see Figures 4B, 5 in Soudry and Meir, 2012b). The usual reasons behind this are (see appendix section B1 of Soudry and Meir, 2012b): (1) The common sigmoidal shape of the voltage dependency of the kinetic rates reduces their sensitivity to changes in the amplitude of the AP or the resting potential (2) The shape of the AP is relatively insensitive to s (3) The resting voltage is relatively insensitive to s. Therefore, in most cases we can approximate A_±/0 (s_m) to be constant for simplicity (though this not critical to our subsequent results), as we shall henceforth do.

Additionally, we note that, strictly speaking, the voltage trace during an AP and at rest are stochastic, and therefore, A₊, A₋, A₀, D₊, D₋ and D₀ are stochastic. However, there are two factors that render this stochasticity negligible. First, the sigmoidal shape of the kinetic rates implies that A(V) is rather insensitive to fluctuations in the voltage (Figure 4 in our Soudry and Meir, 2012b). Second, noise mainly plays a role in the timing of AP initiation, but does not much affect the AP shape above threshold (see AP voltage traces in Schneidman et al., 1998, p. 1687). Therefore, we shall approximate A₊, A₋, A₀, D₊, D₋ and D₀ to be deterministic.

In summary, defining

\begin{matrix} \begin{array}{l} A (Y_{m}, T_{m}) = τ_{AP} T_{m}^{- 1} (Y_{m} A_{+} + (1 - Y_{m}) A_{-}) + \\ (1 - τ_{AP} T_{m}^{- 1}) A_{0}, \end{array} & (57) \end{matrix}

and

\begin{matrix} \begin{array}{l} D (Y_{m}, T_{m}, s_{m}) = τ_{AP} T_{m}^{- 1} (Y_{m} D_{+} (s_{m}) + (1 - Y_{m}) D_{-} (s_{m})) \\ + (1 - τ_{AP} T_{m}^{- 1}) D_{0} (s_{m}) \end{array} & (58) \end{matrix}

we can write

\begin{matrix} Δ s_{m} = T_{m} A (Y_{m}, T_{m}) s_{m} + n_{m}, & (59) \end{matrix}

with〈n_m|s_m, T_m, Y_m〉 = 0 and

\begin{matrix} 〈 n_{m} n_{m}^{⊤} | s_{m}, T_{m}, Y_{m} 〉 = T_{m} D (Y_{m}, T_{m}, s_{m}) . & (60) \end{matrix}

These equations correspond to the result presented in Equation (8).

Finally, we note that the distribution of n_m given s_m, T_m, Y_m can be generally computed using the approach described in Orio and Soudry (2012). For example, it can be well approximated to have a normal distribution if channel numbers are sufficiently high and channel kinetics are not too slow (Orio and Soudry, 2012). In that case only knowledge of the variance (Equation 60) is sufficient to generate n_m. And so, using Equations (45), (59) and the full distribution of n_m, we can now simulate the neuronal response using a reduced model, more efficiently and concisely (with fewer parameters) than the full model (Equations 36, 38), since every time step is a stimulation event. The simulation time should shorten approximately by a factor of 〈T_m〉/dt, where dt is the full model simulation step. Note that the reduced model parameters, having been deduced from the full model itself, still retain a biophysical interpretation.

4.2.4. Calculation of p_AP (s)

We numerically calculated p_AP (s) by disabling all the slow kinetics in the model – i.e., we only use Equations (1, 2) in main text, while ṡ = 0. Then, for every value of s we simulated this “semi-frozen” model numerically by first allowing r to relax to a steady state and then giving a stimulation pulse with amplitude I₀. We repeat this procedures 200 times for each s, and calculate p_AP (s) as the fraction of simulations that produced an AP. A few comments are in order: (1) In some cases (e.g., the HHMS model) we can use a shortcut and calculate p_AP (s) based on previous results. For example, suppose we know the probability function $\tilde{p}$ _AP (s) for some model with a scalar s and we make the substitution s = h (s) where the components of s represent independent and uncoupled channel types (Orio and Soudry, 2012) – then p_AP (s) = $\tilde{p}$ _AP (h (s)) in the new model. (2) The timescale separation assumption τ_AP ≪ T_m ≪ τ_s implies that all the properties of the generated AP (amplitude, latency etc.) maintain similar causality relations with s_m as does Y_m, so we can find their distribution using the same simulation we used to find p_AP (s), similarly to the approach taken to compute L (s) in the deterministic setting (Soudry and Meir, 2012b). (3) Numerical results (Figure 6) suggest that we can generally write

\begin{matrix} p_{AP} (s) = Φ (E (s) / \sqrt{N_{r}}), & (61) \end{matrix}

where Φ is the cumulative distribution function of the normal distribution, E (s) is some “excitability function” (as defined in Soudry and Meir (2012b), so p_AP (s) = 0.5 on the threshold Θ = {s|E (s) = 0}), and N^−1/2_r, the “noisiness” of the rapid sub-system, directly affects the slope of p_AP (s) (Figure 6D, bottom). Also, as explained in Soudry and Meir (2012b), E (s) is usually monotonic in each component separately and increasing in I₀ (Figure 6C, top) – which could be considered as just another component of s which has zero rates.

4.2.5. Compressed formulation – reduction

We can perform a very similar model reduction and linearization using the compressed formalism presented in section 4.1.1. We just need to define (or re-define) A_±,0, b_±,0, D_±,0 (s_m), A (Y_m, T_m), b (Y_m, T_m) and D (Y_m, T_m, s_m) in the obvious way and repeat very similar derivations, arriving to

Δ s_{m} = T_{m} [A (Y_{m}, T_{m}) s_{m} - b (Y_{m}, T_{m})] + n_{m},

instead of Equation (59) (or Equation 8). Next, we demonstrate this for the HHS model.

4.2.6. Example – HHS model reduction

We derive the parameters of the HHS reduced map. Recall that the HHS model is based on the compressed formulation. Following the reduction technique described in the previous sections, we numerically find the average rates γ_±,0 and δ_±,0 (as in Equations (2.15, 2.16) of Soudry and Meir (2012b), where there we denoted H/M/L instead of +/ − /0 here), τ_AP and p_AP (s) (section 4.2.4).

From Equations (32, 35), we find,

\begin{matrix} A_{\pm, 0} = - γ_{\pm, 0} - δ_{\pm, 0} & (62) \end{matrix}

\begin{matrix} b_{\pm, 0} = - δ_{\pm, 0} & (63) \end{matrix}

\begin{matrix} D_{\pm, 0} (s) = N_{s}^{- 1} (δ_{\pm, 0} (1 - s) + γ_{\pm, 0} s) . & (64) \end{matrix}

and so A (Y_m, T_m) and D (Y_m, T_m, s_m) are defined as in Equations (57) and (58), and similarly

b ​ (Y_{m}, T_{m}) ​ = ​ τ_{AP} T_{m}^{- 1} ​ (Y_{m} b_{+} + (1 - Y_{m}) b_{-}) + ​ (1 - τ_{AP} T_{m}^{- 1}) b_{0} .

We give for example some specific values: if τ_AP = 15 ms, then in the range I₀ = 7.5−8.3 μA, we have δ_±,0 = 25.5−25.6 mHz, γ₊ = 22.9−22.1 mHz, γ₋ = 0.9−1.3μ Hz and γ₀ = 0.29−0.28μ Hz.

Recall that these averaged kinetic rates are determined by the shape of the voltage dependent rates (γ (V) and δ (V), see Equation 125) (Soudry and Meir, 2012b). The relative values of the averaged kinetic rates determine what kind of information can be stored in s (which retains the “memory” of the neuron between stimulation). We qualitatively demonstrate this in Figure 7 depicting the values of γ_±,0 for three different shapes of γ (V): when γ (V) is sigmoidal with high threshold, when it is sigmoidal with low threshold and when it is constant. These determine whether γ (V) is affected by the output (APs), the input (stimulation pulses) or neither. Therefore: (1) if γ (V) and δ (V) are independent of the voltage, then s cannot store any information on input or output. (2) if γ (V) or δ (V) have low voltage threshold, then s can directly store information on the input. (3) if γ (V) or δ (V) have high voltage threshold, then s can directly store information about the output. In the HHS model the inactivation rate γ has high threshold, while δ is voltage independent – therefore, s directly stores information on the output.

4.3. Linearization

In this section we present a more detailed account on how to arrive from the reduced model (mainly, Equation 8) to its linearized version (the results in Equations 11, 12).

First, we write the complete reduced model, using Equations (59), (60), and (45). The reduced model is a non-linear stochastic dynamic “state-space” system with T_m, the inter-stimulus interval lengths, serving as inputs, s_m representing the neuronal state, and Y_m the output. We have

\begin{matrix} Δ s_{m} = T_{m} A (Y_{m}, T_{m}) s_{m} + n_{m}, & (65) \end{matrix}

\begin{matrix} Y_{m} = p_{AP} (s_{m}) + e_{m}, & (66) \end{matrix}

where 〈n_mn^⊤_m|s_m, T_m, Y_m〉 = T_mD (Y_m, T_m, s_m),

\begin{array}{l} A (Y_{m}, T_{m}) = τ_{AP} T_{m}^{- 1} (Y_{m} A_{+} + (1 - Y_{m}) A_{-}) + \\ (1 - τ_{AP} T_{m}^{- 1}) A_{0}, \end{array}

and

\begin{array}{l} D (Y_{m}, T_{m}, s_{m}) = τ_{AP} T_{m}^{- 1} (Y_{m} D_{+} (s_{m}) + (1 - Y_{m}) D_{-} (s_{m})) + \\ (1 - τ_{AP} T_{m}^{- 1}) D_{0} (s_{m}), \end{array}

and we defined

\begin{matrix} e_{m} ≜ Y_{m} - p_{AP} (s_{m}) . & (67) \end{matrix}

Based on the causality structure in Equation (42), it is straightforward to prove that e_m and n_m are uncorrelated white noise processes – i.e., 〈e_m〉 = 0, 〈n_m〉 = 〈e_nn_m〉 = 0 and 〈n_mn^⊤_n〉 = 〈n_mn^⊤_m〉 δ_mn, 〈e_me_n〉 = 〈e²_m〉 δ_mn where δ_nm = 1 if n = m and 0 otherwise.

We now examine the case where {T_m} is a Wide Sense Stationary (WSS) process (i.e., the first and second order statistics of the process are invariant to time shifts), with mean T_*, so that the assumptions τ_AP ≪ T_m ≪ τ_s are fulfilled with high probability. In this case the processes {s_m} and {Y_m} are also WSS, with constant means 〈s_m〉 = s_* and 〈Y_m〉 = p_*. Also, it is straightforward to verify that $〈 {\hat{T}}_{m} n_{n} 〉 = 0$ , and $〈 {\hat{T}}_{m} e_{n} 〉 = 0$ .

In order to linearize the system in Equations (59–66) we denote ${\hat{T}}_{m} ≜ T_{m} - T_{*}$ , ${\hat{Y}}_{m} ≜ Y_{m} - p_{*}$ , ${\hat{s}}_{m} ≜ s_{m} - s_{*}$ , $w ≜ {\nabla p_{AP} |}_{s^{*}}$ . In order for this linearization to be accurate we require that ${\hat{s}}_{m}$ is “small enough.”

Assumption 3. With high probability $| {\hat{s}}_{m} | ≪ | s_{*} |$ (component-wise) and $| w^{⊤} {\hat{s}}_{m} | ≫ | {\hat{s}}_{m}^{⊤} ({\nabla \nabla p_{AP} |}_{s^{*}}) {\hat{s}}_{m} |$ .

This assumption essentially means that s_* = s_*(p_*, T_*) is a stable fixed point of the system (Equations 59–66), and stochastic fluctuations around it are small, compared to the size of the region {s|p_AP (s_m) ≠ 0, 1} (usually determined by the noise level of the rapid system {V, r}, see section 4.2.4). Note that the region is usually rather narrow (Figure 6) and therefore $| {\hat{s}}_{m} | ≪ | s_{*} |$ is often implied by this description. Given Assumption 3, we can approximate to first order

\begin{matrix} p_{AP} (s_{m}) \approx p_{*} + w^{⊤} {\hat{s}}_{m}, & (68) \end{matrix}

which allows us to linearize Equation (66). This essentially means that the components of ${\hat{s}}_{m}$ determine the neuronal response linearly, with the components of w serving as the effective weights (related to the relevant conductances in the original full neuron model).

Next, we wish to linearize Equation (59). Using our assumptions, we obtain to first order

\begin{matrix} \begin{array}{l} {\hat{s}}_{m + 1} \approx {\hat{s}}_{m} + A (p_{*}, T_{*}) (s_{*} + {\hat{s}}_{m}) \\ + A_{0} (s_{*} + {\hat{s}}_{m}) {\hat{T}}_{m} + τ_{AP} (A_{+} - A_{-}) (s_{*} + {\hat{s}}_{m}) {\hat{Y}}_{m} + n_{m} \end{array} & (69) \end{matrix}

Taking expectations and using Equations (66) and (68), we obtain

\begin{matrix} 0 = 〈 s_{m + 1} - s_{m} 〉 \approx T_{*} A (p_{*}, T_{*}) s_{*}, & (70) \end{matrix}

to zeroth order. Defining the solution of this equation is s_*(p_*, T_*) and we can find p_* implicitly from

\begin{matrix} p_{*} = p_{AP} (s_{*} (p_{*}, T_{*})) . & (71) \end{matrix}

We write the explicit solution of this equation as p_*(T_*). Next, using $| {\hat{s}}_{m} | ≪ | s_{*} |$ , Equation (70) and defining

\begin{matrix} F ≜ I + T_{*} A_{*} (p_{*}, T_{*}) & (72) \end{matrix}

\begin{matrix} d ≜ A_{0} s_{*} & (73) \end{matrix}

\begin{matrix} a ≜ τ_{AP} (A_{+} - A_{-}) s_{*} & (74) \end{matrix}

we can approximate Equation (69) as

\begin{matrix} {\hat{s}}_{m + 1} = F {\hat{s}}_{m} + d {\hat{T}}_{m} + a {\hat{Y}}_{m} + n_{m}, & (75) \end{matrix}

which, together with

\begin{matrix} {\hat{Y}}_{m} = w^{⊤} {\hat{s}}_{m} + e_{m}, & (76) \end{matrix}

yields a simple linear state space representation with ${\hat{T}}_{m}$ as the input, ${\hat{s}}_{m}$ as the state, ${\hat{Y}}_{m}$ as the output and two uncorrelated white noise sources with variances

\begin{matrix} Σ_{n} ≜ 〈 n_{m} n_{m}^{⊤} 〉 = T_{*} D_{*} (p_{*}, T_{*}, s_{*}), & (77) \end{matrix}

\begin{matrix} σ_{e}^{2} ≜ 〈 e_{m}^{2} 〉 \approx p_{*} - p_{*}^{2}, & (78) \end{matrix}

to first order.

4.3.1. Derivation of w

From Equation (61), we note that generally we can write

\begin{matrix} w = \nabla p_{AP} {(s)}_{s = s_{*}} = \frac{\nabla E (s_{*})}{\sqrt{2 π N_{r}}} \exp (- \frac{E^{2} (s_{*})}{2 N_{r}}), & (79) \end{matrix}

where in many cases the excitability function E (s) has the form E (s) = μ^⊤ s − θ, where the components of μ are proportional to the relevant conductances (Soudry and Meir, 2012b). Therefore, if

p_{*} = p_{AP} (s_{*}) = Φ (E (s) / \sqrt{N_{r}}) \to 0 or 1

then E (s) → ±∞, so in this case (assuming E (s_*) is not a particularly “pathological” function) we have

\begin{matrix} w \to 0. & (80) \end{matrix}

4.3.2. Compressed formulation – linearization

In the compressed formulation (introduced in sections 4.1.1 and 4.2.5), we can perform similar linearization by re-defining F ≜ I + T_*A (p_*, T_*), d ≜ A₀s_* − b₀, a ≜ τ_AP ((A₊ − A₋)s_* − (b₊ − b₋)), and repeat very similar derivations, where now we can write more explicitly

\begin{matrix} s_{*} = A^{- 1} (p_{*}, T_{*}) b (p_{*}, T_{*}), & (81) \end{matrix}

instead of Equation (70).

4.3.3. Example – HHS model linearization

Note again that all the parameters are scalar now, and so are not boldfaced, as in the general case. From Equations (71) and (81) we obtain s_* and p_* for a given T_*. Once s_* is known, from Equation (79) w can be obtained⁷. Next, we denote the average inactivation rate at steady state by

γ_{*} ≜ (p_{*} γ_{+} + (1 - p_{*}) γ_{-}) τ_{AP} T_{*}^{- 1} + (1 - τ_{AP} T_{*}^{- 1}) γ_{0},

and similarly for the recovery rate δ_*. And so, s_* = δ_*/(γ_* + δ_*), and

\begin{matrix} A_{*} = A_{*} (p_{*}, s_{*}) = - γ_{*} - δ_{*}, & (82) \end{matrix}

\begin{matrix} b_{*} = - δ_{*}, & (83) \end{matrix}

\begin{matrix} D_{*} = D_{*} (p_{*}, T_{*}, s_{*}) = N_{s}^{- 1} (δ_{*} γ_{*} / (γ_{*} + δ_{*})) . & (84) \end{matrix}

Denoting γ₁ ≜ γ₊ − γ₋ and similarly for δ₁, we obtain

\begin{matrix} F = 1 - T_{*} (γ_{*} + δ_{*}) & (85) \end{matrix}

\begin{matrix} a = τ_{AP} (γ_{*} δ_{1} - γ_{1} δ_{*}) / (γ_{*} + δ_{*}) & (86) \end{matrix}

\begin{matrix} d = (γ_{*} δ_{0} - γ_{0} δ_{*}) / (γ_{*} + δ_{*}) & (87) \end{matrix}

Finally, from Equations (77, 78) we find

\begin{matrix} Σ_{n} = T_{*} D_{*} & (88) \end{matrix}

\begin{matrix} σ_{e}^{2} = p_{*} - p_{*}^{2} . & (89) \end{matrix}

4.4. Linear Systems Analysis

In section 2.4 we describe the neuronal dynamics using a linear system for the fluctuations, as depicted in Figure 1. This linear description allows us to use standard engineering tools to analyze the system. In this section we provide an easy to follow description on how this was done, for those unfamiliar with these topics.

4.4.1. Second order statistics and linear systems

We start with a short reminder on some known results for stochastic processes (Papoulis and Pillai, 1965; Gardiner, 2004); these results are standard but are provided for completeness. These results will be used in later sections.

Assume {x_m} and {y_m} are two real-valued vector stochastic processes that are jointly wide-sense stationary (i.e., a simultaneous time shift of both processes will not change their first and second order statistics). We define the cross-covariance (recall that $\hat{x} = x - 〈 x 〉$ )

\begin{matrix} R_{x y} (k) ≜ 〈 {\hat{x}}_{m} {\hat{y}}_{m + k}^{⊤} 〉 \end{matrix}

and the Cross-Power Spectral Density (CPSD), given by its Fourier transform

Additionally, the auto-covariance is defined as R_x ≜ R_xx and the corresponding Power Spectral Density (PSD) as S_x ≜ S_xx. Also, note that R_yx(k) = R_xy^⊤(−k) and so S_yx(k) = S_xy^⊤(−ω).

Suppose now that {y_m} is generated from a process {x_m} using a linear system: i.e., if the Fourier transform x(ω) ≜ ∑^∞_{k = −∞} x_ke^−iωk exists, then in the frequency domain

y (ω) = H (ω) x (ω),

where H(ω) is a matrix-valued “transfer” function. Therefore, under some regularity conditions (allowing us to switch the order of integration end expectation),

\begin{matrix} \begin{array}{l} S_{x y} (ω) = \sum_{k = - \infty}^{\infty} 〈 {\hat{x}}_{m} {\hat{y}}_{m + k}^{⊤} 〉 e^{- i ω k} \\ = S_{x} (ω) H^{⊤} (ω) \end{array} & (90) \end{matrix}

And similarly

\begin{matrix} \begin{array}{l} S_{y} (ω) = \sum_{k = - \infty}^{\infty} 〈 {\hat{y}}_{m} {\hat{y}}_{m + k}^{⊤} 〉 e^{- i ω k} \\ = H (- ω) S_{x} (ω) H^{⊤} (ω) \end{array} & (91) \end{matrix}

where in the second equality here we used an almost identical derivation as for S_xy(ω).

Note that if instead

y (ω) = H_{x} (ω) x (ω) + H_{z} (ω) z (ω),

where x and z are two uncorrelated signals, then we can write

y (ω) = H (ω) v (ω),

where

H (ω) = [\begin{matrix} H_{x} (ω) & 0 \\ 0 & H_{z} (ω) \end{matrix}], v (ω) = [x (ω), z (ω)] .

Thus Equations (90) and (91), respectively give

\begin{matrix} S_{x y} (ω) = S_{x} (ω) H_{x}^{⊤} (ω), & (92) \end{matrix}

\begin{matrix} S_{y} (ω) = H_{x} (- ω) S_{x} (ω) H_{x}^{⊤} (ω) + H_{z} (- ω) S_{z} (ω) H_{z}^{T} (ω) ​ . & (93) \end{matrix}

4.4.2. The second order statistics of our system

Previously, we derived Equations (11, 12), which describe the neuronal dynamics using a linear system, written in “state-space” form

\begin{matrix} {\hat{s}}_{m + 1} = F {\hat{s}}_{m} + d {\hat{T}}_{m} + a {\hat{Y}}_{m} + n_{m}, & (94) \end{matrix}

\begin{matrix} {\hat{Y}}_{m} = w^{⊤} {\hat{s}}_{m} + e_{m} & (95) \end{matrix}

where n_m, e_m and ${\hat{T}}_{m}$ are uncorrelated, zero mean processes with the PSDs Σ_n ≜ T_*D (p_*, T_*, s_*), σ²_e = p_* (1 − p_*) and S_T(ω), respectively.

In order to apply Equations (92) and (93) to our system we first need to find the transfer function of the system. Applying the Fourier transform to Equations (94, 95) gives

\begin{matrix} e^{i ω} \hat{s} (ω) = F \hat{s} (ω) + d \hat{T} (ω) + a \hat{Y} (ω) + n (ω), & (96) \end{matrix}

\begin{matrix} \hat{Y} (ω) = w^{⊤} \hat{s} (ω) + e (ω) . & (97) \end{matrix}

Re-arranging terms, we obtain

\begin{matrix} \hat{s} (ω) = H_{c} (ω) (n (ω) + d \hat{T} (ω) + a e (ω)), & (98) \end{matrix}

\begin{matrix} \hat{Y} (ω) = w^{⊤} H_{c} (ω) ​ (n ​ (ω) + d \hat{T} (ω) + a e (ω)) + e (ω), & (99) \end{matrix}

where we denoted

H_{c} (ω) ≜ {(I e^{i ω} - F - a w^{⊤})}^{- 1} .

This gives the “closed loop” transfer functions of the system (including the effect of the feedback $\hat{Y} (ω)$ ). Next, combining Equations (98, 99) and Equations (92, 93) leads to explicit expressions for the PSDs and CPSDs.

\begin{matrix} S_{s T} (ω) = H_{c} (- ω) d S_{T} (ω) & (100) \end{matrix}

\begin{matrix} S_{s} (ω) = H_{c} (- ω) ​ (Σ_{n} + a a^{⊤} σ_{e}^{2} + d d^{⊤} S_{T} ​ (ω) ​) ​ H_{c}^{⊤} ​ (ω) ​, & (101) \end{matrix}

\begin{matrix} S_{Y T} (ω) = w^{⊤} H_{c} (- ω) d S_{T} (ω), & (102) \end{matrix}

\begin{matrix} \begin{array}{l} ​ ​ ​ S_{Y} (ω) = w^{⊤} H_{c} (- ω) (Σ_{n} + d d^{⊤} S_{T} (ω)) H_{c}^{⊤} (ω) w \\ + σ_{e}^{2} {| 1 + w^{⊤} H_{c} (- ω) a |}^{2} . \end{array} & (103) \end{matrix}

For low frequencies it is sometimes more convenient to use the “continuous-time” versions of the PSDs, S_xy(f) ≜ T_*S_xy(ω)_{ω = 2π fT_*} for f ≪ T⁻¹_*, which are approximated by

S_{s T} (f) = T_{*}^{- 1} H_{c} (- f) d S_{T} (f)

\begin{matrix} \begin{array}{l} S_{s} (f) = H_{c} (- f) (D (p_{*}, T_{*}, s_{*}) + T_{*}^{- 1} a a^{⊤} σ_{e}^{2} + T_{*}^{- 2} d d^{⊤} \\ S_{T} (f)) H_{c}^{⊤} (f), \end{array} & (104) \end{matrix}

\begin{matrix} S_{Y T} (f) = T_{*}^{- 1} w^{⊤} H_{c} (- f) d S_{T} (f), & (105) \end{matrix}

\begin{matrix} \begin{array}{l} S_{Y} (f) = w^{⊤} H_{c} (- f) (D (p_{*}, T_{*}, s_{*}) + T_{*}^{- 2} d d^{⊤} S_{T} (f)) \\ H_{c}^{⊤} (f) w \\ + T_{*} σ_{e}^{2} {| 1 + T_{*}^{- 1} w^{⊤} H_{c} (- f) a |}^{2} . \end{array} & (106) \end{matrix}

where

H_{c} (f) = {(2 π f i I - A (p_{*}, T_{*}) - T_{*}^{- 1} a w^{⊤})}^{- 1},

and we used the fact that F = I + T_*A (p_*, T_*) (Equation 72) and Σ_n = T_*D (p_*, T_*, s_*) (Equation 77).

Note that if the dimension of s is finite and there is no degeneracy, we can always write

\begin{matrix} S_{Y} (f) = c_{0} + \sum_{j = 1}^{M} \frac{c_{j}}{{(2 π f)}^{2} + λ_{j}^{2}}, & (107) \end{matrix}

where λ_i, the poles of S_Y(f), are determined solely by the poles of H_c(f) and S_T(f), while all the other parameters in Equation (106) affect only the constants c_j. Commonly, S_T(f) has no poles – for example, if S_T(f) is constant so T_m is a renewal process (e.g., the stimulation is periodic or Poisson). Therefore all poles of S_Y(f) (or the other PSDs) are determined by H_c(f), i.e., λ_j are the roots of the characteristic polynomial

\begin{matrix} | λ I - A (p_{*}, T_{*}) - T_{*}^{- 1} a w^{⊤} | = 0. & (108) \end{matrix}

4.4.3. Spectral factorization

Equations (96) and (97) can be re-arranged as a direct I/O relation, formulated, for convenience, in the frequency domain (this can be either f or ω – in the section we use ω for brevity of notation, and f in other places). Specifically, this relation is of the form

\begin{matrix} \hat{Y} (ω) = H^{ext} (ω) \hat{T} (ω) + H^{int} (ω) v (ω), & (109) \end{matrix}

so v_m = yes ⁻¹ (v(ω)) is a single scalar “noise” process with zero mean and PSD σ²_v (here yes ⁻¹ is the inverse Fourier transform). This v_m process combines the contributions of e_m and n_m, which are the noise processes in the original system (in Equations 96, 97). Such a description, as in Equation (109), describes concisely the contributions of the input and noise to the output (an ARMAx model Lejung, 1999). Using 92 and 93 we respectively find that

\begin{matrix} S_{Y T} (ω) = H^{ext} (- ω) S_{T} (ω) & (110) \end{matrix}

\begin{matrix} S_{Y} (ω) = {| H^{ext} (ω) |}^{2} S_{T} (ω) + {| H^{int} (ω) |}^{2} σ_{v}^{2} . & (111) \end{matrix}

Comparing Equation (102) with (110) we obtain

\begin{matrix} H^{ext} (ω) = w^{⊤} H_{c} (ω) d . & (112) \end{matrix}

Comparing Equation (103) with (111), while using Equation (112), will yield the equation

\begin{matrix} \begin{array}{l} {| H^{int} (ω) |}^{2} σ_{v}^{2} = w^{⊤} H_{c} (- ω) Σ_{n} H_{c}^{⊤} (ω) w + σ_{e}^{2} | 1 \\ {+ w^{⊤} H_{c} (- ω) a |}^{2} . \end{array} & (113) \end{matrix}

This is a “spectral factorization” problem (Anderson and Moore, 1979), with solution

\begin{matrix} H^{int} (ω) = w^{⊤} H_{c} (ω) K + 1, & (114) \end{matrix}

where

\begin{matrix} K = a + F P w σ_{v}^{- 2} & (115) \end{matrix}

and

\begin{matrix} σ_{v}^{2} = w^{⊤} P w + σ_{e}^{2}, & (116) \end{matrix}

with P the solution of

\begin{matrix} P = F P F^{⊤} - {(w^{⊤} P w + σ_{e}^{2})}^{- 1} F P w w^{⊤} P F^{⊤} + Σ_{n}, & (117) \end{matrix}

derived from the general discrete-time algebraic Riccati equation. This can be verified by substitution

\begin{array}{l} w^{⊤} H_{c} (- ω) Σ_{n} H_{c}^{⊤} (ω) w + σ_{e}^{2} {| 1 + w^{⊤} H_{c} (ω) a |}^{2} \\ - {| H^{int} (ω) |}^{2} σ_{v}^{2} \\ = w^{⊤} H_{c} (- ω) (P - F P F^{⊤} + σ_{v}^{- 2} F P w w^{⊤} P F^{⊤}) H_{c}^{⊤} (ω) w \\ + σ_{e}^{2} {| 1 + w^{⊤} H_{c} (ω) a |}^{2} \\ - {| w^{⊤} H_{c} (ω) (a + F P w σ_{v}^{- 2}) + 1 |}^{2} σ_{v}^{2} \\ \overset{(1)}{=} ​ [w^{⊤} H_{o} (- ω) ​ (P - F P F^{⊤} + σ_{v}^{- 2} F P w w^{⊤} P F^{⊤}) H_{o}^{⊤} (ω) w + σ_{e}^{2} \\ - {| w^{⊤} H_{o} (ω) F P w σ_{v}^{- 2} + 1 |}^{2} σ_{v}^{2}] {| 1 - w^{⊤} H_{o} (ω) a |}^{- 2} \\ = [w^{⊤} H_{o} (- ω) (P - F P F^{⊤}) H_{o}^{⊤} (ω) w - w^{⊤} P w \\ - w^{⊤} H_{o} (ω) F P w - w^{⊤} H_{o} (- ω) F P w] {| 1 - w^{⊤} H_{o} (ω) a |}^{- 2} \\ \overset{(2)}{=} [w^{⊤} (F H_{o} (ω) + I) P (F^{⊤} H_{o}^{⊤} (ω) + I) w + w^{⊤} H_{o} (- ω) F P F^{⊤} \\ H_{o}^{⊤} (ω) w - w^{⊤} P w \\ - w^{⊤} H_{o} (ω) F P w - w^{⊤} H_{o} (- ω) F P w] {| 1 - w^{⊤} H_{o} (ω) a |}^{- 2} \\ =  0 \end{array}

where in (1) we used the fact that w^⊤ H_c(ω) = w^⊤ H_o(ω) (1 − w^⊤ H_o(ω) a)⁻¹ from the Sherman–Morrison lemma, with H_o(ω) = (e^iω I − F)⁻¹ being the “open loop” version of H_c(ω) (i.e., if a was zero), and in (2) we used the fact that H_o(ω) = e^−iω(FH_o(ω) + I).

4.4.4. Optimal linear estimation of linear systems

Given that the neuronal dynamics are given by the linear system in Equations (96, 97), there are two different estimation problems one may be interested in. We may want to estimate, based on the history of the previous inputs and outputs ${{\hat{T}}_{k}, {\hat{Y}}_{k}}_{k = - \infty}^{m - 1}$ , either the parameters of the model (F, w, a, d, σ_e and Σ_n), or the variables in the model ( ${\hat{Y}}_{m}$ or ${\hat{s}}_{m}$ ). The first problem is generally termed a “system identification” problem (Lejung, 1999), while the second is a “filtering” (or prediction) problem (Anderson and Moore, 1979). Both are intimately related, and sometimes the solution of the second problem can yield a method of solving the first problem (e.g., section 3.3 in Anderson and Moore, 1979).

A relatively simple way to approach the second (filtering) problem involves the output decomposition we have found in section 4.4.3

\hat{Y} (ω) = w^{⊤} H_{c} (ω) d \hat{T} (ω) + (w^{⊤} H_{c} (ω) K + 1) v (ω) .

Using this decomposition we can now write a new state-space representation for the system in terms of new state variable ${\hat{z}}_{m}$ ,

\begin{array}{l} {\hat{z}}_{m + 1} = (F + a w^{⊤}) {\hat{z}}_{m} + d {\hat{T}}_{m} + K v_{m}, \\ {\hat{Y}}_{m} = w^{⊤} {\hat{z}}_{m} + v_{m}, \end{array}

which has the same output in the frequency domain (recall, from linear systems theory, that a single I/O relation can be generated by multiple state space realizations). This “innovation form” is particularly useful, since, given the entire history of the previous inputs and outputs $H_{m - 1} ≜ {{\hat{T}}_{k}, {\hat{Y}}_{k}}_{k = - \infty}^{m - 1}$ , we can recursively estimate the current state precisely (with zero error) (Anderson and Moore, 1979)

\begin{matrix} {\hat{z}}_{m} = (F + a w^{⊤}) {\hat{z}}_{m - 1} + d {\hat{T}}_{m - 1} + K ({\hat{Y}}_{m - 1} - w^{⊤} {\hat{z}}_{m - 1}) . & (118) \end{matrix}

Given this precise estimate of ${\hat{z}}_{m}$ , the best linear estimate of ${\hat{Y}}_{m}$ is simply

〈 {\hat{Y}}_{m} | H_{m - 1} 〉 = w^{⊤} {\hat{z}}_{m}

and the estimation error is simply

〈 {({\hat{Y}}_{m} - 〈 {\hat{Y}}_{m} | H_{m - 1} 〉)}^{2} 〉 = 〈 v_{m}^{2} 〉 = σ_{v}^{2} .

Since both the innovation form and the original form have the same second order statistics for the input–output, the optimal linear estimator (and its error) for ${\hat{Y}}_{m}$ in the original system would be the same. Moreover, one can show (Anderson and Moore, 1979) that Equation (118) will also give the optimal linear estimate of ${\hat{s}}_{m}$ in the original system, and with error P (Equation 117). This solution is the well-known “Kalman filter.”

4.4.5. Example – HHS model power spectral densities

Substituting the parameters for the linearized map (Equations 85–89) into the expressions for the power-spectral densities (Equations 104–106), gives

\begin{matrix} S_{Y} ​ (f) ​ = ​ \frac{w^{2} ​ (D_{*} + T_{*}^{- 2} d^{2} S_{T} ​ (f) ​) ​ + ​ T_{*} σ_{e}^{2} ​ (​ {(2 π f)}^{2} + A_{*}^{2})}{{(2 π f)}^{2} + {(A_{*} + T_{*}^{- 1} w a)}^{2}} & (119) \end{matrix}

\begin{matrix} S_{s} (f) = \frac{D_{*} + T_{*}^{- 1} a^{2} σ_{e}^{2} + T_{*}^{- 2} d^{2} S_{T} (f)}{{(2 π f)}^{2} + {(A_{*} + T_{*}^{- 1} w a)}^{2}} & (120) \end{matrix}

\begin{matrix} S_{Y T} (f) = \frac{T_{*}^{- 1} w d}{2 π f i - A_{*} - T_{*}^{- 1} w a} S_{T} (f) . & (121) \end{matrix}

Note that when S_T(f) ≡ 0 (i.e., periodical spike stimulus), S_Y(f) has the shape of high pass filter (Figure 3B, top). In contrast, S_s(f) (Figure 3B, bottom) and S_YT(f) both have the shape of a low pass filter (Figure 3D, top). From Equations (111) and (110) we know that S_Y(f) = |H^int(f)|²σ²_v and S_YT(f) = H^ext(f)S_T(f), respectively. Therefore, this indicates that H^int(f) and H^ext(f) are high pass and low pass filters, respectively.

4.4.6. Power spectral densities of response features

So far we have concentrated on the PSD of the response Y_m. However, it is easy to extend our formalism to derive the PSDs of different features of the AP, such as its latency or amplitude. We exemplify this on the latency. In Soudry and Meir (2012b) we showed (Figure 3) that for deterministic CBMs, the latency of the AP generated in response to the m-th stimulation can be written as a function of the excitability L_m = L (s_m). In a stochastic model, we have instead

L_{m} = {\begin{array}{l} \begin{array}{l} L (s_{m}) + ϕ_{m}, Y_{m} = 1 \\ not defined, Y_{m} = 0 \end{array} \end{array}

where ϕ_m is a zero mean, white noise process generated by the stochasticity of the rapid system. Since it is problematic to define the PSD of L_m if sometimes Y_m = 0, we focus on the case that p_* = 1, so we always have Y_m = 1. In this case, assuming again that the perturbations in ${\hat{s}}_{m}$ are small, we can linearize

L (s_{m}) \approx L (s_{*}) + l^{⊤} {\hat{s}}_{m}

where l = ∇L (s)_{s = s_*}, to obtain (using Equation 11)

\begin{matrix} {\hat{s}}_{m + 1} = F {\hat{s}}_{m} + d {\hat{T}}_{m} + n_{m}, & (122) \end{matrix}

\begin{matrix} {\hat{L}}_{m} = l^{⊤} {\hat{s}}_{m} + ϕ_{m} & (123) \end{matrix}

where he F = I + T_*A (1, T_*). Therefore, it is straightforward to show that the PSD of the latency is

\begin{matrix} S_{L} (f) = l^{⊤} S_{s} (f) l + T_{*} σ_{ϕ}^{2} & (124) \end{matrix}

where σ²_ϕ = 〈ϕ²_m〉. Note that if latency is a good indicator of excitability, i.e., L (s) changes similarly to p (s) so that l ∝ w, then S_L(f) = c₁S_Y(f) + c₂ for some constants c₁, c₂, when the input is periodic (T_m = T_*) and p_* → 1.

4.5. Numerical Tests

MATLAB (2010b) code is available on the ModelDB website, with accession number 144993. In all the numerical simulations of the full stochastic Biophysical neuron model we used Equations (1–3) in main text. We used first order Euler–Maruyama integration with a time step of dt = 5 μs (quantitative results were verified also at dt = 0.5 μs). Each stimulation pulse was given as a square pulse with a width of t_stim = 0.5 ms and amplitude I₀ (which were respectively named t₀ and I₀ in Soudry and Meir, 2012b). The results are not affected qualitatively by our choice of a square pulse shape. We define an AP to have occurred if, after the stimulation pulse was given, the measured voltage has crossed some threshold V_th (we use V_th = −10 mV in all cases). In all cases where direct stimulation is given, unless stated otherwise, we used periodic stimulation with I₀ = 7.9 μA and T_* = 50 ms. Note that for the parameter values used, no APs are spontaneously generated, consistently with experimental results (Gal et al., 2010).

The PSDs were estimated using the Welch method and averaged over eight windows, unless 1/f behavior was observed, in which case we used a single window instead, since long term correlations may generate bias if averaging is used (Beran, 1994). Numerical estimation of the cross-PSD is more problematic. When estimating cross-spectra, estimation noise level can be quite high (proportional to the inverse coherence, according to Bendat and Piersol, 2000, p. 321). To estimate the level of estimation noise, we estimate the cross-spectrum with the input randomly shuffled (Figure 8). Since in this case there is no input–output correlation, this new estimate is pure noise. Finally, as suggested by the reviewer, we smoothed the resulting PSD (or cross-PSD) in all figures (except in Figure 8, where we aimed to show the level of estimation noise). To achieve uniform variance with low bias, we divided the spectrum into 30 logarithmically spaced segments from f = 10⁻³ Hz to the maximal frequency (T_*/2). In each segment n the PSD (or cross-PSD) was smoothed using a window of size n.

FIGURE 8

Figure 8. Estimation noise in the cross-power spectral density. To estimate the level of this noise in Figure 3D, we added $S_{Y \tilde{T}} (f)$ where ${{\tilde{T}}_{m}}$ is a shuffled version of {T_m}. Only when the estimated S_YT(f) is above $S_{Y \tilde{T}} (f)$ , is its estimation valid. Therefore, in Figure 3D we show only this region (left of dashed black line), where estimation is valid.

Next, we describe the models used Figures 3–5 and provide their parameter values. These models have either been studied in the literature or are extensions of such models, which are meant to explore the limit for the validity of our analytic approximations. In all cases where direct stimulation is given, unless stated otherwise, we use periodic stimulation with I₀ = 7.9 μA and T_* = 50 ms. Notice the form of the models is given in the (more popular) compressed formalism (section 4.1.1), which employs the normalization of state occupation probability to reduce the dimensionality of equations of Equations (2, 3) in the main text.

4.5.1. The HHS model

The HHS model combines the Hodgkin–Huxley equations (Hodgkin and Huxley, 1952) with slow sodium inactivation (Chandler and Meves, 1970; Fleidervish et al., 1996). The model equations (Soudry and Meir, 2012b), which employ the uncoupled stochastic noise approximation, are

\begin{array}{l} C \dot{V} = {\bar{g}}_{N a} s m^{3} h (E_{N a} - V) + {\bar{g}}_{K} n^{4} (E_{K} - V) \\ + {\bar{g}}_{L} (E_{L} - V) + I (t) \\ \dot{m} = ϕ [α_{m} (V) (1 - m) - β_{m} (V) m] \\ + \sqrt{N_{m}^{- 1} ϕ (α_{m} (V) (1 - m) + β_{m} (V) m)} ξ_{m} \\ \dot{n} = ϕ [α_{n} (V) (1 - n) - β_{n} (V) n] \\ + \sqrt{N_{m}^{- 1} ϕ (α_{n} (V) (1 - n) + β_{h} (V) n)} ξ_{n} \\ \dot{h} = ϕ [α_{h} (V) (1 - h) - β_{h} (V) h] \\ + \sqrt{N_{h}^{- 1} ϕ (α_{h} (V) (1 - h) + β_{h} (V) h)} ξ_{h} \\ \dot{s} = δ ​ (V) ​ (1 - s) - γ ​ (V) s + \sqrt{N_{s}^{- 1} ​ (δ (V) (1 - s) + γ (V) s)} ξ_{s} . \end{array}

Most of the parameters are given their original values (as in Hodgkin and Huxley, 1952; Fleidervish et al., 1996):

\begin{array}{l} V_{N a} = 50 mV, V_{K} = - 77 mV, V_{L} = - 54 mV, \\ {\bar{g}}_{N a} = 120 {(k Ω \cdot c m^{2})}^{- 1}, {\bar{g}}_{K} = 36 {(k Ω \cdot c m^{2})}^{- 1}, g_{L} = 0.3 {(k Ω \cdot c m^{2})}^{- 1}, \\ α_{n} (V) = \frac{0.01 (V + 55)}{1 - e^{- 0.1 \cdot (V + 55)}} kHz, β_{n} (V) = 0.125 \cdot e^{- (V + 65) / 80} kHz, \\ α_{m} (V) = \frac{0.1 (V + 40)}{1 - e^{- 0.1 \cdot (V + 40)}} kHz, β_{m} (V) = 4 \cdot e^{- (V + 65) / 18} kHz, \\ α_{h} (V) = 0.07 \cdot e^{- (V + 65) / 20} kHz, β_{h} (V) = {(e^{- 0.1 \cdot (V + 35)} + 1)}^{- 1} kHz, \end{array}

where in all the rate functions V is used in units of mV. In order to obtain the specific spike shape and the latency transients observed in cortical neurons, some of the parameters were modified to

\begin{matrix} \begin{array}{l} C_{m} = 0.5 μ F / {cm}^{2}, ϕ = 2 \\ γ ​ (V) = 0.51 \cdot ​ {(e^{- 0.3 \cdot (V + 17)} + 1)}^{- 1} Hz, δ ​ (V) = 0.05 e^{- (V + 85) / 30} Hz . \end{array} & (125) \end{matrix}

We emphasize that these specific choices do not affect any of our general arguments, but were chosen for consistency with experimental results (Gal et al., 2010). Estimates of channel number vary greatly (Soudry and Meir, 2012b). For simplicity, we chose N = N_n = N_h = N_m = N_s, and unless stated otherwise, we chose, by default N = 10⁶, as in Soudry and Meir (2012b). Note that the HHS model is the same model presented in the paper with M = 1, ϕ_s,1 = 1, N_s,1 = N, N_{r, j} = N and ϕ_r = ϕ.

4.5.2. The coupled HHS model

The coupled version of the HHS model uses the same parameters as the uncoupled version, and a similar voltage equation

\begin{array}{l} C \dot{V} = {\bar{g}}_{N a} s_{0} m_{0} h_{0} (E_{N a} - V) + {\bar{g}}_{K} n_{0} (E_{K} - V) \\ + {\bar{g}}_{L} (E_{L} - V) + I (t) \end{array}

where the variables n₀ and s₀m₀h₀ describe the respective fraction of potassium and sodium channels residing in the “open” state. To obtain the coupled model equations, we need to assume something about the structure of the ion channels. The original assumption by Hodgkin and Huxley was that the channel subunits (e.g., m, n and h) are independent. Over the years, it became apparent that this assumption is inaccurate, and the sodium channel kinetic subunits are, in fact, not independent (Ulbricht, 2005). However, it is not yet clear how the slow sodium inactivation is coupled to the rapid channel kinetics (e.g., Menon et al., 2009; Milescu et al., 2010), so we nevertheless used the original naive HH model assumption that the subunits are independent. In that case the potassium channel structure is given by (for brevity, the voltage dependence on the rates is henceforth ignored for this model)

n_{0} \begin{matrix} \begin{array}{l} 4 α_{n} \\ ⇌ \\ β_{n} \end{array} \end{matrix} n_{1} \begin{matrix} \begin{array}{l} 3 α_{n} \\ ⇌ \\ 2 β_{n} \end{array} \end{matrix} n_{2} \begin{matrix} \begin{array}{l} 2 α_{n} \\ ⇌ \\ 3 β_{n} \end{array} \end{matrix} n_{3} \begin{matrix} \begin{array}{l} α_{n} \\ ⇌ \\ 4 β_{n} \end{array} \end{matrix} n_{4}

while for the sodium channel it is described by

In this diagram, transition rates indicated between two boxed regions, imply that the same rates are used between all corresponding states in boxed regions. The corresponding 32 SDEs are derived using the method described in Orio and Soudry (2012) (or 30 equations if we use the compressed formalism). In this model we used I₀ = 8.3 μA.

4.5.3. The HHSTM model

In order to investigate the effect of a more “physiological” stimulation, we changed the HHS model and added synapses. We used the popular Tsodyks–Markram model for the effect of a synapse with short-term-depression on the somatic voltage (the model first appeared in Tsodyks and Markram (1997) and was slightly corrected in Tsodyks et al. (1998)). In the model x, y and z are the fractions of resources in the recovered, active and inactive states respectively, interacting through the system

\begin{matrix} \begin{matrix} \begin{array}{l} \begin{matrix} y \end{matrix} \\ \begin{array}{l} ↗ & ↘ \end{array} \\ \begin{array}{l} x & \leftarrow & z \end{array} \end{array} & . \end{matrix} & (126) \end{matrix}

Here the z → x rate is τ⁻¹_rec, the x → y rate is τ⁻¹_in, and the x → y rate is U_SEδ(t − t_sp), where δ(·) is the Dirac delta function, and t_sp is the pre-synaptic spike arrival time. The post-synaptic current is given by I_s(t) = A_SEy(t) where A_SE is a parameter. Additionally, we added noise to the model using the coupled SDE method (Orio and Soudry, 2012), assuming that the diagram in Equation (126), with the corresponding rates, hint at the underlying Markov kinetic structure, with N = 10⁶. As in Figure 1B of Tsodyks and Markram (1997), we used τ_in = 3 ms, τ_rec = 800 ms and U_SE = 0.67. Additionally, we set A_SE = 70 μA to obtain an AP response in our model.

4.5.4. The HHMS model

The HHMS model consists of many sodium currents, each with a different slow kinetic variable. The equations are identical to the HHS model, except that g_Nas is replaced by g_NaM⁻¹ ∑ ^M_i=1 s_i, where s₁ has the same equation as s in the HHS model, and for i > 2,

\begin{array}{l} {\dot{s}}_{i} = [δ (V) (1 - s_{i}) - γ (V) s_{i}] ϕ_{s, i} \\ + \sqrt{(δ (V) (1 - s_{i}) + γ (V) s_{i}) N_{s, i}^{- 1} ϕ_{s . i}} ξ_{s, i}, \end{array}

with ϕ_{s, i} = ϵⁱ and N_{s, i} = N₀ϵ^iη, where γ (V) and δ (V) are taken from the HHS model. Unless mentioned otherwise, we chose as default ϵ = 0.2, η = 1.5, M = 5, and N₀ = N as in Figure 5.

4.5.5. The multiplicative hhms model

The Multiplicative HHMS model is identical to the HHMS model with η = 1, except that ${\bar{g}}_{N a} M^{- 1} \sum_{i = 1}^{M} s_{i}$ is replaced with ${\bar{g}}_{N a} \prod_{i = 1}^{M} s_{i}$ .

4.5.6. The HHSIP model

The HHSIP model equations (Soudry and Meir, 2012b) are identical to the HHS model equations, except that s is renamed to s₁ and an Inactivating Potassium current was added to the voltage equation, where

I_{K} = {\bar{g}}_{M} n^{4} s_{2} (E_{K} - V),

with g_M = 0.05g_K and

\begin{array}{l} {\dot{s}}_{2} = δ_{2} (V) (1 - s_{2}) - γ_{2} (V) s_{2} \\ + \sqrt{N_{s_{2}}^{- 1} (δ (V) (1 - s_{2}) + γ (V) s_{2})} ξ_{s, 2}, \end{array}

where N_s₂ = N and

\begin{array}{l} δ_{2} (V) = \frac{3.3 e^{(V + 35) / 15} + e^{- (V + 35) / 20}}{1 + e^{- (V + 35) / 10}} Hz, γ_{2} (V) \\ = \frac{3.3 e^{(V + 35) / 15} + e^{- (V + 35) / 20}}{1 + e^{(V + 35) / 10}} Hz . \end{array}

Again, in all the rate functions V is used in mV units. In this model we used I₀ = 8.3 μA and T_* = 33 ms.

4.5.7. The HHMSIP model

The HHMSIP model combines HHSIP and HHMS. Its equations are identical to the HHMS model with η = 2, except they also contain the I_K current from the HHSIP model. In this model we used I₀ = 8.3 μA and T_* = 33 ms, unless otherwise specified.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors are grateful to O. Barak, N. Brenner, Y. Elhanati, A. Gal, T. Knafo, Y. Kafri, S. Marom, and J. Schiller for insightful discussions and for reviewing parts of this manuscript. The research was partially funded by the Technion V.P.R. fund and by the Intel Collaborative Research Institute for Computational Intelligence (ICRI-CI).

Footnotes

1. ^A semi-analytic derivation is an analytic derivation in which some terms are obtained by relatively simple numerics. See 2.2 for our implementation.

2. ^We demonstrated that such noise should strongly affect the neuronal response to sparse stimulation (Soudry and Meir, 2012b).

3. ^I.e., if ∀t:I(t) = 0, then the probability that a neuron will fire is negligible – on any relevant finite time interval (e.g., minutes or days).

4. ^E.g., as in Equations (50–52). Note also a similar notation was also used in Soudry and Meir (2012b) (e.g., Equations 2.15, 2.16), where we used H/M/L instead of +/−/0.

5. ^Later we shall demonstrate numerically that this is not a necessary condition.

6. ^Note a more general Box–Jenkins model is not required, since the poles of H^ext(f) and H^int(f) are identical (assuming no pole-zero cancelation).

7. ^Also, as explained in section 4.3.1, we approximately have w ∝ g_Na, from Equation (21).

References

Anderson, B. D. O., and Moore, J. B. (1979). Optimal Filtering, Vol. 11. Englewood Cliffs, NJ: Prentice Hall.

Bean, B. P. (2007). The action potential in mammalian central neurons. Nat. Rev. Neurosci. 8, 451–465. doi: 10.1038/nrn2148

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bendat, J. S., and Piersol, A. G. (2000). Random Data Analysis and Measurement Procedures, Vol. 11, 3rd edn. New York, NY: Wiley.

Beran, J. (1992). A goodness-of-fit test for time series with long range dependence. J. R. Stat. Soc. Ser. B 54, 749–760.

Beran, J. (1994). Statistics for Long-Memory Processes. New York, NY: Chapman & Hall.

Brecht, M., Schneider, M., Sakmann, B., and Margrie, T. W. (2004). Whisker movements evoked by stimulation of single pyramidal cells in rat motor cortex. Nature 427, 704–710. doi: 10.1038/nature02266

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chandler, W. K., and Meves, H. (1970). Slow changes in membrane permeability and long-lasting action potentials in axons perfused with fluoride solutions. J. Physiol. 211, 707–728.

Pubmed Abstract | Pubmed Full Text

Channelpedia. Available online at: http://channelpedia.epfl.ch/

Colquhoun, D., and Hawkes, A. G. (1981). On the stochastic properties of single ion channels. Proc. R. Soc. Lond. Ser. B Biol. Sci. 211, 205–235. doi: 10.1098/rspb.1981.0003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Contou-Carrere, M. N. (2011). Model reduction of multi-scale chemical Langevin equations. Syst. Control Lett. 60, 75–86. doi: 10.1016/j.sysconle.2010.10.011

CrossRef Full Text

De Col, R., Messlinger, K., and Carr, R. W. (2008). Conduction velocity is regulated by sodium channel inactivation in unmyelinated axons innervating the rat cranial meninges. J. Physiol. 586, 1089–1103. doi: 10.1113/jphysiol.2007.145383

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

De Paola, V., Holtmaat, A., Knott, G., Song, S., Wilbrecht, L., Caroni, P., et al. (2006). Cell type-specific structural plasticity of axonal branches and boutons in the adult neocortex. Neuron 49, 861–875. doi: 10.1016/j.neuron.2006.02.017

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Debanne, D., Campanac, E., Bialowas, A., and Carlier, E. (2011). Axon physiology. Physiol. Rev. 91, 555–602. doi: 10.1152/physrev.00048.2009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Druckmann, S., Berger, T. K., Schürmann, F., Hill, S., Markram, H., and Segev, I. (2011). Effective stimuli for constructing reliable neuron models. PLoS Comput. Biol. 7:e1002133. doi: 10.1371/journal.pcbi.1002133

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Elul, R., and Adey, W. R. (1966). Instability of firing threshold and “remote” activation in cortical neurons. Nature 212, 1424–1425. doi: 10.1038/2121424a0

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ermentrout, B., and Terman, D. (2010). Mathematical Foundations of Neuroscience, Vol. 35. New York, NY: Springer. doi: 10.1007/978-0-387-87708-2

CrossRef Full Text

Fleidervish, I. A., Friedman, A., and Gutnick, M. J. (1996). Slow inactivation of Na+ current and slow cumulative spike adaptation in mouse and guinea-pig neocortical neurones in slices. J. Physiol. 493, 83–97.

Pubmed Abstract | Pubmed Full Text

Fox, R. F., and Lu, Y. N. (1994). Emergent collective behavior in large numbers of globally coupled independently stochastic ion channels. Phys. Rev. E 49, 3421–3431. doi: 10.1103/PhysRevE.49.3421

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gal, A., Eytan, D., Wallach, A., Sandler, M., Schiller, J., and Marom, S. (2010). Dynamics of excitability over extended timescales in cultured cortical neurons. J. Neurosci. 30, 16332–16342. doi: 10.1523/JNEUROSCI.4859-10.2010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gal, A., and Marom, S. (2013). Entrainment of the intrinsic dynamics of single isolated neurons by natural-like input. J. Neurosci. 33, 7912–7918. doi: 10.1523/JNEUROSCI.3763-12.2013

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gardiner, C. W. (2004). Handbook of Stochastic Methods, 3rd edn. Berlin: Springer-Verlag. doi: 10.1007/978-3-662-05389-8

CrossRef Full Text

Gerstner, W., and Naud, R. (2009). How good are neuron models? Science 326, 379–380. doi: 10.1126/science.1181936

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Goldwyn, J. H., Imennov, N. S., Famulare, M., and Shea-Brown, E. (2011). Stochastic differential equation models for ion channel noise in Hodgkin-Huxley neurons. Phys. Rev. E 83, 041908. doi: 10.1103/PhysRevE.83.041908

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Goldwyn, J. H., Rubinstein, J. T., and Shea-Brown, E. (2012). A point process framework for modeling electrical stimulation of the auditory nerve. J. Neurophysiol. 108, 1430–1452. doi: 10.1152/jn.00095.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Grubb, M. S., and Burrone, J. (2010). Activity-dependent relocation of the axon initial segment fine-tunes neuronal excitability. Nature 465, 1070–1074. doi: 10.1038/nature09160

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hille, B. (2001). Ion Channels of Excitable Membranes, 3rd edn. Sunderland, MA: Sinauer Associates.

Hodgkin, A. L., and Huxley, A. F. (1952). A quantitative description of membrane current and its application to conduction and excitation in nerve. J. Physiol. 117, 500.

Pubmed Abstract | Pubmed Full Text

Huys, Q. J. M., Ahrens, M. B., and Paninski, L. (2006). Efficient estimation of detailed single-neuron models. J. Neurophysiol. 96, 872. doi: 10.1152/jn.00079.2006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ikegaya, Y, Sasaki, T., Ishikawa, D., Honma, N., Tao, K., Takahashi, N., et al. (2012). Interpyramid spike transmission stabilizes the sparseness of recurrent network activity. Cereb. Cortex 23, 293–304. doi: 10.1093/cercor/bhs006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Jugloff, D. G. M. (2000). Internalization of the Kv1.4 potassium channel is suppressed by clustering interactions with PSD-95. J. Biol. Chem. 275, 1357–1364. doi: 10.1074/jbc.275.2.1357

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kaplan, D. T., Clay, J. R., Manning, T., Glass, L., Guevara, M. R., and Shrier, A. (1996). Subthreshold dynamics in periodically stimulated squid giant axons. Phys. Rev. Lett. 76, 4074–4077. doi: 10.1103/PhysRevLett.76.4074

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kasischke, K. A., Vishwasrao, H. D. D., Fisher, P. J., Zipfel, W. R., and Webb, W. W. (2004). Neural activity triggers neuronal oxidative metabolism followed by astrocytic glycolysis. Science (New York, N.Y.) 305, 99–103. doi: 10.1126/science.1096485

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keshner, M. S. (1982). 1/f noise. Proc. IEEE 70, 212–218. doi: 10.1109/PROC.1982.12282

CrossRef Full Text

Koch, C., and Segev, I. (1989). Methods in Neuronal Modeling: From Ions to Networks, Vol. 484, 2nd edn. Cambridge: MIT Press.

Komlósi, G., Molnár, G., Rózsa, M., Oláh, S., Barzó, P., and Tamás, G. (2012). Fluoxetine (prozac) and serotonin act on excitatory synaptic transmission to suppress single layer 2/3 pyramidal neuron-triggered cell assemblies in the human prefrontal cortex. J. Neurosci. 32, 16369–16378. doi: 10.1523/JNEUROSCI.2618-12.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lejung, L. (1999). System Identification: Theory for the User. Upper Saddle River, NJ: PTR Prentice Hall.

Levitan, I. B. (1994). Modulation of ion channels by protein phosphorylation and dephosphorylation. Annu. Rev. Physiol. 11, 193–212. doi: 10.1146/annurev.ph.56.030194.001205

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Li, C. Y. T., Poo, M. M., and Y D. (2009). Burst spiking of a single cortical neuron modifies global brain state. Science 324, 643–646. doi: 10.1126/science.1169957

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Linaro, D., Storace, M., and Mattia, M. (2011). Inferring network dynamics and neuron properties from population recordings. Front. Comput. Neurosci. 5:43. doi: 10.3389/fncom.2011.00043

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Marom, S. (2010). Neural timescales or lack thereof. Prog. Neurobiol. 90, 16–28. doi: 10.1016/j.pneurobio.2009.10.003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Menon, V., Spruston, N., and Kath, W. L. (2009). A state-mutating genetic algorithm to design ion-channel models. Proc. Natl. Acad. Sci. U.S.A. 106, 16829–16834. doi: 10.1073/pnas.0903766106

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Migliore, M., Cannia, C., Lytton, W. W., Markram, H., and Hines, M. L. (2006). Parallel network simulations with NEURON. J. Comput. Neurosci. 21, 119–129. doi: 10.1007/s10827-006-7949-5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Milescu, L. S., Yamanishi, T., Ptak, K., and Smith, J. C. (2010). Kinetic properties and functional dynamics of sodium channels during repetitive spiking in a slow pacemaker neuron. J. Neurosci. 30, 12113–12127. doi: 10.1523/JNEUROSCI.0445-10.2010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Molnár, G., Oláh, S., Komlósi, G., Füle, M., Szabadics, J., Varga, C., et al. (2008). Complex events initiated by individual spikes in the human cerebral cortex. PLoS Biol. 6:e222. doi: 10.1371/journal.pbio.0060222

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Monjaraz, E., Navarrete, A., Lopez-Santiago, L. F., Vega, A. V., and Cota, G. (2000). L-type calcium channel activity regulates sodium channel levels in rat pituitary GH3 cells. J. Physiol. 523, 45–55. doi: 10.1111/j.1469-7793.2000.00045.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Neher, E., and Sakmann, B. (1976). Single-channel currents recorded from membrane of denervated frog muscle fibres. Nature 260, 799–802. doi: 10.1038/260799a0

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Nishiyama, H., Fukaya, M., and Watanabe, M. (2007). Axonal motility and its modulation by activity are branch-type specific in the intact adult cerebellum. Neuron 56, 472–487. doi: 10.1016/j.neuron.2007.09.010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Orio, P., and Soudry, D. (2012). Simple, fast and accurate implementation of the diffusion approximation algorithm for stochastic ion channels with multiple states. PLoS ONE 7:e36670. doi: 10.1371/journal.pone.0036670

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Papoulis, A., and Pillai, S. U. (1965). Probability, Random Variables, and Stochastic Processes. New York, NY: McGraw-Hill.

Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286. doi: 10.1109/5.18626

CrossRef Full Text

Roth, A., and Häusser, M. (2001). Compartmental models of rat cerebellar Purkinje cells based on simultaneous somatic and dendritic patch-clamp recordings. J. Physiol. 535, 445–472. doi: 10.1111/j.1469-7793.2001.00445.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Schneidman, E., Freedman, B., and Segev, I. (1998). Ion channel stochasticity may be critical in determining the reliability and precision of spike timing. Neural Comput. 10, 1679–1703. doi: 10.1162/089976698300017089

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Silver, I. A., Deas, J., and Erecinska, M. (1997). Ion homeostasis in brain cells: differences in intracellular ion responses to energy limitation between cultured neurons and glial cells. Neuroscience 78, 589–601. doi: 10.1016/S0306-4522(96)00600-8

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sjöström, P. J. J., Rancz, E. A. A., Roth, A., and Häusser, M. (2008). Dendritic excitability and synaptic plasticity. Physiol. Rev. 88, 769–840. doi: 10.1152/physrev.00016.2007

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Song, S., Sjöström, P. J. J., Reigl, M., Nelson, S. B., and Chklovskii, D. B. (2005). Highly nonrandom features of synaptic connectivity in local cortical circuits. PLoS Biol. 3:e68. doi: 10.1371/journal.pbio.0030068

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Soudry, D., and Meir, R. (2012a). An exact reduction of the master equation to a strictly stable system with an explicit expression for the stationary distribution. arXiv:1207.4436.

Soudry, D., and Meir, R. (2012b). Conductance-based neuron models and the slow dynamics of excitability. Front. Comput. Neurosci. 6:4. doi: 10.3389/fncom.2012.00004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Soudry, D., and Meir, R. (2014). The neuronal response at extended timescales: long term correlations without long memory. Front. Comput. Neurosci. 8:35. doi: 10.3389/fncom.2014.00035

CrossRef Full Text

Staub, O., Gautschi, I., Ishikawa, T., Breitschopf, K., Ciechanover, A., Schild, L., et al. (1997). Regulation of stability and function of the epithelial Na+ channel (ENaC) by ubiquitination. EMBO J. 16, 6325–6336. doi: 10.1093/emboj/16.21.6325

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Toib, A., Lyakhov, V., and Marom, S. (1998). Interaction between duration of activity and time course of recovery from slow inactivation in mammalian brain Na+ channels. J. Neurosci. 18, 1893–1903.

Pubmed Abstract | Pubmed Full Text

Tsodyks, M., and Markram, H. (1997). The neural code between neocortical pyramidal neurons depends on neurotransmitter release probability. Proc. Natl. Acad. Sci. U.S.A. 94, 719–723. doi: 10.1073/pnas.94.2.719

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tsodyks, M., Pawelzik, K., and Markram, H. (1998). Neural networks with dynamic synapses. Neural Comput. 10, 821–835. doi: 10.1162/089976698300017502

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ulbricht, W. (2005). Sodium channel inactivation: molecular determinants and modulation. Physiol. Rev. 85, 1271–1301. doi: 10.1152/physrev.00024.2004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wainrib, G., Thieullen, M., and Pakdaman, K. (2011). Reduction of stochastic conductance-based neuron models with time-scales separation. J. Comput. Neurosci. 32, 327–346. doi: 10.1007/s10827-011-0355-7

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wallach, A. (2012). The Response Clamp : A Control Based Approach for the Study of Neural Systems; Method and Applications. Ph.D. thesis, Technion.

Keywords: conductance based neuron models, noise, ion channels, adaptation, power spectral density, linear response, system identification, analytical methods

Citation: Soudry D and Meir R (2014) The neuronal response at extended timescales: a linearized spiking input–output relation. Front. Comput. Neurosci. 8:29. doi: 10.3389/fncom.2014.00029

Received: 20 December 2013; Accepted: 24 February 2014;
Published online: 02 April 2014.

Edited by:

David Hansel, University of Paris, France

Reviewed by:

Maurizio Mattia, Istituto Superiore di Sanità, Italy
Joaquín J. Torres, University of Granada, Spain

Copyright © 2014 Soudry and Meir. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Daniel Soudry, Department of Statistics, Center for Theoretical Neuroscience, Columbia University, 1255 Amsterdam Avenue, New York, NY 10027, USA e-mail: daniel.soudry@gmail.com

ORIGINAL RESEARCH article

The neuronal response at extended timescales: a linearized spiking input–output relation

1. Introduction

2. Results

2.1. Full Model

2.2. Model Reduction

2.3. Linearization

2.4. Linear Systems Analysis

2.5. Numerical Tests

2.5.1. The HHS model

2.5.2. Testing the limit of our assumptions

3. Discussion

3.1. Connection to Previous Work

3.2. Theoretical Novelty

3.3. Practical Significance

4. Methods

4.1. Full Model (Biophysical Neuron Models)

Microscopic origins

Derivation

4.1.1. Compressed formulation

Derivation

Example – the HHS model

4.2. Model Reduction

4.2.1. The excitability constraint

4.2.2. Problem formulation

4.2.3. Derivations

4.2.4. Calculation of pAP (s)

4.2.5. Compressed formulation – reduction

4.2.6. Example – HHS model reduction

4.3. Linearization

4.3.1. Derivation of w

4.3.2. Compressed formulation – linearization

4.3.3. Example – HHS model linearization

4.4. Linear Systems Analysis

4.4.1. Second order statistics and linear systems

4.4.2. The second order statistics of our system

4.4.3. Spectral factorization

4.4.4. Optimal linear estimation of linear systems

4.4.5. Example – HHS model power spectral densities

4.4.6. Power spectral densities of response features

4.5. Numerical Tests

4.5.1. The HHS model

4.5.2. The coupled HHS model

4.5.3. The HHSTM model

4.5.4. The HHMS model

4.5.5. The multiplicative hhms model

4.5.6. The HHSIP model

4.5.7. The HHMSIP model

Conflict of Interest Statement

Acknowledgments

Footnotes

References

People also looked at

4.2.4. Calculation of p_AP (s)