Small RNA driven feed-forward loop: critical role of sRNA in noise filtering
Abstract
Gene regulatory networks are often partitioned into different types of recurring network motifs. A feed-forward loop (FFL) is a common motif in which an upstream regulator is a protein, typically a transcription factor, that regulates the expression of the target protein in two ways - first, directly by regulating the mRNA levels of the target protein and second, indirectly via an intermediate molecule that in turn regulates the target protein level. Investigations on two variants of FFL - purely transcriptional FFL (tFFL) and sRNA-mediated FFL (smFFL) reveal several advantages of using such motifs. Here, we study a distinct sRNA-driven FFL (sFFL) that was discovered recently in Salmonella enterica: The distinction being the upstream regulator here is not a protein but an sRNA that translationally activates the target protein expression directly; and also indirectly via regulation of the transcriptional activator of the target protein. This variant, i.e. sFFL has not been subjected to rigorous analysis. We, therefore, set out to understand two aspects. First is a quantitative comparison of the regulatory response of sFFL with tFFL and smFFL using a differential equation framework. Since the process of gene expression is inherently stochastic, the second objective is to find how noise in gene expression affects the functionality of the sFFL. We find that unlike for tFFL and smFFL, the response of sFFL is stronger and faster: the change in target protein concentration is rapid and depends critically on the initial concentration of sRNA. Further, our analysis based on generating function approach and stochastic simulations leads to a non-trivial prediction that an optimal noise filtration can be attained depending on the synthesis rate of the upstream sRNA and the degradation rate of the intermediate transcriptional activator. A comparison with a simpler process involving only translational activation by sRNA indicates that the design of sFFL is crucial for optimal noise filtration. These observations prompt us to conclude that sFFL has distinct advantages where the master regulator, sRNA, plays a critical role not only in driving a rapid and strong response, but also a reliable response that depends critically on its concentration.
1 Introduction
Small non-coding RNAs (small RNAs/micro RNAs) and proteins are two important regulators of gene expression at different levels. While a major role of protein regulators is to activate or repress gene expression at the transcriptional level by binding to DNA, the small RNAs (sRNA) often bind to their target mRNAs through sequence complementarity and regulate translation. Experimental studies indicate that sRNAs might regulate gene expression in different ways - by translational repression or activation, or by regulating the mRNA stability storz ; gottesman2 ; papenfort1 ; altuvia ; shimoni ; hwa ; bharat . Since sRNAs are small in size and do not further code for a protein, it is believed that sRNA-mediated regulation can lead to quick response with less energetic cost compared to protein-mediated regulation. Several recent studies show that the gene expression often involves dual strategies combining both protein- and sRNA-mediated regulation.shimoni ; bose2
The process of gene regulation is, in general, non-linear and involves complex networks consisting of a number of genes, proteins, and sRNAs, which are themselves extensively regulated by proteins, sRNAs etc. Despite the complexity, it is possible to break down these complex regulatory networks into smaller sub-networks that function as basic building blocks of the bigger network. The sub-networks, typically known as network motifs, can then be analysed and based on the dynamics of individual motifs one may attempt to understand the dynamics of the entire complex network. The network motifs often have recurrent occurrences and typically certain specific types of network motifs, such as feed-back loops, feed-forward loops (FFL), are over-represented. It is believed that these frequently occurring sub-networks are naturally chosen over others as they provide distinct evolutionary advantages such as speeding up the response time, dampening of noise, etc.alon1 ; mangan ; shen . FFLs involving transcriptional regulators have been studied extensively in the past alon2 . In a transcriptional FFL (referred as tFFL from now on), a protein regulator X transcriptionally activates protein Z directly and also indirectly through the transcriptional activation of an intermediate protein regulator Y which in turn activates Z transcriptionally. An FFL where X positively (or negatively) regulates Z both directly and indirectly, is known as the coherent FFL. FFLs where the direct and indirect paths have opposite regulatory effects are referred as incoherent FFLs. Further, coherent (or incoherent) FFLs can be of different types depending on the nature of interactions (activating or inhibiting) in the individual paths mangan . FFLs involving purely transcriptional regulation are found, for example, in L-arabinose utilisation system schleif . In addition to tFFL, there are other types of FFLs where Y is an sRNA regulating the translation or the stability of the target mRNA (of Z). Such FFLs will be referred in the following as sRNA-mediated FFL (smFFL) from hereon. Another interesting variation is an FFL, wherein the upstream regulator (for example, X, as introduced above) is an sRNA that drives the entire FFL. sRNA, in general, may activate or repress translation or the stability of downstream mRNAs. We shall refer to such FFL in the following as sRNA-driven feed-forward loop (sFFL). The comparison between the two variants, sFFL and smFFL, is presented in figures (1A) and (1B), respectively. Through mathematical and computational modelling, a significant progress has been made in understanding the role of FFL in gene regulation. For example, mathematical modelling suggests that a coherent tFFL can reject transient activation in X and respond only to persistent activation of X mangan . Similar studies also elucidate diverse beneficial features of smFFL like a quick to response to sudden changes in the input signal which is advantageous under transient stress conditions shimoni , the ability to dampen the fluctuations under different contexts osella ; caselle ; caselle2 ; samsonov ; marinari etc.. In this work, using different types of modelling schemes, we study two different aspects of the sFFL; one associated with the response of the system under different types of input signal, the other associated with the noise characteristics of the network.
The sFFL of current interest has a significant role in horizontal gene transfer in Salmonella enterica papenfort2 . As figure (1A) and (1C) show, the FFL is driven by the sRNA, RprA, which is one of the three sRNAs (others being DsrA and ArcZ) that activate the translation of mRNA that codes for the alternative sigma-factor, gottesman3 ; gottesman4 . In this FFL, RprA sRNA drives the regulation in the synthesis of RicI protein through two parallel pathways, both involving translational activation by RprA. RprA binds to mRNA through base-pairing and opens up a translation-inhibitory structure in the -untranslated region (5’-UTR) of the mRNA to facilitate ribosome binding and thereby promote translation papenfort2 . being a transcriptional activator for RicI mRNA up-regulates RicI mRNA synthesis. Finally, RprA, by opening the translation-inhibitory structure of its other target, RicI mRNA, leads to an increased synthesis of RicI protein. Thus, the FFL involves an AND gate mechanism where both and RprA sRNA are essential for the up-regulation of RicI protein. Different types of stress conditions are expected to activate production. The AND gate mechanism, through the involvement of both RprA and , ensures that not every stress condition activating synthesis leads to over-expression of RicI protein.
The inter-cellular transmission of plasmid through bacterial conjugation has an important role in microbial evolution and survival alberts . It is believed that this RprA driven FFL is crucial for regulation in the transfer of plasmid pSLT which encodes several virulence genes in Salmonella. Plasmid transfer via bacterial conjugation is a complex, energy-intensive process consisting of several proteins and RNAs functioning in concert. An important part of this process is the formation of pilus which establishes a contact between the donor and the receptor cells and enables the transfer of genetic material. Although such processes are found to be beneficial in terms of adaptation in the changing environment, due to significant fitness cost, such processes are usually tightly regulated inside the cells papenfort3 . It has been experimentally found that the synthesis of RicI protein is up-regulated in Salmonella treated with bile salt which is a bactericidal agent. Bile salt disrupts the bacterial cell membrane and under such membrane damaging activities, RicI protein interferes with transmembrane assembly that, under a normal condition, leads to pilus formation. Thus, RicI protein provides an extra protection to the bacterial cell by inhibiting energy expensive processes associated with pSLT transfer papenfort2 .
One of the major aims of mathematical modelling is to understand, in general, how a network processes or differentiates different types input signals such as sustained signal, transient signal or oscillatory signal. These studies reveal, for example, the response or shut down time scales of the network under different types of input signals mangan or sometimes more drastic response of the network depending on the details of the input signal lingchong . Such temporal behaviour can be found out by solving the differential equations that describe how the densities of various regulatory components change with time. The differential equation framework essentially captures the time variation of the average concentrations of various regulatory molecules. The gene expression is, however, inherently noisy due to the probabilistic nature of various biochemical processes involved and the noise may lead to significant fluctuations in concentrations of the regulatory molecules raj . These fluctuations are especially relevant when the number of regulatory molecules is small in which case the noise may cause significant deviations in the number of molecules from the respective average values. An important question in this case is that how, despite the inherent stochasticity, the network controls the target protein level reliably. Interestingly, it has been found that there are specific network motifs such as negative feed-back loop or coherent transcriptional FFL that can filter or dampen fluctuations in comparison to many other network motifs alon1 ; bose1 . In this context, one of the pertinent questions is how the gene expression noise contributes to the target protein fluctuation in sFFL.
The aim of the present work is two-fold. The first objective is to use the differential equation description to quantitatively understand how the response of sFFL differs from the other types of FFLs, such as tFFL or smFFL. Interestingly, we find that the response of sFFL to an input signal is, in general, strong and rapid compared to tFFL or smFFL. Furthermore, it is possible to find approximate mathematical solutions for various concentrations by solving a system of coupled, non-linear equations under specific types of input signals. We show that the mathematical solutions agree reasonably well with exact numerical solutions of the differential equations. Such mathematical solutions describe how the concentrations depend on various interactions, thereby reducing the need of further analysis on parameter variation. The second objective of the work is to explore the noise processing characteristics of the sFFL and identify interactions that might be crucial in minimizing fluctuations in the target protein concentration. We address this question by solving the stochastic model analytically using the master equation based generating function approach vankampen . This analysis leads to an interesting observation that the present loop can effectively filter the noise and this noise filtering ability depends crucially on the synthesis rate of the upstream sRNA regulator and the degradation rate of the transcriptional activator of the target protein. In particular, we find a range of values of these parameters over which the noise attenuation becomes optimal. We verify our predictions through stochastic simulations based on Gillespie algorithm (GA), which yields exact results on fluctuations in target protein concentration gillespie1 ; gillespie . The simulation results agree well with mathematical predictions. These observations prompt us to conclude that the master regulator sRNA, not only leads to strong and rapid response through target protein synthesis, but it also plays a significant role in minimizing fluctuations in target protein concentrations.


(C)
2 Results and discussions
2.1 Model and the deterministic description
The sFFL of our interest is shown in figure (1A). This can be compared with the smFFL in figure (1B). The change in concentrations of various regulators such as proteins, mRNAs and sRNAs in sFFL with time is described through the following differential equations
(1) | |||
(2) |
The notations used for various concentrations in these equations are displayed in table 1. In general, the synthesis and degradation rates are denoted by and , respectively. Here, is the Hill function representing transcriptional activation in the synthesis of RicI mRNA by protein and is the ratio of activation and deactivation rates of RicI gene. , also known as the activation threshold, corresponds to the special value of protein concentration at which the transcription rate is the same as of its maximum value.
Protein or mRNA or sRNA | Short form used in the text | Mathematical notation for concentrations |
RprA sRNA | RprA | [Ra] |
mRNA | ||
protein | ||
RicI mRNA | RicIm | |
RicI protein | RicIp |
2.2 Approach to the steady state
The steady-state concentrations of various regulators can be obtained by equating the time derivatives of various concentrations in equations (1) and (2) to zero and solving the resulting algebraic equations. Figure (2) shows a comparison as how different concentrations change with time as they approach the respective steady-state values. For enabling a meaningful comparison, we have chosen the same synthesis and degradation rates for all the regulatory components shen . Unless mentioned otherwise, for all the figures in the following, we assume negligible initial concentrations for all the regulators. As the figure shows, the increase in RicI protein concentration is slower compared to that of protein initially (see also mangan ; papenfort2 ). This lag in RicI production is due to the fact that the transcriptional activation of RicI gene requires production of sufficient amount of transcriptional activator, . The amount of delay in RicI production is, of course, strongly dependent on various biochemical parameters such as (the activation threshold for RicI mRNA transcription), degradation and synthesis rates of various regulatory components.

2.3 Temporal Solution
In order to understand the functionality of the present network motif in comparison with other motifs such as tFFL and smFFL, we solve the differential equations numerically for these three types of motifs. The differential equations describing the dynamics of the tFFL and smFFL are shown in appendix A. For sFFL, we consider differential equations in (1) and (2). Figures (3) and (4) show a comparison as how the target protein concentration approaches the steady state with time in three different cases. In order to have a meaningful comparison, we consider the same parameter values for all the three motifs. While for figure (3), the initial concentration of the upstream, master regulator is low, in case of figure (4) the initial concentration of the upstream regulator is relatively high. Irrespective of the initial condition, the response of the sFFL is the fastest among all the three motifs. The difference in the regulatory patterns seems to originate from the fundamental difference in the mechanism of transcritptional and translational activations. The transcriptional activation that happens through binding to the DNA is expected to cause a delayed response since sufficient concentration of transcriptional activator is required for crossing the activation threshold. In addition to this, due to the saturation kinetics, the transcriptional activation also reaches a saturation value that is independent of the concentration of the transcriptional activator. This results in a delayed response in tFFL that functions through transcriptional activation at three different stages. While in smFFL, the transcription interaction is required for activation of two different genes, in sFFL, only RicI gene is transcriptionally activated. It appears that the fast response of sFFL is linked to a reduced number of transcriptional activation steps associated with the loop. The difference due to the mode of regulation appears more prominently in figure (4) where an increased initial concentration in the upstream regulator leads to a faster and stronger response in sFFL as compared to other FFLs. The peak (non-monotonicity) in the response curve, present in case of smFFL, becomes more prominent in case of sFFL. It might be that for sFFL, the high initial concentration of RprA and the direct interaction between RprA and RicI mRNA lead to a significant translational up-regulation in RicI protein synthesis even when the initial RicI mRNA concentration is low. The sFFL is activated when the cell is exposed to stress due to membrane-damaging activities of bile-salt. RprA, along with which activates RicI transcription, increases the expression of RicI protein. RicI protein localizes at the cytoplasmic membrane and, together with other proteins, blocks the conjugation machinery required for pSLT transfer thereby providing protection to the cell by inhibiting energy-expensive conjugation process. The rapid increase in RicI concentration especially when the initial RprA concentration is high, is possibly crucial for cell’s survival under stress due to bactericidal agents such as bile salt.

(A)

(B)

(A)
(B)
We have, further, obtained approximate mathematical solutions (see appendix B for details) for the target protein concentration in case of sFFL for two different cases; (i) with a constant pool of upstream sRNA regulators and (ii) with sRNA concentrations changing due to its synthesis and degradation processes. Mathematical solutions of such coupled, non-linear equations can be obtained only over restricted parameter space. We show that the mathematical solutions agree reasonably well with exact numerical solutions of the differential equations in (1) and (2) and display a rapid growth in the target protein concentrations for large initial concentration of the upstream sRNA regulator as seen in figure (4).
2.4 Noise processing characteristics
Here we focus on fluctuations in the target protein level in the sFFL of present interest. Using a general notation, we introduce , , , and as the number of sRNA (RprA), mRNA, RicI mRNA, protein and RicI protein molecules, respectively. This set of numbers represents a state of the system and denotes the probability that the system is in a given state specified by these numbers at time . Our fluctuation analysis is based on the master equation which is a differential equation that describes how this probability changes with time osella . The probability changes with time due to various processes such as transcription, translation and degradation of different types of molecules as per the details of the network. The master equation that takes into account all these processes is
(3) | |||||
Here and , in general, represent synthesis and degradation rates, respectively, of different types of molecules. denotes the transcription rate under transcriptional activation by and it is represented by the Hill function as , where , as before, denotes the activation threshold. In order to proceed further, we approximate the Hill function about the average density at the steady state. Under such approximation, we have
(4) |
Next, we introduce the moment generating function
(5) |
with . Various derivatives of the generating function are related to average quantities as and where denotes the mean number of molecules of the th species. Denoting as , one may find the fluctuation in the target protein concentration as . Using (3), it can be seen that the moment generating function satisfies the following equation
(6) | |||||
where . In the steady state (), one may obtain successive moments by taking successive derivatives of equation (6) and then substituting in the resulting equations. In order to obtain the target protein fluctuations, we need to find the first and second moments such as and . The evaluation of these moments becomes complex since an equation for a moment, in general, involves the higher order moments. In the present case, we are able to obtain the target protein fluctuation by evaluating up to fourth moment and ignoring the contributions of the higher order moments (see appendix C for details). The coefficient of variation for the target protein, , has been plotted in figure (5A). The figure shows a minimum indicating an optimal attenuation of fluctuations in the target protein level as is changed. Similar optimal noise filtration can also be seen with respect to , the degradation rate of the transcriptional activator, . Figure (6) shows a 3-dimensional picture of optimum noise filtration as and are changed. These figures show that an optimal noise attenuation is seen only over an intermediate range of values of and and the minimum disappears as one increases the values of these parameters. Further, with similar parameter values, no optimal filtration of noise can be found with respect to the synthesis rates of and (see, for example, figure (5B)).
For understanding the rationale behind such noise processing characteristics of the sFFL, we compare these results with the case where gene regulation involves only translational activation by sRNAs (see figure (11A) in appendix (D)). This is equivalent to considering only the direct pathway in the sFFL through which RprA translationally activates RicI mRNA. This is also how RprA up-regulates translation in the present sFFL while transcripts are synthesized in the cell under various stress conditions. Various moments for such a minimal motif can be obtained in a straightforward way from the details presented in appendix C for the sFFL. The coefficients of variation for the target protein of such a minimal motif and the sFFL are shown in figure (11B) in appendix D. The figure clearly shows that the noise processing characteristics of the simple motif with only translational activation by sRNA are significantly different from that of sFFL considered here. In particular, no optimal noise filtration can be found in case of the minimal motif. Further, for the parameter values chosen here, near the region of optimal attenuation of noise, the coefficient of variation of sFFL becomes approximately of that of the simple case of gene regulation through translational activation. Despite having additional regulatory molecules and hence, associated fluctuations, the sFFL seems to be less noisy as compared to the simple sRNA regulation scheme discussed here. Further, it appears that the topology of the sFFL and the non-linear interactions play a crucial role in noise filtration characteristics.

(A)
(B)



(A)
(B)
The results from the above mathematical analysis are supported by stochastic simulations discussed below. Stochastic simulations based on Gillespie algorithm (GA) is an exact method that allows us to incorporate the probabilistic features in the gene expression in a systematic way gillespie ; gillespie1 . In GA, we take into account different types of molecules such as protein, sRNA, mRNA molecules associated with the regulatory network and the relevant interactions between them. The number of each species of molecules at a given instant of time is denoted by where denotes a specific species of molecules given that there are totally different species of molecules. GA is based on two assumptions; (1) the time interval between two successive reactions is a random variable obeying Poisson Distribution and (2) the specific reaction that occurs at a given instant of time is selected randomly. In the simulation, this is executed by drawing two random numbers at each simulation step; one random number, , is used to decide the time interval between two successive reactions and the other random number, , is used to decide which reaction would occur in the next time step. Denoting a specific reaction by , where , one may define a probability which indicates the probability of occurrence of the th reaction in the time interval and . Here is expressed as the product of two factors; one is the reaction rate and the other factor is the number of possible type reactions. Thus at each simulation step, the time interval before the next reaction is determined and depending on the value of and the various reaction rates , the next reaction is chosen. Subsequently, based on which reaction has occurred, the values of appropriate are updated. We allow the system to evolve over simulation steps and keep a record of the number of sRNA, mRNA, target protein number after every simulation steps leaving about initial steps. The details of the reactions considered and the corresponding rate constants are presented in appendix (E). The coefficients of variation for the target protein is shown in figures (7A) and (7B). Consistent with the mathematical analysis (figure (5A)), figure (7A) also shows an optimal noise attenuation with , the sRNA synthesis rate. As figure (5B), figure (7B) shows no minimum in the coefficient of variation as i.e. the synthesis rate of protein is changed.
The noise processing characteristics of different types of FFLs have been studied in the past. Using a Langevin description, it was shown that among various types of tFFLs, the coherent tFFL with all three activating regulatory interactions, was the least noisy among other kinds of coherent tFFL bose1 . In particular, it was shown that the activating nature of all regulatory interactions is responsible for such maximal noise reduction. Recently, noise processing characteristics have also been studied in detail for incoherent micro RNA (miRNA) mediated FFL osella ; caselle ; caselle2 ; marinari . Unlike Fig. 1B, in such miRNA mediated FFL, the miRNA represses the target protein synthesis by binding to the mRNA of the target protein and degrading it subsequently. Since sRNA/miRNA mediated target repression is a common scenario in gene regulation, there have been efforts in the past to understand the fundamental differences between the miRNA/sRNA mediated and TF mediated gene repression hwa . Currently, it is well established that, in general, sRNA-mediated repression reduces fluctuations in the target protein level as compared to TF mediated repression. Physically, this happens since in case of TF mediated repression, occasional transcriptional leakage is amplified due to translation causing large fluctuations in the protein concentration. In contrary to this, in case of sRNA-mediated repression of the target mRNA, the mRNAs are rarely translated leading to a relatively smooth gene expression. This feature of reducing fluctuations in the target protein level is also found in incoherent miRNA mediated FFL osella . Here, while the fluctuations in the top-tier transcription factor introduces similar variations in both miRNA and mRNA levels, the miRNA due to its repression activity reduces the target protein fluctuations against the transcription factor fluctuations. This leads to a significant noise attenuation in miRNA mediated FFL in comparison to gene expression regulated by transcription factor alone. Additionally, it was also shown in osella that the noise-filtration in incoherent miRNA mediated FFL becomes optimal over a range of miRNA mediated repression strength, miRNA concentration and the transcription factor concentration. In contrary to the earlier work, the sFFL considered here has all the regulatory interactions activating in nature. Interestingly, however, this motif also shows a significant noise attenuation at the target protein level as compared to a much simpler scenario where the gene regulation involves only translational activation by sRNA. From this comparison, it appears that such noise processing behavior of the sFFL is a result of a complex interplay of the non-linear interactions present in the two branches of the sFFL. It would be of considerable interest to explore if, in the realistic scenario, the parameter values are tuned in a specific way that leads to maximum noise attenuation in the target protein level. We believe that the mathematical results for the fluctuations determined here can be used for exploring such possibilities.
3 Conclusion
In this paper, we have studied an sRNA-driven feed-forward loop (sFFL) through mathematical and computational modelling. Here the upstream, master regulator is an sRNA, RprA, which activates the synthesis of the final, target protein, RicI, using two parallel pathways; one, directly through translational activation of the RicI mRNA and the other, via translational activation of , which, in turn, functions as a transcriptional activator of RicI protein. Thus unlike other FFLs involving sRNAs as regulators at an intermediate level and proteins as the upstream regulator, here the upstream regulator is an sRNA while the intermediate regulator is a protein . This kind of an FFL was found only recently; and it is believed that such an FFL plays an important role in horizontal gene transfer in S. enterica. During horizontal gene transfer, bacterial conjugation happens through pilus formation, which is an energy-expensive process. In the presence of bactericidal agents, the RicI interferes with pilus formation and thereby inhibits horizontal gene transfer. This is one way to protect the bacteria during stress. Hence, a quick response to stress necessitates a rapid increase in the levels of RicI. Our work presented here, allows us to rationalize how RprA driven FFL could be a productive strategy to achieve this control to rapidly interfere with pilus formation. In principle, this would be general to any sRNA-driven FFL.
While extensive studies to understand the general characteristics of FFLs, such as purely transcriptional FFL (tFFL) or sRNA-mediated feed-forward loop (smFFL) were carried out previously, FFL driven by sRNA as the top-tier regulator, being a new variant, has not been subject of extensive investigations. To the best of our knowledge, the present work is the first, detailed modelling-based analysis of an sFFL. A major finding of this analysis is that an sFFL is capable of producing a strong and rapid response in terms of enhancing target protein levels, compared to that by tFFL or by smFFL. By comparison, we believe, this would be a generic feature that is linked due to the number of transcriptional activation steps present in sFFL as against tFFL or smFFL. While sFFL involves transcriptional activation of only one gene, for tFFL and smFFl two or more genes are transcriptionally activated. The saturation kinetics and the delay in response due to the presence of the activation threshold appear to be the reasons for delayed and weak response in case of transcriptional activation. This difference in the mode of regulation is captured by the mathematical equations for the respective FFLs and such generic features can be compared meaningfully, when the same parameter values are used for all three cases. A high sensitivity to the initial concentration of the upstream regulator, i.e. sRNA, and therefore a rapid initial response in the target protein level is an additional characteristic of sFFL. We have obtained explicit mathematical solutions describing how the concentrations of various regulatory molecules change with time. These solutions clearly demonstrate the sensitivity of the target protein concentration to the initial conditions. This phenomenon may be rationalized as follows in the context of RprA driven inhibition of pilus formation in S. enterica. This sFFL is triggered during membrane-damaging activities of the bactericidal agent, bile salt, in order to arrest pilus formation by increasing RicI level rapidly. Hence, a sensitive and rapid response to the initial-conditions (concentration of sRNA) might be a necessary strategy for the cell to respond to stress. These insights can also be tested experimentally. A plasmid vector can be designed in a manner that the expression of RprA under the control of arabinose, can lead to the synthesis of and RicI protein. In order to examine the levels of these proteins, reporter genes such as lacZ, GFP and RFP can be fused with the genes coding for and RicI resulting in the synthesis of fluorescent labeled proteins. Such a design would allow a direct validation of the model proposed here.
Furthermore, the RprA driven FFL is expected to function reliably despite the presence of noise in gene expression. This implies that the FFL should ideally filter out the noise such that the target protein (RicI) concentration does not fluctuate significantly from the average concentration level. Keeping this in mind, we analysed the noise characteristics of the sFFL through master equation based modelling and stochastic simulations. We find that the network indeed filters out the noise and this aspect depends significantly on the synthesis rate of the upstream regulatory sRNA and the degradation rate of the transcriptional activator of the target protein. More specifically, an optimal attenuation of noise can be achieved by varying the synthesis rate of sRNA or by varying the degradation rate of the transcriptional activator. In order to have a quantitative comparison of the noise filtering ability of the present loop with a simple sRNA-driven mode of regulation, we consider gene expression regulated through translational activation by sRNAs alone. We find that, in this case, the coefficient of variation in the target protein number is significantly different from that of sFFL and, in particular, no optimal noise attenuation can be found. It appears that an optimal noise attenuation in sFFL is a result of a complex interplay of the non-linear interactions present in the two branches of sFFL. The results from this stochastic analysis can also be verified experimentally by extending the experimental design mentioned before suitably and quantitating the amount of mRNA and protein of RicI as a function of RprA concentration. Overall, the present work suggests that the function of sFFL is critically governed by the sRNA - not only in generating a speedy and strong response but also in producing a reliable response by regulating the gene expression noise. The prediction about the critical role of sRNA in noise filtering raises new questions, such as, how the concentration of the top-tier sRNA is regulated in the cell in order to achieve optimal noise filtration through the FFL. Besides, this being a general model, the insights obtained from the present study will be applicable to other sRNA-driven coherent FFLs that might be discovered in the future and also for designing artificial networks for optimal regulation.
Conflict of interest: Authors declare no conflict of interest.
Acknowledgement ST and SM thank DBT, India for financial support through grant no. BT/PR16861/BRB/10/1475/2016.
APPENDIX
Appendix A Purely transcriptional feed-forward loop and sRNA-mediated feed-forward loop
In figures (3) and (4) of the main text, we have shown how the target protein concentration changes with time for tFFL and smFFLs. In the following, we present the differential equations that describe the dynamics of the tFFL and smFFL. A schematic representation of the tFFL network is shown in figure (8). The parameters, and , in general, represent the synthesis and degradation rates, respectively, of various regulatory molecules. Parameters denoted by in Hill functions are, in general, related to the activation threshold.
A.1 Transcriptional feed-forward loop (tFFL)
The following equations describe the tFFL.
(7) | |||||
(8) | |||||
(9) | |||||
(10) | |||||
(11) |
, , , , and represent the concentrations of the upstream transcriptional activator, -mRNA, -protein, -mRNA, and -protein, respectively. transcriptionally activates the synthesis of -mRNA; and are both required for transcriptional activation of .

A.2 sRNA-mediated feed-forward loop (smFFL)
Equations describing smFFL are
(12) | |||||
(13) | |||||
(14) | |||||
(15) |
Here, [x], [s], [m], and [p] denote the concentrations of the upstream transcriptional activator, sRNA, target protein mRNA and target protein, respectively. translationally activates sRNA synthesis as well as the synthesis of the target protein transcripts. Finally, the sRNA translationally activates the synthesis of the target protein, .
Appendix B Details on Temporal Solutions
Here we find the response of the network under persistent or time-varying input signals. We assume two possible scenarios as a response to stress; (a) a constant pool of sRNA, RprA, (b) the concentration of sRNA, RprA, varies due to its synthesis and degradation. Under these conditions, we explicitly find the solutions of the differential equations that describe how the concentrations of various regulators change with time.
B.1 Under a constant pool of sRNA
Assuming a constant solution for RprA as , and using a solution for , we find
(16) |
where and are the integration constants. Considering negligible initial concentration of mRNA and protein, we find and . Here and . The concentration of RicI mRNA can be found in a similar way starting with . The solution for can be written as
(17) |
where is the integration constant, and . For the integration, we have used . turns out to be negative upon substituting the expression for in the definition of . While obtaining (17), we have assumed that the degradation constant of RicI mRNA, is twice smaller than the degradation constants of mRNA and protein and , respectively and . The final solution for RicIm is
(18) |
Assuming RicIp has a solution of the form , we have the following differential equation satisfied by
(19) |
where , and . Solving this equation, we finally find the solution for RicIp as
(20) | |||||
where is the integration constant.
In order to obtain (20), we have assumed twice smaller than . Considering initial concentrations of all the regulatory molecules to be negligible, we have compared numerical and mathematical solutions for RicIm and RicIp in figure (9). The numerical solutions provide an exact picture as how various concentrations vary with time. Based on and is approximately twice smaller than or , we have obtained an approximate form of the integrand in equation (17). Due to this approximation, the mathematical solutions deviate slightly from the exact, numerical solutions. Since the deviations are small, we assume that (18) and (20) provide a reasonable quantitative description of time-variation in the target protein concentrations.

(A)

(B)
B.2 Under time-varying concentration of sRNA
Here, we find analytical solutions of the coupled, non-linear equations in (1) and (2) of the main text. Unlike the previous section, Rpra concentration is non-constant and it is governed by the equation
(21) |
The solutions for and are as follows.
(22) |
With these equations, the equation for protein is as follows
(23) |
The solution for is
(24) |
where various constants are as given below.
(25) | |||
(26) |
with and being the initial concentrations of Rpra and . The equation for RicIm can be solved by assuming a solution of the form with satisfying the equation
(27) |
The solution for is
(28) | |||||
where .
In order to have explicit mathematical solutions for different concentrations, we assume and . Under these conditions, the integration for can be done exactly. We find
(29) | |||||
when . The solution for RicI mRNA is thus
(30) | |||||
for . To find , we numerically integrate the equation
(31) |
using the mathematical solution for in (30). The change in various concentrations are shown in figure (10).

(A)

(B)

(C)

(D)
Appendix C Moments
In the following, we show our results for first, second, and third moments derived using the generating function formulation described in sub-section (2.4) of the main text.
(32) | |||||
(33) | |||||
(34) | |||||
(35) | |||||
(36) | |||||
(37) | |||||
(38) | |||||
(39) | |||||
(40) | |||||
(41) | |||||
(42) | |||||
(43) | |||||
(44) | |||||
(45) | |||||
(46) | |||||
(47) |
(48) | |||||
(49) | |||||
(50) | |||||
(51) | |||||
(52) | |||||
(53) | |||||
(54) | |||||
(55) | |||||
(56) | |||||
(57) | |||||
(58) | |||||
(59) | |||||
(60) | |||||
(61) | |||||
(62) | |||||
(63) | |||||
(64) | |||||
(65) |
Appendix D Translational activation by an sRNA
Here we consider a simple mechanism where the gene expression only involves translational activation by sRNA. In other words, sRNAs bind to the mRNAs and activate translation (see Fig. (11A)). The differential equations describing variations of different concentrations with time are given below.
(66) | |||
(67) | |||
(68) |
where , , and denote concentrations of sRNA, mRNA, and protein, respectively. The stochastic analysis can be performed in a similar way as described in the main text. In this case, the number of moments required to find the coefficient of variation is smaller compared to that of sFFL and one requires the moments , , , , , , , , , , , and to find the coefficient of variation. The coefficient of variation for the target protein has been plotted in figure (11B).

(A)
(B)
Appendix E Reaction Scheme
Here, we list different biochemical reactions considered in stochastic simulations. The values of various rate constants used for figure (7A) are mentioned inside the brackets.
(70) | |||
(71) | |||
(72) | |||
(73) | |||
(74) | |||
(75) | |||
(76) | |||
(77) | |||
(78) | |||
(79) | |||
(80) | |||
(81) | |||
(82) | |||
(83) | |||
(84) |
Here, and denote the inactivated and activated form of the gene synthesising . For figure (7B), we choose .
References
- (1) Storz G, Altuvia S, Wassarman KM 2005 An abundance of RNA regulators. Annu. Rev. Biochem. 74 199-217
- (2) Gottesman S 2005 Micros for microbes: non-coding regulatory RNAs in bacteria Trends Genet. 21 399-404
- (3) Papenfort K and Vanderpool C K 2015 Target activation by regulatory RNAs in bacteria FEMS Microbiol. Rev. 39 362-78
- (4) Altuvia S 2004 Regulatory Small RNAs: the Key to Coordinating Global Regulatory Circuits J. Bacteriol. 186 6679-80
- (5) Shimoni Y, Friedlander G, Hetzroni G, Niv G, Altuvia S, Biham O and Margalit H 2007 Regulation of gene expression by small non-coding RNAs: a quantitative view Mol. Syst. Biol. 3 1-9
- (6) Levine E, Zhang Z, Kuhlman T and Hwa T 2007 Quantitative Characteristics of Gene Regulation by Small RNA PLoS Biol. 6 e5
- (7) Iyengar B R, Pillai B, Venkatesh K V and Gadgil C J 2017 Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs Mol. BioSyst. 13 1235-45
- (8) M Pal, S Ghosh and I Bose 2016 Functional characteristics of gene expression motifs with single and dual strategies of regulation Biomed. Phys. Eng. Express 2 025009
- (9) Alon U S 2006 An introduction to systems biology: design principles of biological circuits (London, UK: Chapman and Hall/CRC)
- (10) Mangan S and Alon U 2003 Structure and function of the feed-forward loop network motif Proc. Natl. Acad. Sci. U S A. 100 11980-5
- (11) Shen-Orr S S, Milo R, Mangan S and Alon U 2002 Network motifs in the transcriptional regulation network of Escherichia coli Nat. Genet. 31 64-8
- (12) Alon U 2007 Network motifs: theory and experimental approaches Nat. Rev. Genet. 8 450-61
- (13) Schleif R 2000 Regulation of the L-arabinose operon of Escherichia coli Trends Genet. 16 559-65
- (14) Osella M, Bosia C, Cora D and Caselle M 2011 The Role of Incoherent MicroRNA-Mediated Feed-forward Loops in Noise Buffering PLoS Comput Biol. 7 e1001101
- (15) Riba A, Bosia C, Baroudi M E, Ollino L, Caselle M 2014 A Combination of Transcriptional and MicroRNA Regulation Improves the Stability of the Relative Concentrations of Target Genes PLoS Comput Biol. 10 e1003490
- (16) Cora D, Re A, Caselle M, and Bussolino F 2017 MicroRNA-mediated regulatory circuits: outlook and perspectives Phys. Biol. 14 045001-11
- (17) Duk M, Samsonova M and Samsonov A 2014 Dynamics of miRNA driven feed-forward loop depends upon miRNA action mechanisms BMC Genomics. 15 S9
- (18) Grigolon Silvia, Di Patti Francesca, De Martino Andrea and Marinari Enzo 2016 Noise Processing by MicroRNA-Mediated Circuits: the Incoherent Feed-Forward Loop, Revisited Heliyon 2 e00095 (revisited)
- (19) Papenfort K, Espinosa E, Casadesus J and Vogel J 2015 Small RNA-based feed-forward loop with AND-gate logic regulates extrachromosomal DNA transfer in Salmonella Proc. Natl. Acad. Sci. U S A. 112 E4772-81
- (20) Majdalani N, Hernandez D and Gottesman S 2002 Regulation and mode of action of the second small RNA activator of RpoS translation, RprA Mol. Microbiol. 46 813-26
- (21) Majdalani N, Chen S, Murrow J, John K S and Gottesman S 2004 Regulation of RpoS by a novel small RNA: the characterization of RprAMol. Microbiol. 39 1382-94
- (22) Alberts B, Johnson A, Lewis J, Raff M, Roberts K and Walter P 2008 Molecular biology of the cell (New York: Garland Science Taylor and Francis Group)
- (23) Frohlich K S and Papenfort K 2016 Interplay of regulatory RNAs and mobile genetic elements in enteric pathogens Mol. Microbiol. 101 701-13
- (24) Zhang C, Tsoi R, Wu F and You L 2016 Processong oscillatory signals by incoherent feedforward loops PLoS Comput Biol. 12 e1005101.
- (25) Raj A and Oudenaarden A van 2008 Nature, nurture, or chance: Stochastic gene expression and its consequences Cell 135 216-26
- (26) Ghosh B, Karmakar R and Bose I 2005 Noise characteristics of feed forward loops Phys. Biol. 2 36-45
- (27) Kampen N G van 2007 Stochastic processes in physics and chemistry (Amsterdam: Elsevier)
- (28) Gillespie D T 1976 A general method for numerically simulating the stochastic time evolution of coupled chemical reactions J. Comput. Phys. 22 403-34
- (29) Gillespie D T 1977 Exact stochastic simulation of coupled chemical reactions J. Phys. Chem. 81 2340-61