: Metabolic function optimizationOPTIMIZING YOUR METABOLISM | The significance of this result consists in the fact that, instead of searching the optimal control among piecewise continuous functions assuming values between 0 and 1, one only has to look functions assuming the extreme values of 0 and 1. Furthermore, in the case study considered, it is shown that the optimum has only one switch between 0 and 1. Therefore the search for the optimum is reduced to find the switching instant, t reg , that leads to the maximum final yield. Considering the structure of the metabolic network, this is intuitive: the optimum is achieved by first applying all cell resources to population growth and, after t reg , to redirect them to desired production. If t reg is too small, the desired production rate is higher during more time, but the cell population to which it applies is small. If t reg is too big, there are many cells to produce, but they only act during a small time interval. Hence, there is an optimum value for t reg. As mentioned in the Background section, a major problem is the high level of uncertainty in the knowledge about metabolic network dynamics. In this respect we consider different optimization algorithms that assume various degrees of information about the system to be optimized. The first is direct optimization. This assumes complete knowledge about the system and is included to establish a benchmark with which other methods may be compared. The other two methods are variants of a bi-level algorithm designed in order to accommodate missing information on the network kinetics. Both cases differ from the type of inner-optimization: Geometric Programming in one case and Linear Programming in the other. Both methods lead to good approximations of the optimal control, with a slight advantage of the one relying on Geometric Programming. The optimization strategies were tested on a prototype network that is a modified version of a previously one suggested in [ 16 ]. The choice of this network was due to its widespread use as a test benchmark for several optimization algorithms. A graphical representation of the network is shown in Figure 1 associated with the following set of ordinary differential equations:. Prototype network. The circles correspond to metabolites and the arrows to fluxes with the reaction rates indicated. Assuming that x 3 represents a precursor of the cellular objective such as growth and x 5 the desired product, if u t is biased towards the branch of v 2 this yields the formation of x 3 but little or no production of x 5. If u t is biased towards the branch of v 3 the production of x 5 will be affected by the low concentration of x 3 since there is a forward feedback. Thus, there is an optimal profile for u t to maximize the concentration of x 5 at the final time t final. In the framework of S-systems [ 16 ] the prototype network is described by:. where β i are the rate constants, g ij and h ij are the kinetic orders. Table 1 shows the list of parameters. Direct optimization uses model 2 with the set of parameters from Table 1. Results of the simulation using Direct optimization. The final product concentration is shown as a function of T reg. For the value of T reg corresponding to the dotted line there is a maximum yield. It is clear from Figure 2 that there is an optimal value for the time of regulation that maximizes the yield of x 5. If u t switches from 0 to 1 before t reg the formed biomass will not be enough to maximize x 5 t final. On the other hand, if u t switches from 0 to 1 after t reg , there will be more biomass but there time will not be enough time to produce the maximum possible amount of x 5. Comparison of three u t profiles. Three time profiles for the control function u t above and the corresponding product yield below. The solid line is the optimal T reg obtained by Direct Optimization. The Bi-Level optimization was used to test all the possible values of t reg. By comparing Figure 4 with Figure 2 it can be seen that the profiles remain similar. The final product yield, x 5 t final , increases with t reg until the optimal value is reached, then it starts decreasing. Result of the optimization using the Inner Optimization with Geometric Programming left and Linear Programming right. The profiles of the production of x 5 remain similar to the simulation using Direct optimization. As shown mathematically in the methods section, the optimal control function is either 0 or 1, provided that the dynamics depends linearly on the control and the cost to optimize has only a final term. In this case the dependency of the Hamiltonian function on u is linear as given by 8 below. Figure 5 shows a plot of ϕ λ t , x t obtained with a near-optimal control function u t. Thus, the optimal control is obtained on the extremes of the allowed interval and furthermore, one single switch from 0 to 1 is enough to achieve the optimal control. This function changes sign at the optimal instant of control switching T reg. For a class of networks in which the yield of the product that favors cell population growth the "natural" product competes with the desired product yield, with the manipulated variable affecting linearly the fluxes, it has been shown that the optimal control assumes only extreme values. While the implementation of this optimal control poses no challenge on in silico metabolic networks, on real metabolic networks complex bioengineering skills are required. Gene knockout manipulations do not adequate to this kind of control problem due to the long time scale associated with these techniques. The manipulation of specific enzyme levels, controlled by modulating the expression of the corresponding genes using promoter systems and inducers, is a possible solution to this kind of control problem [ 14 ]. The use of a bi-level optimization strategy, that maximizes the natural product in the inner level by manipulating the fluxes, leads to a good approximation to the optimal solution, with the advantage of not requiring the full knowledge of the network model. Real networks are extremely complex and exhibit relations between metabolites that are not always expected or fully understood. This gives emphasis to the need of good in silico models and also to the determination of the exact branches to be modified when optimizing a network. Although the example network used is very simple, it has proved to be useful to test the optimization strategies but a more complex network should be used to confirm that the strategy can be scaled to a larger network. The solution of the optimization problem is obtained using different approaches. Before accomplishing this task, Pontryagin's Maximum Principle is invoked to establish a particular form of the optimal control function for the class of problems at hand. The control function is now optimized in order to obtain a maximum yield of biomass at the end of the run-time t final. Three different methods, assuming various levels of information about the network, are considered in order to attain this goal. The first method, direct optimization, is used as a benchmark to compare the results of the other methods. The last two methods rely on a Bi-level optimization and illustrate a possible solution to the optimization problem when the information about the network is incomplete. The first method, Direct Optimization, is used mainly as a benchmark, to compare the results of the following methods. Since it is assumed that all the information about the network kinetics is known, the system of differential equations, described in 2 is used. The value of t reg that results on a maximum product yield is then determined by solving a simple optimization problem. The optimization was tested with two MATLAB functions: fmincon , from the standard optimization toolbox, that finds the minimum of a constrained nonlinear multi variable function, and simannealingSB from Systems Biology Toolbox [ 17 ] that performs simulated annealing optimization. The Bi-Level optimization algorithm was structured so as to accommodate missing information on the network kinetics. The boxed metabolites and fluxes from Figure 1 are a part of the network that might not be fully described in terms of kinetics. In this approach the missing kinetic information is replaced by stoichiometric data and flux balance analysis is used to obtain the proper flux distribution. Then, an inner optimization determines the fluxes during the batch time. The first step of the inner optimization process is to define the initial conditions of the input x 1 and outputs x 3 , x 5. A valid distribution for the fluxes v 1 , v 2 , v 3 and v 4 is then obtained. During this time interval the function u t and the values of v 1 , v 2 , v 3 and v 4 are kept constant. The time interval for the integration was defined to be 1 second. The inner optimization process allows us to obtain the product yield, x 5 t final , given a certain u t , taking into account a valid approximation of the network dynamics over the simulation time. The detailed fluxogram of the inner-optimization is shown in Figure 6. The bi-level optimization algorithm can be represented schematically as in Figure 7. On the first implementation of the Bi-Level optimization algorithm the dynamics of the boxed metabolites from Figure 1 are used but, following the algorithm structure, steady-state is assumed. 