 Review
 Open Access
 Published:
Climbing the mountain: experimental design for the efficient optimization of stem cell bioprocessing
Journal of Biological Engineering volume 11, Article number: 35 (2017)
Abstract
“To consult the statistician after an experiment is finished is often merely to ask him to conduct a post mortem examination. He can perhaps say what the experiment died of.” – R.A. Fisher
While this idea is relevant across research scales, its importance becomes critical when dealing with the inherently large, complex and expensive process of preparing material for cellbased therapies (CBTs). Effective and economically viable CBTs will depend on the establishment of optimized protocols for the production of the necessary cell types. Our ability to do this will depend in turn on the capacity to efficiently search through a multidimensional problem space of possible protocols in a timely and costeffective manner. In this review we discuss approaches to, and illustrate examples of the application of statistical design of experiments to stem cell bioprocess optimization.
Background
Stem cells are capable of both replenishing their own numbers, and giving rise to one (unipotent stem cells) or more (multipotent, pluripotent or totipotent stem cells) other cell types. As such, bioprocesses that produce these cells costeffectively, in quantity and with the desired properties, are foundational to efforts to bring tissue engineering and regenerative medicine to the clinic.
Once basic research has provided proof of concept for specific cellbased therapies (CBTs), applied research into the conversion of benchscale protocols into optimized bioprocesses comes to the fore. Promising early clinical trials to treat retinal degenerative diseases with embryonic stem cell (ESC)derived retinal pigmented epithelium have showed encouraging results [1, 2] and have in turn led to further trials that attempt to use CBTs to treat these diseases (reviewed in [3]). Insulinsecreting betalike cells derived from from ESCs are also undergoing phase I/II clinical trials to evaluate their efficacy as a CBT for Type 1 diabetes (trial ID NCT02239354). There are, however, a number of challenges that must be overcome before CBTs can become generally available. Biological, technical and economical factors that need to be addressed have all been expertly reviewed elsewhere [4–7]. These factors ought to be kept in mind even at the earliest stages of stem cell research to facilitate translation towards technically and economically viable CBTs. Two critical but oftenoverlooked metrics for a given stem cell bioprocess are yield, the quantity of output cells of the desired type produced, and sensitivity, the robustness of the process in the face of minor variations in input variables.
Protocol yield – cell production per input cell, per mL of growth medium, per unit cost, etc – is not widely reported in the stem cell literature, but forms an essential step in the understanding of process efficiency. Where the term efficiency is encountered, it is often conflated with the purity of the output population. This is a critical metric in its own right, particularly when as few as 1 in 4000 undifferentiated pluripotent stem cell (PSCs) can lead to teratoma formation [8] for example, but should be distinguished from process efficiency. Monitoring and process refinement around yield can enable dramatic improvements, once this point is recognized [9]. When considering the magnitude of cells required for the replacement of celldense organs, estimated to be upwards of 10^{9} cells per patient per treatment [10], the importance of yield to process viability becomes clear. Given a doubling time of approximately one week during early human fetal development [11], a 90day protocol beginning with one million input cells should theoretically generate in excess of 7^{9} progeny, assuming continuous replication in the absence of cell death. While this example demonstrates that the quantities of material required for CBT are attainable in principle, it must also focus attention on opportunities for improvement in processes that fall short of these numbers. To have impact beyond the laboratory, stem cell bioprocesses will require yield optimization across a broad array of input parameters.
In turn, sensitivity directly impacts process reproducibility, currently a major concern in scientific publishing [12]. Cases of scientific fraud notwithstanding, it is likely that for the majority of processes that might be considered poorly reproducible, they exist in a highly sensitive region where small variations in one of potentially many process inputs (e.g. bioactive cytokine concentration, oxygen tension) can lead to drastic changes in output (Fig. 1). Where simple publication of an unreliable protocol can have negative reputational effects and lead to lost time and resources, attempts to translate such a protocol to the clinic can have farreaching impacts on both patient health, and the financial viability of the organization responsible. Understanding to which inputs the process is most sensitive is essential for both good science, and the robust and reliable production of cells for therapeutic applications.
A review by Placzek et al. details many of the design principles required to translate stem cell bioprocessing into viable commercial products. Considerations toward process components such as cells and scaffolds, and process requirements including automation, characterization, harvesting and storage are detailed thoroughly [13]. The complexity of stem cell bioprocessing requires the examination of these multiple components that must be controlled to arrive at the correct state of the cell at the end of the process. Given this, it is important that careful thought be given to the design of experiments used to understand stem cell bioprocessing systems. Statisticians have been giving serious thought to such issues for many decades, developing a field of research known as design of experiments (DOE) or experimental design [14].
DOE methods cover a range of activities that relate to the logical choice of experiments with which to explore a system or test hypotheses about a system. In this review we highlight some important concepts of experimental design, and show how incorporating DOE techniques into stem cell bioprocessing can help answer fundamental questions about stem cell biology and facilitate the translation of basic and proofofconcept research in stem cell bioprocessing.
Design of experiments
Background
In a basic research setting, experiments are commonly planned in an informal, ‘intuitive’ manner. Traditional experimentation in stem cell biology, as elsewhere, has typically been conducted using a onefactoratatime (OFAT) approach. Under such an approach, attempts are made to hold every factor (variable) constant except for the target of investigation as this one factor is varied and the resulting output measured. This method can elucidate important biological ‘main effects’, but important effects from interactions between factors end up as part of the error term. Additionally, the complexity of stem cell bioprocessing requires the examination of numerous input variables that must be controlled to arrive at the correct state of the cell at the end of the process. While many investigations into optimized stem cell bioprocessing have used the OFAT method to substantially improve both purity and yield [9, 15–21], the involvement of multiple inputs (e.g., signaling pathways, oxygenation, duration of individual steps and the overall process, shear effects) means that understanding the interactions between factors will be necessary to optimize increasingly complex protocols.
Consider the optimization of two variables in a stem cell bioprocess as shown graphically in Fig. 2. An OFAT approach would take us first in the direction of one axis, and then once optimized along this axis, perpendicular in the direction of the other. If we have luck on our side, and begin our exploration in a sensible place, we can arrive at the global maximum, thus finding settings of the two input variables tailored to optimizing our output variable. However, more likely, at the end of the experimental process we would find ourselves to be in fact at a local maximum or pseudooptimum (as in Fig. 2 a). A better solution to finding the optimum could be achieved by considering a more thoughtful two factor experiment, or factorial design (Fig. 2 b). Such an approach, as well as leading to a better estimate of the optimum, also allows interactions between important variables in the culture to be estimated. A more rigorous process of determining where to place these experimental points and how to analyze the response is discussed below.
Response surface methodology
In many situations experimental outputs can be noisy, and there may be many inputs of interest. In such cases, statisticallybased experimental planning can result in much more informative data, in the sense that the selection of data points can be tuned to maximize information content relevant to the research questions of interest. The typical framework in which the DOE problem is set consists of k factors that are believed to have the potential to influence a given process output, y. Typically, each factor is assigned a small integer number of levels, l (e.g, {0,1} for l=2, or {1, 0, 1} for l=3). The choice of experimental design then depends upon which among the many possible designs optimizes some criteria quantifying the amount of information that can be expected. This criterion is often based upon the precision or accuracy of the input variable estimates or predictions that can be made from the fitted model about the output variable.
We first consider the relationship between the output y, and each of our factors x _{1},x _{2},...,x _{ k }. In stem cell bioprocesses, the exact nature of this relationship is most often unknown. Instead, we generate an appropriate model of the system wherein we attempt to describe the output, or response, of the system based on potentially influential factors. This ‘response surface’ model is usually a firstorder (linear) or secondorder (quadratic) polynomial, and is generally based on continuous inputs such as temperature, serum concentration, levels of cytokines, and so on. Each variable is usually ‘coded’ so as to vary over the same range (e.g., {1,0,1}) with mean zero and the same standard deviation [22]. The appropriate experimental design and matched analysis together consitute response surface methodology (RSM).
Sequential experimentation
One of the most important characteristics of RSM is the ability to design and analyze experiments sequentially. Initially, the experimenter will have ideas about which factors likely influence the response. An early stage screening experiment can verify the role of each factor and eliminate unimportant ones. This has the effect of reducing the number of factors for future experiments to limit the number of required experimental runs. Similarly, the fitted model is used to determine if the collected data lie near to an ideal response or at some distance from it. This allows for an investigation of the problem space and identification of where subsequent regions of experimentation should take place. At this stage, widely spread data points aid in developing an overview of the process space (Fig. 2 b). The final round of experimentation takes place around the true optimum and is designed to generate a model that more accurately represents the true function within a reduced problem space (Fig. 3).
Modeling
Each iteration of the experimentation serves to improve our model of the process. Beginning with a screening experiment, the important inputs can be determined and we thus have the building blocks for the model. Mathematical modeling of biological systems maximizes the information available from limited experimental data, and can help answer complex outstanding biological questions and understand nonintuitive behaviour [23–25]. As mentioned, it is important that the experimental data points are carefully collected. In order to take advantage of the statistical analyses implicit in RSM, experimental runs need to be conducted to produce a model that has strong predictive capabilities.
Experimental designs
Factorial designs
In a factorial design, each experimental run consists of a combination of levels for each factor. A full factorial design requires each combination of each factor at every level to be run, resulting in l ^{k} experimental runs (often 2^{k} or 3^{k}). However, such designs can become very large in size. If we have two threelevel factors, the full factorial design consists of nine experimental runs. As we increase the number of threelevel factors, the full factorial requirement increases to 27, 81, 243, 729, 2187, etc. runs (Fig. 4).
A fractional factorial experiment makes use of a subset of these runs, l ^{k−p}, where p is the size of the fraction of the full factorial. Fractional factorial designs can be used to investigate the most important aspects of the design space with considerably less effort and cost than would be required for a full factorial experiment. In general, we choose a fractional factorial design where some of the high order interactions are assumed to be negligible, but we can still estimate main effects and lower order interactions. Provided the same signaling pathway is not targeted by multiple variables, we would not commonly expect third, fourth or higherorder interactions between the variables to significantly affect biological changes [26]. Instead, by modeling first and secondorder interactions, we capture the most critical components of the bioprocess.
Central composite designs
Moving from full or fractional factorial designs we begin to encounter fivelevel experimental designs commonly refered to as BoxWilson, or central composite designs (CCDs) [27]. These designs allow for the efficient estimation of second degree polynomial and quadratic responses [27]. Central composite designs attempt to balance the design, through the use of coded variables, to achieve rotability. By removing directional bias in the design, rotable designs predict output values with the same precision at all factor levels a constant distance from the centre of the design. These designs possess a high level of orthogonality, which means that each coefficient estimate is independent from one another [27]. Starting with a fractional factorial design, CCDs extend the range of each variable through socalled ‘star points’ that allow for the estimation of curvature. Therefore, CCDs are a fivelevel design, { −α,1, 0, 1, α}. Two important classes of CCD with regards to stem cell bioprocessing are those designs that limit the experimental space to known regions rather than extending α (star points) potentially outside of realistic ranges (e.g. negative cytokine concentrations). These are known as central composite inscribed (CCI; whereas the original designs were circumscribed) and facecentred (CCF) designs. Examples of CCD, CCI and CCF designs for two and three factors are shown in Fig. 4. Importantly, in all types of CCDs, the uncertainty of the model predictions increases markedly as factor levels approach the upper and lower ends of the ranges investigated [28]. This highlights the advantage of sequential experimentation to recentre the design and generate a more accurate model around the suspected optimum.
Advanced experimental designs
With continuing increases in computer power, more complex designs for nonstandard scenarios and models can also be produced. In the designs described above, the number of runs used is generally constrained by mathematical considerations. For example, in a fivefactor, twolevel factorial scenario, the full factorial design consists of 32 runs. It is trivial to construct half fraction factorial designs of 16 runs, or quarter fraction designs of eight runs. However, it is not easy to construct a design of say 15 runs using such methods. However, in socalled optimal design, an optimality criterion is selected, usually based upon the precision of the parameter estimates or model output. The computer is then used to carry out a search of possible designs for a set number of runs chosen by the user. This can be computationally intensive, but allows the user a much greater deal of flexibility in setting their design parameters. For example, any set number of runs can be chosen according to logistical constraints of the process or system being examined, and in situations where various factor level combinations are infeasible, irregular design spaces, which do not include such factor level combinations, can be constructed.
Further, when we wish to fit nonlinear/polynomial models (e.g., theoretically derived growth curves for biological processes) to our experimental data, an added complication to the design problem is that the optimal design will now depend upon the parameters of the underlying model. This poses a circular problem since we are wishing to construct a design to estimate the parameters of the underlying model, but we need to know the parameters of the underlying model in order to find the optimal design. A typical approach to such problems is to use Bayesian optimal design (e.g., [29]), in which a prior distribution has to be placed on the model parameters, expressing the user’s belief and uncertainty about the parameters before the data was observed. Such approaches can be carried out in a sequential manner so that at subsequent iterations of the design and analysis process, we can hone in on the salient regions of the design space and improve upon the quality of the fitted model.
Design of experiments and stem cell bioprocessing
Stem cell growth and expansion
Given the ability of DOE approaches to model complex behaviour, many aspects of stem cell bioprocessing would benefit from the application of these techniques. Although the adoption of DOE into stem cell bioprocessing has been limited, its use has started to expand in recent years. Of particular note are those investigations looking at stem cell production.
An early investigation into the 10day in vitro expansion of haematopoeietic stem and progenitor cells (HSCs/HPCs) isolated from adult mouse bone marrow used a twolevel full factorial design to screen the effects of cytokines, and the incubation temperature [30]. Following this initial screen, a more detailed analysis of interactive effects on the desired cell population was undertaken using response surface methodology [30]. This was used to develop an empirical model describing HSC repopulation, colony formation, and total cell expansions as a function of three cytokine concentrations. Each of the fractional factorial designs was composed of 16 experimental units plus four replicated points (center points), to obtain an independent estimate of the intrinsic variability (pure error) in the data [30]. Synergistic interactions between interleukin11 and flt3 ligand on total cell production was also detected, as was a negative thirdorder interaction between all three cytokines. These negative interactions reflect the fact that the combined effect on total cell and colonyforming cell production was less than the sum of their individual effects [30]. This study extended other single factor studies and identified important interactions in a complex multiple interacting cytokine culture system.
With the goal of defining the operating space for economic passaging of human ESCs, a threelevel, threefactor (i.e., 3^{3}) BoxBehnken experimental design was applied to evaluate the effects of seeding density, media volume and media exchange time [31]. Experimental data were subsequently used to model twoprocess responses: ESC expansion performance at the second passage and at harvest (24 h later) [31]. The authors found that lackoffit tests were not significant, indicating that additional variation in the residuals could not be removed with a better model [31]. Initially, three BoxBehnken RSM cell culture experiments, incorporating the chosen factors at softwarespecified design levels, were conducted over 36, 48 and 60h passage periods, although analysis of the models with a 48 and 60h passage period did not provide outcomes that met critical optimization criteria [31]. Interestingly, they applied mathematical multipleresponse optimization routine (desirability analysis) to visualize the region where both responses were simultaneously within optimization criteria [31]. While the authors of this paper acknowledged the use of T25 flasks during their ESC culture, they support the use of this method as a direct stepup to automated T175 processes, as the cells were passaged using a singlecell method amenable to automation.
It is indeed of critical importance to be able to automate the process, as traditional planar culture is labourintensive and will make CBTs unrealistically time consuming and expensive. Thomas et al. used an automated system combined with a full factorial design to optimize media concentrations for the expansion of human MSCs. Their use of a full factorial was necessitated by a need to avoid confounding interactions with main effects [32]. An alternative approach could have been an initial fractional factorial experiment, to identify those factors most important in the expansion of this cell population, before switching to a more refined, composite design that would permit investigation of both interactions and quadratic effects in the system. Nonetheless, this proved to be an interesting study that examined key components necessary in the expansion of MSCs including cell seeding density, serum percentage, media volume per flask, and culture time [32]. Interestingly, they found that seeding density and serum level had negative interactions, yet high levels of one or the other improved cell growth. The use of automation and robotic culture allowed for improved randomization of runs and removed many sources of variation from human processing of each flask.
While automated planar culture may prove sufficient for CBT development, particularly relating to monolayer tissues such as the retinal pigmented epithelium, the production of large numbers of stem cells has largely been left to stirred suspension bioreactors. Their capacity for empirical scale up, compared to other systems, and the ability to precisely regulate the culture environment in real time makes them ideal candidates for DOE applications. Because of variations in impeller design and the precise geometries of each bioreactor, little consistency is found between published protocols for the expansion of stem cells using bioreactor technologies. Hunt et al. undertook a full factorial design (3^{2}) to investigate the effects of inoculation density and agitation rate on the production of human ESCs. It was found that the interaction of these two factors had a significant effect on growth rate, and to a lesser extent the maximum density [33]. Interestingly, higher inoculation densities negatively affected the fold increase [33]. While this study was limited in its scope, it revealed important interacting effects that may not have been uncovered using a typical OFAT approach. In both planar cultures and stirred suspension bioreactor systems, DOE can be applied early on to understand the process and this may subsequently advise for or against one particular system. When a particular production system is chosen, further application of DOE will allow for optimization of the bioprocess depending on the specific outputs desired.
Biomaterials
Most often, experimental design has been applied to biotechnologies that have considerable chemical and engineering components. For instance, Zhou et al. used several designs to optimize the degradation of gelatinPEG composite hydrogels [34]. After first screening factors with a PlackettBurman design, these same factors were used in a BoxBehnken central composite design to understand the interaction between them and generate response surfaces for systematic optimization [34]. While they did analyze the survival of MSCs seeded onto these hydrogels, only the degradation rate was used as an output parameter. With the model established, it would have been interesting to include viability of MSCs seeded as a response output to better understand the design space. Nih et al. also used a DOE approach to create a complex in vitro matrix environment with varying peptide motifs and growth factors [35]. Neural precursor cells derived from iPSCs were encapsulated in hydrogels and exposed to combinations of brainderived neurotrophic factor (BDNF) and BMP4 using in vitro neural cell survival as an output before the optimized gels were tested in vivo in an induced stroke mouse model [35]. As a brief data communication, there was little discussion of the effects of using DOE to generate a hydrogel, although heparin modification of the hydrogel interacted with the concentrations of growth factors, showing that low BDNF and low BMP4 was beneficial when heparin was bound as opposed to high BDNF in nonheparin conditions [35].
A more thorough investigation of hydrogel formulation was demonstrated using modular selfassembling peptide ligands to generate synthetic extracellular matrices (ECMs) [36]. Jung et al. exploited the modularity of the system to undertake factorial experiments and RSM, and avoid the compositional drift that occurs when changing the concentrations of one molecule without affecting the concentration of others. They first began by testing each ligand alone to determine independent effects on endothelial growth. This was followed by a factorial design to identify interactions between ligands before using a CCI design to optimize their formulation [36]. At each stage of experimentation, the design space was shifted towards the perceived optimum. This study elegantly demonstrated a sequential experimentation strategy that was able to significantly improve cell growth on their optimized synthetic ECM upwards of 30% over their preoptimized formula [36]. Interactions between nearly all ligands was found to be significant, with the strength of the effect of one ligand dependant on the concentration of another [36], lending more weight to the desirability of avoiding OFAT approaches to optimize biomaterial formulations.
Stem cell differentiation
Whereas most multifactorial studies look at stem cell expansion and survival, Chang and Zandstra, and Glaser et al. have showed that models of the differentiation process can also be fitted and optimized using DOE techniques.
Directing the differentiation of ESCs towards a definitive endodermal fate, two rounds of experiments using factors from the literature were conducted [37]. These were: glucose, insulin, basic fibroblast growth factor (bFGF), epidermal growth factor (EGF) and retinoic acid (RA), and the output of the system was measured in terms of the percentage of cytokeratin8 and hepatocyte nuclear factor3 β doublepositive cells obtained after thirteen days [37]. After identifying the most important factors in a twolevel, fivefactor factorial experiment (2^{5}), the authors conducted a refined threelevel, twofactor factorial experiment (2^{3}) to identify synergistic and quadratic effects of RA and EGF, holding the other factors fixed. As this study’s purpose was to identify a quantitative screening technology, differentiation protocols were not further optimized [37]. This study, did nevertheless reveal interesting interactions between these factors that had varying effects on each of the different outputs, namely total cells, total endoderm cells and the percentage of endoderm cells with RA and the interaction between glucose and RA negatively impacting all three processes [37].
Using their previously published chemically defined protocol for generating endothelial cells from ESCs, Glaser et al. included a number of factors in their optimization: time, cell seeding density, matrix substrates and cytokines [25]. They used a two stage differentiation protocol to direct endothelial cell fate, first generating mesodermal vascular progenitor cells (VPCs) before final endothelial cell (EC) differentiation, each run as a full factorial experiment and assessed by the expression of Flk1/KDR ^{+} VPCs (mouse and human marker, respectively) and VEcadherin ^{+} ECs [25]. Fibronectin and seeding at 10,000 cells/cm^{2} was shown to generate the greatest number of VPCs in both human and mouse ESCs. Interestingly, this group also assessed the importance of time in differentiating pluripotent cells and found that induction of Flk1/KDR occurred within a short time window before receding [25]. Lower seeding of mouse VPCs (500010,000 cells/cm^{2}) on fibronectin with high concentrations of bFGF (50 ng/ml) resulted in up to 95% ECs, whereas human VPCs generated ECs at a rate of 57% when seeded on gelatin with considerably lower bFGF (10 ng/ml). While vascular endothelial growth factor was shown to be statistically unimportant at all stages of EC differentiation, significant interaction effects between seeding density or bFGF concentrations and culture matrix were observed [25]. Followup experiments using the generated modelbased predictions were not tested directly, but rather lined up with the closest experimental run to determine the optimal conditions for generation of ECs. However, this investigation did provide a considerably larger set of variables to be optimized for directing stem cell differentiation.
Conclusions
A major strength of DOE methodology – and RSM in particular – lies in the ability to build on carefully designed experiments in a sequential manner. In stem cell bioprocessing, these sequential experiments can lead to the construction of an empirical model that can elucidate fundamental processes related to cell biology as well as provide a foundation from which future experiments and translational research can take place. Generating mathematical models of the process with carefully planned experiments maximizes information about the system.
As detailed above, models of a given system are of great value to understanding the nature of stem cell biology, and have revealed important insights that can be missed with traditional OFAT methods of experimentation that are less able to study interactive effects between various growth parameters [30]. When applied to the complex systems of stem cell biology, DOE provides an important tool to unravel important interactions. Equally important in science more generally is the ability for experiments to be replicated. Understanding the design space, the importance of specific parameters on the outcome and how robust the entire process is, provides guidance on the reproducibility of the system. Adoption of DOE techniques to help model the system inherently provides a means to test sensitivity and an understanding of how reproducible a given result is likely to be. This in turn will facilitate the translation of fundamental research into viable CBTs. Industrial processes, including the production of cells as therapies, will require robust operating parameters to deal with inevitable variation in batches of input cells, for example. Understanding the system’s sensitivity, or pressure points, is necessary to engineer safeguards preventing failure during production runs.
Continued research into stem cell bioprocesses will greatly benefit from the application of DOE methods. There are, however, still challenges with its implementation in a high throughput manner, particularly with regard to identifying suitable cell outputs, such as marker expression or functional assays. Traditional assessment of cell behaviour by immunostaining, for example, are generally considered unsuitable for large scale screens. However, recent advances in highcontent screening have begun to make this a viable analytical method [37, 38]. Development of biosensors and ’omics technologies and their integration into stem cell bioprocessing pipelines will help to overcome these challenges. Coupled with real time monitoring of bioreactor cultures and automation of routine cell culturing procedures, it should soon be possible to screen large numbers of inputs to generate robust stem cell bioprocesses built on DOE methodology. The use of DOE in other bioprocessing fields such as the production of enzymes and other proteins has continued to grow [39]. As CBTs move towards the clinic, the incorporation of DOE into stem cell bioprocessing will provide a stable foundation upon which therapeutic applications may confidently by constructed.
Abbreviations
 bFGF:

Basic fibroblast growth factor
 CBT:

Cell based therapy
 CCD:

Central composite design
 CCF:

Central composite facecentred
 CCI:

Central composite inscribed
 DOE:

Design of experiments
 EC:

Endothelial cell
 ECM:

Extracellular matrix
 EGF:

Epidermal growth factor
 ESC:

Embryonic stem cell
 HPC:

Haematopoeietic progenitor cell
 HSC:

Haematopoeietic stem cell
 OFAT:

Onefactoratatime
 PSC:

Pluripotent stem cell
 RSM:

Response surface methodology
 RA:

Retinoic acid
 VPC:

Vascular progenitor cell
References
Song WK, Park KM, Kim HJ, Lee JH, Choi J, Chong SY, Shim SH, Del Priore LV, Lanza R. Treatment of macular degeneration using embryonic stem cellderived retinal pigment epithelium: Preliminary results in asian patients. Stem Cell Rep. 2015. doi:10.1016/j.stemcr.2015.04.005.
Schwartz SD, Hubschman JP, Heilwell G, FrancoCardenas V, Pan CK, Ostrick RM, Mickunas E, Gay R, Klimanskaya I, Lanza R. Embryonic stem cell trials for macular degeneration: a preliminary report. Lancet. 2012; 379(9817):713–20. doi:10.1016/S01406736(12)600282.
Zarbin M. Cellbased therapy for degenerative retinal disease. Trends Mol Med. 2016; 22(2):115–34. doi:10.1016/j.molmed.2015.12.007.
Jenkins MJ, Farid SS. Human pluripotent stem cellderived products: advances towards robust, scalable and costeffective manufacturing strategies. Biotechnol J. 2015; 10(1):83–95. doi:10.1002/biot.201400348.
French A, Bravery C, Smith J, Chandra A, Archibald P, Gold JD, Artzi N, Kim HW, Barker RW, Meissner A, Wu JC, Knowles JC, Williams D, GarcíaCardeña G, Sipp D, Oh S, Loring JF, Rao MS, Reeve B, Wall I, Carr AJ, Bure K, Stacey G, Karp JM, Snyder EY, Brindley DA. Enabling consistency in pluripotent stem cellderived products for research and development and clinical applications through material standards. Stem Cells Transl Med. 2015; 4(3):217–3. doi:10.5966/sctm.20140233.
Campbell A, Brieva T, Raviv L, Rowley J, Niss K, Brandwein H, Oh S, Karnieli O. Concise review: Process development considerations for cell therapy. Stem Cells Transl Med. 2015; 4(10):1155–63. doi:10.5966/sctm.20140294.
Kirouac DC, Zandstra PW. The systematic production of cells for cell therapies. Cell Stem Cell. 2008; 3(4):369–81. doi:10.1016/j.stem.2008.09.001.
Hentze H, Soong PL, Wang ST, Phillips BW, Putti TC, Dunn NR. Teratoma formation by human embryonic stem cells: evaluation of essential parameters for future safety studies. Stem Cell Res. 2009; 2(3):198–210. doi:10.1016/j.scr.2009.02.002.
Ungrin MD, Clarke G, Yin T, Niebrugge S, Nostro MC, Sarangi F, Wood G, Keller G, Zandstra PW. Rational bioprocess design for human pluripotent stem cell expansion and endoderm differentiation based on cellular dynamics. Biotechnol Bioeng. 2012; 109(4):853–66. doi:10.1002/bit.24375.
Mason C, Dunnill P. Quantities of cells used for regenerative medicine and some implications for clinicians and bioprocessors. Regen Med. 2009; 4(2):153–7. doi:10.2217/17460751.4.2.153.
Widdowson EM, Crabb DE, Milner RD. Cellular development of some human organs before birth. Arch Dis Child. 1972; 47(254):652–5.
Munafò MR, Nosek BA, Bishop DVM, Button KS, Chambers CD, Percie du Sert N, Simonsohn U, Wagenmakers EJ, Ware JJ, Ioannidis JPA. A manifesto for reproducible science. Nat Hum Behav. 2017; 1:0021.
Placzek MR, Chung IM, Macedo HM, Ismail S, Mortera Blanco T, Lim M, Cha JM, Fauzi I, Kang Y, Yeo DCL, Ma CYJ, Polak JM, Panoskaltsis N, Mantalaris A. Stem cell bioprocessing: fundamentals and principles. J R Soc Interface. 2009; 6(32):209–32. doi:10.1098/rsif.2008.0442.
Lawson J. Design and Analysis of Experiments with R. Boca Raton: CRC Press, Taylor & Francis Group; 2015.
Burridge PW, Matsa E, Shukla P, Lin ZC, Churko JM, Ebert AD, Lan F, Diecke S, Huber B, Mordwinkin NM, Plews JR, Abilez OJ, Cui B, Gold JD, Wu JC. Chemically defined generation of human cardiomyocytes. Nat Methods. 2014; 11(8):855–60. doi:10.1038/nmeth.2999.
Lian X, Bao X, AlAhmad A, Liu J, Wu Y, Dong W, Dunn KK, Shusta EV, Palecek SP. Efficient differentiation of human pluripotent stem cells to endothelial progenitors via smallmolecule activation of wnt signaling. Stem Cell Rep. 2014; 3(5):804–16. doi:10.1016/j.stemcr.2014.09.005.
Idelson M, Alper R, Obolensky A, BenShushan E, Hemo I, YachimovichCohen N, Khaner H, Smith Y, Wiser O, Gropp M, Cohen MA, EvenRam S, BermanZaken Y, Matzrafi L, Rechavi G, Banin E, Reubinoff B. Directed differentiation of human embryonic stem cells into functional retinal pigment epithelium cells. Cell Stem Cell. 2009; 5(4):396–408. doi:10.1016/j.stem.2009.07.002.
Maruotti J, Sripathi SR, Bharti K, Fuller J, Wahlin KJ, Ranganathan V, Sluch VM, Berlinicke CA, Davis J, Kim C, Zhao L, Wan J, Qian J, Corneo B, Temple S, Dubey R, Olenyuk BZ, Bhutto I, Lutty GA, Zack DJ. Smallmoleculedirected, efficient generation of retinal pigment epithelium from human pluripotent stem cells. Proc Natl Acad Sci U S A. 2015. doi:10.1073/pnas.1422818112.
Diekmann U, Lenzen S, Naujok O. A reliable and efficient protocol for human pluripotent stem cell differentiation into the definitive endoderm based on dispersed single cells. Stem Cells Dev. 2015; 24(2):190–204. doi:10.1089/scd.2014.0143.
KaufmanFrancis K, Goh HN, Kojima Y, Studdert JB, Jones V, Power MD, Wilkie E, Teber E, Loebel DAF, Tam PPL. Differential response of epiblast stem cells to nodal and activin signalling: a paradigm of early endoderm development in the embryo.Philos Trans R Soc Lond B Biol Sci. 2014;369(1657). doi:10.1098/rstb.2013.0550.
Gadue P, Huber TL, Paddison PJ, Keller GM. Wnt and tgfbeta signaling are required for the induction of an in vitro model of primitive streak formation using embryonic stem cells. Proc Natl Acad Sci U S A. 2006; 103(45):16806–11. doi:10.1073/pnas.0603916103.
Myers R. Response surface methodology: process and product optimization using designed experiments. Hoboken: Wiley; 2016.
McConnell MJ, MacMillan HR, Chun J. Mathematical modeling supports substantial mouse neural progenitor cell death. Neural Dev. 2009; 4:28. doi:10.1186/17498104428.
White DE, Sylvester JB, Levario TJ, Lu H, Streelman JT, McDevitt TC, Kemp ML. Quantitative multivariate analysis of dynamic multicellular morphogenic trajectories. Integr Biol. 2015. doi:10.1039/C5IB00072F.
Glaser DE, Turner WS, Madfis N, Wong L, Zamora J, White N, Reyes S, Burns AB, Gopinathan A, McCloskey KE. Multifactorial optimizations for directing endothelial fate from stem cells. PLoS ONE. 2016; 11(12):0166663. doi:10.1371/journal.pone.0166663.
Macke JH, Opper M, Bethge M. Common input explains higherorder correlations and entropy in a simple model of neural population activity. Phys Rev Lett. 2011; 106:208102. doi:10.1103/PhysRevLett.106.208102.
Box GEP, Wilson KB. On the experimental attainment of optimum conditions. J R Stat Soc Series B (Methodological). 1951; 13(1):1–45.
Box GEP, Draper NR. Empirical Modelbuilding and Response Surfaces. New York: Wiley; 1987. https://lccn.loc.gov/86004064.
Chaloner K, Verdinelli I. Bayesian experimental design: a review. Stat Sci. 1995; 10(3):273–304. doi:10.1214/ss/1177009939.
Audet J, Miller CL, Eaves CJ, Piret JM. Common and distinct features of cytokine effects on hematopoietic stem and progenitor cells revealed by doseresponse surface analysis. Biotechnol Bioeng. 2002; 80(4):393–404. doi:10.1002/bit.10399.
Ratcliffe E, Hourd P, GuijarroLeach J, Rayment E, Williams DJ, Thomas RJ. Application of response surface methodology to maximize the productivity of scalable automated human embryonic stem cell manufacture. Regen Med. 2013; 8(1):39–48. doi:10.2217/rme.12.109.
Thomas RJ, Hourd PC, Williams DJ. Application of process quality engineering techniques to improve the understanding of the in vitro processing of stem cells for therapeutic use. J Biotechnol. 2008; 136(34):148–55. doi:10.1016/j.jbiotec.2008.06.009.
Hunt MM, Meng G, Rancourt DE, Gates ID, Kallos MS. Factorial experimental design for the culture of human embryonic stem cells as aggregates in stirred suspension bioreactors reveals the potential for interaction effects between bioprocess parameters. Tissue Eng Part C Methods. 2014; 20(1):76–89. doi:10.1089/ten.tec.2013.0040.
Zhou N, Liu C, Lv S, Sun D, Qiao Q, Zhang R, Liu Y, Xiao J, Sun G. Degradation prediction model and stem cell growth of gelatinpeg composite hydrogel. J Biomed Mater Res A. 2016; 104(12):3149–56. doi:10.1002/jbm.a.35847.
Nih LR, Moshayedi P, Llorente IL, Berg AR, Cinkornpumin J, Lowry WE, Segura T, Carmichael ST. Engineered ha hydrogel for stem cell transplantation in the brain: Biocompatibility data using a design of experiment approach. Data Brief. 2017; 10:202–9. doi:10.1016/j.dib.2016.11.069.
Jung JP, Moyano JV, Collier JH. Multifactorial optimization of endothelial cell growth using modular synthetic extracellular matrices. Integr Biol (Camb). 2011; 3(3):185–96. doi:10.1039/c0ib00112k.
Chang KH, Zandstra PW. Quantitative screening of embryonic stem cell differentiation: Endoderm formation as a model. Biotech Bioeng. 2004; 88(3):287–98. doi:10.1002/bit.20242.
Kumar N, Richter J, Cutts J, Bush KT, Trujillo C, Nigam SK, Gaasterland T, Brafman D, Willert K. Generation of an expandable intermediate mesoderm restricted progenitor cell line from human pluripotent stem cells. Elife. 2015;4. doi:10.7554/eLife.08413.
Mandenius CF, Brundin A. Bioprocess optimization using designofexperiments methodology. Biotechnol Prog. 2008; 24(6):1191–203. doi:10.1002/btpr.67.
Acknowledgements
Not applicable.
Funding
DT is supported by funds from the National Science and Engineering Research Council (NSERC) and the Stem Cell Network (SCN). The funding bodies supported DT during the writing of this manuscript but were not otherwise involved in its development.
Availability of data and materials
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Author information
Authors and Affiliations
Contributions
All authors collaboratively wrote and edited the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
No ethics approval required.
Consent for publication
All authors consent to the publication of this work.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Toms, D., Deardon, R. & Ungrin, M. Climbing the mountain: experimental design for the efficient optimization of stem cell bioprocessing. J Biol Eng 11, 35 (2017). https://doi.org/10.1186/s130360170078z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s130360170078z
Keywords
 Design of experiments
 DOE
 Stem cell
 Bioprocessing