# Lipase-Catalyzed Baeyer-Villiger Oxidation of Cellulose-Derived Levoglucosenone into (*S*)-γ-Hydroxymethyl-α,β-Butenolide: Optimization by Response Surface Methodology

^{1}Chaire Agro-Biotechnologies Industrielles, AgroParisTech, Reims, France^{2}UMR GENIAL, AgroParisTech, Institut National de la Recherche Agronomique, Université Paris-Saclay, Massy, France^{3}UMR 1318 IJPB, AgroParisTech, Institut National de la Recherche Agronomique, Université Paris-Saclay, Versailles, France^{4}UMR 782 GMPA, AgroParisTech, Institut National de la Recherche Agronomique, Université Paris-Saclay, Thiverval-Grignon, France

Cellulose-derived levoglucosenone (**LGO**) has been efficiently converted into pure (*S*)-γ-hydroxymethyl-α,β-butenolide (**HBO**), a chemical platform suited for the synthesis of drugs, flavors and antiviral agents. This process involves two-steps: a lipase-catalyzed Baeyer-Villiger oxidation of **LGO** followed by an acid hydrolysis of the reaction mixture to provide pure **HBO**. Response surface methodology (RSM), based on central composite face-centered (CCF) design, was employed to evaluate the factors effecting the enzyme-catalyzed reaction: pka of solid buffer (7.2–9.6), **LGO** concentration (0.5–1 M) and enzyme loading (55–285 PLU.mmol^{-1}). Enzyme loading and pka of solid buffer were found to be important factors to the reaction efficiency (as measured by the conversion of **LGO**) while only the later had significant effects on the enzyme recyclability (as measured by the enzyme residual activity). **LGO** concentration influences both responses by its interaction with the enzyme loading and pka of solid buffer. The optimal conditions which allow to convert at least 80% of **LGO** in 2 h at 40°C and reuse the enzyme for a subsequent cycle were found to be: solid buffer pka = 7.5, **[LGO]** = 0.50 M and 113 PLU.mmol^{-1} for the lipase. A good agreement between experimental and predicted values was obtained and the model validity confirmed (*p* < 0.05). Alternative optimal conditions were explored using *Monte Carlo* simulations for risk analysis, being estimated the experimental region where the **LGO** conversion higher than 80% is fulfilled at a specific risk of failure.

## 1. Introduction

Driving forces for the global trend of using clean renewable sources in the production of valuable chemical are the inevitable decline of fossil fuels and a more demanding legislation regarding the disposal of industrial wastes. In this context, lignocellulosic biomass is envisaged as an interesting source to produce highly valuable synthons due to its low cost and high availability. Thermochemical processing, phosphoric acid-catalyzed pyrolysis in particular, is the simplest way to efficiently convert lignocellulosic biomass into its degradation products (Huber et al., 2006; Babu, 2008). However, the use of strong acids is one of the drawbacks of such process, mainly associated to its inefficient recovery and environmental concerns related to water and air pollution (Wei et al., 2014). Furacell Process developed by Circa is claimed to be an easily scalable catalytic thermochemical process with no harmful effluents which converts renewable cellulose into a very simple mixture of easily separable products, comprising levoglucosenone (**LGO**), char and water reusable in the process (Court et al., 2012).

**LGO** is a multifunctional C6-monomer suited for organic synthesis (Miftakhov et al., 1994; Sarotti et al., 2012). Indeed, **LGO** is a chiral synthon that can be used in the synthesis of a wide variety of biologically active compounds such as pharmaceutical ingredients, agrochemicals, polymers, or specialty chemicals (Miftakhov et al., 1994; Budarin et al., 2011). Among them, (*S*)-γ-hydroxymethyl-α,β-butenolide (aka **HBO**) is certainly the most interesting since it is a precursor of many drugs (Enders et al., 2002), flavors (Kawakami et al., 1990), and antiviral agents (Flores et al., 2011).

Kawakami (Kawakami et al., 1990) and Paris (Paris et al., 2013) reported the most efficient methods for **HBO** synthesis, being obtained high overall yields (ca. 80–90%, Figure 1).

**Figure 1. Baeyer-Villiger oxidation of LGO into HBO and FBO following Kawakami (Kawakami et al., 1990), Paris (Paris et al., 2013), and Flourat's (Flourat et al., 2014) procedures**.

In both methods, **HBO** is produced through a Baeyer-Villiger oxidation of **LGO** followed by an acid hydrolysis to convert formate lactone (**FBO**), a reaction by-product, into **HBO**. Kawakami*et al*. (Kawakami et al., 1990) used peracids (such as peracetic acid or *m*-chloroperbenzoic acid) and Me_{2}S during 48 h followed by an HCl-mediated hydrolysis to provide pure **HBO**. Paris et al. (2013) developed a method to produce **HBO** in only 4 h using metal-based zeolites as catalysts to oxidize **LGO** and Amberlyst-15 as acid resins to promote the subsequent acid hydrolysis. However, the complexity and high cost inherent to zeolites synthesis may compromise the technical and economic viability of such process (Perot and Guisnet, 1990).

The use of lipases as biocatalyst seemed to be a promising greener alternative, providing, in addition, a cost-efficient transformation. In a recent publication (Flourat et al., 2014), we reported an efficient chemo-enzymatic process for the production of **HBO** with high yields (> 80%). The first step involved a Baeyer-Villiger oxidation of **LGO** mediated by a commercial immobilized lipase from *Candida antarctica* (CAL-B, Novozyme^{®} 435), under the presence of a solid buffer and using ethyl acetate and hydrogen peroxide as acyl donor and oxidant, respectively (Figure 2). After 2 h of reaction, the resulting mixture of **FBO** and **HBO** was hydrolyzed under acid conditions, using Amberlyst-15, to provide pure **HBO**.

**Figure 2. Lipase-mediated Baeyer-Villeger oxidation of LGO into HBO and FBO using AcOEt as an acyl donor and H _{2}O_{2} as an oxidant**.

Response surface methodology (RSM) is an effective tool for optimizing a range of processes (Montgomery, 2008) and evaluate the interactions of multiple parameters, being established in addition a prediction model based on statistics (Montgomery, 2008). RSM based on central composite face-centered (CCF) design was employed to found the conditions which allow to maximize simultaneously the conversion of **LGO** and the enzyme residual activity. The present work focuses on the study of reaction parameters (solid buffer pka, **LGO** concentration and enzyme loading) that may affect the conversion of **LGO** into **HBO** and the enzyme recyclability (measured by the enzyme residual activity) in order to establish their relationships and, if possible, further optimize the conversion obtained in an earlier authors' publication (Flourat et al., 2014); 83%. The temperature was set at 40°C to minimize the **LGO** degradation (reactant specie due to its acetal moiety) and to avoid the risk of explosion linked to the *in-situ* formation of peracetic acid. The RSM model allowed explore alternative optimal conditions and using *Monte Carlo* simulations the risk of failure could be determined.

## 2. Experimental

### 2.1. Reagents

Cellulose-derived levoglucosenone (**LGO**) was kindly provided by CIRCA Group (Knoxfield Victoria, Australia). Novozyme^{®} 435 (CAL-B, lot no. SLBF9301 V, 9120 PLU g^{-1}), hydrogen peroxide (50% w/w), biological solid buffers (MOPS, TAPS, CAPSO, HEPES and their sodium salt), lauric acid (99%) and 1-propanol were purchased from Sigma-Aldrich. Ethyl acetate (analytical grade), hexane (ACS reagent) and acetonitrile (HPLC grade) were purchased from Thermofisher Scientific. Ultra-pure laboratory grade water (MilliQ, 18.2 megaOhms, 25°C) was employed for HPLC analysis.

### 2.2. Baeyer-Villiger Oxidation of (-)-Levoglucosenone (LGO)

Lipase-mediated Baeyer-Villiger oxidation of **LGO** is schematically represented in Figure 2. Reactions were carried out in sealed erlenmeyers (50 mL) to avoid solvent evaporation. **LGO** (250 mg, 1 equiv.) was dissolved in different volumes of ethyl acetate (C = 0.5–1 M). Biological solid buffers along with their sodium form were added (20 mg mL^{−1} each) in order to set the pka (7.5 < pka < 9.6) and as consequence control the protonation state of the enzyme. It should be noted that unlike the organic soluble buffers, each buffer pair will set a fixed value of the relevant ionization parameter (Partridge et al., 2001). Different amounts of lipase (CAL-B) were added accordingly to those indicated in Table 2, followed by the addition of hydrogen peroxide (1.2 equiv., 50% w/w) at once. The mixture was incubated at 40°C and stirred using an orbital shaker (ThermoScientific, MaxiQ400) at 250 rpm for 2 h. Samples (10 μ*L*) were collected and diluted with 1.5 mL of acetonitrile for **LGO** quantification by HPLC analysis. The conversion of **LGO** was defined as the molar ratio of the remaining **LGO** and the **LGO** added in the beginning.

### 2.3. Determination of Enzyme Residual Activity

After reaction, the enzyme was recovered by filtration, washed with ethyl acetate (10 mL) and hexane (10 mL), dried in an oven at 40°C for 1 h, and then kept in dessicator overnight and under vacuum. Three enzyme samples (ca. 14 mg each) were collected, weighed in a 20 mL vial and kept at 60°C. A solvent free equimolar solution of lauric acid and 1-propanol with 3% w/w of water was prepared and incubated at 60°C. After 1 h, 5 g of the above solution were added to the enzyme and stirred in an incubating orbital minishaker (VWR) at 400 rpm and 60°C for 15 min. Samples (2 μ*L*) were collected, weighed in a GC-vial and diluted with 1 mL of hexane. Lauric acid conversion was determined by GC-MS. Unit definition: 1 PLU = 1 μ*mol* of 1-propyl laurate formed per gram of enzyme per minute at 60°C.

### 2.4. GC-MS Method

Lauric acid (used to determine the enzyme residual activity) was quantified by a GC-MS system which consisted of an Agilent GC 5975 coupled with MS 7890 in electron impact mode with electron energy set at 70 eV and a mass range at *m/z* (30–350 amu). A HP5-MS capillary column (Agilent, 30 m × 0.25 mm, 0.25 μ*m*) was used for chromatographic separation. Injection was performed at 280°C in split mode (40:1), being injected 1 μ*L* of each sample. The oven temperature program was the following: from 60°C held for 1 min, then rinse until 325°C at 20°C/min with a 5 min hold. Hydrogen flow rate was set at 1.2 mL/min. The mass detector was set as follows: source and quad temperatures at 230 and 150°C, respectively. A calibration curve was performed each time, with pure lauric acid (0.2–4.5 mg/mL) in hexane. Typical retention time for lauric acid and 1-propyl laurate were 7.22 min and 7.92 min, respectively.

### 2.5. HPLC Method

(-)-Levoglucosenone (**LGO**) was quantified by HPLC (Thermofisher Ultimate 3000) on a Syncronis aQ column (250 × 4.6 mm, 5 μ*m*, Thermoscientific) with Milli-Q water (solvent A) and acetonitrile (solvent B) as mobile phase. The flow rate and temperature were set at 0.8 mL/min and 30°C, respectively. The gradient elution was as follows: isocratic at 85% A (0–5 min), from 85 to 90% A (5–10 min), isocratic at 90% A (10–15 min), from 90% A to 85% A (15–20 min). Adequate detection was obtained with a diode array detector (DAD) set at 220 nm. Injection volume in HPLC injector was set at 10 μ*L*. Samples for HPLC analysis were prepared by diluting 10 μ*L* of the reaction mixture in 1.5 mL of acetonitrile. The peaks of **LGO** were identified and quantified using a standard curve prepared in acetonitrile. Typical retention time for **HBO**, **FBO** and **LGO** were 3.73, 3.87, and 8.40 min, respectively.

### 2.6. Experimental Design and Statistical Analysis

RSM, based on a 3-factor-3-level CCF design, was employed to determine the parameters affecting the Baeyer-Villiger oxidation of **LGO** as well as to found the optimal set of conditions. CCF is a good choice from a practical point of view, even inducing some correlations between the quadratic terms. These correlations will induce a slight increase in the confidence intervals but the model will still be able to estimate the quadratic effects.

A short reaction time of 2 h was set in order to minimize the residence time of the enzyme and limit the contact with the inhibitors as well as to minize the residence time of **LGO** inside the reactor and, thus, minimizing **LGO** degradation. The temperature was set at 40°C for the same reason but also to minimize the risk of explosion linked to the presence of peracetic acid (intermediary specie).

Table 1 presents the independent variables (*x*_{i}), levels and experimental design in terms of uncoded and coded (transformation of each studied real value into coordinates inside a scale with dimensionless values). The variation ranges of each variable were fixed taking into account specific constraints:

- Following the manufacture (Novozymes, 2015) the optimal pka for the lipase CAL B lies between 5 and 9. However, the pka buffer cannot be acid (<7) in order to avoid the degradation of **LGO** (acetal moieties reactive under such conditions). Therefore, the effect of pka in the enzyme protonation state was assessed with MOPS (pka = 7.2) and CAPSO (pka = 9.6), using TAPS (pka = 8.4) as a central point.

- A lower solvent volume to obtain higher **LGO** concentrations implies the presence of inhibitors (peracetic acid, acetic acid and formic acid) at higher concentrations. Moreover, the use of high **LGO** concentrations (> 1 M) leads to a decrease in the **LGO** conversion and (mainly) in the enzyme activity (Flourat et al., 2014). Instead, a low concentration of substrates implies the use of large quantities of solvents, which may compromise the economic sustainability of a process. Thus, **LGO** concentration was varied between 0.5 and 1 M.

- Enzymes must be used at catalytic quantities to assure the economic sustainability of the process. Thus, the enzyme loading was varied between 55 and 285 PLU.mmol^{−1}, which corresponds to 2 and 10 % (w/w), respectively.

Table 2 presents the runs set by a CCF design and the respective experimental responses obtained for **LGO** conversion (*Y*_{1}) and enzyme residual activity (*Y*_{2}). CCF design consists of 8 factorial points, 6 axial points (two axial points on the axis of each design variable at a distance 1 from the design point) and 3 central points (Eriksson et al., 2008), making a total of 17 runs. Duplicates in each point were performed in order to obtain a more precise model (total of 34 runs). To avoid bias, all runs were performed in a totally random order. Additional runs (intermediary points) were performed to validate the model and later added to the experimental data for model refining.

In order to find a suitable approximation for the true functional relationship between independent variables and the response surface, a second-order polynomial Equation (**1**) was used, being expressed as:

where *Y*_{i} represents the response *i* (**LGO** conversion and enzyme residual activity in this study), *x*_{i} are the coded independent variables, β_{0} is a constant coefficient, and β_{ki}, β_{kii}, and β_{kij} are the linear, quadratic, and interaction coefficients, respectively.

The variable levels *X*_{i} were scaled and centered (coded) according to the equation below such that *X*_{0} corresponded to the central value:

where *x*_{i} is the dimensionless value of an independent variable, *X*_{i} is the real value of an independent variable, *X*_{0} is the real value of an independent variable at the center point, and Δ*X*_{i} is the step change.

Modde v.10.1 sofware (Umetrics AB, Sweden) was used to generate the CCF design and analyze experimental data by RSM. Regression coefficients were determined by multiple linear regression (MLR). The significant parameters in the model were found by analysis of their *p*-value. The model validation was based on the variance (ANOVA) for each response, namely, by the analysis of *R*^{2}, *Q*^{2}, and *lack of fit* (*LOF*) test. *R*^{2} measures how well the regression model fits the experimental data, *Q*^{2} shows an estimate of the future prediction precision, and, *LOF* assess whether the models error is comparable to the replicate error.

## 3. Results and Discussion

### 3.1. Preliminary Study

In our previous study (Flourat et al., 2014) we reported the ability of commercial immobilized *Candida antarctica* lipase (Novozyme^{®} 435) to oxidize **LGO** (Figure 2). High conversions of **LGO** (> 83%) were obtained after 2 h, using a low enzyme loading (113 PLU.mmol^{−1}) and a **LGO** concentration of 0.75 M. Such mild conditions allowed to reuse the enzyme for a second reaction cycle without a significant loss of enzyme activity.

The optimization was conducted using the One Variable At a Time (OVAT) method, being optimized the temperature, **LGO** concentration, enzyme loading, nature of buffer (solid/liquid and pka) as well as the nature and quantity of the oxidizing agent.

Results showed that at low temperatures (<40°C) the final **LGO** conversion is compromised, being, although, similar at 40 and 60°C (after 8 h of reaction). However, there are other reasons to remain at 40°C than an obvious energy saving: the **LGO** degradation (due to its acetal moiety) as well as the risk of explosion linked to the presence of peracetic acid (intermediary specie). For all this reasons, the temperature was set at 40°C in this work even having to ignore interactions with other variables.

Moreover, it was showed that a low-water media (resulted in using solid buffers) leaded to higher conversion of **LGO** probably due to a higher stability of the enzyme and inhibition of secondary reactions. Unfortunately in this reaction, the water activity cannot be actively controlled due to high water content of commercial hydrogen peroxide solution (50% w/w).

As summary, the previous study (Flourat et al., 2014) suggested that only three variables had significant effect on the **LGO** conversion. These variables were the buffer pka (*x*_{1}), the enzyme loading (*x*_{2}) and the **LGO** concentration (*x*_{3}). The level values of variables were chosen in such a way that their limits were as wide as possible and that contains the optimal values determined using the OVAT method (Table 1). The levels of the three other variables with a small effect on the oxidation reaction were fixed as follow: temperature: 40°C, reaction time: 2 h, nature and quantity of the oxidizing agent: hydrogen peroxide (H_{2}O_{2}) at 1.2 equiv. (relative to **LGO**).

### 3.2. Assessing the Design Orthogonality

The condition number is a parameter that assesses the sphericity of the design, thus, the orthogonality. Formally, the condition number is the ratio of the largest and the smallest singular values of the X-matrix, that is, the matrix of the factors extended with higher order terms. As a thumb rule for an optimization (Eriksson et al., 2008), the condition number should be lower than 8. The condition number obtained in our model was 4.1 for **LGO** conversion and 1.6 for the enzyme activity. Such low values demonstrate well the orthogonality of the selected design, thus, the adequacy of the design selected.

As can be seen in Table 2, additional runs were performed. Such procedure allowed improving the model quality at an exploratory level, however, without compromising the model orthogonality.

### 3.3. Optimization by Design of Experiments (DOE)

The analysis of experimental data through DOE consists of four primary stages. The first stage, *evaluation of raw data*, focuses on identifying regularities and peculiarities in the experimental data. The second stage, *regression analysis*, involves the calculation of the model linking the variables and response(s) together, and is followed by a third stage, the *model interpretation*. Finally, in the fourth stage, *use of regression model*, the model obtained is used to predict the optimal experimental conditions to maximize/minimize the response(s).

#### 3.3.1. Testing the Normality for Raw Data Evaluation

In regression analysis, it is advantageous if data of a response are normally distributed. This improves the efficiency of data analysis, and enhances model validity and inferential reliability. It is not recommended to apply regression analysis to a response with heavy tails as originally observed for **LGO** conversion (*Y*_{1}) and enzyme residual activity (*Y*_{2}) responses (Figure S1), since that would correspond to assigning the extreme measurement an undue influence in the modeling (Eriksson et al., 2008). Normality can be assessed to some extent by obtaining skewness and kurtosis values. Skewness is a measure of the asymmetry of the probability distribution of a random variable about its mean. Kurtosis is a measure of the “peakedness” of the probability distribution of a random variable. In other words, kurtosis indicates how tall and sharp the central peak is, relative to that of a standard bell curve. As a general rule of thumb the skewness and kutosis values should range between −0.5 and 0.5 in a normal distribution (Montgomery, 2008). If these values are not included within such range, a transformation of the response should be performed (see Supplementary Material for more details). Table 3 compares these values for both responses before and after negative logarithmic transformation.

**Table 3. Skewness and kurtosis values for LGO conversion (Y_{1}) and enzyme residual activity (Y_{2}) responses before and after a negative logarithmic transformation**.

As shown in Table 3, skewness and kutosis values are within the range after response transformation. In this transformation, each measured value is subtracted from the maximum value (100, for variables expressed in percentages), and then the negative logarithm is applied (see Supplementary Material for more details, including other tools for measure the raw data normality).

#### 3.3.2. Regression Analysis

The next stage consists of fitting the second-order polynomial Equation (1) to the experimental data (Table 2) and determining the significant coefficients for each response.

Designs with a low condition number mean having low correlations among the terms in the model. As can be seen in Table 4, significant correlations are only observed between variables and responses (values highlighted in bold). An exception for the correlations between quadratic terms, as a direct consequence of the design selection (CCF). However, it should be highlighted that these correlations induce a slight increase in the confidence intervals but the model will still be able to estimate the quadratic effects.

The standard approach for selecting significant coefficients for each response is based on their *p*-value (Eriksson et al., 2008; Montgomery, 2008): a *p*-value lower than 0.05 means that the coefficient is significant (Tables 5, 6, values highlighted in bold). The coefficients centered and scaled (Coeff. SC, referring to the coded −1 to +1 unit) as well as their standard error (Std. Err.), *p*-value and confidence interval at 95% (CI) are listed in the Tables 5, 6 for the **LGO** conversion and the enzyme residual activity models, respectively.

**Table 5. Model coefficients [centered and scaled (SC)], their standard error (Std. Err.), p-value and confidence interval at 95% (CI) for the LGO conversion model**.

**Table 6. Model coefficients [centered and scaled (SC)], their standard error (Std. Err.), p-value and confidence interval at 95% (CI) for the enzyme residual activity model**.

For the model of **LGO**, following the *p*-value analysis of each coefficient, **LGO** concentration (*x*_{3}), the respective quadratic term (*x*_{3} * *x*_{3}), the interaction between pka, the enzyme loading (*x*_{1} * *x*_{2}) as well as the interaction between enzyme loading and **LGO** concentration (*x*_{2} * *x*_{3}) are not significant terms. For the model of enzyme residual activity, the enzyme loading (*x*_{2}), the **LGO** concentration (*x*_{3}), all quadratic terms as well as the interaction between enzyme loading and **LGO** concentration (*x*_{2} * *x*_{3}) are not significant terms. As result, the fitted model for each response, expressed in coded variables (scaled and centered), may be represented by the following equations (see in Supplementary Material the model equations in uncoded units, as well as their 3D graphical representation, Figure S6):

Analysis of variance (ANOVA, Tables 7, 8) indicates that second-order polynomial model is adequate to represent the actual relationships between the responses (**LGO** conversion, *Y*_{1}, and enzyme residual activity, *Y*_{2}) and the significant variables (*p* < 0.05). Satisfactory coefficient of determination (*R*^{2} > 0.5, using *least squares* method) and coefficient of cross-validation (*Q*^{2} > 0.5, using the leave-one-out cross-validation) were obtained, showing the *goodness of fit* and the *goodness of prediction*, respectively. Moreover, *F*-tests performed in ANOVA assessing the significance of the regression model (*p* < 0.05) and the *lack of fit* (*p* > 0.05) showed statistical significance of both models and a similar magnitude of replicate errors (*no lack of fit*).

**Table 7. Analysis of variance (ANOVA) for the quadratic polynomial model fitted to LGO conversion, Y_{1}**.

**Table 8. Analysis of variance (ANOVA) for the quadratic polynomial model fitted to enzyme residual activity, Y_{2}**.

#### 3.3.3. Model Interpretation

Model interpretation plays an important role in DOE. Model coefficients given by the Equations (**3**) and (**5**) are unscaled in order to enable the direct application of equations without calculating the corresponding code level of each variable. However, when scaled and centered (referring to the coded −1 to +1 unit), the coefficients values are useful for model interpretation (see the values in the Table 5, 6).

Figure 3 shows the graphical representation of the coefficient scaled and centered of the models of **LGO** conversion and enzyme residual activity, where positive values indicate a synergistic effect on the response while the negative indicate an antagonistic effect.

**Figure 3. Regression coefficient of (A) LGO conversion model and (B) enzyme residual activity model**. Model coefficients are scaled and centered.

As can be observed, enzyme loading (*x*_{2}) is the most significant variable affecting positively the conversion of **LGO** (*Y*_{1}), being observed as well a negative quadratic effect mainly by the solid buffer pka (*x*_{1}) and enzyme loading (*x*_{2}). **LGO** concentration (*x*_{3}) affects the conversion of **LGO** only due to its interaction with the pka variable (*x*_{1}*x*_{3}). These results were expected, since increasing the enzyme loading more enzyme units are available to convert the substrate, thus, higher is the conversion of **LGO**. Of course, there are economic restraints in using high enzyme loading which should be taken into account in the optimization. On the other side, the effect of pka (equivalent to the pH in aqueous solutions) on the enzyme activity is widely reported in literature.

The model of the enzyme residual activity (*Y*_{2}) is more complex. The solid buffer pka (*x*_{1}) proved to have a significant negative effect as well as interacts with the enzyme loading and the **LGO** concentration (*x*_{1}*x*_{2} and *x*_{1}*x*_{3}). The complexity of the enzyme residual activity and existence of interactions is understandable and expected when taking into account all possible mechanisms of activation and inactivation of an enzyme.

The relationships between variables and responses can be better understood by examining the contour plots (Figure 4) generated from the models predicting the **LGO** conversion and enzyme residual activity responses (Equations 3 and 5, respectively). The contour plots can provide the contour lines of the independent variables (*x*_{i}) that have the same response value *Y*_{i}.

Contour plot of **LGO** conversion (Figure 4A) shows that high values (> 90%) can be reached at higher enzyme loading (175 PLU.mmol^{-1}) and pka's (>8.4). However, these conditions corresponds to the lowest enzyme residual activity (Figure 4B). The maximal value for such response (> 85%) was observed at low pka (MOPS, 7.2), low enzyme loading (<100 PLU.mmol^{-1}). Therefore, a trade-off between responses should be placed in order to attain the optimal conditions.

#### 3.3.4. Use of Regression Model for Attaining Optimal Conditions

The optimal conditions were determined using a Nelder-Mead Simplex algorithm. This method computes the variable values (*x*_{1}, *x*_{2}, and *x*_{3}) that minimizes simultaneously the normalized distance to target values of responses and DPMO (Defects Per Million Opportunities outside specifications). DPMO gives information about robustness to small disturbances introduced by the precision specified for the factors. The optimal conditions to obtain at least 80% of **LGO** conversion (*Y*_{1}) and enzyme residual activity (*Y*_{2}) were found to be: pka = 7.5 (HEPES), **LGO** concentration = 0.50 M and enzyme loading = 194 PLU.mmol^{-1}. The advantage of using HEPES as solid buffer, in contrast to MOPS (pka = 7.2), is its easier recovery for reuse in a consecutive batch, since it does not form a gel.

Figure 5 shows the *Sweet Spot* plot when setting 80% as the minimum value for both responses. As can be observed, the criteria (area highlighted in green) were exclusively met at low pka (HEPES, pka = 7.5) and **LGO** concentration (<0.57 M).

**Figure 5. 4D Sweet Spot plot**. Green color indicates the “sweet spot,” where both responses are at least 80%; blue indicates the area where the criteria fails for one of the responses and white indicates the area where none of the responses are within the selected range.

Alternative optimal conditions can be explored using a *design space* (*DS*) plot. This plot uses Monte Carlo simulations for risk analysis, estimating the volume in the experimental design region where it can be expected that all specifications are fulfilled at a specific risk level. Figure 6 shows the *DS* plot using as specification a high **LGO** conversion (*Y*_{1} > 80%). However, it should be taken into account that the lower enzyme residual activity, the greater **LGO** conversion (>80%). Three points were identified using this plot. Point A is the optimum identified by a Nelder-Mead Simplex algorithm, Point *B* uses a lower enzyme loading to convert at least 80% of **LGO** (both with a risk level between 1 and 2%) and Point *C* illustrates a condition where no risk is taken to obtain 80% as minimum value for both responses, in spite of using a high enzyme loading.

**Figure 6. Design space plot**. Contours indicate the risk of failure (%) in the specifications fulfilling (**LGO** conversion > 80%, pka_{HEPES} = 7.5). Green color indicates the area where the risk of failure is lower (< 1%), while the red indicates a higher risk of failure (> 2%). Point *A*- optimum identified by a Nelder-Mead Simplex algorithm (194 PLU.mmol^{-1} and **[LGO]** = 0.50 M), Point *B* - optimum identified in an earlier publication (Flourat et al., 2014) (113 PLU.mmol^{-1} and **[LGO]** = 0.75 M); Point *C* - point with a zero risk level (227 PLU.mmol^{-1} and **[LGO]** = 0.75 M).

The advantage of using the conditions expressed by the point *B* is the low enzyme loading (113 PLU.mmol^{-1}) necessary to obtain an acceptable **LGO** conversion (80%), which represents an economic saving and increases the viability of the process at industrial scale. The point *B* coincides with the optimal point identified by the authors in an earlier publication (Flourat et al., 2014), however, using the OVAT method.

### 3.4. External Model Validation

The internal validity of the predicting model was assessed by the Q^{2} coefficient, obtained by leave-one-out cross-validation. Points *A*, *B*, and *C* were used for an additional external validation.

As shown in Table 9, the experimental responses are in agreement to the ones predicted by the model, thus, the model validity can be inferred within the region (ranges of variables) studied in this work.

### 3.5. Enzyme Recyclability and HBO Production

The possibility of reusing the enzyme under point *B* conditions at multigram scale was shown by the authors in an early publication (Flourat et al., 2014), being obtained high conversion of **LGO** in the first two cycles (> 80% per cycle). Scale up of this process was performed with 10 g of **LGO** per oxidation cycle. Under such conditions, conversions of **LGO** were comparable to those observed at the 500 mg scale (1st cycle: 83 and 81% respectively). The reaction mixture was then combined and subjected to the acid hydrolysis. Finally, after concentration to dryness, the crude mixture was easily purified by column chromatography and provided pure HBO in 67% yield (overall yield for the two 2-h oxidation cycles). ^{1}H and ^{13}C NMR spectras can be found in our early publication (Flourat et al., 2014).

## 4. Conclusion

RSM has proven to be adequate for the optimization of the enzymatic Baeyer-Villiger oxidation of **LGO**, providing, in addition, a better understanding of the individual and mutual effects of buffer pka, enzyme loading and **LGO** concentration on the overall reaction efficiency (measured as conversion of **LGO**) as well as on the enzyme recyclability (measured as enzyme residual activity). Enzyme loading and solid buffer pka were found to be important variables to the **LGO** conversion while for the enzyme residual activity, only the pka and all their interactions were significant. **LGO** concentration influences both responses by their interaction with the enzyme loading and solid buffer pka.

An antagonist effect of the variables on both responses was observed, thus, being necessary to establish a compromise to attain the optimal conditions. Such conditions were found to be: solid buffer pka = 7.5, **[LGO]** = 0.75 M and 113 PLU.mmol^{-1} for the lipase. Under these conditions, a high conversion (> 80%) was obtained in two consecutive batch at multigram scale.

The statistical models obtained by RSM for each response, represented by Equations (**3**) and (**5**), enable the prediction of **LGO** conversion and enzyme residual activity, respectively, at different conditions of pka, enzyme loading and **LGO** concentration. The validity of the model was confirmed (*p* < 0.05), being observed a good agreement between experimental and predicted values.

## Author Contributions

FA conceived and supervised the research project. AT performed the experimental design and its statistical analysis. AF and AP carried-out all the enzymatic reaction and quantified LGO by HPLC. AT done all the enzymatic residual activity tests, while FB quantified by GC-MS. AT drafted the manuscript, all authors were involved in revising it; FA supervised its preparation. All authors have approved and are accountable for the final version of the manuscript.

## Funding

The authors are grateful to Région Champagne-Ardenne, Conseil Général de la Marne and Reims Métropole for financial support.

## Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Acknowledgments

The authors are grateful to Circa Group for providing industrial grade levoglucosenone.

## Supplementary Material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fchem.2016.00016

## References

Babu, B. V. (2008). Biomass pyrolysis: a state-of-the-art review. *Biofuels Bioprod. Bioref.* 2, 393–414. doi: 10.1002/bbb.92

Budarin, V. L., Shuttleworth, P. S., Dodson, J. R., Hunt, A. J., Lanigan, B., Marriott, R., et al. (2011). Use of green chemical technologies in an integrated biorefinery. *Energy Environ. Sci.* 4, 471–479. doi: 10.1039/C0EE00184H

Court, G., Lawrence, C., Raverty, W., and Duncan, A. (2012). *Method for Converting Lignocellulosic Materials into Useful Chemicals.* EP Patent Application EP20100793418.

Enders, D., Lausberg, V., Del Signore, G., and Berner, O. M. (2002). A general approach to the asymmetric synthesis of lignans: (−)-Methyl piperitol, (−)-sesamin, (−)-aschantin, (+)-yatein, (+)-dihydroclusin, (+)-burseran, and (−)-isostegane. *Synthesis* 33, 515–522. doi: 10.1055/s-2002-20967

Eriksson, L., Johansson, E., Kettaneh-Wold, N., Wikströn, C., and Wold, S. (2008). *Design of Experiments: Principles and Applications, 3rd Edn.* Umeå: Umetrics Academy.

Flores, R., Rustullet, A., Alibés, R., Álvarez Larena, A., de March, P., Figueredo, M., et al. (2011). Synthesis of purine nucleosides built on a 3-oxabicyclo[3.2.0]heptane scaffold. *J. Org. Chem.* 76, 5369–5383. doi: 10.1021/jo200775x

Flourat, A. L., Peru, A. A. M., Teixeira, A. R. S., Brunissen, F., and Allais, F. (2014). Chemo-enzymatic synthesis of key intermediates (s)-γ-hydroxymethyl-α, β-butenolide and (s)-γ-hydroxymethyl-γ-butyrolactone via lipase-mediated Baeyer-Villiger oxidation of levoglucosenone. *Green Chem.* 17, 404–412. doi: 10.1039/C4GC01231C

Huber, G. W., Iborra, S., and Corma, A. (2006). Synthesis of transportation fuels from biomass: chemistry, catalysts, and engineering. *Chem. Rev.* 106, 4044–4098. doi: 10.1021/cr068360d

Kawakami, H., Ebata, T., Koseki, K., Matsumoto, K., Matsushita, H., Naoi, Y., et al. (1990). Stereoselectivities in the coupling reaction between silytated pyrimidine bases and 1-halo-2,3-dideoxyribose. *Heterocycles* 31, 2041–2054. doi: 10.3987/COM-90-5563

Miftakhov, M. S., Valeev, F. A., and Gaisina, I. N. (1994). Levoglucosenone: the properties, reactions, and use in fine organic synthesis. *Russ. Chem. Rev.* 63, 869. doi: 10.1070/RC1994v063n10ABEH000123

Novozymes (2015). http://www.novozymes.com

Paris, C., Moliner, M., and Corma, A. (2013). Metal-containing zeolites as efficient catalysts for the transformation of highly valuable chiral biomass-derived products. *Green Chem.* 15, 2101–2109. doi: 10.1039/c3gc40267c

Partridge, J., Harper, N., Moore, B. D., and Halling, P. J. (2001). “Control of acid-base conditions in low-water media,” in *Enzymes in Nonaqueous Solvents*, Vol. 15 of *Methods in Biotechnology*, eds E. Vulfson, P. Halling, and H. Holland (Totowa, NJ: Humana Press), 227–234.

Perot, G., and Guisnet, M. (1990). Advantages and disadvantages of zeolites as catalysts in organic chemistry. *J. Mol. Catal.* 61, 173–196. doi: 10.1016/0304-5102(90)85154-A

Sarotti, A. M., Zanardi, M. M., Spanevello, R. A., and Suarez, A. G. (2012). Recent applications of levoglucosenone as chiral synthon. *Curr. Org. Synth.* 9, 439–459. doi: 10.2174/157017912802651401

Keywords: response surface methodology, reaction optimization, Bayer-Villiger bio-oxidation, lipase, enzymatic reaction, levoglucosenone

Citation: Teixeira ARS, Flourat AL, Peru AAM, Brunissen F and Allais F (2016) Lipase-Catalyzed Baeyer-Villiger Oxidation of Cellulose-Derived Levoglucosenone into (*S*)-γ-Hydroxymethyl-α,β-Butenolide: Optimization by Response Surface Methodology. *Front. Chem.* 4:16. doi: 10.3389/fchem.2016.00016

Received: 23 February 2016; Accepted: 29 March 2016;

Published: 19 April 2016.

Edited by:

Warwick Douglas Raverty, Monash University, AustraliaReviewed by:

George Kokotos, University of Athens, GreeceVictor Sans Sangorrin, University of Nottingham, UK

Copyright © 2016 Teixeira, Flourat, Peru, Brunissen and Allais. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Andreia R. S. Teixeira, andreia.teixeira@agroparistech.fr