19
1 Prediction of metabolic pathways to produce naringenin from the organisms involved in the fermentation of cocoa and synthesis of value- added compounds Lina Suárez Medina 1 , Andrés González Barrios 2 Master's Degree in Chemical Engineering, Chemical Engineering Department, Universidad de los Andes 1. [email protected], 2 [email protected] 2019-05-30 ___________________________________________________________________________________________________________ Abstract Due to the growing interest in beneficial health products, such as functional foods, dietary supplements, and nutraceuticals, various techniques and methods have been developed to obtain products with nutritional and pharmaceutical components that contribute to consumer health. It is known that cocoa beans contain high levels of polyphenols, which are mostly present in the form of flavonoids, recognized for their antioxidant capacity and health benefits. The fermentation of cocoa is a critical process in the synthesis of the organoleptic and nutritional characteristics of chocolate, being derived from polyphenols such as flavonoids. Based on this, the interest in knowing the metabolic pathways for the generation of flavonoids and added value novel products from fermentation, aiming to give an added value to the derivatives of chocolate and why not, to the design of new products with high health benefits. In order to predict a set of alternative metabolic pathways to produce the flavonoid precursor naringenin from the amino acids L-tyrosine and L- phenylalanine, the OptStoic algorithm was performed. Two metabolic pathways were obtained for naringenin flavonoid production, one per amino acid, encoded by the genes PAL / TAL, 4CL, CHI, CHS from L-tyrosine and the C4H gene additional to the previous ones from L-phenylalanine. The FBA was performed to analyze the metabolic capacities of E. coli to produce naringenin by incorporating the non-native path predicted using the strain E. coli ATCC 8739, obtaining a growth of 0.8616 mmol/gDW -1 *h -1 and flux of naringenin production of 0.1231 mmol/gDW -1 *h -1 Finally, an OptKnock analysis was carried out with which the set of knockout reactions was obtained to improve flavonoid production composed of CITL and MALS, generating a biomass growth of 0.887 mmol/gDW -1 *h -1 and a naringenin production coupled to the growth of 0.1267 mmol/gDW -1 *h -1 . Since naringenin is a commercially available compound, the chemical synthesis of its apiforol derivative was carried out and its antioxidant activity was checked through the FRAP method. Keywords: Metabolic pathways, nutraceutical, cocoa fermentation, algorithm, naringenin, apiforol. _________________________________________________________________________________________________________________________ Introduction Theobroma cacao is a plant native from South America 1 From this plant, the raw material for the production of chocolate is obtained, which is composed of the seeds inside the cocoa fruits 2 . A crucial process for the correct formation of the precursors of cocoa flavor, amino acids, and the short-chain peptides, which favor the biochemical changes within the grains, is fermentation 24 . The overall result of this process has a profound impact on the color and flavor potential of the beans 5 , being key in the determines final cocoa quality 6 . Currently, the offer of added valued products focuses on improving human health, has encouraged the food industry to research and generate beneficial foods, exalting their properties nutritional and pharmaceutical, as is the case of functional foods, dietetic supplements, and nutraceutical 7 . Among the nutraceuticals are antioxidants such as vitamins C and E, phenolic compounds such as flavonoids, saponins, fatty acids, phospholipids in others 8 . From this point of view, cocoa is very important, due to its high content of polyphenols, which are related to the organoleptic characteristics and the nutritional value of the final product. Previously, high amounts of flavonoids have been found in cocoa, mainly (−)-epicatechin (EC), (+)catechin (C), and their dimers procyanidin B2 (PB2) and B1 (PB1), which have been reported effects on many important biological functions in vitro and in vivo, due to their antioxidant, anti-free radical and antiplatelet impacts. Furthermore, they are able to modulate the immune response, anti- inflammatory, anti-diabetic and anti-carcinogenic properties 9 . An interesting compound found in the Theobroma cacao genome reported in the KEGG database is the naringenin flavonoid. Naringenin has been reported as a natural compound that has many effects on human health, as agent normolipidemic, inhibitor and activator of important enzymes in the oxidation and synthesis fatty acids (Goldwasser, Cohen, Yang, Balaguer, Yarmush, 2010). It also has been shown effects in decreasing Hepatitis C virus (HCV) in sick rats 11 . This has antioxidant, anti-inflammatory, antiproliferative and antimutagenic properties, and it is suggested as an active chemoprotector 12 , it fights liver, coronary and obesity diseases and serves as a treatment for the dengue virus 13 . Naringenin is commercially available in several forms, as a nutritional supplement, however, the compound derived from this, flavan-4-ol

Prediction of metabolic pathways to produce naringenin

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

1

Prediction of metabolic pathways to produce naringenin from the

organisms involved in the fermentation of cocoa and synthesis of value-

added compounds

Lina Suárez Medina1, Andrés González Barrios2

Master's Degree in Chemical Engineering, Chemical Engineering Department, Universidad de los Andes [email protected], [email protected]

2019-05-30

___________________________________________________________________________________________________________

Abstract

Due to the growing interest in beneficial health products, such as functional foods, dietary supplements, and nutraceuticals, various

techniques and methods have been developed to obtain products with nutritional and pharmaceutical components that contribute to

consumer health. It is known that cocoa beans contain high levels of polyphenols, which are mostly present in the form of flavonoids,

recognized for their antioxidant capacity and health benefits. The fermentation of cocoa is a critical process in the synthesis of the

organoleptic and nutritional characteristics of chocolate, being derived from polyphenols such as flavonoids. Based on this, the interest

in knowing the metabolic pathways for the generation of flavonoids and added value novel products from fermentation, aiming to give

an added value to the derivatives of chocolate and why not, to the design of new products with high health benefits. In order to predict

a set of alternative metabolic pathways to produce the flavonoid precursor naringenin from the amino acids L-tyrosine and L-

phenylalanine, the OptStoic algorithm was performed. Two metabolic pathways were obtained for naringenin flavonoid production,

one per amino acid, encoded by the genes PAL / TAL, 4CL, CHI, CHS from L-tyrosine and the C4H gene additional to the previous

ones from L-phenylalanine. The FBA was performed to analyze the metabolic capacities of E. coli to produce naringenin by

incorporating the non-native path predicted using the strain E. coli ATCC 8739, obtaining a growth of 0.8616 mmol/gDW-1*h-1 and flux

of naringenin production of 0.1231 mmol/gDW-1*h-1 Finally, an OptKnock analysis was carried out with which the set of knockout

reactions was obtained to improve flavonoid production composed of CITL and MALS, generating a biomass growth of 0.887

mmol/gDW-1*h-1 and a naringenin production coupled to the growth of 0.1267 mmol/gDW-1*h-1. Since naringenin is a commercially

available compound, the chemical synthesis of its apiforol derivative was carried out and its antioxidant activity was checked through

the FRAP method.

Keywords: Metabolic pathways, nutraceutical, cocoa fermentation, algorithm, naringenin, apiforol. _________________________________________________________________________________________________________________________

Introduction Theobroma cacao is a plant native from South

America1 From this plant, the raw material for the

production of chocolate is obtained, which is composed

of the seeds inside the cocoa fruits2. A crucial process for

the correct formation of the precursors of cocoa flavor,

amino acids, and the short-chain peptides, which favor

the biochemical changes within the grains, is

fermentation2–4. The overall result of this process has a

profound impact on the color and flavor potential of the

beans5, being key in the determines final cocoa quality6.

Currently, the offer of added valued products

focuses on improving human health, has encouraged the

food industry to research and generate beneficial foods,

exalting their properties nutritional and pharmaceutical,

as is the case of functional foods, dietetic supplements,

and nutraceutical7. Among the nutraceuticals are

antioxidants such as vitamins C and E, phenolic

compounds such as flavonoids, saponins, fatty acids,

phospholipids in others8. From this point of view, cocoa

is very important, due to its high content of polyphenols,

which are related to the organoleptic characteristics and

the nutritional value of the final product. Previously,

high amounts of flavonoids have been found in cocoa,

mainly (−)-epicatechin (EC), (+)–catechin (C), and their

dimers procyanidin B2 (PB2) and B1 (PB1), which have

been reported effects on many important biological

functions in vitro and in vivo, due to their antioxidant,

anti-free radical and antiplatelet impacts. Furthermore,

they are able to modulate the immune response, anti-

inflammatory, anti-diabetic and anti-carcinogenic

properties9.

An interesting compound found in the

Theobroma cacao genome reported in the KEGG

database is the naringenin flavonoid. Naringenin has

been reported as a natural compound that has many

effects on human health, as agent normolipidemic,

inhibitor and activator of important enzymes in the

oxidation and synthesis fatty acids (Goldwasser, Cohen,

Yang, Balaguer, Yarmush, 2010). It also has been

shown effects in decreasing Hepatitis C virus (HCV) in

sick rats11. This has antioxidant, anti-inflammatory,

antiproliferative and antimutagenic properties, and it is

suggested as an active chemoprotector12, it fights liver,

coronary and obesity diseases and serves as a treatment

for the dengue virus 13. Naringenin is commercially

available in several forms, as a nutritional supplement,

however, the compound derived from this, flavan-4-ol

2

apiforol, obtained by the action of the enzyme flavanone

4-reductase, is a compound that is currently not

commercialized and which, in turn, it has been reported

as beneficial for health. This compound can be obtained

through a reduction reaction with sodium borohydride,

where the ketone group from naringenin is reduced to

alcohol group forming apiforol. Also, apiforol showed a

significant α-glucosidase inhibitory activity so that it can

be considered as a potent antidiabetic agent as it

exhibited a reduction in the postprandial hyperglycemia,

by inhibiting the digestive enzymes and enhancing the

uptake of glucose by the cells14. For this reason, apiforol

is a promising and high impact compound.

With advances in DNA sequencing and

recombinant DNA technology, many biosynthetic

pathways responsible to produce valuable compounds

have been elucidated, offering the opportunity for

functional integration of biosynthetic pathways into

suitable microorganisms, such as Escherichia coli or

Saccharomyces cerevisiae. Through metabolic

engineering, we can modify the metabolic profiles

according to the purposes of individual production and

heterologous expression of the complete pathways of

plants to confer the ability to synthesize natural products

derived from plants, as is the case of naringenin and the

apiforol15. The central structure of the flavonoids was

derived directly from the amino acids L-phenylalanine

and L-tyrosine, through the elimination of a non-

oxidative amino group by a phenylalanine amino-lyase

(PAL) or a tyrosine-amino-lyase (TAL) respectively,

generating access to various flavonoid precursors15. This

is the case of the flavonoid precursor, naringenin.

Recently, an economic process has been

developed without the use of precursor amino acids for

the microbial production of naringenin from glucose16.

This was achieved through the introduction of a

heterologous pathway, consisting of chalcone synthase

(CHS), tyrosine-amino-lyase (TAL) maximized by

codons, 4-coumaroyl-coa (4CL) and chalcone

isomerase (CHI), in E. Coli strains designed to produce

L-tyrosine. These strains were able to produce 29 mg/L

of naringenin from glucose and 84 mg/l of naringenin

from glucose when the enzymes involved in fatty acid

biosynthesis were further suppressed.

Based on technological advances and the need to

predict and design new metabolic pathways, in order to

obtain new useful products for industry and human

development, several studies of design and development

of bioinformatics tools have been developed with the

help of the systems biology, with the aim of creating

algorithms capable of predicting metabolic pathways

based on their topology (length analysis), the metabolic

efficiency of the host organism, energy costs, genetic

loads17, COnstraint-Based Reconstruction and Analysis

(COBRA)18, making it possible to approach the design

of pathways from the quantitative and functional point

of view, obtaining results of cellular metabolisms that

have successfully predicted the production of several

compounds18. In addition, the use of synthetic tools from

systems biology allowed the regulation of genes, the

assembly of heterologous pathways and the temporal

control of gene expression.

At present, several in silico mathematical

methods have been constructed to design and predict

synthetic metabolic pathways, such as GEN-Path

(Genome-Scale-Model Predictor Pathway)18 and

FindPath17. These use Flux Balance Analysis (FBA),

steady-state, databases with information on compounds,

objective functions of biomass growth and

thermodynamic restrictions. On the other hand, OptStoic

is a conversion algorithm for overall stoichiometry and

intermediate metabolic reactions, which is based on two

steps: 1, identification of optimal general stoichiometry

of a general reaction formulated for prediction, 2,

identification of the reaction pathways that conform to

the overall stoichiometry identified in the first step19.

In this work we want to obtain de novo metabolic

pathways, enzymes and associated genes, in the process

of fermentation of cocoa seeds, for the production of the

naringenin flavonoid precursor from the amino acids L-

phenylalanine and L-tyrosine and synthesize the apiforol

compound from the naringenin.

1. Methods and materials

Obtaining metabolic pathways for naringenin

production. OptStioc was used to identify optimal

overall stoichiometry and MinFlux was used to predict

metabolic pathways adjusted to overall stoichiometry

identified by Optstoic. OptSotic formulation is a MILP

(Mixed Integer Linear Programming) or IP (Integer

Programming Problem) problem, depending on whether

the stoichiometric coefficients are declared as real or

integer variables, respectively. This formulation allows

identifies the overall stoichiometry of conversion from a

reagent A (in this case L-tyrosine and L-phenylalanine)

to a product B (naringenin), maximizing a desired yield

objective subject to overall constraints of elemental and

charge balance and thermodynamic feasibility, allowing

combinations between cofactors (co-reactive D and co-

product C), from a metabolite database and their

respective stoichiometric coefficients in the general

conversion (b, c, d).

Overall conversion equation:

𝒂𝑨 + 𝒄𝑪 → 𝒃𝑩 + 𝒅𝑫 (1)

3

Where A, B, C, and D, are the metabolites of the

reaction and a, b, c, and d, are the stoichiometric

coefficients of the metabolites, which must be positive

and not null.

The overall optimization formulation is:

𝑴𝒂𝒙𝒊𝒎𝒊𝒛𝒆 𝒁𝒑 = ∑𝑪𝒊

−𝑪𝑨𝒊∈𝑷𝑫𝑻

(2)

Subject to:

𝒏𝑩𝒒𝒃 + 𝒏𝑫𝒒𝒅 − 𝒏𝑨𝒒𝒂 − 𝒏𝑪𝒒𝒄 = 𝟎, ∀ 𝐪 ∈ 𝐐 (3)

𝒆𝑩𝒃 + 𝒆𝑫𝒅 + 𝒆𝑪𝒅 + 𝒆𝑪𝒄 = 𝟎, (4)

∆𝑮𝑩𝒇

𝒃 + ∆𝑮𝑫𝒇

𝒅 − ∆𝑮𝑪𝒇

𝒄 − ∆𝑮𝑨𝒇

𝒂 ≤ −∆𝑮𝒎𝒊𝒏 (5)

𝒉(𝒂, 𝒃, 𝒄, 𝒅) = 𝟏 (6)

𝒂, 𝒃, 𝒄, 𝒅 ∈ 𝑹+/𝒁 (7)

Where i = 1... N in the set I of metabolites present

in the database and q ϵ Q is the set of elements

found in the metabolites, 𝑛𝑖𝑞 is the number of atoms of

element q in metabolite i, 𝑒𝑖 is the charge of metabolite

i, ∆𝐺𝑖𝑓

is the transformed free energy of formation (at 25

°C and 0.1 M ionic strength) of metabolite i, ∆𝐺𝑔𝑒𝑛𝑒𝑟𝑎𝑙

must be less to zero and y ∆𝐺𝑚𝑖𝑛 is the extra energy

change amount available for the stoichiometry and h is

stoichiometry of the base metabolite, which is set at a

value of 1. 𝑍𝑝 is the yield with respect to the reagent,

𝑃𝐷𝑇 is a subset of target products and 𝐶𝑖 is the

stoichiometric coefficient of metabolite i.

To run OptStoic, two metabolites databases were

constructed, one with the metabolites related with

metabolic reactions of the organisms of cocoa seeds

fermentation and other with the metabolites belong to

KEGG database, which includes net charges and

reversibility (∆𝐺𝑖𝑓) for each metabolite from Pactric20

and Metacyc21, CheBi22 and PubChem23 databases. Once

the stoichiometry and the cofactors involved in the

overall reaction have been obtained, the metabolic

pathways to contain these metabolites are identified by

minimizing the total metabolic flux through minFlux19

formulation.

Set of minimum reactions that can match the

desired overall stoichiometry is formulated with the

following linear programming problem (LP), which

minimizes the total metabolic flux of the chosen

reactions. The objective of this optimization problem is

to minimize the reaction flux, which approximates the

total enzyme load imposed on the organism. This

problem can be described mathematically as:

𝑴𝒊𝒏𝒊𝒎𝒊𝒛𝒆 (𝒙𝒋) ∑ 𝒙𝒋

𝒋 ∈ 𝑱

(𝒎𝒊𝒏𝑭𝒍𝒖𝒙) (8)

Subject to:

∑ 𝑺𝒊𝒋𝒗𝒋

𝒋 ∈ 𝑱

= 𝟎, ∀ 𝒊 ∈ 𝑰 (9)

𝒗𝑨𝒆𝒙 = −𝒂, 𝒗𝑪

𝒆𝒙 = −𝒄, 𝒗𝑩𝒆𝒙 = −𝒃, 𝒗𝑫

𝒆𝒙

= −𝒅 (10)

𝑳𝑩𝒋 ≤ 𝒗𝒋 ≤ 𝑼𝑩𝒋 , ∀ 𝒋 ∈ 𝑱 (11)

𝒙𝒋 ≥ 𝒗𝒋, ∀ 𝒋 ∈ 𝑱 (12)

𝒙𝒋 ≥ −𝒗𝒋, ∀ 𝒋 ∈ 𝑱 (13)

𝑣𝑗 , 𝑥𝑗 ∈ 𝑅

𝜺𝒖𝒍 ≤ 𝑺𝒍 ≤ 𝑴𝒖𝒍, ∀ 𝒍 ∈ 𝑳 (14)

∑ (𝟏 − 𝒖𝒍)

𝒍|𝒖𝒍𝒌=𝟏

≥ 𝟏, ∀ 𝒌 = 𝟏, … , 𝑲 (15)

𝒗𝒋 ≥ 𝑳𝑩𝒋𝒖𝒋, ∀ 𝒋|∆𝑮𝒋𝟎 ≤ 𝟎 (161)

𝒗𝒋 ≤ 𝑼𝑩𝒋𝒖𝒋, ∀ 𝒋|∆𝑮𝒋𝟎 ≥ 𝟎 (17)

∑ 𝒖𝒋

𝒋

≤ 𝒑𝒕 (18)

𝒖𝒋 = {𝟏, 𝒊𝒇 𝒔𝒊𝒈𝒏(𝒗𝒋) = 𝒔𝒊𝒈𝒏(∆𝑮𝒋

𝟎)

𝟎, 𝒐𝒕𝒉𝒆𝒓𝒘𝒊𝒔𝒆 (19)

Where 𝑖, 𝑗 are metabolites and reactions,

respectively, 𝑣 is the flux dual variable of each reaction

(Include inside and exchange reactions), 𝑆𝑖𝑗:

Stoichiometry matrix that describes the coefficient of

metabolite i in reaction j, 𝑣𝐴𝑒𝑥, 𝑣𝐶

𝑒𝑥, 𝑣𝐵𝑒𝑥, 𝑣𝐷

𝑒𝑥 are the

exchange fluxes of the participating metabolites in the

global reaction, which are equal to their stoichiometric

coefficients, 𝐿𝐵𝑗 𝑎𝑛𝑑 𝑈𝐵𝑗 are lower and upper bounds

on the flux of each reaction j, 𝑥𝑖 is the sum of the absolute

values of the reaction fluxes, 𝑣𝑗 is reaction fluxes vector,

𝜀 is a small positive value (𝜀 = 10−5) that ensures that

𝑆𝑙 is nonzero when 𝑢𝑙 = 1. Binary variables 𝑢𝑙 denote

the presence of nonzero basic variable l in a previously

identified solution k, ∆𝐺𝑗0

is the free energy change of the

reaction under standard conditions, 𝑝𝑡 is a parameter that

keeps the total number of reactions with a positive

standard change in free energy to a minimum and 𝑢𝑗 is a

binary variable that selects the flows of the reactions.

To run minFlux, two inputs sets were

constructed, the first with the metabolic information of

the species reported in the literature (see Table 1¡Error!

No se encuentra el origen de la referencia.) for the

4

fermentation process of cocoa seeds, including the

information Theobroma cacao plant. In these inputs,

species without a metabolic model were not used.

Additionally, the reactions involved in glycolysis, the

pentose phosphate pathway, the phenylalanine, tyrosine,

and tryptophan biosynthesis and the secondary

metabolism of the Theobroma cacao plant were

considered. Moreover, the second inputs all the

metabolic information of the KEGG database (Kyoto

Encyclopedia of Genes and Genomes)24–26 were

included.

Integration of metabolic pathways predicted

on an E. coli in silico model. Once the predicted

pathways were obtained, these were mapped using the

Escher web tool27 and a Flux Balance Analysis (FBA)

was carried out. FBA was performed to analyze the

metabolic capabilities of E. coli to produce naringenin

by incorporation of nonnative pathways predicting

through OptStoic. This algorithm was used following the

mathematical formulation reported by Savinell and

Palsson 199228. The most attractive feature of this tool is

its ability to make quantitative predictions in a network,

without the need for detailed kinetic information for its

development, only the stoichiometry of the reactions, a

defined objective biological function (production of

biomass, synthesis of metabolites, redox potential, etc.)

and growth conditions (availability of substrates)29. For

FBA, the model of the organism E. coli ATCC 8739 was

used and the execution was performed under aerobic

conditions and with a glucose uptake of -10 mmol*gDW-

1*h-1 as a carbon source. FBA can be described as follow:

𝒎𝒂𝒙𝒊𝒎𝒊𝒛𝒆 𝒁 = 𝑪𝑻𝒗𝒋 (20)

Subject to:

𝒅𝑪

𝒅𝒕= 𝑺𝒊,𝒋 ∙ 𝒗𝒋 = 𝟎 (21)

𝑳𝑩𝒋 ≤ 𝒗𝒋 ≤ 𝑼𝑩𝒋 (22)

Where 𝐶𝑇is transposed vector of the

concentrations for each species.

Improving the naringenin production in E.

coli through OptKnock analysis. Due the fact that

cellular metabolism is prepared, through natural

selection, for the maximum responsiveness to the

history of selective pressures rather than for the

overproduction of specific chemical compounds, so that

the yield of these products on several organisms are

often far below their theoretical maximums, as is the

case of production of plants products in bacterial, for

example, naringenin production. With the aim of

improving the reaction flux for naringenin production in

E. coli, an OptKnock30 analysis was carried out in

COBRA toolbox. OptKnock suggests gene knockout

strategies for biochemical overproduction while

recognizing that metabolic flux distributions are

governed by internal cellular objectives30. This tool

identifies and eliminates metabolic reactions that are

capable of decoupling cell growth from chemical

production. Involving optimization at two levels, which

considers two optimal strategists competing with each

other (i.e. the cell target and chemical production) by

identifying multiple combinations of gene deletion. In

this case, biomass formation was used as a cell target and

naringenin production as chemical production.

OptKnock can be described as:

𝒎𝒂𝒙𝒊𝒎𝒊𝒛𝒆(𝒚𝒊) 𝒗𝒄𝒉𝒆𝒎𝒊𝒄𝒂𝒍 (23)

Subject to:

𝒗𝒃𝒊𝒐𝒎𝒂𝒔𝒔 = 𝒗𝒂𝒕𝒑−𝒎𝒂𝒊𝒏 . 𝝁𝒂𝒕𝒑 + 𝒗𝒃𝒊𝒐𝒎𝒂𝒔𝒔𝒐𝒃𝒋

. 𝝁𝒃𝒊𝒐𝒎𝒂𝒔𝒔

+ 𝒗𝒈𝒍𝒄−𝒖𝒑𝒕𝒂𝒌𝒆. 𝒈𝒍𝒄 (24)

Subject to:

∑ 𝑺𝒊𝒋𝒗𝒋 = 𝟎

𝑴

𝒋=𝟏

∀𝒊 ∈ 𝑵 (25)

𝒗𝒑𝒕𝒔 + 𝒗𝒈𝒍𝒌 = 𝒗𝒈𝒍𝒄−𝒖𝒑𝒕𝒂𝒌𝒆 (26)

𝒗𝒂𝒕𝒑 ≥ 𝒗𝒂𝒕𝒑−𝒎𝒂𝒊𝒏 (27)

∑ 𝜸𝒊𝒔𝒕𝒐𝒊𝑺𝒊,𝒈𝒍𝒌

𝑵

𝒊=𝟏

+ 𝒖𝒈𝒍𝒌 + 𝒈𝒍𝒄 = 𝟎 (28)

∑ 𝜸𝒊𝒔𝒕𝒐𝒊𝑺𝒊,𝒑𝒕𝒔

𝑵

𝒊=𝟏

+ 𝒖𝒑𝒕𝒔 + 𝒈𝒍𝒄 = 𝟎 (29)

∑ 𝜸𝒊𝒔𝒕𝒐𝒊𝑺𝒊,𝒃𝒊𝒐𝒎𝒂𝒔𝒔

𝑵

𝒊=𝟏

+ 𝒖𝒃𝒊𝒐𝒎𝒂𝒔𝒔 = 𝟏 (30)

∑ 𝜸𝒊𝒔𝒕𝒐𝒊𝑺𝒊,𝒋

𝑵

𝒊=𝟏

+ 𝒖𝒋 = 𝟎, ∀𝒋 ∈ 𝝁, 𝒋 ≠ 𝒈𝒌𝒍, 𝒑𝒕𝒔, 𝒃𝒊𝒐𝒎𝒂𝒔𝒔 (31)

∑(𝟏 − 𝒚𝒋)

𝒋∈𝑴

≤ 𝑲 (32)

𝒗𝒃𝒊𝒐𝒎𝒂𝒔𝒔 ≥ 𝒗𝒃𝒊𝒐𝒎𝒂𝒔𝒔𝒐𝒃𝒋

(33)

𝒖𝒋𝒎𝒊𝒏. (𝟏 − 𝒚𝒋) ≤ 𝒖𝒋 ≤ 𝒖𝒋

𝒎𝒂𝒙. (𝟏 − 𝒚𝒋), ∀𝒋 ∈ 𝝁𝒓𝒆𝒗 𝒚 𝒋

∉ 𝝁𝒔𝒆𝒄𝒓 (34)

𝒖𝒋𝒎𝒊𝒏. (𝟏 − 𝒚𝒋) ≤ 𝒖𝒋, ∀𝒋 ∈ 𝝁𝒓𝒆𝒗 𝒚 𝝁𝒔𝒆𝒄𝒓 (35)

𝒖𝒋 ≤ 𝒖𝒋𝒎𝒂𝒙. (𝟏 − 𝒚𝒋), ∀𝒋 ∈ 𝝁𝒊𝒓𝒓𝒆𝒗 𝒚 𝒋 ∉ 𝝁𝒔𝒆𝒄𝒓 (36)

𝒖𝒋 ∈ 𝑹, ∀𝒋 ∈ 𝝁𝒊𝒓𝒓𝒆𝒗 𝒚 𝝁𝒔𝒆𝒄𝒓 (37)

𝒗𝒋𝒎𝒊𝒏. 𝒚𝒋 ≤ 𝒗𝒋 ≤ 𝒗𝒋

𝒎𝒂𝒙. 𝒚𝒋, ∀ 𝒋 ∈ 𝝁 (38)

5

𝒚𝒋 = {𝟎, 𝟏}, ∀ 𝒋 ∈ 𝝁 (39)

𝜸𝒋 = {𝟎, 𝟏}, ∀𝒋 ∈ 𝑵 (40)

Where 𝑣𝑔𝑙𝑐−𝑢𝑝𝑡𝑎𝑘𝑒 is glucose basic uptake,

𝑣𝑎𝑡𝑝−𝑚𝑎𝑖𝑛 is the maintenance requirement of ATP not

associated with growth, 𝑣𝑏𝑖𝑜𝑚𝑎𝑠𝑠𝑜𝑏𝑗

is a minimal level of

biomass production, 𝑣𝑝𝑡𝑠 is glucose uptake through the

phosphotransferase system, 𝑣𝑔𝑙𝑘 is glucose uptake

through the glucokinase system, 𝑦 is a binary variable (1

if the flux of the reaction is active or 0 otherwise), 𝜇 is

the transport reaction set (reversible (𝑣𝑟𝑒𝑣) and

irreversible (𝑣𝑖𝑟𝑟𝑒𝑣) of metabolites only secreted (𝑣𝑠𝑒𝑐𝑟),

𝑁 is the metabolites set, 𝐾 is the number of knockouts

allowed, 𝛾 is a dual variable associated with the

stoichiometric constraints, 𝑔𝑙𝑐 is a dual variable dual

associate with the glucose uptake, 𝑢 is a dual variable

associated with any other constraints in its

corresponding flux.

Possibles added-value compounds from

naringenin. Because naringenin is a product with

several commercial presentations as a nutritional

supplement, it is necessary to investigate novel products

for the pharmaceutical and food industry. Based on this,

two criteria were used to select those products derived

from naringenin 1. The chemical compound must be

beneficial for human health and 2. This, in turn, may not

have commercial products available.

Obtaining apiforol as a novel product and

measuring its antioxidant capacity. Apiforol is a

derivative of naringenin, which has been reported with

antidiabetic properties14. In order to obtain apiforol from

naringenin, a chemical synthesis was performed.

Naringenin (20 mg) was dissolved in 400 µl of 100%

ethanol and reduced with sodium borohydride (10 mg).

After 45 minutes, the reduction product was acidified

with 10% (v/v) acetic acid (200 µl) in ethanol (1800 µl)

to turn off the borohydride and reduce the pH of the

solution. Later, this was evaporated at a temperature of

35 ° C and the residue containing apiforol was stored at

room temperature in the dark. The presence of flavan-4-

ol was confirmed by the formation of a red pigment in

the presence of 30% (v/v) hydrochloric acid in ethanol31.

With the aim to check if indeed apiforol had health

benefices an antioxidant capacity assay was carried out.

Antioxidant capacity of apiforol was measured through

FRAP (ferric reducing/antioxidant power) method,

which is based on an increase in the absorbance at 593

nm due to formation of TPTZ-Fe(II)32. This method

evaluates the antioxidant capacity of a sample according

to its ability to reduce ferric iron Fe(III) present in a

complex with 2,4,6-tri (2-pyridyl) -s-triazine (TPTZ) up

to the ferrous form Fe(II)32. The reaction mixture FRAP

was prepared by mixing 25 mL of 0.3 mol/L sodium

acetate buffer solution (pH 3.6), 2.5 mL of 10 mmol/L

TPTZ and 2.5 mL of 20 mmol/L FeCl3. Reaction mixture

was incubated at 37°C for 30 min. Different

concentrations of apiforol (0.1, 0.2, 0.3 mg/mL) were

mixed with reaction mixture, as follows: 3 ml prepared

FRAP reagent, 100 µL of sample was then added, along

with 300 µl H2O. Absorbance readings were taken after

0.5 s and every 15 s thereafter during the monitoring

period at 593 nm. This assay was carried out in the T75

UV-VIS Spectrophotometer

2. Results

On the consulted literature, 71 species of

organisms were found (See Table 1¡Error! No se

encuentra el origen de la referencia.), 15 (21.1%)

yeasts (Y), 6 (8.5%) acidic acid bacteria (AAB), 22

(30.9%) lactic acid bacteria (LAB) and 28 (39.5%)

organisms not classified in the three previous categories

and in which the Enterobacteriaceae are found, such as

Erwinia and Klebsiella2. Also, families of infectious

viruses for the bacteria Myoviridae and Siphoviridae are

reported33. From these organisms, only 44 organisms

were found with metabolic models in specialized

databases such as Metacyc and Patric (information

associated with EC Number see Table 1. On average, the

species have about 1066 reactions and 1164 metabolites.

OptStoic formulation of the overall reaction.

OptStoic was run with the information from the KEGG

database, a total of 10,195 reactions and 18,120

metabolites were obtained. The Gibbs free energy

change (ΔGf) and the net charge of each metabolite were

extracted from the Metacyc and chEBI databases.

Because OptStoic generates the stoichiometric

coefficients that satisfy the optimum stoichiometry for

the general reaction, two global reactions were proposed

from two flavonoid precursors, with the aim of finding

which of these reactions satisfies the thermodynamic

constraints while this is feasible for production of the

target product. To produce the naringenin flavonoid

precursor, two reactants were chosen, the amino acids L-

phenylalanine and L-tyrosine.

Naringenin production from L-phenylalanine.

𝑳_𝒑𝒉𝒆𝒏𝒚𝒍𝒂𝒍𝒂𝒏𝒊𝒏𝒆 → 𝑵𝒂𝒓𝒊𝒏𝒈𝒆𝒏𝒊𝒏 (41)

Naringenin production from L-tyrosine.

𝑳_𝒕𝒚𝒓𝒐𝒔𝒊𝒏𝒆 → 𝑵𝒂𝒓𝒊𝒏𝒈𝒆𝒏𝒊𝒏 (42)

To make these reactions chemically viable, the

compounds that make up the LB growth medium were

added to the algorithm (See Table 2).

6

Table 1. Microorganisms reported in the literature for the fermentation of cocoa seeds in samples from Cameroon, Ghana, Cote d'Ivoire, Brazil, Malaysia, India, Mexico, and Cuba.

Organisms Metabolic model Source Ref. Organisms Metabolic model Source Ref.

Yeasts (Y) Pediococcus acidilactici Pediococcus acidilactici MA18/5M Metacyc 40

Hanseniaspora uvarum --- 35, 36, 34, 37 Lactobacillus pentosus Lactobacillus pentosus KCA1 Metacyc 38

Hanseniaspora opuntiae --- 34, 38, 39 Leuconostoc

pseudomesenteroides Leuconostoc pseudomesenteroides 4882 Metacyc 38

Hanseniaspora guilliermondii --- 35, 36 Leuconostoc mesenteroides Leuconostoc mesenteroides cremoris ATCC

8293 Metacyc 34

Saccharomyces cerevisiae Saccharomyces cerevisiae S288c Metacyc 40, 38, 41, 36, 37, 34, 42, 39 Lactococcus lactis Lactococcus lactis lactis IL1403 Metacyc 34

Saccharomycodes ludwigii --- 43 Enterococcus camelliae --- 2

Candida magnolia --- 40 Weissella paramesenteroides Weissella paramesenteroides ATCC 33313 Metacyc 38, 2, 46, 47, 48

Candida carpophila --- 39 Enterococcus casseliflavus Enterococcus casseliflavus ATCC 12755 Metacyc 2, 47 Candida orthopsilosis --- 39 Others

kudriavzevii (Issatchenkia orientalis) --- 39, 36 Zymomonas mobilis Zymomonas mobilis mobilis ATCC 10988 Metacyc 34

Pichia manshurica --- 39 Staphylococcus saprophyticus Staphylococcus saprophyticus saprophyticus

ATCC 15305 Metacyc 37, 2, 42

Meyerozyma caribbica --- 39 Paenibacillus lautus --- 40

Yamadazyma mexicana --- 39 Paenibacillus illinoisensis --- 40

Pichia kluyveri --- 36, 37 Paenibacillus chibensis --- 40

Kodamaea ohmeri --- 39 Oenococcus oeni Oenococcus oeni ATCC BAA-1163 Metacyc 34

Lachancea thermotolerans --- 34, 43 Lysinobacillus boronitolerans --- 2

Acetic acid bacteria (AAB) Lysinibacillus sphaericus Lysinibacillus sphaericus C3-41 Metacyc 40

Acetobacter pasteurianus Acetobacter pasteurianus 386B Metacyc 34, 35, 38, 2, 40, 44 Lysinibacillus fusiformis Lysinibacillus fusiformis Metacyc 40

Acetobacter syzygii --- 2 Klebsiella pneumoniae Klebsiella pneumoniae KCTC 2242 Metacyc 2, 49

Acetobacter senegalensis Acetobacter senegalensis 108B Metacyc 38, 2, 44 Erwinia tasmaniensis Erwinia tasmaniensis Et1/99 Metacyc 34

Acetobacter sicerae --- 2 Corynebacterium variabile Corynebacterium variabile DSM 44702 Metacyc 2

Acetobacter tropicalis Acetobacter tropicalis NBRC 101654 Metacyc 37, 2, 44, 42 Bacillus subtilis Bacillus subtilis168 Metacyc 37, 34, 42, 50, 51

Gluconobacter Oxidans --- 40 Bacillus safensis --- 2

Lactic acid bacteria (LAB) Bacillus altitudinis --- 40

Lactobacillus fermentum Lactobacillus fermentum ATCC 14931 Metacyc 34, 36, 37, 38, 2, 40, 45 Arthrobacter solii --- 2

Lactobacillus brevis Lactobacillus brevis ATCC 367 Metacyc 34 Acinetobacter radioresistens Acinetobacter radioresistens SK82 Metacyc 2

Lactobacillus rhamnosus Lactobacillus rhamnosus ATCC 21052 Metacyc 36, 34, 35 Acinetobacter guillouiae Acinetobacter guillouiae CIP 63.46 Patric 2

Lactobacillus plantarum

Lactobacillus plantarum plantarum

ATCC 14917 = JCM 1149 = CGMCC

1.2437

Metacyc 34, 35, 37, 38, 2, 40, 42, 45, 41 Fructobacillus pseudoficulneus Fructobacillus pseudoficulneus DSM 15468 Metacyc 34

Lactobacillus plantarum /

paraplantarum Lactobacillus paraplantarum L-ZS9 Metacyc 2 Bacillus pumilus18 Bacillus pumilus ATCC 7061 Metacyc 34, 52

Lactobacillus casei Lactobacillus casei LOCK919 Metacyc 36, 34, 35 Bacillus pseudomycoides Bacillus pseudomycoides DSM 12442 Patric 40

Lactobacillus nagelii Lactobacillus nagelii DSM 13675 Metacyc 2 Bacillus patagoniensis --- 40

Lactobacillus vaginalis Lactobacillus vaginalis DSM 5837 =

ATCC 49540 Metacyc 2 Bacillus mojavensis Bacillus mojavensis RO-H-1 = KCTC 3706 Metacyc 40

Lactobacillus ghanensis Lactobacillus ghanensis DSM 18630 Metacyc 2 Bacillus megaterium Bacillus megaterium QM B1551 Patric 37, 34, 42, 51

Lactobacillus coryniformis Lactobacillus coryniformis coryniformis

KCTC 3167 = DSM 20001 Metacyc 40 Bacillus circulans Bacillus circulans NBRC 13626 Patric 2

Lactobacillus curvatus --- 40 Bacillus cereus Bacillus cereus ATCC 10876 Metacyc 34, 50

Lactobacillus mali Lactobacillus mali KCTC 3596 = DSM

20444 Metacyc 40 Bacillus clausii Bacillus clausii KSM-K16 Metacyc 2

Lactobacillus sakei Lactobacillus sakei sakei 23K Metacyc 40 Bacillus amyloliquefaciens Bacillus amyloliquefaciens DSM 7 Metacyc 51

Lactobacillus hilgardii Lactobacillus hilgardii DSM 20176 =

ATCC 8290 Metacyc 40

7

Identify optimal overall stoichiometry. The stoichiometric coefficients obtained for the reactions are the

following:

Naringenin production from L-tyrosine:

𝟏𝟐 𝒕𝒚𝒓𝒐𝒔𝒊𝒏𝒆 + 𝟏𝟒 𝑶𝟐 → 𝟓 𝒏𝒂𝒓𝒊𝒏𝒈𝒆𝒏𝒊𝒏𝒆 + 𝟑 𝒂𝒄𝒆𝒕𝒂𝒕𝒆 + 𝟑 𝒂𝒓𝒈𝒊𝒏𝒊𝒏𝒆 + 𝟗 𝑯𝟐𝑶 + 𝟗 𝑪𝑶𝟐 (43)

𝟏𝟐 𝑪𝟗𝑯𝟏𝟏𝑯𝑶𝟑 + 𝟏𝟒 𝑶𝟐 → 𝟓 𝑪𝟏𝟓𝑯𝟏𝟐𝑶𝟓 + 𝟑 𝑪𝟐𝑯𝟒𝑶𝟐 + 𝟑 𝑪𝟔𝑯𝟏𝟒𝑵𝟒𝑶𝟐 + 𝟗 𝑯𝟐𝑶 + 𝟗 𝑪𝑶𝟐 (44)

Naringenin production from L-phenylalanine:

𝟏𝟐 𝒑𝒉𝒆𝒏𝒚𝒍𝒂𝒍𝒂𝒏𝒊𝒏𝒆 + 𝟗 𝑶𝟐 + 𝟔 𝑪𝑶𝟐 → 𝟔 𝒏𝒂𝒓𝒊𝒏𝒈𝒆𝒏𝒊𝒏𝒆 + 𝟏𝟐 𝒈𝒍𝒚𝒄𝒊𝒏𝒆 (45)

𝟏𝟐 𝑪𝟗𝑯𝟏𝟏𝑵𝑶𝟐 + 𝟗 𝑶𝟐 + 𝟔 𝑪𝑶𝟐 → 𝟔 𝑪𝟏𝟓𝑯𝟏𝟐𝑶𝟓 + 𝟏𝟐 𝑪𝟐𝑯𝟓𝑵𝑶𝟐 (46)

These results were obtained for the input with

total KEGG information, while for the input with

information from organisms of the fermentation process

only the production of naringenin from tyrosine was

obtained. From the above equations, it can be observed

that in the production of naringenin from phenylalanine

and tyrosine, is necessary the participation of arginine

and glycine, respectively. Allowing the stoichiometric

balance of these reactions. Also, the production of

naringenin from tyrosine had an in silico yield of 41%,

which means that to produce 1 molecule of naringenin

2.4 molecules of tyrosine are necessary, and a ∆𝑟𝐺 of -

113.69 kcal. The production of naringenin from

phenylalanine had an in silico yield of 50%, which

means that to produce 1 molecule of naringenin 2

molecules of phenylalanine are necessary, and a ∆𝑟𝐺 of

-50.4 kcal.

Two pathways to naringenin produce from

phenylalanine and tyrosine were predicted. In this

step, the metabolites glycans, polysaccharides composed

of monosaccharides of a single type34, were not taken

into account, so the number of reactions was reduced to

10,195 from 10,668 for the KEGG database and for the

organisms database the reduction was 47,959 reactions

to 39,990. Figure 1 and Figure 2 shows the metabolic

pathways for the production of naringenin from

phenylalanine and tyrosine obtained based on the

stoichiometry predicted before, which have been

reported in previous studies35,36.

A solution for the overall reaction phenylalanine-

naringenin was found in the input with information from

organisms of the fermentation process, where the

OstStoic algorithm generated a five-step path (reactions:

R00697, R02255, R08815, R01613, and R02446, see

Table 3), completely complying with the stoichiometry

of the reaction.

Table 2. LB metabolites composition

Chemical formula22

Name22,25 ΔGf (kcal/mol)22 Charge23

O2 Oxygen 3.9197 0 CO2 Carbon dioxide -92.26 0 H2O Water -56.687 0 NH3 Ammonium -18.97 1 C9H11NO2 L-phenylalanine -51.05 0 C9H11NO3 L-tyrosine -90.53 0 C15H12O5 Naringenin -113.81 0 C2H3O2 Acetate -88.29 -1 C3H7NO2 L-alanine -87.92 0 C6H12N2O4S2 Cystine -159.61 0 C6H9N3O2 Histidine -45.63 0 C6H15N2O2 Lysine -84.04 1 C5H8NO2 L-Proline -62.54 -1 C11H12N2O2 L-Tryptophan -25 0 C6H15N4O2 Arginine -73.77 1 C5H8NO4 L-Glutamate -164.13 -1 C6H13NO2 L-Isoleucine -84.87 0 C5H11NSO2 L-Methionine -75.91 0 C3H7NO3 L-Serine -124.15 0 C4H6NO4 L-Aspartate -165.75 -1 C2H5NO2 Glycine -87.73 0 C3H7NO2 L-Leucine -84.87 0 C4H9NO3 L-Threonine -124.34 0 C5H11NO2 L-Valine -86.49 0 C3H7NSO2 Thiamine 52.18 1 C17H20N4O6 Riboflavin -110.72 0 C9H16NO5 Pantothenate -188.54 -1 C8H11NO3 Pyridoxine -67.43 0 C10H15N2SO3 Biotin -109.02 -1 C19H21N7O6 Tetrahydrofolate -113.97 -2 C6H4NO2 Niacin -36.89 -1

8

For the overall reaction of tyrosine-naringenin,

no solution was obtained from this input, because no

fermentation organism has a reaction capable of turning

tyrosine into p-coumaric acid. From the input of

information of the KEGG database, a solution was found

for each overall reaction. For the phenylalanine-

naringenin reaction, the same five-step pathway was

generated as for the fermentation database, while for the

tyrosine-naringenin reaction, a four-step path composed

of the reactions was generated (R00737, R01616,

R01613, and R02446, see Table 3).

Associated genes and enzymes to the pathways

predicted. In the Table 3 eight reactions belong to the

two predicted pathways are observed. Enzymes and

genes associated are also showed. For the selection of

genes associated with the catalytic enzymes of reactions

belonging to the predicted pathways, the following

criteria were used:

• The gene has been expressed in E. coli.

• The reported Michaelis Menten km constant

must be small, with an order of

magnitude 𝑥10−2.

• Gene must have mutations that can contribute

to the production of naringenin.

Figure 1. Metabolic pathway L-phenylalanine to naringenin. This pathway is composed of reactions PHETA1

(Phenylalanine transaminase), PAL/TAL1 (phenylalanine ammonia-lyase), CNL (p-coumaroyl CoA ligase), CINCOA

(cinnamic acid 4-hydroxylase), CHS (naringenin chalcone synthase) and CHI (chalcone isomerase). The circles and arrows

represent metabolites and reactions respectively. The names associated with the abbreviations used in this figure can be

found in supplementary material 3.

Figure 2. Metabolic pathway L-tyrosine to naringenin. This pathway is composed of reactions TYRTA (Tyrosine

transaminase), PAL/TAL2 (phenylalanine/tyrosine ammonia-lyase), 4CL (4-coumarate—CoA ligase), CHS (naringenin

chalcone synthase) and CHI (chalcone isomerase). The circles and arrows represent metabolites and reactions respectively.

The names associated with the abbreviations used in this figure can be found in supplementary material 3.

The first parameter is the most important because

it guarantees that the chosen gene can be expressed in E.

coli. The second one is related to the reaction rate and

indicates the affinity that the enzyme has for the

substrate and the last parameter is relevant when genes

with mutations that can have a positive impact on

naringenin biosynthesis are included (See Table 3).

Once the genes associated to the enzymes of each

reaction were found, the tyrosine-naringenin pathway

was chosen, this being the shortest, four steps to produce

the flavonoid, which facilitates the heterologous

expression of the genes in a bio-factory such as E. coli

(Table 4).

FBA and OptKnock. After the selection of

genes and enzymes, an FBA was carried out for the

pathway tyrosine-naringenin, using a model for E. coli

ATCC 8739 to verify the viability of the pathways in this

organism. FBA was performed under aerobic conditions

and with a glucose uptake of -10 mmol*gDW-1 *h-1. The

objective function "biomass" of the model obtained an

optimum value of 0.8935 mmol*gDW-1 *h-1 when there

is no production of naringenin (without the introduction

of the pathway).

9

Table 3. Reactions, enzymes, and genes belonging to the predicted pathways.

ID-RXN EC Number Enzyme Name Gene Organism Km Present in the fermentation

R00697 4.3.1.24 4.3.1.25

phenylalanine ammonia-lyase (PAL) phenylalanine/tyrosine ammonia-lyase (PAL/TAL)

BoPAL4 Bambusa oldhammi

0.09737 No

R02255 6.2.1.12 4-coumarate---CoA ligase (4CL)

Os4CL1 Oryza sativa 0.009438 No

R08815 1.14.14.91 cinnamate 4-hydroxylase39 ------------ ------------- ------------- No

R01616 6.2.1.12 4-coumarate---CoA ligase (4CL)

Q274H-F269L Solanum

lycopersicum 0.03640 No

R01613 2.3.1.74 naringenin-chalcona sintase CHS Hypericum

androsaemum 0.004941

Lysinibacillus sphaericus

C3- 41, Bacillus

subtilis168, Bacillus

pumilus ATCC 7061 and

Bacillus clausii KSM-K16.

R02446 5.5.1.6 chalcona isomerase CHI Pueraria

montana var. Lobata42

N/A No

R00737 4.3.1.23 4.3.1.25

tyrosine ammonia-lyase (TYR) phenylalanine/tyrosine ammonia-lyase (PAL/TAL)

TAL, BoPAL4

Rhodobacter capsulatus, Bambusa oldhammi

0.01643, 0.09737

No

The four reactions that form the pathway (Figure

2) were added and a new FBA was executed (R00697,

R01616, R01613, and R02446, see Table 3). In this new

analysis, the biomass growth obtained an optimum value

of 0.893 mmol*gDW-1*h-1 and a naringenin flux of

0.00026 mmol*gDW-1*h-1, which represents around

0.029% with respect to biomass growth. As seen in

Figure 3 as the biomass varies, the naringenin

production also varies, showing a production inversely

proportional to the biomass, directly affecting the growth

of the organism.

In order to carry out the production of naringenin,

it is necessary to take into account its precursor

metabolites. These metabolites are E4P: Erythrose 4-

phosphate associated with the pentose phosphate

pathway, PEP: phosphoenolpyruvate associated with

glycolysis and gluconeogenesis and the production of

acetyl coa. Malonyl coa is derived from acetyl coa. E4P

and PEP are precursors for the generation of the

shikimate pathway, essential for the formation of p-

Coumaroyl-CoA that joined with three molecules of

malonyl coa to react and produce naringenin chalcone,

four molecules of CoA and three molecules of CO2, a

reaction catalyzed by the enzyme chalcone synthase

(CHS). Naringenin chalcone is the direct precursor of

naringenin, which is transformed through the enzyme

chalcone isomerase (CHI)15. The pathways involved in

the production of naringenin are very important because

these are the ones that redirect their fluxes in order to

make this production feasible, decreasing or increasing

the flux in other pathways and reactions of the organism,

thus changing its phenotype.

To optimize the production of naringenin it was

necessary to force the strain to adapt the yield of this

product to a cellular objective such as the biomass,

establishing a relationship between the reactions

EX_naringenin (hypothetical exchange reaction) and

biomass44.

Once the production of naringenin was obtained

in the model, an optKnock analysis was carried out, with

the objective of maximizing this production while

maximizing the biomass of the model. The secretion of

a product can be coupled to the growth of biomass if its

precursor provides an essential component of the

biomass (as is the case of tyrosine) and if pathways that

metabolize the desired product are eliminated. Optknock

finds genes that can be knockout, thus forcing the strain

to secrete the product in optimal growth.

For coupling the following equation was used:

𝒗𝒃𝒊𝒐𝒎𝒂𝒔𝒔 = 𝑿 𝒗𝑬𝑿_𝒏𝒂𝒓[𝒆] (47)

Where 𝑣𝐸𝑋_𝑛𝑎𝑟[𝑒] is the flow of the

naringenin exchange reaction, 𝑣𝑏𝑖𝑜𝑚𝑎𝑠𝑠 is the flow of

the biomass reaction and 𝑋 is a proportional constant.

This equation means that the flux of the

naringenin exchange reaction is X times the biomass

10

flux. For the run of optKnock, a group of 96 candidate

reactions to be eliminated, participants in the central

metabolism, the shikimate pathway and the synthesis

and degradation of tyrosine were chosen.

Table 4. Genes selected to produce naringenin from

tyrosine

Gene Name Gene Organism Mutation Ref.

PAL/TAL BoPAL4 Bambusa oldhammi F133H 37

4CL Q274H-F269L

Solanum lycopersicum

Q274H 40

CHS CHS Hypericum

androsaemum L263M-F265Y

41

CHI CHI Pueraria montana

var. Lobata C119A 42

Just after performing the coupling and in where

the value of X was found through OptKnock iterations

where the value that obtained the highest fluxes for

biomass and naringenin production was chosen. Which

was X=1/7. With this result, a new FBA was carried out,

where a flux of 0.1231 mmol*gDW-1*h-1 of naringenin

was obtained by optimizing the biomass flux (0.8616

mmol*gDW-1*h-1).Increasing the flux of naringenin,

but decreasing the flux of biomass compared to the

previous results. In consequence, the pathways involved

in the naringenin production increase their fluxes in the

same way that Ex_nar[e] reaction flux. Also, 61

reactions were affected after performing the coupling in

the FBA, which mostly are transport reactions (37%) and

others belong to tyrosine, tryptophan and phenylalanine

metabolism (15%), glycerophospholipid metabolism

(15%), exchange reactions (15%), prosthetic group and

alternative carbon (12%) and cofactors metabolism

(10%).

Other effects related to coupling were the

generation of extreme fluxes due to possible

Thermodynamically Infeasible Cycles (TIC) and

activation and deactivation of the reaction. ICTs lead to

non-zero cycles without communication with the system

and the external environment, generating errors in

solving the optimization problem involved in the FBA45.

As such, there cannot be a net flow around a closed cycle

in a steady-state network46. The extreme fluxes are

directly related mainly to transport reactions, while the

functions performed by the reactions that were

deactivated are carried out by means of alternative

reactions or pathways with the aim of seeking balance

within the cell, thus manifesting a different phenotype

than that obtained before coupling. The reactions

involved in the exchange and transport of h+, Fe3 +, in

oxidative phosphorylation in the periplasm and in the

citric acid cycle were activated or deactivated due to

coupling. In addition, other reactions such as the

exchange and transport of Fe2 +, CO2 and H2O and the

formation of fumarate from succinate were affected by

the coupling.

Once the proportion between biomass and

naringenin production fluxes was fixed in the model, the

Optknock analysis was executed. This analysis

generated three sets of reactions which, when knocked

out, can increase the biomass while increasing the

production of naringenin. Supplementary material 2.

The first set consists of the reactions CITL (Citrate lyase

EC-4.1.3.6), MALS (Malate synthase EC-2.3.3.9) and

PPS (Phosphoenolpyruvate synthase EC-2.7.9.2), the

first reaction belongs to the citric acid cycle, the second

is an anaplerotic reaction, and the latter is part of

glycolysis and glycogenesis, this set obtained a flux of

0.1267 mmol*gDW-1*h-1 of naringenin and biomass flux

of 0.887 mmol*gDW-1*h-1. The second set is composed

of the PFK reactions (Phosphofructokinase EC-2.7.1.11)

that are part of the glycolysis and glycogenesis, MALS

and PPS, this set obtained a flux of 0.1258 mmol*gDW-

1*h-of naringenin and biomass flux of 0.8809

mmol*gDW-1*h-. The last set contains the CITL and

MALS reactions, this set obtained a flux of 0.1267

mmol*gDW-1*h-1 of naringenin and biomass flux of

0.887 mmol*gDW-1*h-1. Supplementary material 1. The

fluxes for the production of biomass and naringenin

increased in comparison to the fluxes of these before

optimization for the three sets of reactions given by

Optknock.

Figure 3. Naringenin flux versus biomass

growth. This graph was obtained using an iterative FBA,

varying the stoichiometric coefficients of biomass

growth (variations of 0.05 mmol*gDW-1*h-1) to observe

the behavior of the naringenin flux.

Chemical production of apiforol and

antioxidant capacity assay. From naringenin

compound, a search was made for products that had a

beneficial effect on human health and, in turn, did not

11

have a commercial form. Due to this and after reviewing

the reactions downstream from naringenin in the

metabolism of the flavonoids, 16 EC numbers, 40

reactions and 31 metabolites associated with these were

found. From the review of scientific literature and

specialized databases, two metabolites were found that

met the first criteria, see methods, the first was the

metabolite 8-C-Glucosylnaringenin (C16492) and the

second was Apiforol (C12124).

Apiforol was selected to perform the chemical

synthesis since it met both parameters since currently no

commercial products have been reported from this, in

addition, its antidiabetic properties have been reported

16, while for 8-C-glucosylnaringenin it has not been

reported. They found health benefits. An additional

advantage of apiforol is that it is a direct derivative of

naringenin, through the action of the enzyme Flavanone

4-reductase EC: 1.1.1.234. Apiforol is a 3-

deoxyflavonoids just like luteophorol, known as flavan-

4oles47,48. Flavan-4oles are derived from alcohols and a

family of flavonoids and these compounds are colorless

that polymerize to form red flavophens49. The latter is

red substances soluble in alcohols and insoluble in water,

which can be extracted from the plant or result from the

treatment of tannin extracts with mineral acids such as

hydrochloric acid 50.

The reduction of naringenin was carried out using

the compound sodium borohydride and a metabolic

solution of naringenin. 10 mg of apiforol were obtained

from the reaction, this being a yellow compound.

Subsequently, apiforol were diluted in 5 ml of reagent

grade ethanol and stored at -80°C in dark conditions.

From this solution an aliquot of 50 µl was taken and

mixed with 300 µl of hydrochloric acid and 1 ml of

ethanol, immediately passing from a light-yellow color

to a strong red. Thus, confirming the presence of flavan-

4-ol apiforol.

After obtaining the apiforol, its antioxidant capacity

was measured by the FRAP method. The dose–

response characteristics of apiforol showing that

antioxidant activity is concentration-dependent in the

three concentrations. However, the measurements of

the 0.1 mg / ml curve show very high standard

deviations, which may be due to errors in these.

Figure 4 shows the rate of increase in absorbance

at 593 nm for 100 mmol/L solutions of apiforol. This

reaction does not reach a final point within the

monitoring period and a constant increase in absorbance

is observed throughout the time interval from the

sample/reagent mixture. To observe the variation of the

absorbance for each concentration with respect to the

calibration curves (FeSO4). Supplementary material 2.

The FRAP value for the time interval t was

obtained by the following equation51:

𝑭𝑹𝑨𝑷𝒕(𝑴) = (∆𝒂𝒕𝑭𝑰

∆𝒂𝒕𝑭𝒆𝟐+) ∗ 𝟏𝟎−𝟓 (48)

Figure 4. FRAP reaction kinetics. Rate of increase in absorbance at 593 nm for 100 mmol/L solutions of apiforol

compound.

12

Where:

∆𝑎𝑡𝐹𝐼: Change in absorbance after the time interval t, in

relation to the apiforol tested at a defined concentration.

∆𝑎𝑡𝐹𝑒2+: Change of absorbance in the same time

interval, for ferrous sulfate at the same concentration.

Figure 5. Total antioxidant capacity of apiforol by FRAP

method for concentrations 0.1, 0.2 and 0.3 mg/ml to 8

min.

The FRAP values for apiforol in this study were

4x10-5 M (40 μmol/L), 8x10-5 M (80 μmol/L) and

4.5x10-4 M (450 μmol/L) for concentrations of 401

μmol/L, 693 μmol/L and 981 μmol/L of apiforol and

FeSO4 respectively. These results were for the time

interval of the first four-minutes. For the total study time

(8 minutes), the FRAP values obtained were 8x10-5 M

(80 μmol/L), 1.2x1045 M (120 μmol / L) and 6x10-4 M

(600 μmol/L) for the concentrations of 401 μmol/L, 693

μmol/L and 981 μmol/L of apiforol (see Figure 5).

3. Discussion

Table 3 shows that the only enzyme found in the

organisms of the process of fermentation of cocoa seeds

is the naringenin-chalcone synthase EC: 2.3.1.74,

present in the organisms Lysinibacillus sphaericus C3-

41, Bacillus subtilis168, Bacillus pumilus ATCC 7061

and Bacillus clausii KSM-K16. These organisms are not

classified in the three groups of organisms most relevant

to the process (Y, ALB, and AAB), however, several

Bacillus strains are important in the first days of

fermentation since they have been related to the

degradation of the pectin present in the mucilage of

cocoa seeds52.

Also, could be observed that almost all the

enzymes (Table 3) are found in the genomes of plants

Theobroma cacao and Arabidopsis thaliana, except the

enzymes related to the EC numbers: EC: 4.3.1.25

(PAL/TAL) and 4.3.1.23 (TAL). TAL is found in the

genomes of bacteria such as Ectothiorhodospira

halophila, Actinosynnema mirum, Rhodobacter

sphaeroides, Salinibacter ruber and Cupriavidus

metallidurans, and in yeast as Rhodotorula glutinis.

While, PAL/TAL is present in the genome of plants

Oryza sativa japónica, Oryza brachyantha, and

Brachypodium distachyon among others.

Because the PAL/TAL enzyme has been

previously reported with the particularity of

transforming both tyrosine and phenylalanine, previous

research has shown that the BoPAL4 gene, associated

with this enzyme, could have a double activity for this

enzyme. For this reason, this gene was chosen to be

expressed in E. coli. The rest of the genes were chosen

because of a small km, which means that the enzyme-

substrate complex is stable and the tendency to form

predominates, meaning that there is a high affinity of the

enzyme for the substrate, considering the mutations as

shown in Table 4.

When introducing the chosen metabolic pathway

and carrying out the FBA and optKnock analyzes and

comparing the two phenotypes, with and without

coupling, changes were evidenced in 61 reactions of the

model (of a total of 2875 reactions), which were

activated, deactivated or there was an increase or

decrease in their activity. The sub-systems with the

greatest number of affected reactions are the transport

ones (inside the membrane and through porins), the

alternate metabolism of the carbon, the metabolism of

the glycerophospholipids, and the biosynthesis of the

groups of cofactors and prosthetics, the exchange

reactions and of the metabolism of tyrosine, tryptophan,

and phenylalanine.

From the transport subsystem, the reaction

GLUt4pp (Na+/glutamate symport (periplasm))

transporting Na+ from the periplasm to the cytosol was

deactivated. Several proton transport reactions were also

activated, in which abrupt activity was observed,

probably linked to non-feasible thermodynamic cycles,

in which reactions were observed with fluxes that

oscillated the values of the lower and upper limits of the

reactions (LB = -1000 and UB = 1000) both symport and

antiport pathways. Another reaction that presented

extreme flows was the transport of carnitine between the

cytosol and the periplasm, as well as the consumption of

water and the production of CO2 from and to the

environment, respectively within the network. In

addition, the transport of Fe2 + and Fe3 + ions were

activated through the diffusion pathway with

considerable reaction flows.

In the subsystem of alternative carbon

metabolism, six reactions were deactivated, of which

ACCOAL (Acetate-CoA ligase (ADP-forming)) and

PPCSCT (Propanoyl-CoA: succinate CoA-transferase),

13

involved in the formation of ADP and succinyl-CoA,

had an abrupt drop in their flows. For the

glycerophospholipid metabolism subsystem, three

ACPPAT160 (Acyl-(acyl carrier protein):phosphate

acyltransferase (C16:0)), ACPPAT161 (Acyl-(acyl

carrier protein):phosphate acyltransferase (C16:1)) and

G3PAT181 (Glycerol-3-phosphate acyltransferase (C18

1)) reactions were turn off and three other ACPPAT181

(Acyl-(acyl carrier protein):phosphate acyltransferase

(C18:1)), G3PAT160 (Glycerol-3-phosphate

acyltransferase (C16:0)), and G3PAT161 (Glycerol-3-

phosphate acyltransferase (C16:1)) were activated,

involved with palmitoyl-ACP (n-C16: 0ACP), cis-

hexadeca-9enoyl- [acyl-carrier-protein] (n-C16: 1), cis-

octadeca-11-enoyl- [acyl-carrier protein] (n-C18: 1) in

phosphate transport and h+, activation, and deactivation

indicates that alternate reactions were activated to

transport the metabolites involved.

In the biosynthesis subsystem of the cofactor and

prosthetic groups, the LIPOCT (Lipoyl(octanoyl)

transferase), MECDPDH2 (2C-methyl-D-erythritol 2,4

cyclodiphosphate dehydratase), and NPHS (Naphthoate

synthase) reactions were turned off and the MECDPDH5

(2C-methyl-D-erythritol 2,4 cyclodiphosphate

dehydratase) and OCTNLL (Octanoate non-lipoylated

apo domain ligase) reactions were activated, the

activation and deactivation indicate that alternate

reactions were activated for the production of Octanoate

(protein-bound) and 1-hydroxy-2-methyl-2- (E) -butenyl

4-diphosphate.

The subsystem of exchange reactions involves

the aforementioned, in the case of proton flux, the proton

exchange reaction EX_h_e, was turned off showing that

the need for protons for the metabolism of the organism

is so overwhelmed that there is no export to the outside

of the cell of this cofactor. In the case of the exchange

reactions related to Fe2+ and Fe3+, it is observed that

the consumption of Fe2+ from the medium increased

about seven times and the production of Fe3+ in the

medium increased equally around 7 times. In the

metabolism subsystem of tyrosine, tryptophan and

phenylalanine, it was observed that the fluxes that make

up the shikimate path increased its production flux from

0.3325 to 0.4435 mmol*gDW-1*h-1, which caused an

increase in the flux of the TYRTA reaction, from -0.12

to -0.24 mmol*gDW-1*h-1, which is the direct couple

reaction to the predicted heterologous pathway. The flux

of naringenin production went from 0.00026 to 0.1267

mmol*gDW-1*h-1 when performing the coupling.

From the tests carried out in the optKnock

analysis, we observed that from the ratio 1: 7 (1 mmol-

naringenin*gDW-1*h-1 per 7 mmol*gDW-1*h-1 of

biomass), the relation is satisfactorily coupled, meaning

that the production of naringenin is connected to the

growth of biomass. Of the results thrown by OptKnock,

both the set 1 and the set 2 show a naringenin production

and an equal growth rate, which are greater than those

shown for the set 2. However, the set 1 has seven genes

involved, four for CITL (citE, citD, citX and citF), two

for MALS (aceB, glcB) and one for PPS (ppsA), while

the set 3 has six genes. Also, the set 2 has five genes, two

for PFK (pfkA, pfkB), and two for MALS (aceB, glcB)

and one for PPS (ppsA).

By performing an FBA for each reaction set

thrown by the OptKnock analysis, the following results

were obtained: Set 1. PPS, MALS, and CITL were

knocked out; the knockout of the PPS reaction prevented

the formation of PEP from pyruvate, directing the

pyruvate flow to the formation of actetyl coa. By

knocking out the MALS and CITL reactions, the

additional expense of acetyl coa in the TCA is avoided,

so the flows are concentrated in the production of acetyl

coa, a direct precursor of the malonyl coa molecules

necessary for the formation of the naringenin chalcone.

Set 2. PFK, MALS, and PPS were knocked out; the

knockout of the PFK reaction led to the knockout of the

FBA reaction, redirecting the flow through the F6PA

reaction (Fructose 6-phosphate aldolase) and thus

allowing the formation of the PEP and 4EP

concentrating the fluxes of these metabolites towards the

shikimate pathway, directing them towards the

formation of p-coumaroyl coa, also necessary in the

formation of naringenin chalcone. Set 3. CITL and

MALS were knocked out. From the above it can be

concluded that the three sets sought to improve the

formation of malonyl coa and p-coumaroyl coa, direct

precursors of naringenin, thus improving the formation

of this metabolite. From the above and with the objective

of generating the least number of disturbances within the

network, the most appropriate set to improve naringenin

production was set 3 formed by the MALS and CITL

reactions.

OptKnock results show that naringenin

production increases as the growth rate increases

(Supplementary material 1), since the direct precursor in

this case, of the flavonoid naringenin is the amino acid

tyrosine, which is part of the biomass objective function.

Therefore, by optimizing the growth rate, tyrosine

production will be optimized together, forcing the strain

to improve the secretion of the product of interest30. It

should be considered that the relationship used for the

coupling was estimated through several iterations in

OptKnock since no information was available on the

relationship between the growth rate and the production

of naringenin of any organism, neither modified nor

wild. For this reason, they must carry out studies that

give this information and adjust the model to these

results.

14

From the chemical synthesis of naringenin to

produce apiforol, a yellow substance was generated, as

confirmed by Khalil et al., 201250. In the confirmation of

the presence of apiforol in the product generated from

the synthesis, a substance was obtained between rose and

red, which confirms that flavan 4-ol was obtained.

The FRAP values for apiforol in this study were

4x10-5 M (40 μmol/L), 8x10-5 M (80 μmol/L) and

4.5x10-4 M (450 μmol/L) for concentrations of 401

μmol/L, 693 μmol/L and 981 μmol/L of apiforol and

FeSO4 respectively. These results were for the time

interval of the first four minutes. Previously, it was

reported that for naringenin under similar conditions,

except for the concentration (10 μmol/L), the FRAP

value was 0 umol/L for the 4 minute time interval and

activity was observed at 60 minutes, with a FRAP value

of 0.5 μmol/L, showing a very low activity, reflected in

a very low absorbance change at a wavelength of 595

nm51. The comparison of our results and those shown in

the previous study for naringenin show considerable

variations, since it is observed that apiforol has a higher

antioxidant capacity than naringenin in a shorter time

interval; however, it is suggested to perform the FRAP

test for two compounds under the same conditions and

to reinforce the results of antioxidant capacity with other

methods such as ABTS+, DHHP, among others.

4. Conclusions

It was possible to predict optimal metabolic

pathways to produce naringenin from the metabolic

information of the organisms involved in cocoa seed

fermentation and the KEGG database, using the

OptStoic algorithm, which had already been previously

reported by the researchers 35,36.

One of the cases in which artificial pathways or

de novo synthetic pathways are used is when the product

of interest is generated by a higher organism such as

plants and whose production is not profitable; such is the

case discussed in this document (naringenin production).

Taking this approach, it is formulated to insert the

pathways predicted by the algorithm proposed by the

researchers A. Chowdhury and C. Maranas19, in the

microorganism model E. coli, specifically the pathway

from L-tyrosine, using the proposed heterologous genes

(BoPAL4, Q274H-F269L, CHS, and CHI) (Table 4).

From the FBA, it was found that the model alone

produced a very low naringenin flow, less than 0.1%

when introducing the heterologous pathway, so it was

necessary to make an adjustment to the model, in which

the production was coupled flavonoid the rate of growth

(biomass target function), generating a phenotype able to

produce naringenin with a ratio of 1: 7, forcing the strain

by this coupling. It is suggested that this coupling be

adjusted according to the results by heterologous

expressing the pathway in E. coli, looking for a

relationship between the production of naringenin and

the production of biomass of the organism.

The abrupt variations in the protonation,

phosphate and ferric compounds flow must be

considered since these are fundamental in the catalytic

and stoichiometric processes of the organism.

From the analysis of the results obtained by

optKnock, the best set of reactions to knock out,

depending on the production of naringenin and the

number of genes to be eliminated is the set 3 composed

of the CILT and MALS reactions, which increases the

production flux of naringenin in almost 3%.

It was possible to synthesize the apiforol

compound from naringenin and it was demonstrated by

the FRAP assay, that it has antioxidant activity, which

must be validated by other methods and it is suggested

to perform this same assay under the same conditions for

the naringenin compound.

5. Bibliography

1. Valenzuela, J. F., Fernández, J. A., Restrepo, A.

P. & Aristizábal, R. M. El cultivo de cacao. 12

(2012).

2. Bortolini, C., Patrone, V., Puglisi, E. & Morelli,

L. Detailed analyses of the bacterial

populations in processed cocoa beans of

different geographic origin, subject to varied

fermentation conditions. Int. J. Food Microbiol.

236, 98–106 (2016).

3. Lima, L. J. R. et al. Theobroma cacao L ., “

The Food of the Gods ”: Quality Determinants

of Commercial Cocoa Beans , with Particular

Reference to the Impact of Fermentation

Theobroma cacao L ., “ The Food of the Gods

”: Quality Determinants of Commercial Cocoa

Beans , with Pa. 8398, (2011).

4. Guehi, T. S. et al. Performance of different

fermentation methods and the effect of their

duration on the quality of raw cocoa beans. Int.

J. Food Sci. Technol. 45, 2508–2514 (2010).

5. Kongor, J. E. et al. Factors influencing quality

variation in cocoa (Theobroma cacao) bean

flavour profile - A review. Food Res. Int. 82,

44–52 (2016).

6. Mayorga-Gross, A. L., Quirós-Guerrero, L. M.,

15

Fourny, G. & Vaillant, F. An untargeted

metabolomic assessment of cocoa beans during

fermentation. Food Res. Int. 89, 901–909

(2016).

7. Kalra, E. K. Nutraceutical-definition and

introduction. AAPS PharmSci 5, 27–28 (2003).

8. Biruete, A., Juárez, E., Sieiro, E., Romero, R. &

Silencio, J. Los nutracéuticos. Lo que es

conveniente saber. Rev. Mex. Pediatría 76,

136–145 (2009).

9. Giacometti, J., Muhvić, D., Pavletić, A. &

Dudarić, L. Cocoa polyphenols exhibit

antioxidant, anti-inflammatory,

anticancerogenic, and anti-necrotic activity in

carbon tetrachloride-intoxicated mice. J. Funct.

Foods 23, 177–187 (2016).

10. Goldwasser, J., Cohen, P. Y., Yang, E.,

Balaguer, P. & Yarmush, M. L. Transcriptional

Regulation of Human and Rat Hepatic Lipid

Metabolism by the Grapefruit Flavonoid

Naringenin : Role of PPAR a , PPAR c and

LXR a. 5, 1–9 (2010).

11. Nahmias, Y. et al. Apolipoprotein B-dependent

hepatitis C virus secretion is inhibited by the

grapefruit flavonoid naringenin. Hepatology 47,

1437–1445 (2008).

12. Cavia-saiz, M. et al. Antioxidant properties ,

radical scavenging activity and biomolecule

protection capacity of flavonoid naringenin and

its glycoside naringin : a comparative study.

1238–1244 (2010). doi:10.1002/jsfa.3959

13. Frabasile, S. et al. The citrus flavanone

naringenin impairs dengue virus replication in

human cells. 1–10 (2017).

doi:10.1038/srep41864

14. Gopalan, G. et al. Screening of Musa balbisiana

Colla. seeds for antidiabetic properties and

isolation of apiforol, a potential lead, with

antidiabetic activity. (2018).

15. Marienhagen, J. & Bott, M. Metabolic

engineering of microorganisms for the

synthesis of plant natural products. J.

Biotechnol. 163, 166–178 (2013).

16. Santos, C. N. S., Koffas, M. & Stephanopoulos,

G. Optimization of a heterologous pathway for

the production of flavonoids from glucose.

Metab. Eng. 13, 392–400 (2011).

17. Vieira, G., Carnicer, M., Portais, J. C. & Heux,

S. FindPath: a Matlab solution for in silico

design of synthetic metabolic pathways.

Bioinformatics 30, 2986–2988 (2014).

18. Campodonico, M. A., Andrews, B. A., Asenjo,

J. A., Palsson, B. O. & Feist, A. M. Generation

of an atlas for commodity chemical production

in Escherichia coli and a novel pathway

prediction algorithm, GEM-Path. Metab. Eng.

25, 140–158 (2014).

19. Chowdhury, A. & Maranas, C. D. Designing

overall stoichiometric conversions and

intervening metabolic reactions. Sci. Rep. 5,

16009 (2015).

20. Wattam, A. R. et al. PATRIC, the bacterial

bioinformatics database and analysis resource.

Nucleic Acids Res. 42, D581–D591 (2014).

21. Caspi, R. et al. The MetaCyc database of

metabolic pathways and enzymes and the

BioCyc collection of pathway/genome

databases. Nucleic Acids Res. 44, D471–D480

(2016).

22. Hastings, J. et al. The ChEBI reference

database and ontology for biologically relevant

chemistry: enhancements for 2013. Nucleic

Acids Res. 41, D456 – D463 (2013).

23. Kim, S. et al. PubChem Substance and

Compound databases. Nucleic Acids Res. 44,

D1202-13 (2016).

24. Kanehisa, M. & Goto, S. KEGG: Kyoto

Encyclopedia of Genes and Genomes. Nucleic

Acids Res. 28, 27–30 (2000).

25. Kanehisa, M., Furumichi, M., Tanabe, M.,

Sato, Y. & Morishima, K. KEGG: new

perspectives on genomes, pathways, diseases

and drugs. Nucleic Acids Res. 45, 27899662

(2017).

26. Kanehisa, M., Sato, Y., Kawashima, M.,

Furumichi, M. & Tanabe, M. KEGG as a

reference resource for gene and protein

annotation. Nucleic Acids Res. 44, D457–D462

(2016).

27. King, Z. A. et al. ESCHER. Escher: A web

application for building, sharing, and

16

embedding data-rich visualizations of

biological pathways 11, 1004321 (2015).

28. Savinell, J. M. & Palsson, B. O. Network

analysis of intermediary metabolism using

linear optimization. I. Development of

mathematical formalism. J. Theor. Biol. 154,

421–454 (1992).

29. Zomorrodi, A. R., Suthers, P. F., Ranganathan,

S. & Maranas, C. D. Mathematical optimization

applications in metabolic networks. Metab.

Eng. 14, 672–686 (2012).

30. Burgard, A. P., Pharkya, P. & Maranas, C. D.

OptKnock: A Bilevel Programming Framework

for Identifying Gene Knockout Strategies for

Microbial Strain Optimization. Biotechnol.

Bioeng. 84, 647–657 (2003).

31. Schutt, C. & Netzly, D. Effect of apiforol and

apigeninidin on growth of selected fungi. J.

Chem. Ecol. 17, 2261–2266 (1991).

32. Benzie, I. & Strain, J. the ferric reducing ability

of plasma(FRAP)as a measure of ‘Antioxidan

power’:the FRAP assay analytical

biochemistry. Anal. Biochem. 239, 70–76

(1996).

33. Illeghems, K., de Vuyst, L., Papalexandratou,

Z. & Weckx, S. Phylogenetic analysis of a

spontaneous cocoa bean fermentation

metagenome reveals new insights into its

bacterial and fungal community diversity. PLoS

One 7, (2012).

34. Jirat, B. & Jenkins, A. IUPAC. Compendium of

Chemical Terminology. Blackwell Scientific

Publications (1997).

35. Hwang, E. Il et al. Production of Plant-Speci c

Flavanones by. Appl. Environ. Microbiol. 69,

2699–2706 (2003).

36. Wu, J., Du, G., Zhou, J. & Chen, J. Systems

metabolic engineering of microorganisms to

achieve large-scale production of flavonoid

scaffolds. J. Biotechnol. 188, 72–80 (2014).

37. Hsieh, L. S., Ma, G. J., Yang, C. C. & Lee, P.

D. Cloning, expression, site-directed

mutagenesis and immunolocalization of

phenylalanine ammonia-lyase in Bambusa

oldhamii. Phytochemistry 71, 1999–2009

(2010).

38. Gui, J., Shen, J. & Li, L. Functional

Characterization of Evolutionarily Divergent 4-

Coumarate:Coenzyme A Ligases in Rice. Plant

Physiol. 157, 574–586 (2011).

39. Laboratories, K. KEGG: Kyoto Encyclopedia

of Genes and Genomes. 27, 29–34 (1998).

40. Alberstein, M., Eisenstein, M. & Abeliovich, H.

Removing allosteric feedback inhibition of

tomato 4-coumarate:CoA ligase by directed

evolution. Plant J. 69, 57–69 (2012).

41. Liu, B., Falkenstein-Paul, H., Schmidt, W. &

Beerhues, L. Benzophenone synthase and

chalcone synthase from Hypericum

androsaemum cell cultures: cDNA cloning,

functional expression, and site-directed

mutagenesis of two polyketide synthases. Plant

J. 34, 847–855 (2003).

42. Katsuyama, Y., Funa, N., Miyahisa, I. &

Horinouchi, S. Synthesis of Unnatural

Flavonoids and Stilbenes by Exploiting the

Plant Biosynthetic Pathway in Escherichia coli.

Chem. Biol. 14, 613–621 (2007).

43. Kyndt, J. A., Meyer, T. E., Cusanovich, M. A.

& Van Beeumen, J. J. Characterization of a

bacterial tyrosine ammonia lyase, a

biosynthetic enzyme for the photoactive yellow

protein. FEBS Lett. 512, 240–244 (2002).

44. Vlasov, V. & Pfau, T. Model manipulation.

(2017). doi:10.1016/B978-0-12-409548-

9.10134-4

45. Murabito, E., Simeonidis, E., Smallbone, K. &

Swinton, J. Capturing the essence of a

metabolic network: A flux balance analysis

approach. J. Theor. Biol. 260, 445–452 (2009).

46. Schellenberger, J., Lewis, N. E. & Palsson, B.

Elimination of thermodynamically infeasible

loops in steady-state metabolic models.

Biophys. J. 100, 544–553 (2011).

47. Persson, I. A. L. Tea Flavanols: An Overview.

Tea in Health and Disease Prevention

(Elsevier, 2013). doi:10.1016/B978-0-12-

384937-3.00006-9

48. Quattrocchio, F., Baudry, A., Lepiniec, L. &

Grotewold, E. The regulation of flavonoid

biosynthesis. in The Science of Flavonoids 97–

122 (Springer New York, 2006).

17

doi:10.1007/978-0-387-28822-2_4

49. Styles, E. D. & Ceska, O. Flavonoid pigments

in genetic strains of maize. Phytochemistry 11,

3019–3021 (1972).

50. Khalil, A., Baltenweck-Guyot, R., Ocampo-

Torres, R. & Albrecht, P.

Retrodihydrochalcones in Sorghum species:

Key intermediates in the biosynthesis of 3-

deoxyanthocyanidins? Phytochem. Lett. 5, 174–

176 (2012).

51. Firuzi, O., Lacanna, A., Petrucci, R., Marrosu,

G. & Saso, L. Evaluation of the antioxidant

activity of flavonoids by ‘ferric reducing

antioxidant power’ assay and cyclic

voltammetry. Biochim. Biophys. Acta - Gen.

Subj. 1721, 174–184 (2005).

52. Ouattara, H. G., Reverchon, S., Niamke, S. L.

& Nasser, W. Regulation of the synthesis of

pulp degrading enzymes in Bacillus isolated

from cocoa fermentation. Food Microbiol. 63,

255–262 (2017).

18

Supplementary 1. Reaction sets found with

OptKnock.

Supplementary 2. Antioxidant capacity by apiforol

concentrations compared to the FeSO4 calibration

curve

19

Supplementary 3. Abbreviations used in predicted metabolic pathways

abbreviation Name

akg 2-Oxoglutarate

glu__L L-Glutamate

phe__L L-Phenylalanine

phpyr Phenylpyruvate

nh3 Ammonia

cinnm Trans-Cinnamate

coa Coenzyme A

atp ATP

cinnmcoa Cinnamoyl-CoA

ppi Diphosphate

amp AMP

nadph Nicotinamide adenine dinucleotide phosphate - reduced

o2 O2

h H+

nadph Nicotinamide adenine dinucleotide phosphate - reduced

h2o H2O

coucoa 4-Coumaroyl-CoA

malcoa Malonyl-CoA

narcha naringenin chalcone

co2 CO2

nar naringenin

34hpp 3-(4-Hydroxyphenyl)pyruvate

tyr__L L-Tyrosine

nh4 Ammonium

Th4cinnm Trans 4 Hydroxycinnamate