Endoglucanase là gì

Obtaining bioethanol from cellulosic biomass involves numerous steps, among which the enzymatic conversion of the polymer to individual sugar units has been a main focus of the biotechnology industry. Among the cellulases that break down the polymeric cellulose are endoglucanases that act synergistically for subsequent hydrolytic reactions. The endoglucanases that have garnered relatively more attention are those that can withstand high temperatures, i.e., are thermostable. Although our understanding of thermostability in endoglucanases is incomplete, some molecular features that are responsible for increased thermostability have been recently identified. This review focuses on the investigations of endoglucanases and their implications for biofuel applications.

Introduction

The use of plants as bioreactors is not a new concept. Within the last fifteen years, studies have established that with the knowledge of biotechnology and genetic engineering, plants are indeed a low-cost source to produce stable molecules: plants are harnessed to produce antibodies, biodegradable plastics, recombinant proteins, carbohydrates, and fatty acids. A major goal of plant-based bioreactor technology is the production of stable enzymes to produce industrially useful products. Among these enzymes are manganese-dependent lignin peroxidase [for bleaching of pulp], phytase [for animal feed], [1–3,1-4]-β-glucanase [for brewing], and xylanase [for animal feed, paper and baking], which are used in processing plants such as alfalfa, tobacco, and barley[–]. In addition, as a result of growing environmental concerns of consuming fossil-based fuels, enzymes used in the production of plant-based ethanol [i.e., bioethanol] gained more importance in recent years, including α-amylases and endoglucanases.

α-amylase and endoglucanase are both involved in the conversion of plant material into sugar; however, there is a critical difference when it comes to which part of the plant they catalyze: while α-amylases break down starch from maize grain, which is primarily used as food by humans and animals, endoglucanases break down cell-wall cellulose primarily from maize stover [i.e., leaves and stalk], which has been historically considered as waste. Despite the obvious drawbacks of using a crucial food item as a fuel source, maize grain remains a preferred choice as the biofeedstock for the ethanol production, because of its ease of harvesting in a large acreage and of introducing new traits.

The demand for bioethanol is expected to increase significantly in the near future: according to the Billion Ton Study published in August 2011[], 76 million tons of maize is used for the production of ethanol and yields 14.2 billion gallons of ethanol fuel per year. By 2017, the consumption of biomass for biofuel production is projected to be 103 million tons. Therefore, with the ever-increasing demand for energy sources, there is a strong need to look at other non-grain sources of biomass. While many feedstock sources [e.g., perennials such as switchgrass and Miscanthus, and biomass from forests] may take several years to become fully developed as resources, maize stover is a readily available resource for biofuel production[].

Some efforts have been made to reduce the cost of cellulose breakdown in maize stover to make the process more cost-effective, and therefore more attractive for the bioethanol industry. Traditionally, industrial enzymes are added into a batch of reagents in reactors during chemical processing. Controlling pH and temperature levels become critical, because even small environmental variations can cause enzyme denaturation and subsequent loss of enzymatic activity[]. Cellulose-based bioethanol processing therefore requires several reactors, including one for a high-temperature pre-treatment [~75°C] and another for low-temperature endoglucanase-mediated cellulose conversion into sugar. One strategy currently being explored to improve the efficiency and lower the cost of these conversion steps is to develop feedstocks [such as maize stover] with heterologously expressed thermostable endoglucanases. Such a strategy will allow bypassing the need to add an externally sourced enzyme and possibly reduce number of processing steps[–]. To help in these efforts, we need to understand what factors, including protein sequence, structure, and dynamics, can be used to engineer thermostable endoglucanases.

Thermostability is a complex property that can be controlled by several factors, which may be working additively. Many studies comparing mesostable and thermostable proteins concluded that following properties may increase thermostability in general: selective pressure of certain amino acids[], increase in hydrophobicity[], change in a single amino acid[], increase in compactness[], increase in positively charged amino acids[], and Gibbs free energy change of hydration[]. These studies are usually general in the sense that they compare proteins across protein families, folds, sizes, and cellular location. When it comes to a specific protein family, however, our knowledge is usually limited as to what differentiates thermostable and mesostable proteins in that protein family.

In this review, we will first provide a short description of endoglucanase-mediated cellulose conversion process. Then we will return our attention to the main question of the thermostability-sequence-structure relationship in endoglucanases.

Breaking down cellulose

Although the composition varies between different plants, 40-55% of the plant biomass is comprised of cellulose [a homopolymer made of repeating units of glucose], 25-50% of hemicelluloses [a heteropolymer made of glucose and other sugars][], and the remaining 10-40% is made up of lignin [a complex organic polymer][]. Conversion of cellulose polymers to simple sugars requires the use of cellulases. Cellulase is comprised of three distinct classes of enzymes [endoglucanases, cellobiohydrolases, and β-glucosidases] that act synergistically to break down the polymer. Endoglucanases act by cleaving internal β-glycosidic bonds in the cellulose chain, thereby making chain ends accessible to cellobiohydrolase. The end product cellobiose is further broken down to glucose units by β-glucosidase. Endoglucanases have an enzyme classification number 3.2.1.4 and belong to the broader enzyme group called glycosyl hydrolases, which also includes other cellulases such as exoglucanase and β-glucosidase. According to the CAZy database [//www.cazy.org], endoglucanases are part of 13 distinct glycosyl hydrolase families, distributed in several archeal, bacterial, fungal, and eukaryotic organisms.

Cellulose to biofuel conversion

From the agricultural field to the gas station, biofuel production from cellulosic biomass involves numerous steps that are strategically designed for efficient conversion[]. In one of the earlier steps, pre-processing of the biomass occurs at high temperatures [75°C] coupled with a dilute sulfuric acid treatment[]. This is done to liberate lignin, hemicellulose, and other compounds, and make the cellulosic polymers available for enzymatic degradation. Inefficient liberation of these compounds can lead to the enzyme inhibition, and therefore, reduce the efficiency of cellulose conversion. Currently, the efficiency of ethanol production from lingocellulosic biomass is physically and economically constrained by the pretreatment processes and the subsequent addition of cellulases to affect cellulose breakdown[–].

Industrial cellulases, particularly endoglucanases, are currently obtained from the fermentation of fungal and bacterial sources that are added to the production batch after the pretreatment step. The expression of endoglucanases from these external sources adds to the final cost of the bioethanol product. There is an intense interest in exploiting the potential of thermostable bioprocessing enzymes[, ].

Due to the pH and temperature extremes involved in the biomass-to-biofuel conversion, stable endoglucanases and its synergistic enzymes, such as cellobiohydrolase and β-glucosidase, are sought for the enzymatic conversion of biomass. Thermostable endoglucanases from extremophiles are considered promising because they typically exhibit valuable characteristics in biofuel production including optimal functionality at higher temperatures and the ability to withstand extreme pH changes[].

The strategy of heterologously expressing a thermostable endoglucanase in maize stover bypasses the need to add an externally sourced enzyme[, ]. This way, the enzyme [in the biomass itself] can breakdown cellulose immediately following the pre-processing step. Furthermore, if the stable enzyme is expressed in the host tissue for biomass conversion, there can be substantive gains in production efficiency. This concept was shown to work previously with a non-cellulase enzyme during the breakdown of starch in maize grain: a synthetic chimera of three wild-type α-amylases from the archea Thermococcales that have an optimal growth temperature of above 80°C[]. Introducing endoglucanase into plants reduces the recalcitrance of cellulose[, ], since endoglucanases are capable of making random internal cleavage of the polymer, whose hydrolysis products are used by other enzymes. Transgenic plant expression is ideal for many reasons, including [a] almost unlimited scale-up potential, [b] cheap production – i.e. photosynthesis, [c] correct protein folding, [d] lack of human pathogens, and [e] the potential for direct use[]. Cellulases currently account for approximately $0.68 to $1.47 per gallon of ethanol produced from cellulosic feedstocks[], but this cost component could be reduced 5-fold with in planta expression[–].

A closer look at endoglucanases originating from bacterial sources shows significant diversity in their optimal temperature for enzymatic function. Some of these are optimally functional at elevated temperatures and thus thermostable. In the next section, we are interested in the following questions: What could be the differences between a thermostable and a mesostable endoglucanase? If there are any, can we translate these differences to modify an existing endoglucanase into a more thermostable protein? We will pursue the answers to these questions by reviewing computational analyses of thermostability in endoglucanases.

Thermostability in endoglucanases

Enzymes that can [a] withstand high temperatures, [b] resist unfolding, and [c] perform their optimal activity at higher temperatures are called thermostable or thermophilic. Thermostable enzymes are highly stable at elevated temperatures where mesostable enzymes become denatured and thus lose their optimal activities[, , ].

Generally speaking, thermostability is a desired quality for proteins that have industrial and therapeutic significance[, , –]. Introducing thermostability has been one of the major focuses of protein engineering studies, specifically of computational studies, which can be divided in three broad categories in terms of number of enzymatic families and organisms analyzed: the proteome of a thermophilic organism to the proteome of a mesophilic organism[][], [b] proteins from multiple organisms belonging to a range of different protein families[, , , , –], and [c] a single protein family between thermophilic and mesophilic organisms[, , ].

Such studies have identified various factors imparting thermostability, including sequence-level factors [specific amino acids like Arg and Glu being significantly higher in thermophiles] and structure-level factors [energy of unfolding, number of Van der Waals contacts per residue, number of hydrogen bonds per residue, or number of residues involved in secondary structure]. Other factors such as Gibbs free energy change of hydration, long-range non-bonded energy, and hydrophobicity, have also been mentioned. Although most of the studies attempted to identify “universal” factors imparting thermostability across protein families, at the protein family level, another set of factors [e.g., a different amino acid composition] may determine thermostability[]. This inability to identify common features responsible for thermostability provided evidence for the view that no single rule defines thermostability[].

In the previous studies of thermostable endoglucanases, Panasik et al.[] analyzed the thermostability factors for proteins belonging to GH families that have the identical [α/β]8 fold. The lack of Gly in thermostable glycosyl hydrolase [which includes endoglucanase] was identified as the responsible factor. However, this analysis falls short in the following areas when it comes to its applicability to other endoglucanases: [a] the criteria of selecting the dataset of 29 proteins were based on higher crystallographic resolution rather than on diverse sequence identities, [b] only three endoglucanase structures were studied, and [c] the study does not individually analyze endoglucanases, but as part of a larger group of GH families.

Recently, a directed evolution approach has been used to identify an endoglucanase with higher thermostability from a thermophilic endoglucanase[]. Although directed evolution has the ability to successfully find a mutant with higher stability and similar enzymatic activity, it is not guaranteed to do so. As such, finding a desirable mutant is a matter of trial and error, and does not necessarily explain why a particular mutant is thermostable or not. A promising method is using SCHEMA, where the structure of the protein is also used. This was used in developing thermostable chimeric cellobiohydrolases, of which many exhibited higher stability and optimal activity[, ].

Current computational thermostability studies with respect to Endoglucanases

Protein structures tell a fold-dependent pattern of thermostability

Abundant structural information is present for endoglucanases in three different folds; namely [α/β]8, β-jelly roll, and [α/α]6 fold. Segregating and analyzing the structural data on the basis of these three different fold families has indicated that the type of fold is critical in identifying specific thermostabilizing features for endoglucanases[].

When comparing thermostable and mesostable endoglucanase enzymes as a whole, the amino acids Met and Arg and ionic interactions were significantly more enriched in thermophiles [See Figure ]. However, the enrichment of these amino acids is not statistically significant when considered at the fold-level analysis. In contrast, the solvent accessibility and secondary structure preference [placement in a helix, strand, or loop] showed a fold-dependent preference. Pairwise analysis of structurally similar endoglucanases [i.e. a pair of thermostable and mesostable endoglucanases] from the same fold showed distinct patterns of amino acid substitutions in the mesophile as compared to thermophile: in β-jelly roll fold, the amino acid Arg is replaced by Pro, and hydrophobic amino acids [such as Trp, Tyr, Phe, Iso, Met, Leu] indicating decreased ionic interactions in the mesostable endoglucanases.

Figure 1

Statistically significant amino acids and intramolecular interactions between thermostable and mesostable endoglucanases. They are shown in thermostable endoglucanases as an enzyme group, based on the reaction mechanism, and within each fold, as compared to their mesostable counterparts [p-value 

Chủ Đề