[go: up one dir, main page]

WO2012021883A2 - Nouvelles enzymes fongiques - Google Patents

Nouvelles enzymes fongiques Download PDF

Info

Publication number
WO2012021883A2
WO2012021883A2 PCT/US2011/047734 US2011047734W WO2012021883A2 WO 2012021883 A2 WO2012021883 A2 WO 2012021883A2 US 2011047734 W US2011047734 W US 2011047734W WO 2012021883 A2 WO2012021883 A2 WO 2012021883A2
Authority
WO
WIPO (PCT)
Prior art keywords
seq
acid sequence
protein
nucleic acid
enzyme
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2011/047734
Other languages
English (en)
Other versions
WO2012021883A3 (fr
Inventor
Mark Emalfarb
Johannes Visser
Sandra Hinz
Jan Werij
Jacob Visser
Vivi Joosten
Martijn Koetsier
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dyadic International Inc
Original Assignee
Dyadic International Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dyadic International Inc filed Critical Dyadic International Inc
Publication of WO2012021883A2 publication Critical patent/WO2012021883A2/fr
Publication of WO2012021883A3 publication Critical patent/WO2012021883A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)

Definitions

  • This invention relates to novel enzymes and novel methods for producing the same.
  • this invention relates to enzymes produced by fungi.
  • the invention also relates to a method to convert lignocellulosic biomass or cellulosic substrates to fermentable sugars with enzymes that degrade lignocellulosic, cellulosic, and even more complex plant cell wall material and to novel combinations of enzymes, including those that provide a combined or synergistic release of sugars from plant biomass.
  • the invention also relates to a method to release cellular contents by effecting degradation of the cell walls.
  • the invention also relates to methods of using the novel enzymes and compositions of such enzymes in a variety of other processes, such as washing or treating of clothing or fabrics, detergent processes, animal feed, food, baking, biofuel, starch preparation, liquification, beverage, biorefining, deinking and biobleaching of paper and pulp, oil and waste dispersing, and treatment of waste streams.
  • distillers' dried grains are lignocellulosic byproducts of the corn dry milling process. Milled whole corn kernels are treated with amylases to liquefy the starch within the kernels and hydroryze it to glucose. The glucose so produced is then fermented in a second step to ethanol. The residual solids after the ethanol fermentation and distillation are centrifuged and dried, and the resulting product is DDG, which is used as an animal feed stock.
  • DDG composition can vary, a typical composition for DDG is: about 32% hemicellulose, 22% cellulose, 30% protein, 10% lipids, 4% residual starch, and 4% inorganics.
  • the cellulose and hemicellulose fractions comprising about 54% of the weight of the DDG, can be efficiently hydro lyzed to fermentable sugars by enzymes; however, it has been found that the carbohydrates comprising lignocellulosic materials in DDG are more difficult to digest.
  • the efficiency of hydrolysis of these (hemi) cellulosic polymers by enzymes is much lower than the hydrolytic efficiency of starch, due to the more complex and recalcitrant nature of these substrates. Accordingly, the cost of producing the requisite enzymes is higher than the cost of producing amylases for starch hydrolysis.
  • Major polysaccharides comprising lignocellulosic materials include cellulose and hemicelluloses.
  • the enzymatic hydrolysis of these polysaccharides to soluble sugars (and finally to monomers such as glucose, xylose and other hexoses and pentoses) is catalyzed by several enzymes acting in concert.
  • endo-l ,4-P-glucanases (EGs) and exo-cellobiohydrolases (CBHs) catalyze the hydrolysis of insoluble cellulose to cellooligosachharides (with cellobiose as the main product), while ⁇ - glucosidases (BGLs) convert the oligosaccharides to glucose.
  • EGs endo-l ,4-P-glucanases
  • CBHs exo-cellobiohydrolases
  • BGLs ⁇ - glucosidases
  • xylanases together with other enzymes such as a-L-arabinofuranosidases, ferulic and acetylxylan esterases and ⁇ -xylosidases, catalyze the hydrolysis of hemicelluloses.
  • the hydrohtic efficiency of a multi-enzyme cocktail in the process of lignocellulose saccharification depends on properties of individual enzymes, the synergies between them, and their ratio in the multi-enzyme cocktail.
  • the hydrolytic efficiency of a multi-enzyme complex in the process of lignocellulosic saccharification depends both on properties of the individual enzymes and the ratio of each enzyme within the complex.
  • Enzymes useful for the hydrolysis of complex polysaccharides are also highly useful in a variety of industrial textile applications, as well as industrial paper and pulp applications, and in the treatment of waste streams.
  • methods for treating cellulose- containing fabrics for clothing with hydrolytic enzymes, such as cellulases are known to improve the softness or feel of such fabrics.
  • Cellulases are also used in detergent compositions, either for the purpose of enhancing the cleaning ability of the composition or as a softening agent.
  • Cellulases are also used in combination with polymeric agents in processes for providing a localized variation in the color density of fibers.
  • Such enzymes can also be used for the saccharification of lignocellulosic biomass in waste streams, such as municipal solid waste, for biobleaching of wood pulp, and for deinking of recycled print paper.
  • waste streams such as municipal solid waste
  • biobleaching of wood pulp and for deinking of recycled print paper.
  • hydrolysis of these polysaccharides in lignocellulosic materials for use as feedstocks described above the cost and hydrolytic efficiency of the enzymes are major factors that control the use of enzymes in these processes.
  • Enzymes are also usedful in the food and animal feed industry.
  • esterases can be utilized to degum vegetable oils; improving the production of various food products as well as enhancing the flavor of food products.
  • Esterases can be used in the feed to reduce the amount of phosphate in feed.
  • Carbohydrases can be used to increase the yield of fruit juice and oils; stimulate fermentation in the brewing industry; produce gelling agents; and modify starches, to name a few.
  • Carbohydrases in the feed industry include, but are not limited to, improving feed conversion, reducing the viscosity, and producing oligosaccharides.
  • Filamentous fungi such as Aspergillus sp. and Trichoderma sp. are sources of cellulases and hemicellulases, as well as other enzymes useful in the enzymatic hydrolysis of major polysaccharides.
  • strains of Trichoderma sp. such as T. viride, T. reesei and T. longibrachiatum, and Penicillium sp., and enzymes derived from these strains, have previously been used to hydrolyze crystalline cellulose.
  • the present invention comprises an isolated nucleic acid sequence selected from the group consisting of:
  • a nucleic acid sequence encoding a protein comprising an amino acid sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:58, SEQ ID NO:60, SEQ ID NO:62, SEQ ID NO:64, SEQ ID NO:66, SEQ ID NO:
  • nucleic acid sequence encoding a fragment of the protein of (a), wherein the fragment has a biological activity of the protein of (a); and c) a nucleic acid sequence encoding an amino acid sequence that is at least about 70% identical to an amino acid sequence of (a) and has a biological activity of the protein comprising the amino acid sequence.
  • the nucleic acid sequence encodes an amino acid sequence that is at least about- 90%, at least about 95%, at least about 97% or at least about 99% identical to the amino acid sequence of (a) and has a biological activity of the protein comprising the amino acid sequence.
  • the nucleic acid sequence encodes a protein comprising an amino acid sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 1 0, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:58, SEQ ID NO:60, SEQ ID NO:62, SEQ ID NO:64, SEQ ID NO:
  • the nucleic acid sequence comprises a nucleic acid sequence selected from the group consisting of: SEQ ID NO: l , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: l l , SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21 , SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31 , SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41 , SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51 , SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:57, SEQ ID NO:59, SEQ ID NO:61, SEQ ID NO:
  • the present invention comprises nucleic acid sequences that are fully complementary to any of the nucleic acid sequences described above.
  • the present invention comprises an isolated protein comprising an amino acid sequence encoded by any of the nucleic acid molecules described above.
  • the present invention comprises an isolated fusion protein comprising an isolated protein of the present invention fused to a protein comprising an amino acid sequence that is heterologous to the isolated protein.
  • the present invention comprises a kit for degrading a lignocellulosic material to fermentable sugars comprising at least one isolated protein of the present invention.
  • the present invention comprises a detergent comprising at least one isolated protein of the present invention.
  • the present invention comprises a composition for the degradation of a lignocellulosic material comprising at least one isolated protein of the present invention.
  • the present invention comprises a recombinant nucleic acid molecule comprising an isolated nucleic acid molecule of the present invention, operatively linked to at least one expression control sequence.
  • the recombinant nucleic acid molecule comprises an expression vector.
  • the recombinant nucleic acid molecule comprises a targeting vector.
  • the present invention comprises an isolated host cell transfected with a nucleic acid molecule of the present invention.
  • the host cell is a fungus.
  • the host cell is a filamentous fungus.
  • the filamentous fungus is from a genus selected from the group consisting of: Chrysosporium, Thielavia.
  • the host cell is a bacterium.
  • the present invention comprises an oligonucleotide consisting essentially of at least 12 consecutive nucleotides of a nucleic acid sequence selected from the group consisting of: SEQ ID NO: l , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: l 1 , SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21 , SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31 , SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41 , SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51 , SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO
  • the present invention comprises a kit comprising at least one oligonucleotide of the present invention.
  • the present invention comprises methods for producing a protein of the present invention, comprising culturing a cell that has been transfected with a nucleic acid molecule comprising a nucleic acid sequence encoding the protein, and expressing the protein with the transfected cell. In some embodiments, the present invention further comprises recovering the protein from the cell or from a culture comprising the cell.
  • the present invention comprises a genetically modified organism comprising components suitable for degrading a lignocellulosic material to fermentable sugars, wherein the organism has been genetically modified to express at least one protein of the present invention.
  • the genetically modified organism is a plant, alga, fungus or bacterium.
  • the fungus is yeast, mushroom or filamentous fungus.
  • the filamentous fungus is from a genus selected from the group consisting of: Chrysosporium, Thielavia, Neurospora, Aureobasidium, Filibasidium, Piromyces, Corynascus, Cryptococcus, Acremonium, Tolypocladium, Scytalidium, Schizophyllum, Sporotrichum, Penicillium, Talaromyces, Gibberella, Myceliophthora, Mucor, Aspergillus, Fusarium, Humicola, and Trichoderma.
  • the filamentous fungus is selected from the group consisting of: Trichoderma reesei, Trichoderma harzanium, Chrysosporium lucknowense, Aspergillus niger, Aspergillus. oryzae Aspergillus japonicus, Penicillium canescens, Penicillium solitum, Penicillium funiculosum, Talaromyces flavus, Talaromyces emersonii and Myceliophthora thermophila.
  • the genetically modified organism has been genetically modified to express at least one additional enzyme.
  • the additional enzyme is an accessory enzyme selected from the group consisting of: cellulase, glucosidase, xylanase, xylosidase, ligninase, glucuronidase, arabinofuranosidase, arabinase, arabinogalactanase, ferulic acid esterase, lipase, pectinase, glucomannase, amylase, laminarinase, xyloglucanase, galactanase, galactosidase, glucoamylase, pectate lyase, chitosanases, exo-P-D-glucosaminidase, cellobiose dehydrogenase, and acetylxylan esterase.
  • the genetically modified organism is a plant.
  • the present invention comprises a recombinant enzyme isolated from a genetically modified microorganism of the present invention.
  • the recombinant enzyme has been subjected to a purification step.
  • the present invention comprises a crude fermentation product produced by culturing the cells from the genetically modified organism of the present invention, wherein the crude fermentation product contains at least one protein of the present invention.
  • the present invention comprises a multi-enzyme composition comprising enzymes produced by a genetically modified organism of the present invention , and recovered therefrom.
  • the present invention comprises a multi-enzyme composition comprising at least one protein of the present inventions, and at least one additional protein for degrading a lignocellulosic material or a fragment thereof that has biological activity.
  • the multi-enzyme composition comprises at least one cellobiohydrolase, at least one xylanase, at least one endoglucanase, at least one ⁇ - glucosidase, at least one ⁇ -xylosidase, and at least one accessory enzyme.
  • between about 50% and about 70% of the enzymes in the multi-enzyme composition are cellobiohydrolases.
  • between about 10% and about 30% of the enzymes in the composition are xylanases.
  • between about 5% and about 15% of the enzymes in the composition are endoglucanases.
  • between about 1 % and about 5% of the enzymes in the composition are ⁇ -glucosidases. In some embodiments, between about 1% and about 3% of the enzymes in the composition are ⁇ -xylosidases.
  • the multi-enzyme composition comprises about 60% cellobiohydrolases, about 20% xylanases, about 10% endoglucanases, about 3% ⁇ - glucosidases, about 2% ⁇ -xylosidases, and about 5% accessory enzymes.
  • the xylanases are selected from the group consisting of: endoxylanases, exoxylanases, and ⁇ -xylosidases.
  • the accessory enzymes include an enzyme selected from the group consisting of: cellulase, glucosidase, xylanase, xylosidase, ligninase, glucuronidase, arabinofiiranosidase, arabinase, arabinogalactanase, ferulic acid esterase, lipase, pectinase, glucomannase, amylase, laminarinase, xyloglucanase, galactanase, galactosidase, glucoamylase, pectate lyase, chitosanase, ⁇ - ⁇ -D- glucosaminidase, cellobiose dehydrogenase, and acetylxylan esterase.
  • the multi-enzyme composition comprises at least one hemicellulase.
  • the hemicellulase is selected from the group consisting of a xylanase, an arabinofiiranosidase, an acetyl xylan esterase, a glucuronidase, and endo-galactanase, a mannanase, an endo-arabinase, an exo arabinase, an exo-galactanase, a ferulic acid esterase, a galactomannanase, a xyloglucanase, and mixtures thereof.
  • the xylanase is selected from the group consisting of endoxylanases, exoxylanase, and ⁇ - xylosidase.
  • the multi-enzyme composition comprises at least one cellulase.
  • the composition is a crude fermentation product. In some embodiments, the composition is a crude fermentation product that- has been subjected to a purification step.
  • the multi-enzyme composition further comprises one or more accessory enzymes.
  • the accessory enzymes include at least one enzyme selected from the group consisting of: cellulase, glucosidase, xylanase, xylosidase, ligninase, glucuronidase, arabinofuranosidase, arabinase, arabinogalactanase, ferulic acid esterase, lipase, pectinase, glucomannase, amylase, laminarinase, xyloglucanase, galactanase, galactosidase, glucoamylase, pectate lyase, chitosanase, exo- -D-glucosaminidase, cellobiose dehydrogenase, and acetylxylan esterase.
  • the accessory enzyme is selected from the group consisting of a glucoamylase, a pectinase, and a ligninase. In some embodiments, the accessory enzyme is added as a crude or a semi-purified enzyme mixture. In some embodiments, the accessory enzyme is produced by culturing at least one organism on a substrate to produce the enzyme.
  • the multi-enzyme composition comprises at least one protein . of the present invention, and at least one additional protein for degrading an arabinoxylan-containing material or a fragment thereof that has biological activity.
  • the composition comprises at least one endoxylanase, at least one ⁇ -xylosidase, and at least one arabinofuranosidase.
  • the arabinofuranosidase comprises an arabinofuranosidase with specificity towards single substituted xylose residues, an arabinofuranosidase with specificity towards double substituted xylose residues, or a combination thereof.
  • the present invention comprises methods for degrading a lignocellulosic material to fermentable sugars, comprising contacting the lignocellulosic material with at least one isolated protein of the present invention.
  • the methods of the present invention further comprise contacting the lignocellulosic material with at least one additional isolated protein comprising an amino acid sequence that is at least about 95% identical to an amino acid sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:
  • the additional isolated protein is part of a multi-enzyme composition.
  • the present invention comprises methods for degrading a lignocellulosic material to fermentable sugars, comprising contacting the lignocellulosic material with at least one multi-enzyme composition of the present invention.
  • the present invention comprises a method for producing an organic substance, comprising:
  • the steps of saccharifying and fermenting are performed simultaneously.
  • the organic substance is an alcohol, organic acid, ketone, amino acid, or gas.
  • the alcohol is ethanol.
  • the lignocellulosic material is selected from the group consisting of herbaceous material, agricultural residue, forestry residue, municipal solid waste, waste paper, and pulp and paper mill residue. [059] In some embodiments, the lignocellulosic material is distiller's dried grains (DDG) or DDG with solubles. In some embodiments, the DDG or DDG with solubles is derived from corn.
  • DDG distiller's dried grains
  • the present invention comprises a method for degrading a lignocellulosic material consisting of DDG or DDG with solubles to sugars, the method comprising contacting the DDG or DDG with solubles with a multi-enzyme composition of the present invention, whereby at least about 10% of the fermentable sugars are liberated. In some embodiments, at least about 15%, at least 20%, or at least about 23% of the sugars are liberated.
  • the present invention further comprises a pretreatment process for pretreating the lignocellulosic material.
  • the pretreatment process is selected from the group consisting of physical treatment, metal ion, ultraviolet light, ozone, organosolv treatment, steam explosion treatment, lime impregnation with steam explosion treatment, hydrogen peroxide treatment, hydrogen peroxide/ozone (peroxone) treatment, acid treatment, dilute acid treatment, and base treatment.
  • the pretreatment process is selected from the group consisting of organosolv, steam explosion, heat treatment and AFEX.
  • the heat treatment comprises heating the lignocellulosic material to 121°C for 15 minutes.
  • the present invention further comprises detoxifying the lignocellulosic material.
  • the present invention further comprises recovering the fermentable sugar.
  • the sugar is selected from the group consisting of glucose, xylose, arabinose, galactose, mannose, rhamnose, sucrose and fructose.
  • the present invention further comprises recovering the contacted lignocellulosic material after the fermentable sugars are degraded.
  • the present invention comprises a feed additive comprising the recovered lignocellulosic material of the present invention.
  • the protein content of the recovered lignocellulosic material is higher than that of the starting lignocellulosic material.
  • the present invention comprises methods of improving the performance of an animal which comprises administering to the animal the feed additive of the present invention.
  • the present invention comprises methods for improving the nutritional quality of an animal feed comprising adding the feed additive of the present invention to an animal feed.
  • the present invention comprises methods for stonewashing a fabric, comprising contacting the fabric with at least one isolated protein of the present invention.
  • the present invention comprises methods for stonewashing a fabric, comprising contacting the fabric with at least one multi-enzyme composition of the present invention.
  • the fabric is denim.
  • the present invention comprises methods for enhancing the softness or feel of a fabric or depilling a fabric, comprising contacting the fabric with at least one isolated protein of the present invention, or a fragment thereof comprising a carbohydrate binding module (CBM) of the protein.
  • CBM carbohydrate binding module
  • the present invention comprises methods for enhancing the softness or feel of a fabric or depilling a fabric, comprising contacting the fabric with at least one multi-enzyme composition of the present invention.
  • the present invention comprises methods for restoring color to or brightening a fabric, comprising contacting the fabric with at least one isolated protein of the present invention.
  • the present invention comprises methods for restoring color to or brightening a fabric, comprising contacting the fabric with at least one multi- enzyme composition of the present invention.
  • the present invention comprises methods of biopolishing, defibrillating, bleaching, dyeing or desizing a fabric, comprising contacting the fabric with at least one isolated protein of the present invention.
  • the present invention comprises methods of biopolishing, defibrillating, bleaching, dyeing or desizing a fabric, comprising contacting the fabric with at least one multi-enzyme composition of the present invention.
  • the present invention comprises methods of biorefining, deinking or biobleaching paper or pulp, comprising contacting the paper or pulp with at least one isolated protein of the present invention. [080] In some embodiments, the present invention comprises methods of biorefining, deinking or biobleaching paper or pulp, comprising contacting the paper or pulp with at least one multi-enzyme composition of the present invention
  • the present invention comprises methods for enhancing the cleaning ability of a detergent composition, comprising adding at least one isolated protein of the present invention to the detergent composition.
  • the present invention comprises methods for enhancing the cleaning ability of a detergent composition, comprising adding at least one multi- enzyme composition of the present invention to the detergent composition.
  • the present invention comprises a detergent composition, comprising at least one isolated protein of the present invention and at least one surfactant.
  • the present invention comprises a detergent composition, comprising at least one multi-enzyme composition of the present invention and at least one surfactant.
  • the present invention comprises methods for releasing cellular contents comprising contacting a cell with at least one protein of the present invention.
  • the cell may be a bacterium, an algal cell, a fungal cell or a plant cell. In preferred embodiments, the cell is an algal cell.
  • contacting the cell with at least one protein of the present invention degrades the cell wall.
  • the cellular contents are selected from the group consisting of: alcohols and oils.
  • the present invention comprises compositions for degrading cell walls comprising at least one protein of the present invention.
  • the present invention comprises methods for improving the nutritional quality of food comprising adding to the food at least one protein of the present invention.
  • the present invention comprises methods for improving the nutritional quality of food comprising pretreating the food with at least one protein of the present invention.
  • the present invention comprises methods for improving the nutritional quality of animal feed comprising adding to the animal feed at least one protein of the present invention.
  • the present invention comprises methods for improving the nutritional quality of animal feed comprising pretreating the feed with at least one isolated protein of the present invention.
  • the present invention comprises a genetically modified organism comprising at least one nucleic acid molecule encoding a protein of the present invention, in which the activity of one or more of the proteins is upregulated, the activity of one or more of the proteins downregulated, or the activity of one or more of the proteins is upregulated and the activity of one or more of the proteins is downregulated.
  • Myceliophthora thermophila CI Incubation was performed at 50°C at pH 5 during 24 (white bars) and 72 hours (grey bars).
  • the present invention relates generally to proteins that play a role in the degradation of cellulose and hemicellulose and nucleic acids encoding the same.
  • the present invention relates to enzymes isolated from a filamentous fungal strain denoted herein as CI (Accession No. VKJvl F-3500-D), nucleic acids encoding the enzymes, and methods of producing and using the enzymes.
  • the invention also provides compositions that include at least one of the enzymes described herein for uses including, but not limited to, the hydrolysis of lignocellulose.
  • the invention stems, in part, from the discovery of a variety of novel cellulases and hemicellulases produced by the CI fungus that exhibit high activity toward cellulose and other components of bio mass.
  • the present invention also provides methods and compositions for the conversion of plant biomass to fermentable sugars that can, in turn, be converted to useful products.
  • Such products may include, without limitation, metabolites, bioplastics, biopolymers and biofuels.
  • the methods include methods for degrading lignocellulosic material using enzyme mixtures to liberate sugars.
  • the compositions of the invention include enzyme combinations that break down lignocellulose.
  • biomass or lignocellulosic material” includes materials containing cellulose and/or hemicellulose. Generally, these materials also contain pectin, lignin, protein, carbohydrates (such as starch and sugar) and ash. Lignocellulose is generally found, for example, in the stems, leaves, hulls, husks, and cobs of plants or leaves, branches, and wood of trees.
  • Fermentable sugars refers to simple sugars, such as glucose, xylose, arabinose, galactose, mannose, rhamnose, sucrose and fructose.
  • Biomass can include virgin biomass and/or non-virgin biomass such as agricultural biomass, commercial organics, construction and demolition debris, municipal solid waste, waste paper and yard waste.
  • biomass include trees, shrubs and grasses, wheat, wheat straw, sugar cane bagasse, sugar beet, soybean, corn, corn husks, corn kernel including fiber from kernels, products and by-products from milling of grains such as corn, tobacco, wheat and barley (including wet milling and dry milling) as well as municipal solid waste, waste paper and yard waste.
  • the biomass can also be, but is not limited to, herbaceous material, agricultural residues, forestry residues, municipal solid wastes, waste paper, and pulp and paper mill residues.
  • Agricultural biomass includes branches, bushes, canes, corn and corn husks, energy crops, algae, fruits, flowers, grains, grasses, herbaceous crops, leaves, bark, needles, logs, roots, saplings, short rotation woody crops, shrubs, switch grasses, trees, vegetables, fruit peels, vines, sugar beet pulp, wheat midlings, oat hulls, peat moss, mushroom compost and hard and soft woods (not including woods with deleterious materials).
  • agricultural biomass includes organic waste materials generated from agricultural processes including farming and forestry activities, specifically including forestry wood waste. Agricultural biomass may be any of the aforestated singularly or in any combination or mixture thereof.
  • Energy crops are fast-growing crops that are grown for the specific purpose of producing energy, including without limitation, biofuels, from all or part of the plant.
  • Energy crops can include crops that are grown (or are designed to grow) for their increased cellulose, xylose and sugar contents. Examples of such plants include, without limitation, switchgrass, willow and poplar.
  • Energy crops may also include algae, for example, designer algae that are genetically engineered for enhanced production of hydrogen, alcohols, and oils, which can be further processed into diesel and jet fuels, as well as other bio-based products.
  • biomass high in starch, sugar, or protein such as corn, grains, fruits and vegetables are usually consumed as food.
  • biomass high in cellulose, hemicellulose and lignin are not readily digestible and are primarily utilized for wood and paper products, animal feed, fuel, or are typically disposed.
  • the substrate is of high lignocellulose content, including distillers' dried grains corn stover, corn cobs, rice straw, wheat straw, hay, sugarcane bagasse, sugar cane pulp, citrus peels and other agricultural biomass, switchgrass, forestry wastes, poplar wood chips, pine wood chips, sawdust, yard waste, and the like, including any combination thereof.
  • the lignocellulosic material is distillers' dried grains (DDG).
  • DDG also known as dried distiller's grain, or distiller's spent grain
  • the lignocellulosic material can also be distiller's dried grain with soluble material recycled back (DDGS). While reference will be made herein to DDG for convenience and simplicity, it should be understood that both DDG and DDGS are contemplated as desired lignocellulosic materials. These are largely considered to be waste products and can be obtained after the fermentation of the starch derived from any of a number of grains, including corn, wheat, barley, oats, rice and rye. In one embodiment the DDG is derived from corn.
  • distiller's grains do not necessarily have to be dried.
  • the grains normally are currently dried, water and enzymes are added to the DDG substrate in the present invention. If the saccharification were done on site, the drying step could be eliminated and enzymes could be added to the distiller's grains without drying.
  • the present invention includes enzymes or compositions thereof with, for example, cellobiohydrolase, endoglucanase, xylanase, ⁇ -glucosidase, and hemicellulase activities.
  • Fermentable sugars can be converted to useful value-added fermentation products, non-limiting examples of which include amino acids, vitamins, pharmaceuticals, animal feed supplements, specialty chemicals, chemical feedstocks, plastics, solvents, fuels, or other organic polymers, lactic acid, and ethanol, including fuel ethanol.
  • Specific value-added products that may be produced by the methods of the invention include, but not limited to, biofuels (including ethanol and butanol); lactic acid; plastics; specialty chemicals; organic acids, including citric acid, succinic acid, itaconic acid and maleic acid; solvents; animal feed supplements; pharmaceuticals; vitamins; amino acids, such as lysine, methionine, tryptophan, threonine, and aspartic acid; industrial enzymes, such as proteases, . cellulases, amylases, glucanases, xylanases, arabinanases, lactases, lipases, esterases, lyases, oxidoreductases, transferases ; and chemical feedstocks.
  • biofuels including ethanol and butanol
  • lactic acid plastics
  • specialty chemicals organic acids, including citric acid, succinic acid, itaconic acid and maleic acid
  • solvents including citric acid, succinic acid, itac
  • the enzymes of the present invention may also be used for stone washing cellulosic fabrics such as cotton (e.g., denim), linen, hemp, ramie, cupro, lyocell, newcell, rayon and the like. See, for example, U.S. Patent No. 6,015,707.
  • the enzymes and compositions of the present invention are suitable for industrial textile applications in addition to the stone washing process.
  • cellulases are used in detergent compositions, either for the purpose of enhancing the cleaning ability of the composition or as a softening agent. When so used, the cellulase will degrade a portion of the cellulosic material, e.g., cotton fabric, in the wash, which facilitates the cleaning and/or softening of the fabric.
  • endoglucanase components of fungal cellulases have also been used for the purposes of enhancing the cleaning ability of detergent compositions, for use as a softening agent, and for use in improving the feel of cotton fabrics, and the like.
  • Enzymes and compositions of the present invention may also be used in the treatment of paper pulp (e.g., for improving the drainage or for de-inking of recycled paper) or for the treatment of wastewater streams (e.g., to hydrolyze waste material containing cellulose, hemicellulose and pectins to soluble lower molecular weight polymers).
  • the enzymes of the present invention may also be used to release the contents of a cell.
  • contacting or mixing the cells with the enzymes of the present invention will degrade the cell walls, resulting in cell lysis and release of the cellular contents.
  • Such cells can include bacteria, plant cells, fungi including yeasts, and algae.
  • the enzymes of the present invention may be used to degrade the cell walls of algal cells in order to release the materials contained within the algal cells.
  • such materials may include, without limitation, alcohols and oils. The alcohols and oils so released can be further processed to produce diesel, jet fuels, as well as other economically important bio-products..
  • the enzymes of the present invention may be used alone, or in combination with other enzymes, chemicals or biological materials.
  • the enzymes of the present invention may be used for in vitro applications in which the enzymes or mixtures thereof are added to or mixed with the appropriate substrates to catalyze the desired reactions. Additionally, the enzymes of the present invention may be used for in vivo applications in which nucleic acid molecules encoding the enzymes are introduced into cells and are expressed therein to produce the enzymes and catalyze the desired reactions within the cells.
  • enzymes capable of promoting cell wall degradation may be added to algal cells suspended in solutions to degrade the algal cell walls and release their content, whereas in some embodiments, nucleic acid molecules encoding such enzymes may be introduced into the algal cells to express the enzymes therein, so that these enzymes can degrade the algal cell walls from within.
  • Some embodiments may combine the in vitro applications with the in vivo applications.
  • nucleic acids encoding enzymes capable of catalyzing cell wall degradation may be introduced into algal cells to express the enzymes in those cells and to degrade their cell walls, while enzymes may also added to or mixed with the cells to further promote the cell wall degradation.
  • the enzymes used for in vitro applications may be different from the enzymes used for in vivo applications.
  • an enzyme with the laminarinase activity may be mixed with the cells, while an enzyme with the xyloglucanase activity is expressed within the cells.
  • the present invention includes proteins isolated from, or derived from the knowledge of enzymes from, a fungus such as Myceliophthora thermophila or a mutant or other derivative thereof, and more particularly, from the fungal strain denoted herein as CI (Accession No. VKM F-3500-D).
  • M. thermophila has previously appeared in patent applications and in the literature as Chrysosporium lucknowense or Sporotrichum thermophile.
  • the proteins of the invention possess enzymatic activity.
  • U.S. Patent No. 6,015,707 or U.S. Patent No. 6,573,086 a strain called CI (Accession No.
  • VKM F-3500-D was isolated from samples of forest alkaline soil from Sola Lake, Far East of the Russian Federation. This strain was deposited at the All-Russian Collection of Microorganisms of Russian Academy of Sciences (V M), Bakhurhina St. 8, Moscow, Russia, 1 13184, under the terms of the Budapest Treaty on the International Regulation of the Deposit of Microorganisms for the Purposes of Patent Procedure on August 29, 1996, as Chrysosporium lucknowense Garg 27K, VKM-F 3500 D. Various mutant strains of M. thermophila (C. lucknowense) CI have been produced and these strains have also been deposited at the All-Russian Collection of Microorganisms of Russian Academy of Sciences (VKM), Bakhurhina St.
  • Strain CI was mutagenised by subjecting it to ultraviolet light to generate strain UV13-6 (Accession No. VKM F-3632 D). This strain was subsequently further mutated with N-methyl-N'- nitro-N-nitrosoguanidine to generate strain NG7C- 19 (Accession No. VKM F-3633 D).
  • strain UV 18-25 accesion No. VKM F-3631 D
  • strain W1L accesion No. CBS 122189
  • strain W1L#100L accesion No. CBS122190
  • Strain CI was previously classified as a Chrysosporium lucknowense based on morphological and growth characteristics of the microorganism, as discussed in detail in U.S. Patent No. 6,015,707, U.S. Patent No. 6,573,086 and patent PCT/NL2010/000045.
  • a protein of the invention comprises, consists essentially of, or consists of an amino acid sequence selected from Tables 1 and 2.
  • the present invention also includes homologues or variants of any of the above sequences, including fragments and sequences having a given identity to any of the above sequences, wherein the homologue, variant, or fragment has at least one biological activity of the wild-type protein, as described herein.
  • the proteins disclosed herein possess carbohydrase enzymatic activity, or the ability to degrade carbohydrate-containing materials.
  • carbohydrase enzymatic activity or the ability to degrade carbohydrate-containing materials.
  • a review of enzymes involved in the degradation of polysaccharides can be found in de Vries et al., Microbiol. Mol. Biol. Rev. 65:497-522 (2001 ).
  • the proteins may possess cellulase activity such as endoglucanase activity (e.g., l ,4-p-D-glucan-4- glucanohydrolases), exoglucanase activity (e.g., 1 ,4-P-D-glucan cellobiohydrolases), and ⁇ -glucosidase activity.
  • the proteins may possess hemicellulase activity such as endoxylanase activity, exoxylanase activity, or ⁇ -xylosidase activity.
  • the proteins may possess laminarinase, xyloglucanase, galactanase, glucoamylase, pectate lyase, chitosanase, exo-P-D-glucosaminidase, cellobiose dehydrogenase, acetylxylan esterase, ligninase, amylase, glucuronidase, ferulic acid esterase, arabinofuranosidase, pectin methyl esterase, arabinase, lipase, glucosidase, ⁇ - hexosaminidase, rhamnogalacturonan acetylesterase, exo-rhamnogalacturonase, rhamnogalacturonan
  • carbohydrase refers to any protein that catalyzes the hydrolysis of carbohydrates.
  • glycoside hydrolase refers to a protein that catalyzes the hydrolysis of the glycosidic bonds between carbohydrates or between a carbohydrate and a non-carbohydrate residue.
  • Endoglucanases cellobiohydrolases, ⁇ -glucosidases, a-glucosidases, xylanases, ⁇ - xylosidases, alpha- xylosidases, galactanases, a-galactosidases, ⁇ -galactosidases, ct- amylases, glucoamylases, endo-arabinases, arabinofuranosidases, mannanases, ⁇ - mannosidases, pectinases, acetyl xylan esterases, acetyl mannan esterases, ferulic acid esterases, coumaric acid esterases, pectin methyl esterases, and chitosanases are examples of glycosidases.
  • Cellulase refers to a protein that catalyzes the hydrolysis of l ,4 ⁇ -D-glycosidic linkages in cellulose (such as bacterial cellulose, cotton, filter paper, phosphoric acid swollen cellulose, Avicel ® ); cellulose derivatives (such as carboxymethylcellulose and hydroxyethylcellulose); plant lignocellulosic materials, beta-D-glucans or xyloglucans.
  • Cellulose is a linear beta-(l -4) glucan consisting of anhydrocellobiose units. Endoglucanases, cellobiohydrolases, and ⁇ -glucosidases are examples of cellulases.
  • Endoglucanase refers to a protein that catalyzes the hydrolysis of cellulose to oligosaccharide chains at random locations by means of an endoglucanase activity.
  • Cellobiohydrolase refers to a protein that catalyzes the hydrolysis of cellulose to cellobiose via an exoglucanase activity, sequentially releasing molecules of cellobiose from the reducing or non-reducing ends of cellulose or cello-oligosaccharides.
  • ⁇ - glucosidase refers to an enzyme that catalyzes the conversion of cellobiose and oligosaccharides to glucose.
  • Hemicellulase refers to a protein that catalyzes the hydrolysis of hemicellulose, such as that found in lignocellulosic materials. Hemicelluloses are complex polymers, and their composition often varies widely from organism to organism, and from one tissue type to another. Hemicelluloses include a variety of compounds, such as xylans, arabinoxylans, xyloglucans, mannans, glucomannans,. Hemicellulose can also contain glucan, which is a general term for beta-linked glucose residues. In general, a main component of hemicellulose is beta-l ,4-linked xylose, a five carbon sugar.
  • this xylose is often branched as beta- 1 ,3 linkages or beta- 1 ,2 linkages, and can be substituted with linkages to arabinose, galactose, mannose, glucuronic acid, or by esterification to acetic acid.
  • hemicellulose is very different in dicotyledonous plants (dicots, i.e., plant whose seeds have two cotyledons or seed leaves such as lima beans, peanuts, almonds, peas, kidney beans) as compared to monocotyledonous plants (monocots; i.e., plants having a single cotyledon or seed leaf such as corn, wheat, rice, grasses, barley).
  • dicots i.e., plants having a single cotyledon or seed leaf such as corn, wheat, rice, grasses, barley.
  • hemicellulose is comprised mainly of xyloglucans that are 1,4-beta-linked glucose chains with 1 ,6-alpha- linked xylosyl side chains.
  • heteroxylans In monocots, including most grain crops, the principal components of hemicellulose are heteroxylans. These are primarily comprised of 1 ,4-beta-linked xylose backbone polymers with 1 ,2- or 1 ,3-alpha linkages to arabinose, linkage of galactose and mannose to arabinose or xylose in side chains, as well as xylose modified by ester-linked acetic acids. Also present are branched beta glucans comprised of 1 ,3- and 1 ,4-beta-linked glucosyl chains. In monocots, cellulose, heteroxylans and beta glucans are present in roughly equal amounts, each comprising about 15-25% of the dry matter of cell walls.
  • Hemicellulolytic enzymes i.e. hemicellulases
  • Hemicellulases also include the accessory enzymes, such as acetylesterases, ferulic acid esterases, and coumaric acid esterases.
  • xylanases and acetyl xylan esterases cleave the xylan and acetyl side chains of xylan and the remaining xylo-oligomers are unsubstituted and can thus be hydrolysed with ⁇ -xylosidase only.
  • xylanases, acetylesterases and ⁇ -xylosidases are examples of hemicellulases.
  • Xylanase specifically refers to an enzyme that hydrolyzes the ⁇ - 1 ,4 bond in the xylan backbone, producing short xylooligosaccharides.
  • ⁇ -Mannanase or "endo-l ,4 ⁇ -mannosidase” refers to a protein that hydrolyzes mannan-based hemicelluloses (mannan, glucomannan, galacto(gluco)mannan) and produces short ⁇ -l ,4-mannooligosaccharides.
  • Mannan endo- l ,6-a-mannosidase refers to a protein that hydrolyzes 1 ,6-a- mannosidic linkages in unbranched 1 ,6-mannans.
  • ⁇ -Mannosidase ⁇ - l ,4-mannoside mannohydrolase; EC 3.2.1.25
  • EC 3.2.1.25 refers to a protein that catalyzes the removal of ⁇ -D-mannose residues from the nonreducing ends of oligosaccharides.
  • Galactanase refers to a protein that catalyzes the hydrolysis of endo-l ,4-P-D- galactosidic linkages in arabinogalactans.
  • Glucoamylase refers to a protein that catalyzes the hydrolysis of terminal 1 ,4- linked a-D-glucose residues successively from non-reducing ends of the glycosyl chains in starch with the release of ⁇ -D-glucose.
  • ⁇ -hexosaminidase or " ⁇ - ⁇ -acetylglucosaminidase” refers to a protein that catalyzes the hydrolysis of terminal N-acetyl-D-hexosamine residues in N-acetyl- ⁇ -
  • a-L-arabinofuranosidase refers to a protein that hydrolyzes arabinofuranosyl-containing hemicelluloses or pectins. Some of these enzymes remove arabinofuranoside residues from 0-2 or 0-3 single substituted xylose residues, as well as from 0-2 and/or 0-3 double substituted xylose residues. Some of these enzymes remove arabinose residues from arabinan oligomers.
  • Endo-arabinase refers to a protein that catalyzes the hydrolysis of 1 ,5-a- arabinofuranosidic linkages in 1 ,5-arabinans.
  • Exo-arabinase refers to a protein that catalyzes the hydrolysis of 1 ,5-a-linkages in 1 ,5-arabinans or 1 ,5-a-L arabino-oligosaccharides, releasing mainly arabinobiose, although a small amount of arabinotriose can also be liberated.
  • ⁇ -xylosidase refers to a protein that hydrolyzes short l,4-P-D-xylooligomers into xylose.
  • Cellobiose dehydrogenase refers to a protein that oxidizes cellobiose to cellobionolactone.
  • Chitosanase refers to a protein that catalyzes the endohydrolysis of p- l ,4-linkages between D-glucosamine residues in acetylated chitosan (i.e., deacetylated chitin).
  • Exo-polygalacturonase refers to a protein that catalyzes the hydrolysis of terminal alpha 1 ,4-linked galacturonic acid residues from non-reducing ends thus converting polygalacturonides to galacturonic acid.
  • Alcohol xylan esterase refers to a protein that catalyzes the removal of the acetyl groups from xylose residues.
  • Alcohol mannan esterase refers to a protein that catalyzes the removal of the acetyl groups from mannose residues
  • ferulic esterase or "ferulic acid esterase” refers to a protein that hydrolyzes the ester bond between the arabinose substituent group and ferulic acid.
  • Coumaric acid esterase refers to a protein that hydrolyzes the ester bond between the arabinose substituent group and coumaric acid.
  • Acetyl xylan esterases, ferulic acid esterases and pectin methyl esterases are examples of carbohydrate esterases.
  • Pectate lyase and pectin lyases refer to proteins that catalyze the cleavage of 1 ,4- a-D-galacturonan by beta-elimination acting on polymeric and/or oligosaccharide substrates (pectates and pectins, respectively).
  • Endo- l ,3-P-glucanase or “laminarinase” refers to a protein that catalyzes the cleavage of 1 ,3-linkages in ⁇ -D-glucans such as laminarin or lichenin.
  • Laminarin is a linear polysaccharide made up of p- l ,3-glucan with P- l ,6-linkages.
  • lichenan refers to a protein that catalyzes the hydrolysis of lichenan, a linear, 1 ,3- 1 ,4- ⁇ - ⁇ glucan.
  • Rhamnogalacturonan is composed of alternating a- 1 ,4-rhamnose and a- l ,2-linked galacturonic acid, with side chains linked 1,4 to rhamnose.
  • the side chains include Type I galactan, which is ⁇ -l ,4-linked galactose with a-l ,3-linked arabinose substituents; Type II galactan, which is P-l ,3-l ,6-linked galactoses (very branched) with arabinose substituents; and arabinan, which is ct-l ,5-linked arabinose with a- 1 ,3-linked arabinose branches.
  • the galacturonic acid substituents may be acetylated and/or methylated.
  • Exo-rhamnogalacturonanase refers to a protein that catalyzes the degradation of the rhamnogalacturonan backbone of pectin from the nonreducing end.
  • Rhamnogalacturonan acetylesterase refers to a protein that catalyzes the removal of the acetyl groups ester-linked to the highly branched rhamnogalacturonan (hairy) regions of pectin.
  • Rhamnogalacturonan lyase refers to a protein that catalyzes the degradation of the rhamnogalacturonan backbone of pectin via a ⁇ -elimination mechanism (see, e.g., Pages et al., J. Bacterial. 185:4727-4733 (2003)).
  • Alpha-rhamnosidase refers to a protein that catalyzes the hydrolysis of terminal non-reducing a-L-rhamnose residues in a-L-rhamnosides.
  • Glycosidases glycoside hydrolases; GH
  • GH glycoside hydrolases
  • Glycosidases such as the proteins of the present invention may be assigned to families on the basis of sequence similarities, and there are now over 100 different such families defined (see the CAZy (Carbohydrate Active EnZymes database) website, maintained by the Architecture of Fonction de Macromolecules Bi Anlagens of the Centre National de la Recherche Scientifique, which describes the families of structurally-related catalytic and carbohydrate- binding modules (or functional domains) of enzymes that degrade, modify, or create glycosidic bonds; Coutinho, P.M. & Henrissat, B. (1999) Carbohydrate-active enzymes: an integrated database approach. In “Recent Advances in Carbohydrate Bioengineering", H.J. Gilbert, G. Davies, B.
  • sequence homology may be used to identify particular domains within proteins, such as carbohydrate binding modules (CBMs; also known as carbohydrate binding domains (CBDs), sometimes called cellulose binding domains).
  • CBMs carbohydrate binding modules
  • CBDs carbohydrate binding domains
  • the CAZy homologies of proteins of the present invention are disclosed below.
  • An enzyme assigned to a particular CAZy family may exhibit one or more of the enzymatic activities or substrate specificities associated with the CAZy family. In other embodiments, the enzymes of the present invention may exhibit one or more of the enzyme activities discussed above.
  • Certain proteins of the present invention may be classified as "Family 61 glycosidases" based on homology of the polypeptides to CAZy Family GH61.
  • Family 61 glycosidases may exhibit cellulolytic enhancing activity or endoglucanase activity. Additional information on the properties of Family 61 glycosidases may be found in U.S. Patent Application Publication Nos. 2005/0191736, 2006/0005279, 2007/0077630, and in PCT Publication No. WO 2004/031378.
  • cellulolytic enhancing activity refers to a biological activity that enhances the hydrolysis of a cellulosic material by proteins having cellulolytic activity.
  • saccharifying a cellulosic material with a cellulolytic protein in the presence of a Family 61 glycosidases may increase the degradation of cellulosic material compared to the presence of only the cellulolytic protein.
  • the cellulosic material can be any material containing cellulose.
  • the cellulolytic activity is a biological activity that hydrolyzes a cellulosic material.
  • Cellulolytic enhancing activity can be determined by measuring the increase in sugars from the hydrolysis of a cellulosic material by cellulolytic protein.
  • Proteins of the present invention may also include homologues, variants, and fragments of the proteins disclosed herein.
  • the protein fragments include, but are not limited to, fragments comprising a catalytic domain (CD) and/or a carbohydrate binding module (CBM) (also known as a cellulose-binding domain; both can be referred to herein as CBM).
  • CD catalytic domain
  • CBM carbohydrate binding module
  • the identity and location of domains within proteins of the present invention are disclosed in detail below.
  • the present invention encompasses all combinations of the disclosed domains.
  • a protein fragment may comprise a CD of a protein but not a CBM of the protein or a CBM of a protein but not a CD.
  • domains from different proteins may be combined.
  • Protein fragments comprising a CD, CBM or combinations thereof for each protein disclosed herein can be readily produced using standard techniques known in the art.
  • a protein fragment comprises a domain of a protein that has at least one biological activity of the full- length protein. Homologues or variants of proteins of the invention that have at least one biological activity of the full-length protein are described in detail below.
  • biological activity of a protein refers to any function(s) exhibited or performed by the protein that is ascribed to the naturally occurring form of the protein as measured or observed in vitro or in vivo.
  • a protein fragment comprises a domain of a protein that has the catalytic activity of the full-length enzyme. Specific biological activities of the proteins of the invention, and structures within the proteins that are responsible for the activities, are described below.
  • Esterases represent a category of various enzymes including lipases, phospho lipases, cutinases, and phytases that catalyze the hydrolysis and synthesis of ester bonds in compounds.
  • esterases in the food industry include, but are not limited to, degumming vegetable oils; improving the production of bread (e.g., in situ production of emulsifiers); producing crackers, noodles, and pasta; enhancing flavor development of cheese, butter, and margarine; ripening cheese; removing wax; trans- ester ificat ion of flavors and cocoa butter substitutes; synthesizing structured lipids for infant formula and nutraceuticals; improving the polyunsaturated fatty acid content in fish oil; and aiding in digestion and releasing minerals in food processing.
  • degumming vegetable oils e.g., in situ production of emulsifiers
  • producing crackers, noodles, and pasta enhancing flavor development of cheese, butter, and margarine
  • ripening cheese removing wax
  • trans- ester ificat ion of flavors and cocoa butter substitutes synthesizing structured lipids for infant formula and nutraceuticals
  • improving the polyunsaturated fatty acid content in fish oil and aiding in digestion and releasing
  • esters in the household industry include, but are not limited to, use in laundry and detergent (e.g., removal of stains); cleaning agents; and hydrolysis of tallow for laundry detergent.
  • esters include, but are not limited to, the removal of triglycerides, steryl esters, resin acids, free fatty acids, and sterols (e.g., lipophilic wood extractives).
  • esters for chemical and drug intermediates
  • isopropyl myristate, isopropyl palmitate and 2-ethylpalmitate for use as emollient in personal care products
  • saving of energy and minimization of thermal degradation in oleochemical industry use as a feed additive; and enhancing the recovery of oil (e.g., during drilling).
  • the enzyme CL05183 is encoded by the nucleic acid sequence of SEQ ID NO: 1.
  • the CL05183 nucleic acid sequence encodes a 140 amino acid sequence, of SEQ ID NO: 2. We believe the mature protein spans from about position 1 to about position 140 of the CL05183 amino acid sequence. Within CL05183 a catalytic domain (CD) is present. We believe the amino acid sequence containing the CD of CL05183 spans from a starting point of about position 24 to an ending point of about position 101 of the CL05183 amino acid sequence. CL05183 is expected to have feruloyl esterase activity.
  • the enzyme CL01692 is encoded by the nucleic acid sequence of SEQ ID NO: 3.
  • CL01692 nucleic acid sequence encodes a 599 amino acid sequence, of SEQ ID NO:4. We believe the mature protein spans from about position 1 to about position 599 of the CL01692 amino acid sequence. CL01692 is expected to have feruloyl esterase and/or lipase activity.
  • Carbohydrases represent a category of various enzymes and polypeptides including amylases, cellulases, hemicellulases, pectinases, and chitinases that catalyze and/or enhance the hydrolysis or synthesis of a carbohydrate.
  • Applications of carbohydrases in the food industry include, but are not limited to, increasing the yield of fruit juice production in total liquefaction; increasing the pressing yield of oils e.g. from olives, cleaning filters, reduction of viscosity, hydrolyzing starch, and stimulating fermentation in the brewing industry; increasing the loaf volume and improving crust color in the baking industry; preventing/reducing the staling of bread; removing lactose from milk products; clarifying, filtrating, and extracting aroma and color (e.g., the wine industry); debittering or detoxifying plant glycosidic compounds; processing coffee; aiding in digestion; producing starch (e.g., separating starch from gluten); producing oligosaccharides (e.g., nutraceuticals); producing gelling agents; modifying viscosity; saccharification of starch and other biopolymers; and modifying starch.
  • oils e.g. from olives, cleaning filters, reduction of viscosity, hydrolyzing starch, and stimulating fermentation in the brewing industry
  • Saccharification can also be used to produce fermentable sugars for the production of high value metabolites and chemical building blocks for producing derivatives thereof and fine chemical.
  • the enzyme CL04931 is a GH61 family enzyme which is encoded by the nucleic acid sequence of SEQ ID NO: 5.
  • the CL04931 nucleic acid sequence encodes the 173 amino acid sequence of SEQ ID NO:6.
  • CD catalytic domain
  • CL04931 can be assigned to GH61 and CBM1 of the CAZy families and is expected to have endoglucanase and/or cellulase enhancing activity and/or carbohydrate oxidizing activity.
  • CL04931 possesses significant homology (about 54.81 % from amino acids 7 to 139 of CL04931) with family 61 endoglucanase protein from Phanerochaete chrysosporium (Genbank Accession No. AAM22493.1).
  • the enzyme CL06186 is encoded by the nucleic acid sequence of SEQ ID NO: 7.
  • the CL06186 nucleic acid sequence encodes a roughly 306 amino acid sequence, of SEQ ID NO: 8.
  • a cDNA for this enzyme is shown in SEQ ID NO: 189.
  • SEQ ID NO: 7 contains exons starting at position 144 ending at position 590 and starting at position 687 extending to position 1 160.
  • An intron is present from position 591 to position 686.
  • the signal peptide for CL06186 is located from positions 1 to about position 22 of the CL06186 amino acid sequence, with the mature protein spanning from about position 23 to about position 306 of the CL06186 amino acid sequence.
  • a catalytic domain (CD) is present.
  • CL06186 spans from a starting point of about position 23 to an ending point of about position 306 of the CL06186 amino acid sequence. Based on homology, CL06186 can be assigned to GH61 of the CAZy families and is expected to have endoglucanase and/or cellulase enhancing activity and/or carbohydrate oxidizing activity. CL06186 possesses significant homology (about 54 % from amino acids 23 to 285 of CL06186) with a predicted protein from Neurospora crassa OR74 A (Genbank Accession No. E AA29132.1).
  • the enzyme CL08507 is a GH61 family enzyme encoded by the nucleic acid sequence of SEQ ID NO: 9.
  • the CL08507 nucleic acid sequence encodes a 224 amino acid sequence shown in SEQ ID NO: 10.
  • SEQ ID NO: 9 contains exons starting at position 61 to 350, position 484 to position 599, position 698 to 870 and position 929 to 1021. Introns are present from position 351 to 483, position 600 to 697, and position 871 to 928.
  • a cDNA for this enzyme is shown in SEQ ID NO: 190.
  • CL08507 has been assigned to GH61 of the CAZy families and is expected to have endoglucanase and/or cellulase enhancing activity and/or carbohydrate oxidizing activity.
  • CL08507 possesses significant homology (about 66% from amino acids 21 to 224 of CL08507) with a hypothetical protein MGG 04547 from Magnaporthe oryzae 70-15 (Genbank Accession No. XP_362102.1).
  • the enzyme CL1 1443 is a GH61 family enzyme encoded by the nucleic acid sequence of SEQ ID NO: l l .
  • the CL1 1443 nucleic acid sequence encodes a 324 amino acid sequence shown in SEQ ID NO: 12.
  • SEQ ID NO: 1 1 contains exons starting at position 179 to 259, position 352 to position 501 , position 607 to 1287, and position 1420 to 1482. Introns are present from position 260 to 351 , position 502 to 606, and position 1288 to 1419.
  • a cDNA for this sequence is shown as SEQ ID NO: 191.
  • CL1 1443 has a signal peptide located from positions 1 to about position 19 of the CL1 1443 amino acid sequence, with the mature protein spanning from about position 20 to about position 31 1 of the CL1 1443 amino acid sequence. Based on homology, CL1 1443 can be assigned to GH61 and CBM1 of the CAZy families and is expected to have endoglucanase and/or cellulase enhancing activity and/or carbohydrate oxidizing activity. CL1 1443 possesses significant homology (about 67 % from amino acids 20 to 31 1 of CL1 1443 ) with a protein related to endoglucanase IV from Neurospora crassa (Genbank Accession No. CAE81966.1 ).
  • the enzyme CL04069 is encoded by the nucleic acid sequence of SEQ ID NO 13.
  • the CL04069 nucleic acid sequence encodes a 291 amino acid sequence of SEQ ID NO: 14. We believe the mature protein spans from about position 1 to position 291 of the CL04069 amino acid sequence. Within CL04069 a catalytic domain (CD) is present. We believe the amino acid sequence containing the CD of CL04069 spans from a starting point of about position 8 to an ending point of about position 92 of the CL04069 amino acid sequence.
  • the enzyme CL04872 is encoded by the nucleic acid sequence of SEQ ID NO: 15.
  • the CL04872 nucleic acid sequence encodes a 185 amino acid sequence shown in SEQ ID NO: 16. We believe the mature protein spans from about position 1 to position 185 of the CL04872 amino acid sequence.
  • a catalytic domain (CD) is present.
  • the enzyme CL04872 is encoded by the nucleic acid sequence of SEQ ID NO: 15.
  • the CL04872 nucleic acid sequence encodes a 185 amino acid sequence shown in SEQ ID NO: 16.
  • the mature protein spans from about position 1 to position 185 of the CL04872 amino acid sequence.
  • a catalytic domain (CD) is present.
  • CL04872 can be assigned to GH61 of the CAZy families and is expected to have endoglucanase and/or cellulase enhancing activity and/or carbohydrate oxidizing activity.
  • the enzyme CL06168 is encoded by the nucleic acid sequence of SEQ ID NO: 17.
  • the CL06168 nucleic acid sequence encodes a 106 amino acid sequence shown in SEQ ID NO: 18. We believe the mature protein spans from about position 1 to about position 106 of the CL06168 amino acid sequence.
  • Enzymes CL08099 is encoded by the nucleic acid sequence of SEQ ID NO: 19.
  • the CL08099 nucleic acid sequence encodes a 497 amino acid sequence shown in SEQ ID NO: 20. We believe the mature protein spans from about position 2 to about position 497 of the CL08099 amino acid sequence.
  • the enzyme CL08035 is encoded by the nucleic acid sequence of SEQ ID NO: 21 .
  • the CL08035 nucleic acid sequence encodes the 453 amino acid sequenceof SEQ ID NO 22.
  • a cDNA for this enzyme is shown in SEQ ID NO: 192.
  • the mature protein spans from about position 1 to about position 453 of the CL08035 amino acid sequence.
  • a catalytic domain CD
  • CL08035 can be assigned to GH5 of the CAZy families and is expected to have chitosanase (EC 3.2.1.132)and/or ⁇ -mannosidase (EC 3.2.1.25)and/or Cellulase (EC 3.2.1.4)and/or glucan 1,3- ⁇ - glucosidase (EC 3.2.1.58)and/or licheninase (EC 3.2.1.73)and/or glucan endo- ⁇ , ⁇ - ⁇ - glucosidase (EC 3.2.1.75)and/or mannan endo-P-l ,4-mannosidase (EC 3.2.1.78)and/or endo- ⁇ - 1 ,4-xylanase (EC 3.2.1.8)and/or cellulose ⁇ -1 ,4- cellobiosidase (EC 3.2.1.91 )and/or endo- ⁇ - 1 ,6-galactanase (EC 3.2.1.-)and
  • the enzyme CL00367 is encoded by the nucleic acid sequence of SEQ ID NO: 23.
  • the CL00367 nucleic acid sequence encodes the 582 amino acid sequence, of SEQ ID NO 23.
  • the signal peptide for CL00367 is located from positions 1 to about position 21 of the CL00367 amino acid sequence, with the mature protein spanning from about position 22 to about position 582 of the CL00367 amino acid sequence.
  • a catalytic domain CD
  • the amino acid sequence containing the CD of CL00367 spans from a starting point of about position 22 to an ending point of about position 582 of the CL00367 amino acid sequence.
  • CL00367 can be assigned to GH20 of the CAZy families and is expected to have ⁇ -hexosaminidase (EC 3.2.1.52) and/or lacto- N-biosidase (EC 3.2.1.140) and/or ⁇ - 1 ,6-N-acetylglucosaminidase) (EC 3.2.1 .-) and/or P-6-S03-N-acetylglucosaminidase (EC 3.2.1.-) activity.
  • CL00367 possesses significant homology (about 66.83% from amino 69 to 363 of CL00367) with Sequence 21 from patent US 6521435 (Genbank Accession No. AAP01 161 . 1 ).
  • the enzyme CL10658 is encoded by the nucleic acid sequence of SEQ ID NO: 25.
  • the CL10658 nucleic acid sequence encodes the 374 amino acid sequence of SEQ ID NO: 26.
  • the signal peptide for CL 10658 is located from position 1 to about position 21 of the CL10658 amino acid sequence, with the mature protein spanning from about position 22 to about position 374 of the CL10658 amino acid sequence.
  • a catalytic domain CD
  • the amino acid sequence containing the CD of CL10658 spans from a starting point of about position 27 to an ending point of about position 3 1 1 of the CL10658 amino acid sequence.
  • CL10658 can be assigned to GH76 of the CAZy families and is expected to have a-1 , 6-Mannanase activity.
  • the enzyme CL02715 is encoded by the nucleic acid sequence of SEQ ID NO: 27.
  • the CL02715 nucleic acid sequence encodes the 150 amino acid sequence of SEQ ID NO: 28.
  • the signal peptide for CL02715 is located from position 1 to about position 18 of the CL02715 amino acid sequence, with the mature protein spanning from about position 19 to about position 150 of the CL02715 amino acid sequence.
  • a catalytic domain CD
  • the amino acid sequence containing the CD of CL02715 spans from a starting point of about position 24 to an ending point of about position 146 of the CL02715 amino acid sequence. Based on homology, CL02715 can be assigned to CH76 of the CAZy families and is expected to have a-1 , 6-Mannanase activity.
  • the enzyme CL08540 is encoded by the nucleic acid sequence of SEQ ID NO: 29.
  • the CL08540 nucleic acid sequence encodes the 392 amino acid sequence of SEQ ID NO: 30. We believe the mature protein spans from about position 1 to about position 392 of the CL08540 amino acid sequence. Within CL08540 a catalytic domain (CD) is present. We believe the amino acid sequence containing the CD of CL08540 spans from a starting point of about position 26 to an ending point of about position 391 of the CL08540 amino acid sequence. Based on homology, CL08540 can be assigned to GH105 of the CAZy families and is expected to have rhamnogalacturonyl hydrolase activity.
  • CL08540 possesses significant homology (about 83% from amino acids 1 to 392 of CL08540) with a hypothetical protein from Chaetomium globosum (Genbank Accession No. EAQ84515). CL08540 also possesses homology (about 21% from amino acids 1 to 392 of CL08540) with an unsaturated rhamnogalacturonyl hydrolase from Bacillus subtilis subsp. subtilis (Genbank Accession No. CAB 14990).
  • the enzyme CL01467 is encoded by the nucleic acid sequence of SEQ ID NO: 31.
  • the CL01467 nucleic acid sequence encodes a 210 amino acid sequence of SEQ ID NO: 32 .
  • the signal peptide for CL01467 is located from position 1 to about position 24 of the CL01467 amino acid sequence, with the mature protein spanning from about position 25 to position 210 of the CL01467 amino acid sequence.
  • a catalytic domain CD
  • the amino acid sequence containing the CD of CL01467 spans from a starting point of about position 28 to an ending point of about position 209 of the CL01467 amino acid sequence.
  • CL01467 can be assigned to GH2 of the CAZy and is expected to have ⁇ -Galactosidase, ⁇ -mannosidase (EC 3.2. 1.25) and/or ⁇ - glucuronidase (EC 3.2.1.31 ) and/or mannosylglycoprotein endo- -mannosidase (EC 3.2.1.152) and/or exo-P-glucosaminidase (EC 3.2.1.165) activity.
  • CL01467 possesses significant homology (about 71% from amino acids 28 to 209 of CL01467) with a hypothetical protein from Chaetomium globosum (Genbank Accession No. EAQ89447).
  • CL01467 also possesses homology (about 53% from amino acids 56 to 209 of CL01467) with a ⁇ -galactosidase from Pyrenophora tritici- repentis Pt- lC-BFP (Genbank Accession No. EDU44846).
  • the enzyme CL00647 is encoded by the nucleic acid sequence of SEQ ID NO: 33.
  • CL00647 nucleic acid sequence encodes a 1080 amino acid sequence of SEQ ID NO: 34. We believe the mature protein spans from about position 1 to about position 1080 of the CL00647 amino acid sequence. Based on homology, CL00647 can be assigned to GH2 of the CAZy families and is expected to have ⁇ - Galactosidase, ⁇ -mannosidase (EC 3.2.1.25) and/or ⁇ -glucuronidase (EC 3.2.1.31 ) and/or mannosylglycoprotein endo-P-mannosidase (EC 3.2.1.152) and/or exo- ⁇ - glucosaminidase (EC 3.2.1.165) activity.
  • ⁇ - Galactosidase ⁇ -mannosidase
  • EC 3.2.1.31 ⁇ -glucuronidase
  • EC 3.2.1.152 mannosylglycoprotein endo-P-mannosidase
  • exo- ⁇ - glucosaminidase
  • the enzyme CL04789 is encoded by the nucleic acid sequence of SEQ ID NO: 35.
  • CL04789 nucleic acid sequence encodes a 450 amino acid sequence of SEQ ID NO: 36. We believe the mature protein spans from about position 1 to about position 450 of the CL04789 amino acid sequence. Based on homology, CL04789 can be assigned to GH5 of the CAZy families and is expected to have chitosanase (EC 3.2.1.132) and/or ⁇ -mannosidase (EC 3.2.1.25) and/or Cellulase (EC 3.2.1.4) and/or glucan l ,3 ⁇ -glucosidase (EC 3.2.1.58) and/or licheninase (EC 3.2.1 .73) and/or glucan endo- l ,6-P-glucosidase (EC 3.2.1.75)and/or mannan endo-P- l ,4- mannosidase (EC 3.2.1.78) and/or endo ⁇ - l ,4-xylanase (EC 3.2.
  • the enzyme CL10564 is encoded by the nucleic acid sequence of SEQ ID NO: 37.
  • CL10564 nucleic acid sequence encodes a 764 amino acid sequence of SEQ ID NO:38. We believe the mature protein spans from about position 1 to about position 764 of the CL10564 amino acid sequence. Based on homology, CL10564 can be assigned to GH5 of the CAZy families and is expected to have chitosanase (EC 3.2.
  • the enzyme CL07555 is encoded by the nucleic acid sequence of SEQ ID NO: 39.
  • the CL07555 nucleic acid sequence encodes a 568 amino acid sequence of SEQ ID NO: 40. We believe the mature protein spans from about positionl to about position 568 of the CL07555 amino acid sequence.
  • CL07555 can be assigned to GH13 of the CAZy families and is expected to have a-amylase (EC 3.2.1.1) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1.54) and/or trehalose-6-phosphate hydrolase (EC 3.2.
  • a-amylase EC 3.2.1.1
  • pullulanase EC 3.2.1.41
  • cyclomaltodextrin glucanotransferase EC 2.4.1.19
  • cyclomaltodextrinase EC 3.2.1.54
  • trehalose-6-phosphate hydrolase EC 3.2.
  • CL07555 possesses significant homology (about 46% amino acid sequence of CL07555) with a ⁇ -amylase from Aspergillus clavatus (Genbank Accession No. EAW 14194). CL07555 also possesses homology (about 45% amino acid sequence of CL07555) with An09g03100 (agtA) Aspergillus niger (Genbank Accession No. An09g03100).
  • the enzyme CL07554 is encoded by the nucleic acid of SEQ ID NO: 41 .
  • the CL07554 nucleic acid sequence encodes a 2380 amino acid sequence of SEQ ID NO: 42. We believe the mature protein spans from about position 1 to about position 2380 of the CL07554 amino acid sequence.
  • CL07554 can be assigned to GH13 of the CAZy families and is expected to have a-amylase (EC 3.2.1.1) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1 .54) and/or trehalose-6-phosphate hydrolase (EC 3.2.1.93) and/or oligo-a-glucosidase (EC 3.2.1.10) and/or mahogenic amylase (EC 3.2.1.133) and/or neopullulanase (EC 3.2.1.135) and/or a-glucosidase (EC 3.2.1.20) and/or maltotetraose-forming a- amylase (EC 3.2.1.60) and/or isoamylase (EC 3.2.1.68) and/or glucodextrana
  • CL07554 possesses significant homology (about 51 % amino acid sequence of CL07554) with a a-l ,3-glucan synthase Agsl from Aspergillus fumigates (Genbank Accession No. EAL86374). CL07554 also possesses homology (about 50% amino acid sequence of CL07554) with Anl 2g092450 (agsC) from Aspergillus niger (Genbank Accession No. Anl 2g02450).
  • CL07622 is encoded by the nucleic acid sequence of SEQ ID NO: 43.
  • the CL07622 nucleic acid sequence encodes a 631 amino acid sequence of SEQ ID NO: 44. We believe the mature protein spans from about position 1 to position 63 1 of the CL07622 amino acid sequence. Based on homology, CL07622 can be assigned to GH13 of the CAZy families and is expected to have ⁇ -amylase (EC 3.2.1.1 ) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.
  • CL07622 possesses significant homology (about 50% amino acid sequence of CL07622) with a ⁇ -amylase from Aspergillus flavus (Genbank Accession No.
  • CL07622 also possesses homology (about 47% amino acid sequence of CL07622) with An01 gl 3610 (amyD) from Aspergillus niger (Genbank Accession No. An01 gl 3610).
  • the enzyme CL09312 is encoded by the nucleic acid sequence of SEQ ID NO: 45.
  • CL09312 nucleic acid sequence encodes a 1274 amino acid sequence of SEQ ID NO: 46. We believe the mature protein spans from about position 1 to about position 1274 of the CL09312 amino acid sequence. Based on homology, CL09312 can be assigned to GH13 of the CAZy families and is expected to have a-amylase (EC 3.2.1.1) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1.54) and/or trehalose-6-phosphate hydrolase (EC 3.2.1.93) and/or oligo-a-glucosidase (EC 3.2.1.10) and/or maltogenic amylase (EC 3.2.1.133) and/or neopullulanase (EC 3.2.1.135) and/or a-glucosidase (EC 3.2.1.
  • CL09312 possesses significant homology (about 80% amino acid sequence of CL09312) with a 1 ,4-a-glucan branching enzyme from Neurospora crassa (Genbank Accession No. CAB91480). CL09312 also possesses homology (about 73% amino acid sequence of CL09312) with Anl4g04190 (gbeA) from Aspergillus niger (Genbank Accession No. Anl4g04190).
  • the enzyme CL07207 is encoded by the nucleic acid sequence represented in of SEQ ID NO: 47.
  • the CL07207 nucleic acid sequence encodes a 598 amino acid sequence of SEQ ID NO: 48. We believe the mature protein spans from about position 1 to about position 598 of the CL07207 amino acid sequence.
  • CL07207 can be assigned to GH13 of the CAZy families and is expected to have a-amylase (EC 3.2.1 .1 ) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1.54) and/or trehalose-6-phosphate hydrolase (EC 3.2.1.93) and/or oligo-a- glucosidase (EC 3.2.1.10) and/or maltogenic amylase (EC 3.2.1.133) and/or neopullulanase (EC 3.2.1.135) and/or a-glucosidase (EC 3.2.1 .20) and/or maltotetraose-forming ⁇ -amylase (EC 3.2.1.60) and/or isoamylase (EC 3.2.1.68) and/or glucodex
  • CL07207 possesses significant homology (about 58 % amino acid sequence of CL07207) with a maltase from Aspergillus clavatus (Genbank Accession No. EAW12052). CL07207 also possesses homology (about 44 % amino acid sequence of CL07207) with An02gl 3240 (agdC) from Aspergillus niger (Genbank Accession No. An02gl 3240).
  • the enzyme CL07198 is encoded by the nucleic acid sequence of SEQ ID NO: 49.
  • the CL07198 nucleic acid sequence encodes a 1 128 amino acid sequence of SEQ ID NO: 50. We believe the mature protein spans from about position 1 to about position 1 128 of the CL07198 amino acid sequence.
  • CL07198 can be assigned to GH 13 of the CAZy families and is expected to have a-amylase (EC 3.2.1.1 ) and/or pullulanase (EC 3.2.1 .41 ) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1.54) and/or trehalose-6-phosphate hydrolase (EC 3.2.1.93) and/or oligo-a-glucosidase (EC 3.2.1.10) and/or mahogenic amylase (EC 3.2.1.133) and/or neopullulanase (EC 3.2.1.135) and/or a-glucosidase (EC 3.2.1.20) and/or maltotetraose-forming a- amylase (EC 3.2.1.60) and/or isoamylase (EC 3.2.1.68) and/or glucodextran
  • CL07198 possesses significant homology (about 73 % amino acid sequence of CL07198) with a a-glucosidase from Neurospora crassa (Genbank Accession No. EAA32559). CL07198 also possesses homology (about 43 % amino acid sequence of CL07198) with AN3516.2 from Aspergillus nidulans (Genbank Accession No. AN3516.2).
  • the enzyme CLOl 136 is encoded by the nucleic acid sequence of SEQ ID NO: 51.
  • the CLOl 136 nucleic acid sequence encodes a 646 amino acid sequence of SEQ ID NO: 52. We believe the mature protein spans from about position 1 to about position 646 of the CLOl 136 amino acid sequence. Within CLOl 136 is a CBM20 domain, spanning from position 551 to about position 645.
  • CLOl 136 can be assigned to GH13 of the CAZy families and is expected to have a- amylase (EC 3.2.1.1 ) and/or pullulanase (EC 3.2.1.41) and/or cyclomaltodextrin glucanotransferase (EC 2.4.1.19) and/or cyclomaltodextrinase (EC 3.2.1.54) and/or trehalose-6-phosphate hydrolase (EC 3.2.1.93) and/or oligo-a-glucosidase (EC 3.2.1.10) and/or maltogenic amylase (EC 3.2.1.133) and/or neo pullulanase (EC 3.2.1.135).
  • a- amylase EC 3.2.1.1
  • pullulanase EC 3.2.1.41
  • cyclomaltodextrin glucanotransferase EC 2.4.1.19
  • cyclomaltodextrinase EC
  • a-glucosidase EC 3.2.1.20
  • maltotetraose-forming a- amylase EC 3.2.1.60
  • isoamylase EC 3.2.1.68
  • glucodextranase EC 3.2.1.70
  • maltohexaose-forming a-amylase EC 3.2.1.98
  • maltotriose- forming a-amylase EC 3.2.1.1 16)and or branching enzyme (EC 2.4.1 .18)and/or trehalose synthase (EC 5.4.99.16)and/or 4-a-glucanotransferase (EC 2.4.1.25)and or maltopentaose-forming ⁇ -amylase (EC 3.2.1.-) and/or amylosucrase (EC 2.4.1.4) and/or sucrose phosphorylase (EC 2.4.1.7)and/or malto-oligosyltrehalose tre
  • CLOl 136 possesses significant homology (about 47 % amino acid sequence of CLOl 136) with an ⁇ -amylase from Talaromyces stipitatus (Genbank Accession No. EED21740). CLOl 136 also possesses homology (about 45 % amino acid sequence of CLOl 136) with An04g06930 (amyC) from Aspergillus niger (Genbank Accession No. An04g06930).
  • the enzyme CL01049 is encoded by the nucleic acid sequence of SEQ ID NO: 53.
  • the CLOl 049 nucleic acid sequence encodes a 593 amino acid sequence of SEQ ID NO: 54. We believe the mature protein spans from about position 1 to about position 593 of the CL01049 amino acid sequence. Based on homology, CL01049 can be assigned to GH15 of the CAZy families and is expected to have glucoamylase (EC 3.2.1.3) and/or glucodextranase (EC 3.2.1.70) and/or ⁇ , ⁇ -trehalase (EC 3.2.1.28) activity.
  • Enzyme CL03904 is encoded by the nucleic acid sequence of SEQ ID NO: 55 which encodes the 410 amino acid sequence in SEQ ID NO: 56; enzyme CL08085 is encoded by the nucleic acid sequence of SEQ ID NO: 57 which encodes the 394 amino acid sequence in SEQ ID NO: 58; enzyme CL07129 is encoded by the nucleic acid sequence of SEQ ID NO: 59 which encodes the 419 amino acid sequence in SEQ ID NO: 60; enzyme CL06253 is encoded by the nucleic acid sequence of SEQ ID NO: 61 which encodes the 262 amino acid sequence in SEQ ID NO:62; enzyme CL00361 is encoded by the nucleic acid sequence of SEQ ID NO: 63 which encodes the 674 amino acid sequence in SEQ ID NO: 64; enzyme CL08631 is encoded by the nucleic acid sequence of SEQ ID NO: 65 which encodes the 285 amino acid sequence in SEQ ID NO: 66; enzyme CL10952 is encoded by the nucleic acid
  • CL03904, CL08085, CL07129, CL06253, CL00361 , CL08631 , CL10952, CL04578, CL05406, and CL01088 can be assigned to GH16 of the CAZy families and are expected to have xyloglucan.xyloglucosykransferase (EC 2.4.1.207) and/or keratan-sulfate endo-1 ,4- ⁇ -galactosidase (EC 3.2.1.103) and or endo- l ,3-P-glucanase (EC 3.2.1.39) and/or endo- l ,3(4)- -glucanase (EC 3.2.1.6) and/or licheninase (EC 3.2.1.73) and/or ⁇ - agarase (EC 3.2.1.81) and/or ⁇ -carrageenase (EC 3.2.1.83) and/or xyloglucanase (EC 3.2.1.151 )
  • Enzyme CL07062 is encoded by the nucleic acid sequence of SEQ ID NO: 75 which encodes the 405 amino acid sequence in SEQ ID NO: 76 and enzyme CL01490 is encoded- by the nucleic acid sequence of SEQ ID NO: 77 which encodes the 732 amino acid sequence in SEQ ID NO: 78.. Based on homology, CL07062 and CL01490 can be assigned to GH 17 of the CAZy families and are expected to have ⁇ -glucanase activity.
  • Enzyme CL00266 is encoded by the nucleic acid sequence of SEQ ID NO: 79 which encodes the 497 amino acid sequence in SEQ ID NO: 80: enzyme CL07859 is encoded by the nucleic acid sequence of SEQ ID NO: 81 which encodes the 1422 amino acid sequence in SEQ ID NO: 82: enzyme CL08692 is encoded by the nucleic acid sequence of SEQ ID NO: 83 which encodes the 1468 amino acid sequence in SEQ ID NO: 84: enzyme CL10715 is encoded by the nucleic acid sequence of SEQ ID NO: 85 which encodes the 781 amino acid sequence in SEQ ID NO: 86: enzyme CL1 1520 is encoded by the nucleic acid sequence of SEQ ID NO: 87 which encodes the 51 1 amino acid sequence in SEQ ID NO: 88: enzyme CL 10251 is encoded by the nucleic acid sequence of SEQ ID NO: 89 which encodes the 43 1 amino acid sequence in SEQ ID NO: 90: enzyme CL051 57 is
  • CL00266, CL07859, CL08692, CL10715, CL 1 1520, CL 1025 1 , CL05157, and CL01848 can be assigned to GH18 of the CAZy families and are expected to have chitinase activity.
  • Enzyme CL00553 is encoded by the nucleic acid sequence of SEQ ID NO: 95: enzyme and has the 984 amino acid sequence of SEQ ID NO: 96;
  • CL05538 is encoded by the nucleic acid sequence of SEQ ID NO: 97 which encodes the 2024 amino acid sequence in SEQ ID NO: 98;
  • enzyme CL05124 is encoded by the nucleic acid sequence of SEQ ID NO: 99 which encodes the 2070 amino acid sequence in SEQ ID NO: 100;
  • enzyme and CL07846 is encoded by the nucleic acid sequence of SEQ ID NO: 101 which encodes the 669 amino acid sequence in SEQ ID NO: 102.
  • CL00553, CL05538, CL05124, and CL07846 can be assigned to GH31 of the CAZy families and are expected to have a-glucosidase (EC 3.2. 1 .20) and/or a- l ,3-glucosidase (EC 3.2. 1.84) and/or sucrase-isomaltase (EC 3.2. 1.48) (EC 3.2.1. 10) and/or a-xylosidase (EC 3.2.1.-) and/or a-glucan lyase (EC 4.2.2.13) and/or isomaltosyltransferase (EC 2.4.1 .-) activity.
  • a-glucosidase EC 3.2. 1 .20
  • a- l ,3-glucosidase EC 3.2. 1.84
  • sucrase-isomaltase EC 3.2.1. 10
  • a-xylosidase EC 3.2.1.-
  • CL02470 is encoded by the nucleic acid sequence of SEQ ID NO: 103 which encodes the 888 amino acid sequence in SEQ ID NO: 104. Based on homology, CL02470 can be assigned to GH3 1 of the CAZy families and is expected to have a-glucosidase (EC 3.2.1.20) and/or a- l ,3-glucosidase (EC 3.2.1.84) and/or sucrase-isomaltase (EC 3.2.1 .48) (EC 3.2.1.10) and/or a-xylosidase (EC 3.2.1.-) and/or a-glucan lyase (EC 4.2.2.13) and/or isomaltosyltransferase (EC 2.4.1.-) activity.
  • a-glucosidase EC 3.2.1.20
  • a- l ,3-glucosidase EC 3.2.1.84
  • sucrase-isomaltase EC 3.2.1 .
  • the enzyme CL00928 is encoded by the nucleic acid sequence of SEQ ID NO: 105 which encodes the 1540 amino acid sequence in SEQ ID NO: 106. Based on homology, CL00928 can be assigned to GH38 of the CAZy families and are expected to have a-mannosidase activity.
  • CL07803 is encoded by the nucleic acid sequence of SEQ ID NO: 107 which encodes the 614 amino acid sequence in SEQ ID NO: 108. Based on homology, CL07803 can be assigned to GH43 of the CAZy families and is expected to have ⁇ -xylosidase (EC 3.2.1.37) and/or P-l ,3-xylosidase (EC 3.2.1.-) and/or a-L- arabinofuranosidase (EC 3.2.1.55) and/or arabinanase (EC 3.2.1.99) and/or xylanase (EC 3.2.1.8) and/or galactan l ,3-P-galactosidase (EC 3.2.1.145) activity.
  • ⁇ -xylosidase EC 3.2.1.37
  • P-l ,3-xylosidase EC 3.2.1.-
  • a-L- arabinofuranosidase EC 3.2.1.55
  • Enzyme CL01017 is encoded by the nucleic acid sequence of SEQ ID NO: 109 which encodes the 605 amino acid sequence in SEQ ID NO: 1 10; enzyme CL00184 is encoded by the nucleic acid sequence of SEQ ID NO: 1 1 1 which encodes the 1 1 13 amino acid sequence in SEQ ID NO: 1 12, enzyme CL00425 is encoded by the nucleic acid sequence of SEQ ID NO: 1 13 which encodes the 955 amino acid sequence in SEQ ID NO: 1 14, enzyme CL04755 is encoded by the nucleic acid sequence of SEQ ID NO: 1 15 which encodes the 660 amino acid sequence in SEQ ID NO: 1 16, enzyme CL08058 is encoded by the nucleic acid sequence of SEQ ID NO: 1 17 which encodes the 1010 amino acid sequence in SEQ ID NO: 1 18; enzyme CL08951 is encoded by the nucleic acid sequence of SEQ ID NO: 1 19 which encodes the 646 amino acid sequence in SEQ ID NO: 120; and enzyme CL03133 is
  • CL01017, CL00184, CL00425, CL04755, CL08058, CL08951 , and CL03133 can be assigned to GH47 of the CAZy families and are expected to have a-mannosidase activity.
  • Enzyme CL03845 is encoded by the nucleic acid sequence of SEQ ID NO: 123 which encodes the 497 amino acid sequence in SEQ ID NO: 124;
  • CL 10907 is encoded by the nucleic acid sequence of SEQ ID NO: 125 which encodes the 459 amino acid sequence in SEQ ID NO: 126;
  • CL06178 is encoded by the nucleic acid sequence of SEQ ID NO: 127 which encodes the 780 amino acid sequence in SEQ ID NO: 128;
  • CL00825 is encoded by the nucleic acid sequence of SEQ ID NO: 129 which encodes the 1295 amino acid sequence in SEQ ID NO: 130.. Based on homology, CL03845, CL10907, CL06178, and CL00825 can be assigned to GH55 of the CAZy families and are expected to have ⁇ -glucanase activity.
  • CL04536 is encoded by the nucleic acid sequence of SEQ ID NO: 131 which encodes the 838 amino acid sequence of SEQ ID NO: 132. Based on homology, CL04536 can be assigned to GH 63 of the CAZy families and are expected to have a-glucosidase activity.
  • the enzyme CL08404 is encoded by the nucleic acid sequence of SEQ ID NO: 133 which encodes the 1834 amino acid sequence of SEQ ID NO: 134. Based on homology, CL08404 can be assigned to GH71 of the CAZy families and are expected to have a-l,3-glucanase activity.
  • the enzyme CL1 1557 is encoded by the nucleic acid sequence of SEQ ID NO: 135 which encodes the 648 amino acid sequence of SEQ ID NO: 136 Based on homology, CL1 1557 can be assigned to GH71 of the CAZy families and CBM24 and are expected to have a-l ,3-glucanase activity.
  • Enzyme CL09156 is encoded by the nucleic acid sequence of SEQ ID NO: 137 which encodes the 539 amino acid sequence in SEQ ID NO: 138;
  • CL04709 is encoded by the nucleic acid sequence of SEQ ID NO: 139 which encodes the 745 amino acid sequence in SEQ ID NO: 140;
  • CL05353 is encoded by the nucleic acid sequence of SEQ ID NO: 141 which encodes the 454 amino acid sequence in SEQ ID NO: 142;
  • CL03139 is encoded by the nucleic acid sequence of SEQ ID NO: 143 which encodes the 460 amino acid sequence in SEQ ID NO: 144;.
  • CL09156, CL04709, CL05353, and CL03139 be assigned to GH72 of the CAZy families and are expected to have f ⁇ -l ,3-glucanosyltransglycosylase activity.
  • the enzyme CL08065 is encoded by the nucleic acid sequence of SEQ ID NO: 145 which encodes the 1501 amino acid sequence of SEQ ID NO: 146. Based on homology, CL08065 can be assigned to GH81 of the CAZy families and are expected to have p-l ,3-endoglucanase activity.
  • Enzyme CL03179 is encoded by the nucleic acid sequence of SEQ ID NO: 147 which encodes the 914 amino acid sequence in SEQ ID NO: 148; and CL08218 is encoded by the nucleic acid sequence of SEQ ID NO: 149 which encodes the 1377 amino acid sequence in SEQ ID NO: 150.. Based on homology, CL03179 and CL08218 be assigned to GH92 of the CAZy families and are expected to have a- mannosidase activity. [0206] The enzyme CL02507 is encoded by the nucleic acid sequence of SEQ ID NO: 151 which encodes a 236 amino acid sequence of SEQ ID NO: 152.
  • CL02507 is located from positions 1 to about position 18 of the CL02507 amino acid sequence, with the mature protein spanning from about position 19 to position 236 of the CL02507 amino acid sequence.
  • a catalytic domain CD
  • CL02507 is expected to have carbohydrate hydrolase activity.
  • CL02507 possesses significant homology (about 80% % from amino acids 1 to 236 of CL02507) with a hypothetical protein from Chaetomium globosum (Genbank Accession No. EAQ87122)1
  • CL02507 also possesses homology (about 67% from amino acids 1 to 235 of CL02507) with Cipl from Hypocrea jecorina (Genbank Accession No. AAP57751).
  • Enzyme CL10197 is encoded by the nucleic acid sequence of SEQ ID NO: 153 which encodes the 144 amino acid sequence in SEQ ID NO: 154;
  • CL05515 is encoded by the nucleic acid sequence of SEQ ID NO: 155 which encodes the 262 amino acid sequence in SEQ ID NO: 156.. Based on homology, CL 10197 and CL05515 can be assigned to have expansin activity.
  • Enzyme CL10827 is encoded by the nucleic acid sequence of SEQ ID NO: 157 .
  • the CL10827 nucleic acid sequence encodes a 326 amino acid sequence of SEQ ID NO: 1 58 .
  • signal peptide for CL10827 is located from positions 1 to about position 19 of the CL 10827 amino acid sequence, with the mature protein spanning from about position 20 to position 326 of the CL02507 amino acid sequence.
  • a catalytic domain CD
  • the amino acid sequence containing the CD of CL10827 spans from a starting point of about position 20 to an ending point of about position 230 of the CL10827 amino acid sequence. Based on homology, CL10827 is expected to have carbohydrate hydrolase activity.
  • Within CL10827 is a CBM1 domain.
  • Enzyme CL06181 is encoded by the nucleic acid sequence of SEQ ID NO: 159 which encodes the 447 amino acid sequence of SEQ ID NO 160.
  • SEQ ID NO: 159 contains exons starting at position 201 to 347, position 644 to position 747, position 848 to 1523, position 1587 to 1773 and position 1902 to 1971. Introns are present from position 348 to 643, position 748 to 847, position 1524 to 1586 and position 1774 to 1901 .
  • a CDNA for this amino acid sequence is shown in SEQ ID NO: 193.
  • CL061 81 is expected to have carbohydrate hydrolase and/or carbohydrate hydrolase enhancing activity. Within CL061 81 is a CBM 1 domain.
  • Enzyme CL03850 is encoded by the nucleic acid sequence of SEQ ID NO: 161 which encodes the 344 amino acid sequence in SEQ ID NO: 162.
  • SEQ ID NO: 161 contains exons starting at position 168 to 537, position 680 to position 1044, position 1219 to 1292, position 1390 to 1 562 and position 1643 to 1695. Introns are present from position 538 to 679, position 1045 to 1218, position 1293 to 1389 and position 1 563 to 1642.
  • a cDNA for this amino acid sequence is show in SEQ ID NO: 194.
  • Enzyme CL04757 is encoded by the nucleic acid sequence of SEQ ID NO: 163 which encodes the 351 amino acid sequence in SEQ ID NO: 164 ( a CDNA for this amino acid sequence is show in SEQ ID NO: 195); enzyme CL05156 is encoded by the nucleic acid sequence of SEQ ID NO: 165 which encodes the 488 amino acid sequence in SEQ ID NO: 166; enzyme CL05974 is encoded by the nucleic acid sequence of SEQ ID NO: 167 which encodes the 418 amino acid sequence in SEQ ID NO: 168; enzyme CL06157 is encoded by the nucleic acid sequence of SEQ ID NO: 169 which encodes the 683 amino acid sequence in SEQ ID NO: 1 70; enzyme CL06246 is encoded by the nucleic acid sequence of SEQ ID NO: 163 which encodes the 351 amino acid sequence in SEQ ID NO: 164 ( a CDNA for this amino acid sequence is show in SEQ ID NO: 195); enzyme CL05156 is encoded by the
  • CL03850, CL04757, CL05156, CL05974, CL061 57, CL06246, CL08554, CL09803, CL10302, CL10988, and CL1 1320 are expected to have carbohydrate hydrolase and/or carbohydrate hydrolase enhancing acticity.
  • Enzyme CL09027 is encoded by the nucleic acid sequence of SEQ ID NO: 1 83 which encodes the 348 amino acid sequence in SEQ ID NO: 184.
  • SEQ ID NO: 183 contains exons starting at position 99 to 243, position 301 to position 939, and position 1038 to 1300. Introns are present from position 244 to 300, and position 940 to 1037.
  • a cDNA for this amino acid sequence is show in SEQ ID NO: 196.
  • the signal peptide for CL09027 is located from positions 1 to about position 18 of the CL09027 amino acid sequence, with the mature protein spanning from about position 19 to about position 348 of the CL09027 amino acid sequence.
  • Within CL09027 is a CBM1 domain.
  • enzyme CL06576 is encoded by the nucleic acid sequence of SEQ ID NO: 185 ⁇ which encodes the 252 amino acid sequence in SEQ ID NO: 186; and enzyme CL00796 is encoded by the nucleic acid sequence of SEQ ID NO: 187 which encodes the 335 amino acid sequence in SEQ ID NO: 188. Based on homology, CL09027, CL06576, and CL00796 are expected to have hydrolase and/or hydrolase enhancing activity.
  • an isolated protein or polypeptide in the present invention includes full-length proteins and their glycosylated or otherwise modified forms forms, fusion proteins, or any fragment or homologue or variant of such a protein.
  • an isolated protein such as an enzyme according to the present invention, is a protein (including a polypeptide or peptide) that has been removed from its natural milieu (i.e. , that has been subject to human manipulation) and can include purified proteins, partially purified proteins, recombinantly produced proteins, synthetically produced proteins, proteins complexed with lipids, soluble proteins, and isolated proteins associated with other proteins, for example.
  • a "M. thermophila protein” or “M. thermophila enzyme” refers to a protein (generally including a homologue or variant of a naturally occurring protein) from Myceliophthora thermophila or to a protein that has been otherwise produced from the knowledge of the structure (e.g., sequence) and perhaps the function of a naturally occurring protein from Myceliophthora thermophila
  • a M. thermophila protein includes any protein that has substantially similar structure and function of a naturally occurring M.
  • thermophila protein or that is a biologically active (i.e., has biological activity) homologue or variant of a naturally occurring protein from M. thermophila as described in detail herein.
  • a thermophila protein can include purified, partially purified, recombinant, mutated/modified and synthetic proteins.
  • modifications can be used interchangeably, particularly with regard to the modifications/mutations to the amino acid sequence of a M. thermophila protein (or nucleic acid sequences) described herein.
  • An isolated protein according to the present invention can be isolated from its natural source, produced recombinantly or produced synthetically.
  • modification and “mutation” can be used interchangeably, particularly with regard to the modifications/mutations to the primary amino acid sequences of a protein or peptide (or nucleic acid sequences) described herein.
  • modification can also be used to describe post- translational modifications to a protein or peptide including, but not limited to, methylation, farnesylation, carboxymethylation, geranyl geranylation, glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, and/or amidation. Modification can also included the cleavage of a signal peptide, or methionine, or other portions of the peptide that require cleavage to generate the mature peptide.
  • homologue or “variants” are used to refer to a protein or peptide which differs from a naturally occurring protein or peptide (i.e., the "prototype” or “wild-type” protein) by minor modifications to the naturally occurring protein or peptide, but which maintains the basic protein and side chain structure of the naturally occurring form.
  • Such changes include, but are not limited to: changes in one or a few amino acid side chains; changes in one or a few amino acids, including deletions (e.g., a truncated version of the protein or peptide), insertions and/or substitutions; changes in stereochemistry of one or a few atoms; and/or minor derivatizations, including but not limited to for example methylation, glycosylation and phosphorylation.
  • a homologue or variant can have either enhanced, decreased, or substantially similar properties as compared to the naturally occurring protein or peptide.
  • Homologues or variants can be the result of natural allelic variation or natural mutation.
  • a naturally occurring allelic variant of a nucleic acid encoding a protein is a gene that occurs at essentially the same locus (or loci) in the genome as the gene which encodes such protein, but which, due to natural variations caused by, for example, mutation or recombination, has a similar but not identical sequence.
  • Homologues can also be the result of a gene duplication and rearrangement, resulting in a different location.
  • Allelic variants typically encode proteins having similar activity to that of the protein encoded by the gene to which they are being compared.
  • One class of allelic variants can encode the same protein but have different nucleic acid sequences due to the degeneracy of the genetic code.
  • Allelic variants can also comprise alterations in the 5' or 3' untranslated regions of the gene (e.g., in regulatory control regions). Allelic variants are well known to those skilled in the art.
  • Homologues or variants can be produced using techniques known in the art for the production of proteins including, but not limited to, direct modifications to the isolated, naturally occurring protein, direct protein synthesis, or modifications to the nucleic acid sequence encoding the protein using, for example, classic or recombinant DNA techniques to effect random or targeted mutagenesis.
  • Modifications of a protein may result in proteins having the same biological activity as the naturally occurring protein, or in proteins having decreased or increased biological activity as compared to the naturally occurring protein. Modifications which result in a decrease in protein expression or a decrease in the activity of the protein, can be referred to as inactivation (complete or partial), down-regulation, or decreased action of a protein. Similarly, modifications which result in an increase in protein expression or an increase in the activity of the protein, can be referred to as amplification, overproduction, activation, enhancement, up-regulation or increased action of a protein.
  • an isolated protein including a biologically active homologue, variant, or fragment thereof, has at least one characteristic of biological activity of a wild-type, or naturally occurring, protein.
  • the biological activity or biological action of a protein refers to any function(s) exhibited or performed by the protein that is ascribed to the naturally occurring form of the protein as measured or observed in vivo (i.e., in the natural physiological environment of the protein) or in vitro (i.e., under laboratory conditions).
  • the biological activity of a protein of the present invention can include an enzyme activity (catalytic activity and/or substrate binding activity), such as cellulase activity, hemicellulase activity, ⁇ -glucanase activity, ⁇ -glucosidase activity, a-galactosidase activity, ⁇ -galactosidase activity, xylanase activity or any other activity disclosed herein.
  • an enzyme activity catalytic activity and/or substrate binding activity
  • cellulase activity hemicellulase activity
  • ⁇ -glucanase activity ⁇ -glucosidase activity
  • a-galactosidase activity a-galactosidase activity
  • xylanase activity xylanase activity
  • Such assays include, but are not limited to, measurement of enzyme activity (e.g., catalytic activity), measurement of substrate binding, and the like. It is noted that an isolated protein of the present invention (including homologues or variants) is not required to have a biological activity such as catalytic activity.
  • a protein can be a truncated, mutated or inactive protein, or lack at least one activity of the wild-type enzyme, for example. Inactive proteins may be useful in some screening assays, for example, or for other purposes such as antibody production.
  • Methods to measure protein expression levels of a protein according to the invention include, but are not limited to enzyme activity measurements, western blotting, immunocytochemistry, flow cytometry or other immuno logic-based assays; assays based on a property of the protein including but not limited to, ligand binding or interaction with other protein partners.
  • Homologues or variants of a protein encompassed by the present invention can comprise, consist essentially of, or consist of, in one embodiment, an amino acid sequence that is at least about 35% identical, and more preferably at least about 40% identical, and more preferably at least about 45% identical, and more preferably at least about 50% identical, and more preferably at least about 55% identical, and more preferably at least about 60% identical, and more preferably at least about 65% identical, and more preferably at least about 70% identical, and more preferably at least about 75% identical, and more preferably at least about 80% identical, and more preferably at least about 85% identical, and more preferably at least about 90% identical, and more preferably at least about 95% identical, and more preferably at least about 96% identical, and more preferably at least about 97% identical, and more preferably at least about 98% identical, and more preferably at least about 99% identical, or any percent identity between 35% and 99%, in whole integers (i.e., 36%, 37%, etc.), to an amino acid sequence disclosed
  • the amino acid sequence of the homologue or variant has a biological activity of the wild-type or reference protein or of a biologically active domain thereof ⁇ e.g., a catalytic domain).
  • the amino acid position of the wild-type is typically used.
  • the wild-type can also be referred to as the "parent.” Additionally, any generation before the variant at issue can be a parent.
  • a protein of the present invention comprises, consists essentially of, or consists of an amino acid sequence that, alone or in combination with other characteristics of such proteins disclosed herein, is less than 100% identical to an amino acid sequence selected from SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:58, SEQ ID NO:60
  • a protein of the present invention can be less than 100% identical, in combination with being at least about 35% identical, to a given disclosed sequence.
  • a homologue or variant according to the present invention has an amino acid sequence that is less than about 99% identical to any of such amino acid sequences, and in another embodiment, is less than about 98% identical to any of such amino acid sequences, and in another embodiment, is less than about 97% identical to any of such amino acid sequences, and in another embodiment, is less than about 96% identical to any of such amino acid sequences, and in another embodiment, is less than about 95% identical to any of such amino acid sequences, and in another embodiment, is less than about 94% identical to any of such amino acid sequences, and in another embodiment, is less than about 93% identical to any of such amino acid sequences, and in another embodiment, is less than about 92% identical to any of such amino acid sequences, and in another embodiment, is less than about 91% identical to any of such amino acid sequences, and in another embodiment, is less than about 90% identical to
  • a percent (%) identity refers to an evaluation of homology which is performed using: (1 ) a BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches and blastn for nucleic acid searches with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S.F., Madden, T.L., Schaaffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D.J. (1997) "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res.
  • PSI- BLAST provides an automated, easy-to-use version of a "profile" search, which is a sensitive way to look for sequence homologues or variants.
  • the program first performs a gapped BLAST database search.- The PSI-BLAST program uses the information from any significant alignments returned to construct a position-specific score matrix, which replaces the query sequence for the next round of database searching. Therefore, it is to be understood that percent identity can be determined by using any one of these programs.
  • BLAST 2 sequence alignment is performed in blastp or blastn using the BLAST 2.0 algorithm to perform a Gapped BLAST search (BLAST 2.0) between the two sequences allowing for the introduction of gaps (deletions and insertions) in the resulting alignment.
  • BLAST 2.0 Gapped BLAST search
  • a BLAST 2 sequence alignment is performed using the standard default parameters as follows.
  • a protein of the present invention can also include proteins having an amino acid sequence comprising at least 10 contiguous amino acid residues of any of the sequences described herein (i.e. , 10 contiguous amino acid residues having 100% identity with 10 contiguous amino acids of the amino acid sequences of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:
  • a homologue or variant of a protein amino acid sequence includes amino acid sequences comprising at least 20, or at least 30, or at least 40, or at least 50, or at least 75, or at least 100, or at least 125, or at least 150, or at least 175, or at least 150, or at least 200, or at least 250, or at least 300, or at least 350 contiguous amino acid residues of any of the amino acid sequence represented disclosed herein.
  • Even small fragments of proteins without biological activity are useful in the present invention, for example, in the preparation of antibodies against the full-length protein or in a screening assay (e.g., a binding assay).
  • Fragments can also be used to construct fusion proteins, for example, where the fusion protein comprises functional domains from two or more different proteins (e.g., a CBM from one protein linked to a CD from another protein).
  • a homologue or variant has a measurable or detectable biological activity associated with the wild-type protein (e.g., enzymatic activity).
  • the term "contiguous” or “consecutive”, with regard to nucleic acid or amino acid sequences described herein, means to be connected in an unbroken sequence.
  • a first sequence to comprise 30 contiguous (or consecutive) amino acids of a second sequence means that the first sequence includes an unbroken sequence of 30 amino acid residues that is 100% identical to an unbroken sequence of 30 amino acid residues in the second sequence.
  • a first sequence to have "100% identity" with a second sequence means that the first sequence exactly matches the second sequence with no gaps between nucleotides or amino acids.
  • a protein of the present invention includes a protein having an amino acid sequence that is sufficiently similar to a natural amino acid sequence that a nucleic acid sequence encoding the homologue or variant is capable of hybridizing under moderate, high or very high stringency conditions (described below) to (i.e., with) a nucleic acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid strand encoding the natural amino acid sequence).
  • a homologue or variant of a protein of the present invention is encoded by a nucleic acid molecule comprising a nucleic acid sequence that hybridizes under low, moderate, or high stringency conditions to the complement of a nucleic acid sequence that encodes a protein comprising, consisting essentially of, or consisting of, an amino acid sequence represented by any of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:40
  • a nucleic acid sequence complement of nucleic acid sequence encoding a protein of the present invention refers to the nucleic acid sequence of the nucleic acid strand that is complementary to the strand which encodes the protein. It will be appreciated that a double stranded DNA which encodes a given amino acid sequence comprises a single strand DNA and its complementary strand having a sequence that is a complement to the single strand DNA.
  • nucleic acid molecules of the present invention can be either double-stranded or single-stranded, and include those nucleic acid molecules that form stable hybrids under stringent hybridization conditions with a nucleic acid sequence that encodes an amino acid sequence such as the amino acid sequences of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56
  • hybridization conditions refers to standard hybridization conditions under which nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., (see specifically, pages 9.31 -9.62). In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., ibid.
  • moderate stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 30% or less mismatch of nucleotides).
  • High stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 80% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 20% or less mismatch of nucleotides).
  • Very high stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 90% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 10% or less mismatch of nucleotides).
  • conditions permitting about 10% or less mismatch of nucleotides i.e., one of skill in the art can use the formulae in Meinkoth et al., ibid, to calculate the appropriate hybridization and wash conditions to achieve these particular levels of nucleotide mismatch. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for DNA:DNA hybrids are 10°C less than for DNA:RNA hybrids.
  • stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 6X SSC (0.9 M Na*) at a temperature of between about 20°C and about 35°C (lower stringency), more preferably, between about 28°C and about 40°C (more stringent), and even more preferably, between about 35°C and about 45°C (even more stringent), with appropriate wash conditions.
  • stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic strength of 6X SSC (0.9 M Na at a temperature of between about 30°C and about 45°C, more preferably, between about 38°C and about 50°C, and even more preferably, between about 45°C and about 55°C, with similarly stringent wash conditions.
  • hybridization conditions can include a combination of salt and temperature conditions that are approximately 20-25°C below the calculated T m of a particular hybrid, and wash conditions typically include a combination of salt and temperature conditions that are approximately 12-20°C below the calculated T m of the particular hybrid.
  • hybridization conditions suitable for use with DNA:DNA hybrids includes a 2-24 hour hybridization in 6X SSC (50% formamide) at about 42°C, followed by washing steps that include one or more washes at room temperature in about 2X SSC, followed by additional washes at higher temperatures and lower ionic strength (e.g., at least one wash as about 37°C in about 0. 1 X-0.5X SSC, followed by at least one wash at about 68°C in about 0.1X-0.5X SSC).
  • the minimum size of a protein and/or homologue or variant of the present invention is a size sufficient to have biological activity or, when the protein is not required to have such activity, sufficient to be useful for another purpose associated with a protein of the present invention, such as for the production of antibodies that bind to a naturally occurring protein.
  • the protein of the present invention is at least 20 amino acids in length, or at least about 25 amino acids in length, or at least about 30 amino acids in length, or at least about 40 amino acids in length, or at least about 50 amino acids in length, or at least about 60 amino acids in length, or at least about 70 amino acids in length, or at least about 80 amino acids in length, or at least about 90 amino acids in length, or at least about 100 amino acids in length, or at least about 125 amino acids in length, or at least about 150 amino acids in length, or at least about 1 75 amino acids in length, or at least about 200 amino acids in length, or at least about 250 amino acids in length, and so on up to a full length of each protein, and including any size in between in increments of one whole integer (one amino acid).
  • the protein can include a portion of a protein or a full-length protein, plus additional sequence (e.g., a fusion protein sequence), if desired.
  • the present invention also includes a fusion protein that includes a domain of a protein of the present invention (including a homologue or variant) attached to one or more fusion segments, which are typically heterologous in sequence to the protein sequence (i.e., different than protein sequence).
  • Suitable fusion segments for use with the present invention include, but are not limited to, segments that can: enhance a protein's stability; provide other desirable biological activity; and/or assist with the purification of the protein (e.g., by affinity chromatography).
  • a suitable fusion segment can be a domain of any size that has the desired function (e.g. , imparts increased stability, solubility, action or biological activity; and/or simplifies purification of a protein).
  • Fusion segments can be joined to amino and/or carboxyl termini of the domain of a protein of the present invention and can be susceptible to cleavage in order to enable straight-forward recovery of the protein.
  • Fusion proteins are preferably produced by culturing a recombinant cell transfected with a fusion nucleic acid molecule that encodes a protein including the fusion segment attached to either the carboxyl and/or amino terminal end of a domain of a protein of the present invention.
  • proteins of the present invention also include expression products of gene fusions (for example, used to overexpress soluble, active forms of the recombinant protein), of mutagenized genes (such as genes having codon modifications to enhance gene transcription and translation), and of truncated genes (such as genes having membrane binding modules removed to generate soluble forms of a membrane protein, or genes having signal sequences removed which are poorly tolerated in a particular recombinant host).
  • gene fusions for example, used to overexpress soluble, active forms of the recombinant protein
  • mutagenized genes such as genes having codon modifications to enhance gene transcription and translation
  • truncated genes such as genes having membrane binding modules removed to generate soluble forms of a membrane protein, or genes having signal sequences removed which are poorly tolerated in a particular recombinant host.
  • any of the amino acid sequences described herein can be produced with from at least one, and up to about 20, additional heterologous amino acids flanking each of the C- and/or N-terminal ends of the specified amino acid sequence.
  • the resulting protein or polypeptide can be referred to as "consisting essentially of the specified amino acid sequence.
  • the heterologous amino acids are a sequence of amino acids that are not naturally found (i.e., not found in nature, in vivo) flanking the specified amino acid sequence, or that are not related to the function of the specified amino acid sequence, or that would not be encoded by the nucleotides that flank the naturally occurring nucleic acid sequence encoding the specified amino acid sequence as it occurs in the gene, if such nucleotides in the naturally occurring sequence were translated using standard codon usage for the organism from which the given amino acid sequence is derived.
  • the present invention also provides enzyme combinations that break down lignocellulose material.
  • Such enzyme combinations or mixtures can include a multi- enzyme composition that contains at least one protein of the present invention in combination with one or more additional proteins of the present invention or one or more enzymes or other proteins from other microorganisms, plants, or similar organisms.
  • Synergistic enzyme combinations and related methods are contemplated.
  • the invention includes methods to identify the optimum ratios and compositions of enzymes with which to degrade each lignocellulosic material. These methods entail tests to identify the optimum enzyme composition and ratios for efficient conversion of any lignocellulosic substrate to its constituent sugars.
  • the Examples below include assays that may be used to identify optimum ratios and compositions of enzymes with which to degrade lignocellulosic materials.
  • any combination of the proteins disclosed herein is suitable for use in the multi- enzyme compositions of the present invention. Due to the complex nature of most biomass sources, which can contain cellulose, hemicellulose, pectin, lignin, protein, and ash, among other components, preferred enzyme combinations may contain enzymes with a range of substrate specificities that work together to degrade biomass into fermentable sugars in the most efficient manner.
  • a multi-enzyme complex for lignocellulose saccharification is a mixture of cellobiohydrolase(s), xylanase(s), endoglucanase(s), -glucosidase(s), ⁇ - xylosidase(s), and accessory enzymes.
  • the cellobiohydrolase(s) comprise between about 30% and about 90% or between about 40% and about 70% of the enzymes in the composition, and more preferably, between about 55% and 65%, and more preferably, about 60% of the enzymes in the composition (including any percentage between 40% and 70% in 0.5% increments (e.g., 40%, 40.5%, 41 %, etc.).
  • the xylanase(s) comprise between about 10% and about 30% of the enzymes in the composition, and more preferably, between about 15% and about 25%, and more preferably, about 20% of the enzymes in the composition (including any percentage between 10% and 30% in 0.5% increments).
  • the endoglucanase(s) comprise between about 5% and about 15% of the enzymes in the composition, and more preferably, between about 7% and about 13%, and more preferably, about 10% of the enzymes in the composition (including any percentage between 5% and 1 5% in 0.5% increments).
  • the P-glucosidase(s) comprise between about 1 % and about 1 5% of the enzymes in the composition, and preferably between about 2% and 10%, and more preferably, about 3% of the enzymes in the composition (including any percentage between 1 % and 1 5% in 0.5% increments).
  • the P-xylosidase(s) comprise between about 1 % and about 3% of the enzymes in the composition, and preferably, between about 1.5% and about 2.5%, and more preferably, about 2% of the enzymes in the composition (including any percentage between 1 % and 3% in 0.5% increments.
  • the accessory enzymes comprise between about 2% and about 8% of the enzymes in the composition, and preferably, between about 3% and about 7%, and more preferably, about 5% of the enzymes in the composition (including any percentage between 2% and 8% in 0.5% increments.
  • One particularly preferred example of a multi-enzyme complex for lignocellulose saccharification is a mixture of about 60% cellobiohydrolase(s), about 20% xylanase(s), about 10% endoglucanase(s), about 3% P-glucosidase(s), about 2% ⁇ - xylosidase(s) and about 5% accessory enzyme(s).
  • Enzymes and multi-enzyme compositions of the present invention may also be used to break down arabinoxylan or arabinoxylan-containing substrates.
  • Arabinoxylan is a polysaccharide composed of xylose and arabinose, wherein a-L-arabinofuranose residues are attached as branch-points to a P-( l ,4)-linked xylose polymeric backbone.
  • the xylose residues may be mono-substituted at the C2 or C3 position, or di-substituted at both positions.
  • Ferulic acid or coumaric acid may also be ester- linked to the C5 position of arabinosyl residues. Further details on the hydrolysis of arabinoxylan can be found in International Publication No. WO 2006/1 14095.
  • the substitutions on the xylan backbone can inhibit the enzymatic activity of xylanases, and the complete hydrolysis of arabinoxylan typically requires the action of several different enzymes.
  • a multi-enzyme complex for arabinoxylan hydrolysis is a mixture of endoxylanase(s), P-xylosidase(s), and arabinofuranosidase(s), including those with specificity towards single and double substituted xylose residues.
  • the multi-enzyme complex may further comprise one or more carbohydrate esterases, such as acetyl xylan esterases, ferulic acid esterases, coumaric acid esterases or pectin methyl esterases. Any combination of two or more of the above-mentioned enzymes is suitable for use in the multi-enzyme complexes. However, it is to be understood that the invention is not restricted or limited to the specific exemplary combinations listed herein.
  • the endoxylanase(s) comprise at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 40%, at least about 50%, at least about 70% of the enzymes in the composition (including any percentage between 5% and 70% in 0.5% increments (e.g., 5.0%, 5.5%, 6.0%, etc.).
  • Endoxylanase(s) either alone or as part of a multi- enzyme complex, may be used in amounts of 0.001 to 2.0 g kg, 0.005 to 1 .0 g kg, or 0.05 to 0.2 g/kg of substrate.
  • the P-xylosidase(s) comprise at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 40%, at least about 50%, at least about 70% of the enzymes in the composition (including any percentage between 5% and 70% in 0.5% increments (e.g., 5.0%, 5.5%, 6.0%, etc.).
  • P-xylosidase(s) either alone or as part of a multi- enzyme complex, may be used in amounts of 0.001 to 2.0 g kg, 0.005 to 1.0 g/kg, or 0.05 to 0.2 g/kg of substrate.
  • the arabinofuranosidase(s) comprise at least about 5%, at least about 10%), at least about 1 5%, at least about 20%, at least about 25%, at least about 30%, at least about 40%, at least about 50%, at least about 70% of the enzymes in the composition (including any percentage between 5% and 70% in 0.5% increments (e.g., 5.0%, 5.5%, 6.0%, etc.).
  • the total percentage of arabinofiiranosidase(s) present in the composition may include arabinofuranosidase(s) with specificity towards single substituted xylose residues, arabinofuranosidase(s) with specificity towards double substituted xylose residues, or any combination thereof.
  • Arabinofuranosidase(s) either alone or as part of a multi-enzyme complex, may be used in amounts of 0.001 to 2.0 g/kg, 0.005 to 1 .0 g kg, or 0.05 to 0.2 g kg of substrate.
  • One or more components of a multi-enzyme composition can be obtained from or derived from a microbial, plant, or other source or combination thereof, and will contain enzymes capable of degrading lignocellulosic material.
  • Examples of enzymes included in the multi-enzyme compositions of the invention include cellulases, hemicellulases (such as xylanases, including endoxylanases, exoxylanases, and ⁇ -xylosidases; mannanases, including endomannanases, exomannanases, and ⁇ -mannosidases), ligninases, amylases, glucuronidases, proteases, esterases (including ferulic acid esterase), lipases, glucosidases (such as ⁇ -glucosidase), and xyloglucanases.
  • cellulases such as xylanases, including endoxylanases, exoxylanases, and ⁇ -xylosidases
  • mannanases including endomannanases, exomannanases, and ⁇ -mannosidases
  • ligninases amylases,
  • the multi-enzyme composition may contain many types of enzymes, mixtures comprising enzymes that increase or enhance sugar release from biomass are preferred, including hemicellulases.
  • the hemicellulase is selected from a xylanase, an arabinofuranosidase, an acetyl xylan esterase, a glucuronidase, an endo-galactanase, a mannanase, an endo-arabinase, an exo-arabinase, an exo- galactanase, a ferulic acid esterase, a galactomannanase, a xyloglucanase, or mixtures of any of these.
  • the enzymes can include glucoamylase, ⁇ - xylosidase and/or ⁇ -glucosidase.
  • mixtures comprising enzymes that are capable of degrading cell walls and releasing
  • the enzymes of the multi-enzyme composition can be provided by a variety of sources.
  • the enzymes can be produced by growing organisms such as bacteria, algae, fungi, and plants which produce the enzymes naturally or by virtue of being genetically modified to express the enzyme or enzymes.
  • at least one enzyme of the multi-enzyme composition is a commercially available enzyme.
  • the multi-enzyme compositions comprise an accessory enzyme.
  • An accessory enzyme is any additional enzyme capable of hydrolyzing lignocellulose or enhancing or promoting the hydrolysis of lignocellulose, wherein the accessory enzyme is typically provided in addition to a core enzyme or core set of enzymes.
  • An accessory enzyme can have the same or similar function or a different function as an enzyme or enzymes in the core set of enzymes. These enzymes have been described elsewhere herein, and can generally include cellulases, xylanases, ligninases, amylases, lipidases, or glucuronidases, for example.
  • Accessory enzymes can include enzymes that when contacted with biomass in a reaction, allow for an increase in the activity of enzymes (e.g., hemicellulases) in the multi-enzyme composition.
  • An accessory enzyme or enzyme mix may be composed of enzymes from ( 1 ) commercial suppliers; (2) cloned genes expressing enzymes; (3) complex broth (such as that resulting from growth of a microbial strain in media, wherein the strains secrete proteins and enzymes into the media); (4) cell lysates of strains grown as in (3); and, (5) plant material expressing enzymes capable of degrading lignocellulose.
  • the accessory enzyme is a glucoamylase, a pectinase, or a ligninase.
  • a ligninase is an enzyme that can hydrolyze or break down the structure of lignin polymers, including lignin peroxidases, manganese peroxidases, laccases, and other enzymes described in the art known to depolymerize or otherwise break lignin polymers. Also included are enzymes capable of hydrolyzing bonds formed between hemicellulosic sugars (notably arabinose) and lignin.
  • the multi-enzyme compositions comprise a biomass comprising microorganisms or a crude fermentation product of microorganisms.
  • a crude fermentation product refers to the fermentation broth which has ' been separated from the microorganism biomass (by filtration, for example).
  • the microorganisms are grown in fermentors, optionally centrifuged or filtered to remove biomass, and optionally concentrated, formulated, and dried to produce an enzyme(s) or a multi-enzyme composition that is a crude fermentation product.
  • enzyme(s) or multi-enzyme compositions produced by the microorganism are subjected to one or more purification steps, such as ammonium sulfate precipitation, chromatography, and/or ultrafiltration, which result in a partially purified or purified enzyme(s).
  • the enzyme(s) will include recombinant enzymes.
  • the enzyme(s) may include both naturally occurring and recombinant enzymes.
  • compositions comprising at least about 500 ng, and preferably at least about 1 ⁇ g, and more preferably at least about 5 ⁇ g, and more preferably at least about 10 g, and more preferably at least about 25 ⁇ g, and more preferably at least about 50 ⁇ g, and more preferably at least about 75 ⁇ g, and more preferably at least about 100 ⁇ g, and more preferably at least about 250 ⁇ , and more preferably at least about 500 ⁇ g, and more preferably at least about 750 ⁇ g, and more preferably at least about 1 mg, and more preferably at least about 5 mg, of an isolated protein comprising any of the proteins or homologues, variants, or fragments thereof discussed herein.
  • composition of the present invention may include any carrier with which the protein is associated by virtue of the protein preparation method, a protein purification method, or a preparation of the protein for use in any method according to the present invention.
  • a carrier can include any suitable buffer, extract, or medium that is suitable for combining with the protein of the present invention so that the protein can be used in any method described herein according to the present invention.
  • an immobilized enzyme includes immobilized isolated enzymes, immobilized microbial cells which contain one or more enzymes of the invention, other stabilized intact cells that produce one or more enzymes of the invention, and stabilized cell/membrane homogenates.
  • Stabilized intact cells and stabilized cell/membrane homogenates include cells and homogenates from naturally occurring microorganisms expressing the enzymes of the invention and preferably, from genetically modified microorganisms as disclosed elsewhere herein.
  • nucleic acid molecules that encode a protein of the present invention, as well as homologues, variants, or fragments of such nucleic acid molecules.
  • a nucleic acid molecule of the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence encoding any of the isolated proteins disclosed herein, including a fragment or a homologue or variant of such proteins, described above.
  • Nucleic acid molecules can include a nucleic acid sequence that encodes a fragment of a protein that does not have biological activity, and can also include portions of a gene or polynucleotide encoding the protein that are not part of the coding region for the protein (e.g., introns or regulatory regions of a gene encoding the protein). Nucleic acid molecules can include a nucleic acid sequence that is useful as a probe or primer (oligonucleotide sequences).
  • a nucleic acid molecule of the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence represented in SEQ ID NO: l , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: l l , SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21 , SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31 , SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41 , SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51 , SEQ ID NO:53, SEQ ID NO:55, SEQ ID
  • a nucleic molecule of the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence encoding an amino acid sequence represented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:56, SEQ ID NO:58, SEQ ID NO:60,
  • the nucleic acid sequence encodes a protein (including fragments and homologues or variants thereof) useful in the invention, or encompasses useful oligonucleotides or complementary nucleic acid sequences.
  • such nucleic acid molecules include isolated nucleic acid molecules that hybridize under moderate stringency conditions, and more preferably under high stringency conditions, and even more preferably under very high stringency conditions, as described above, with the complement of a nucleic acid sequence encoding a protein of the present invention (i.e., including naturally occurring allelic variants encoding a protein of the present invention).
  • an isolated nucleic acid molecule encoding a protein of the present invention comprises a nucleic acid sequence that hybridizes under moderate, high, or very high stringency conditions to the complement of a nucleic acid sequence that encodes a protein comprising an amino acid sequence represented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:54, SEQ ID NO:
  • an isolated nucleic acid molecule is a nucleic acid molecule (polynucleotide) that has been removed from its natural milieu (i.e., that has been subject to human manipulation) and can include DNA, RNA, or derivatives of either DNA or RNA, including cDNA. As such, “isolated” does not reflect the extent to which the nucleic acid molecule has been purified.
  • nucleic acid molecule primarily refers to the physical nucleic acid molecule
  • nucleic acid sequence primarily refers to the sequence of nucleotides on the nucleic acid molecule
  • the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a protein.
  • An isolated nucleic acid molecule of the present invention can be isolated from its natural source or produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis.
  • PCR polymerase chain reaction
  • Isolated nucleic acid molecules can include, for example, genes, natural allelic variants of genes, coding regions or portions thereof, and coding and/or regulatory regions modified by nucleotide insertions, deletions, substitutions, and/or inversions in a manner such that the modifications do not substantially interfere with the nucleic acid molecule's ability to encode a protein of the present invention or to form stable hybrids under stringent conditions with natural gene isolates.
  • An isolated nucleic acid molecule can include degeneracies.
  • nucleotide degeneracy refers to the phenomenon that one amino acid can be encoded by different nucleotide codons.
  • nucleic acid sequence of a nucleic acid molecule that encodes a protein of the present invention can vary due to degeneracies. It is noted that a nucleic acid molecule of the present invention is not required to encode a protein having protein activity. A nucleic acid molecule can encode a truncated, mutated or inactive protein, for example. In addition, nucleic acid molecules of the invention are useful as probes and primers for the identification, isolation and/or purification of other nucleic acid molecules.
  • the nucleic acid molecule is an oligonucleotide, such as a probe or primer
  • the oligonucleotide preferably ranges from about 5 to about 50 or about 500 nucleotides, more preferably from about 10 to about 40 nucleotides, and most preferably from about 15 to about 40 nucleotides in length.
  • a gene includes all nucleic acid sequences related to a natural (i.e. wild-type) gene, such as regulatory regions, that control production of the protein encoded by that gene (such as, but not limited to, transcription, translation or post-translation control regions) as well as the coding region itself.
  • a gene can be a naturally occurring allelic variant that includes a similar but not identical sequence to the nucleic acid sequence encoding a given protein. Allelic variants have been previously described above. Genes can include or exclude one or more introns or any portions thereof or any other sequences or which are not included in the cDNA for that protein.
  • the phrases "nucleic acid molecule" and “gene” can be used interchangeably when the nucleic acid molecule comprises a gene as described above.
  • an isolated nucleic acid molecule of the present invention is produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning, etc.) or chemical synthesis.
  • Isolated nucleic acid molecules include any nucleic acid molecules and homologues or variants thereof that are part of a gene described herein and/or that encode a protein described herein, including, but not limited to, natural allelic variants and modified nucleic acid molecules (homologues or variants) in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect on protein biological activity or on the activity of the nucleic acid molecule.
  • Allelic variants and protein homologues or variants e.g., proteins encoded by nucleic acid homologues or variants
  • a nucleic acid molecule homologue or variant (i.e., encoding a homologue or variant of a protein of the present invention) can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al.).
  • nucleic acid molecules can be modified using a variety of techniques including, but not limited to, by classic mutagenesis and recombinant DNA techniques (e.g., site- directed mutagenesis, chemical treatment, restriction enzyme cleavage, ligation of nucleic acid f agments and/or PCR amplification), or synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture of nucleic acid molecules and combinations thereof.
  • classic mutagenesis and recombinant DNA techniques e.g., site- directed mutagenesis, chemical treatment, restriction enzyme cleavage, ligation of nucleic acid f agments and/or PCR amplification
  • nucleic acid molecule homologues or variants can be selected by hybridization with a gene or polynucleotide, or by screening for the function of a protein encoded by a nucleic acid molecule ⁇ i.e., biological activity).
  • the minimum size of a nucleic acid molecule of the present invention is a size sufficient to encode a protein (including a fragment, homologue, or variant of a full- length protein) having biological activity, sufficient to encode a protein comprising at least one epitope which binds to an antibody, or sufficient to form a probe or oligonucleotide primer that is capable of forming a stable hybrid with the complementary sequence of a nucleic acid molecule encoding a natural protein (e.g., under moderate, high, or high stringency conditions).
  • the size of the nucleic acid molecule encoding such a protein can be dependent on nucleic acid composition and percent homology or identity between the nucleic acid molecule and complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration).
  • the minimal size of a nucleic acid molecule that is used as an oligonucleotide primer or as a probe is * typically at least about 12 to about 15 nucleotides in length if the nucleic acid molecules are GC-rich and at least about 15 to about 18 bases in length if they are AT-rich.
  • nucleic acid molecule of the present invention can include a portion of a protein encoding sequence, a nucleic acid sequence encoding a full-length protein (including a gene), including any length fragment between about 20 nucleotides and the number of nucleotides that make up the full length cDNA encoding a protein, in whole integers (e.g., 20, 21, 22, 23, 24, 25 nucleotides), or multiple genes, or portions thereof.
  • the heterologous nucleotides are not naturally found ⁇ i.e., not found in nature, in vivo) flanking the nucleic acid sequence encoding the specified amino acid sequence as it occurs in the natural gene or do not encode a protein that imparts any additional function to the protein or changes the function of the protein having the specified amino acid sequence.
  • the polynucleotide probes or primers of the invention are conjugated to detectable markers.
  • Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.
  • Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., DynabeadsTM), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like), radiolabels (e.g., 3 H, 125 I, 35 S, 1 4 C, or 32 P), eri2ymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads.
  • the polynucleotide probes are immobilized on a substrate such as: artificial membranes, organic supports, bioporymer supports and inorganic supports.
  • One embodiment of the present invention relates to a recombinant nucleic acid molecule which comprises the isolated nucleic acid molecule described above which is operatively linked to at least one expression control sequence. More particularly, according to the present invention, a recombinant nucleic acid molecule typically comprises a recombinant vector and any one or more of the isolated nucleic acid molecules as described herein. According to the present invention, a recombinant vector is an engineered ⁇ i.e., artificially produced) nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice and/or for introducing such a nucleic acid sequence into a host cell.
  • the recombinant vector is therefore suitable for use in cloning, sequencing, and/or otherwise manipulating the nucleic acid sequence of choice, such as by expressing and/or delivering the nucleic acid sequence of choice into a host cell to form a recombinant cell.
  • a vector typically contains nucleic acid sequences that are not naturally found adjacent to nucleic acid sequence to be cloned or delivered, although the vector can also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) which are naturally found adjacent to nucleic acid sequences of the present invention or which are useful for expression of the nucleic acid molecules of the present invention (discussed in detail below).
  • the vector can be either RNA or DNA, either prokaryotic or eukaryotic, and typically is a plasmid.
  • the vector can be maintained as an extrachromosomal element (e.g., a plasmid) or it can be integrated into the chromosome of a recombinant host cell, although it is preferred if the vector remains separate from the genome for most applications of the invention.
  • the entire vector can remain in place within a host cell, or under certain conditions, the plasmid DNA can be deleted, leaving behind the nucleic acid molecule of the present invention.
  • An integrated nucleic acid molecule can be under chromosomal promoter control, under native or plasmid promoter control, or under a combination of several promoter controls. Single or multiple copies of the nucleic acid molecule can be integrated into the chromosome.
  • a recombinant vector of the present invention can contain at least one selectable marker.
  • a recombinant vector used in a recombinant nucleic acid molecule of the present invention is an expression vector.
  • expression vector is used to refer to a vector that is suitable for production of an encoded product (e.g., a protein of interest, such as an enzyme of the present invention).
  • a nucleic acid sequence encoding the product to be produced e.g., the protein or homologue or variant thereof is inserted into the recombinant vector to produce a recombinant nucleic acid molecule.
  • the nucleic acid sequence encoding the protein to be produced is inserted into the vector in a manner that operatively links the nucleic acid sequence to regulatory sequences in the vector which enable the transcription and translation of the nucleic acid sequence within the recombinant host cell.
  • a recombinant nucleic acid molecule includes at least one nucleic acid molecule of the present invention operatively linked to one or more expression control sequences (e.g., transcription control sequences or translation control sequences).
  • expression control sequences e.g., transcription control sequences or translation control sequences.
  • the phrase "recombinant molecule” or “recombinant nucleic acid molecule” primarily refers to a nucleic acid molecule or nucleic acid sequence operatively linked to a transcription control sequence, but can be used interchangeably with the phrase “nucleic acid molecule", when such nucleic acid molecule is a recombinant molecule as discussed herein.
  • the phrase "operatively linked” refers to linking a nucleic acid molecule to an expression control sequence in a manner such that the molecule is able to be expressed when transfected (i.e., transformed, transduced, transfected, conjugated or conducted) into a host cell.
  • Transcription control sequences are sequences which control the initiation, elongation, or termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in a host cell or organism into which the recombinant nucleic acid molecule is to be introduced. Transcription control sequences may also include any combination of one or more of any of the foregoing.
  • Recombinant nucleic acid molecules of the present invention can also contain additional regulatory sequences, such as translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell.
  • a recombinant molecule of the present invention including those which are integrated into the host cell chromosome, also contains secretory signals (i.e., signal segment nucleic acid sequences) to enable an expressed protein to be secreted from the cell that produces the protein.
  • Suitable signal segments include a signal segment that is naturally associated with the protein to be expressed or any heterologous signal segment capable of directing the secretion of the protein according to the present invention.
  • a recombinant molecule of the present invention comprises a leader sequence to enable an expressed protein to be delivered to and inserted into the membrane of a host cell.
  • Suitable leader sequences include a leader sequence that is naturally associated with the protein, or any heterologous leader sequence capable of directing the delivery and insertion of the protein to the membrane of a cell.
  • the term "transfection” is generally used to refer to any method by which an exogenous nucleic acid molecule (i.e., a recombinant nucleic acid molecule) can be inserted into a cell.
  • transformation can be used interchangeably with the term “transfection” when such term is used to refer to the introduction of nucleic acid molecules into microbial cells or plants and describes an inherited change due to the acquisition of exogenous nucleic acids by the microorganism that is essentially synonymous with the term “transfection.”
  • Transfection techniques include, but are not limited to, transformation, particle bombardment, electroporation, microinjection, lipofection, adsorption, infection and protoplast fusion.
  • One or more recombinant molecules of the present invention can be used to produce an encoded product (e.g., a protein) of the present invention.
  • an encoded product is produced by expressing a nucleic acid molecule as described herein under conditions effective to produce the protein.
  • a preferred method to produce an encoded protein is by transfecting a host cell with one or more recombinant molecules to form a recombinant cell. Suitable host cells to transfect include, but are not limited to, any bacterial, fungal (e.g., filamentous fungi or yeast or mushrooms), algal, plant, insect, or animal cell that can be transfected. Host cells can be either untransfected cells or cells that are already transfected with at least one other recombinant nucleic acid molecule.
  • Suitable cells may include any microorganism (e.g., a bacterium, a protist, an alga, a fungus, or other microbe), and is preferably a bacterium, a yeast or a filamentous fungus.
  • Suitable bacterial genera include, but are not limited to, Escherichia, Bacillus, Lactobacillus, Pseudomonas and Streptomyces.
  • Suitable bacterial species include, but are not limited to, Escherichia coli, Bacillus subtilis, Bacillus licheniformis, Bacillus stearothermophilus, Lactobacillus brevis, Pseudomonas aeruginosa and Streptomyces lividans.
  • Suitable genera of yeast include, but are not limited to, Saccharomyces, Schizosaccharomyces, Candida, Hansenula, Pichia, Kluyveromyces, and Phaffia.
  • Suitable yeast species include, but are not limited to, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Candida albicans, Hansenula polymorpha, Pichia pastoris, P. canadensis, Kluyveromyces marxianus and Phaffia rhodozyma.
  • Suitable fiingal genera include, but are not limited to, Chrysosporium, Thielavia, Talaromyces, Neurospora, Aureobasidium, Filibasidium, Piromyces, Corynascus, Cryptococcus, Acremonium, Tolypocladium, Scytalidium, Schizophyllum, Sporotrichum, Penicillium, Gibberella, Myceliophthora, Mucor, Aspergillus, Fusarium, Humicola, and Trichoderma, and anamorphs and teleomorphs thereof
  • Suitable fungal species include, but are not limited to, Aspergillus niger, Aspergillus oryzae, Aspergillus nidulans, Aspergillus japonicus, Absidia coerulea, Rhizopus oryzae, Chrysosporium lucknowense, Neurospora crassa, Neurospora intermedia, Trichoderma reesei,
  • the host cell is a fungal cell of the species M. thermophila (Chrysosporium lucknowense) .
  • a while (low cellulose) strain is sued.
  • the host cell is a fungal cell of Strain CI (VKM F-3500-D) or a mutant strain derived therefrom ⁇ e.g., UV13-6 (Accession No. VKM F-3632 D); NG7C-19 (Accession No. VKM F-3633 D); UV18-25 (VKM F-3631D), W1L (CBS122189), or WIUIOOL (CBS122190)).
  • Host cells can be either untransfected cells or cells that are already transfected with at least one other recombinant nucleic acid molecule. Additional embodiments of the present invention include any of the genetically modified cells described herein.
  • one or more protein(s) expressed by an isolated nucleic acid molecule of the present invention are produced by culturing a cell that expresses the protein (i.e., a recombinant cell or recombinant host cell) under conditions effective to produce the protein.
  • the protein may be recovered, and in others, the cell may be harvested in whole, either of which can be used in a composition.
  • Microorganisms used in the present invention are cultured in an appropriate fermentation medium.
  • An appropriate, or effective, fermentation medium refers to any medium in which a cell of the present invention, including a genetically modified microorganism (described below), when cultured, is capable of expressing enzymes useful in the present invention and/or of catalyzing the production of sugars from lignocellulosic biomass.
  • the microorganisms can be cultured by any fermentation process which includes, but is not limited to, batch, fed-batch, cell recycle, and continuous fermentation.
  • the fungal strains are grown in fermentors, optionally centrifuged or filtered to remove biomass, and optionally concentrated, formulated, and dried to produce an enzyme(s) or a multi-enzyme composition that is a crude fermentation product.
  • fermentors optionally centrifuged or filtered to remove biomass, and optionally concentrated, formulated, and dried to produce an enzyme(s) or a multi-enzyme composition that is a crude fermentation product.
  • Particularly suitable conditions for culturing filamentous fungi are described, for example, in U.S. Patent No. 6,015,707 and U.S. Patent No. 6,573,086, supra.
  • resultant proteins of the present invention may either remain within the recombinant cell; be secreted into the culture medium; be secreted into a space between two cellular membranes; or be retained on the outer surface of a cell membrane.
  • the phrase "recovering the protein” refers to collecting the whole culture medium containing the protein and need not imply additional steps of separation or purification.
  • Proteins produced according to the present invention can be purified using a variety of standard protein purification techniques, such as, but not limited to, affinity chromatography, ion exchange chromatography, filtration, electrophoresis, hydrophobic interaction chromatography, gel filtration chromatography, reverse phase chromatography, concanavalin A chromatography, chromatofocusing and differential precipitation or solubilization.
  • standard protein purification techniques such as, but not limited to, affinity chromatography, ion exchange chromatography, filtration, electrophoresis, hydrophobic interaction chromatography, gel filtration chromatography, reverse phase chromatography, concanavalin A chromatography, chromatofocusing and differential precipitation or solubilization.
  • Proteins of the present invention are preferably retrieved, obtained, and/or used in "substantially pure” form.
  • substantially pure refers to a purity that allows for the effective use of the protein in any method according to the present invention.
  • a protein to be useful in any of the methods described herein or in any method utilizing enzymes of the types described herein according to the present invention it is substantially free of contaminants, other proteins and/or chemicals that might interfere or that would interfere with its use in a method disclosed by the present invention (e.g., that might interfere with enzyme activity), or that at least would be undesirable for inclusion with a protein of the present invention (including homologues and variants) when it is used in a method disclosed by the present invention (described in detail below).
  • a "substantially pure" protein is a protein that can be produced by any method (i.e., by direct purification from a natural source, recombinantly, or synthetically), and that has been purified from other protein components such that the protein comprises at least about 80% weight/weight of the total protein in a given composition (e.g., the protein of interest is about 80% of the protein in a solutioiVcomposition/buffer)., and more preferably, at least about 85%, and more preferably at least about 90%, and more preferably at least about 91 %, and more preferably at least about 92%, and more preferably at least about 93%, and more preferably at least about 94%, and more preferably at least about 95%, and more preferably at least about 96%, and more preferably at least about 97%, and more preferably at least about 98%, and more preferably at least about 99%, weight/weight of the total protein in a given composition.
  • Recombinant techniques useful for controlling the expression of nucleic acid molecules include, but are not limited to, integration of the nucleic acid molecules into one or more host cell chromosomes, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals (e.g., ribosome binding sites), modification of nucleic acid molecules to correspond to the codon usage of the host cell, and deletion of sequences that destabilize transcripts.
  • transcription control signals e.g., promoters, operators, enhancers
  • substitutions or modifications of translational control signals e.g., ribosome binding sites
  • a genetically modified microorganism that has been transfected with one or more nucleic acid molecules of the present invention.
  • a genetically modified microorganism can include a genetically modified bacterium, alga, yeast, filamentous fungus, or other microbe.
  • Such a genetically modified microorganism has a genome which is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form such that the desired result is achieved (i.e., increased or modified activity and/or production of at least one enzyme or a multi-enzyme composition for the conversion of lignocellulosic material to fermentable sugars).
  • Genetic modification of a microorganism can be accomplished using classical strain development and/or molecular genetic techniques. Such techniques are known in the art and are generally disclosed for microorganisms, for example, in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press or Molecular Cloning: A Laboratory Manual, third edition (Sambrook and Russel, 2001), (jointly referred to herein as "Sambrook”).
  • a genetically modified microorganism can include a microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that such modifications provide the desired effect within the microorganism.
  • a genetically modified microorganism can endogenously contain and express an enzyme or a multi-enzyme composition for the conversion of lignocellulosic material to fermentable sugars, and the genetic modification can be a genetic modification of one or more of such endogenous enzymes, whereby the modification has some effect on the ability of the microorganism to convert lignocellulosic material to fermentable sugars (e.g., increased expression of the protein by introduction of promoters or other expression control sequences, or modification of the coding region by homologous recombination to increase the activity of the encoded protein).
  • a genetically modified microorganism can endogenously contain and express an enzyme for the conversion of lignocellulosic material to fermentable sugars, and the genetic modification can be an introduction of at least one exogenous nucleic acid sequence (e.g., a recombinant nucleic acid molecule), wherein the exogenous nucleic acid sequence encodes at least one additional enzyme useful for the conversion of lignocellulosic material to fermentable sugars and/or a protein that improves the efficiency of the enzyme for the conversion of lignocellulosic material to fermentable sugars.
  • the microorganism can also have at least one modification to a gene or genes comprising its endogenous enzyme(s) for the conversion of lignocellulosic material to fermentable sugars.
  • the genetically modified microorganism does not necessarily endogenously (naturally) contain an enzyme for the conversion of lignocellulosic material to fermentable sugars, but is genetically modified to introduce at least one recombinant nucleic acid molecule encoding at least one enzyme or a multiplicity of enzymes for the conversion of lignocellulosic material to fermentable sugars.
  • a microorganism can be used in a method of the invention, or as a production microorganism for crude fermentation products, partially purified recombinant enzymes, and/or purified recombinant enzymes, any of which can then be used in a method of the present invention.
  • a cell extract that contains the activity to test can be generated. For example, a lysate from the host cell is produced, and the supernatant containing the activity is harvested and/or the activity can be isolated from the lysate. In the case of cells that secrete enzymes into the culture medium, the culture medium containing them can be harvested, and/or the activity can be purified from the culture medium.
  • the extracts/activities prepared in this way can be tested using assays known in the art. Accordingly, methods to identify mutli-enzyme compositions capable of degrading lignocellulosic biomass are provided.
  • DDG dinitrosalicylic acid assay
  • the present invention is not limited to fungi and also contemplates genetically modified organisms such as algae, bacterial, and plants transformed with one or more nucleic acid molecules of the invention.
  • the plants may be used for production of the enzymes, and/or as the lignocellulosic material used as a substrate in the methods of the invention.
  • Methods to generate recombinant plants are known in the art. For instance, numerous methods for plant transformation have been developed, including * biological and physical transformation protocols. See, for example, Miki et al., "Procedures for Introducing Foreign DNA into Plants” in Methods in Plant Molecular Biology and Biotechnology, Glick, B.R. and Thompson, J.E. Eds.
  • Some embodiments of the present invention include genetically modified organisms comprising at least one nucleic acid molecule encoding at least one enzyme of the present invention, in which the activity of the enzyme is downregulated by manipulating the efficiency with which those nucleic acid molecules are transcribed,. translated, or post-translationally modified or by "knocking out" the endogenous copy of the gene, so that the expression of the gene is substantially reduced or eliminated.
  • the activity of the enzyme may be upregulated.
  • the present invention also contemplates downregulating activity of one or more enzymes while simultaneously upregulating activity of one or more enzymes to achieve the desired outcome.
  • Proteins of the present invention at least one protein of the present invention, compositions comprising such protein(s) of the present invention, and multi-enzyme compositions (examples of which are described above) may be used in any method where it is desirable to hydrolyze glycosidic linkages in lignocellulosic material, or any other method wherein enzymes of the same or similar function are useful.
  • the present invention includes the use of at least one protein of the present invention, compositions comprising at least one protein of the present invention, or multi-enzyme compositions in methods for hydrolyzing lignocellulose and the generation of fermentable sugars therefrom.
  • the method comprises contacting the lignocellulosic material with an effective amount of one or more proteins of the present invention, composition comprising at least one protein of the present invention, or a multi-enzyme composition, whereby at least one fermentable sugar is produced (liberated).
  • the lignocellulosic material may be partially or completely degraded to fermentable sugars. Economical levels of degradation at commercially viable costs are contemplated.
  • the amount of enzyme or enzyme composition contacted with the lignocellulose will depend upon the amount of glucan present in the lignocellulose.
  • the amount of enzyme or enzyme composition contacted with the lignocellulose may be from about 0.1 to about 200 mg enzyme or enzyme composition per gram of glucan; in other embodiments, from about 3 to about 20 mg enzyme or enzyme composition per gram of glucan.
  • the invention encompasses the use of any suitable or sufficient amount of enzyme or enzyme composition between about 0.1 mg and about 200 mg enzyme per gram glucan, in increments of 0.05 mg (i. e., 0.1 mg, 0.1 5 mg, 0.2 mg... 199.9 mg, 199.95 mg, 200 mg).
  • the invention provides a method for degrading DDG, preferably, but not limited to, DDG derived from corn, to sugars.
  • the method comprises contacting the DDG with a protein of the present invention, a composition comprising at least one protein of the present invention, or a multi-enzyme composition. In certain embodiments, at least 10% of fermentable sugars are liberated.
  • the at least 15% of the sugars are liberated, or at least 20% of the sugars are liberated, or at least 23% of the sugars are liberated, or at least 24% of the sugars are liberated, or at least 25% of the sugars are liberated, or at least 26% of the sugars are liberated, or at least 27% of the sugars are liberated, or at least 28% of the sugars are liberated.
  • the invention provides a method for producing fermentable sugars comprising cultivating a genetically modified microorganism of the present invention in a nutrient medium comprising a lignocellulosic material, whereby fermentable sugars are produced.
  • Accessory enzymes have been described elsewhere herein.
  • the accessory enzyme or enzymes may be added at the same time, prior to, or following the addition of a protein of the present invention, a composition comprising at least one protein of the present invention, or a multi- enzyme composition, or can be expressed (endogenously or overexpressed) in a genetically modified microorganism used in a method of the invention.
  • the protein of the present invention, a composition comprising at least one protein of the present invention, or a multi-enzyme composition will be compatible with the accessory enzymes selected.
  • a composition comprising at least one protein of the present invention, or a multi-enzyme composition
  • the conditions such as temperature and pH
  • the accessory enzyme may also be present in the lignocellulosic material itself as a result of genetically modifying the plant.
  • the nutrient medium used in a fermentation can also comprise one or more accessory enzymes.
  • the method comprises a pretreatment process.
  • a pretreatment process will result in components of the lignocellulose being more accessible for downstream applications or so that it is more digestible by enzymes following treatment in the absence of hydrolysis.
  • the pretreatment can be a chemical, physical or biological pretreatment.
  • the lignocellulose may have been previously treated to release some or all of the sugars, as in the case of DDG. Physical treatments, such as grinding, boiling, freezing, milling, vacuum infiltration, and the like may also be used with the methods of the invention.
  • the heat treatment comprises heating the lignocellulosic material to 121 °C for 15 minutes.
  • a physical treatment such as milling can allow a higher concentration of lignocellulose to be used in the methods of the invention.
  • a higher concentration refers to about 20%, up to about 25%, up to about 30%, up to about 35%, up to about 40%, up to about 45%, or up to about 50% lignocellulose.
  • the lignocellulose may also be contacted with a metal ion, ultraviolet light, ozone, and the like.
  • Additional pretreatment processes are known to those skilled in the art, and can include, for example, organosolv treatment, steam explosion treatment, lime impregnation with steam explosion treatment, hydrogen peroxide treatment, hydrogen peroxide/ozone (peroxone) treatment, acid treatment, dilute acid treatment, and base treatment, including ammonia fiber explosion (AFEX) technology.
  • the method comprises detoxifying the lignocellulosic material.
  • Dextoxification may be desirable in the event that inhibitors are present in the lignocellulosic material. Such inhibitors can be generated by a pretreatment process, deriving from sugar degradation or are direct released from the lignocellulose polymer.
  • Detoxifying can include the reduction of their formation by adjusting sugar extraction conditions; the use of inhibitor-tolerant or inhibitor- degrading strains of microorganisms. Detoxifying can also be accomplished by the addition of ion exchange resins, active charcoal, enzymatic detoxification using, e.g., laccase, and the like.
  • the proteins, compositions or products of the present invention further comprises detoxifying agents.
  • the methods may be performed one or more times in whole or in part. That is, one may perform one or more pretreatments, followed by one or more reactions with a protein of the present invention, composition or product of the present invention and/or accessory enzyme.
  • the enzymes may be added in a single dose, or may be added in a series of small doses. Further, the entire process may be repeated one or more times as necessary. Therefore, one or more additional treatments with heat and enzymes are contemplated.
  • the fermentable sugars may be recovered and or purified by any method known in the art.
  • the sugars can be subjected to further processing; e.g., they can also be sterilized, for example, by filtration.
  • the invention provides means for improving quality of lignocellulosic material, including DDG for animal nutrition.
  • the treated lignocellulosic material e.g., a lignocellulosic material which has been saccharified
  • the recovered material can be used as an animal feed additive. It is believed that the recovered material will have beneficial properties for animal nutrition, possibly due to a higher protein content.
  • the amount of enzyme or enzyme composition contacted with the lignocellulosic material may be from about 0.0001 % to about 1.0 % of the weight of the lignocellulosic material; in other embodiments, from about 0.005 % to about 0.1 % of the weight of the lignocellulosic material.
  • the invention includes the use of any amount of enzyme or enzyme composition between about 0.0001 % and about 1.0 %, in increments of 0.0001 (i.e., 0.0001 , 0.0002, 0.0003...etc.).
  • the invention provides a method for producing an organic substance, comprising saccharifying a lignocellulosic material with an effective amount of a protein of the present invention or a composition comprising at least one protein of the present invention, fermenting the saccharified lignocellulosic material obtained with one or more fermentating microorganisms, and recovering the organic substance from the fermentation.
  • Sugars released from biomass can be converted to useful fermentation products including but not limited to amino acids, vitamins, pharmaceuticals, animal feed supplements, specialty chemicals, chemical feedstocks, plastics, solvents, fuels, or other organic polymers, lactic acid, and ethanol, including fuel ethanol.
  • Specific products that may be produced by the methods of the invention include, but are not limited to, bio fuels (including ethanol); lactic acid; plastics; specialty chemicals; organic acids, including citric acid, succinic acid, itaconic acid and maleic acid; solvents; animal feed supplements; pharmaceuticals; vitamins; amino acids, such as lysine, methionine, tryptophan, threonine, and aspartic acid; industrial enzymes, such as proteases, cellulases, amylases, glucanases, lactases, lipases, lyases, oxidoreductases, and transferases; and chemical feedstocks.
  • the methods of the invention are also useful to generate feedstocks for fermentation by fermenting microorganisms.
  • the method further comprises the addition of at least one fermenting organism.
  • fertilizing organism refers to an organism capable of fermentation, such as bacteria and fungi, including yeast. Such feedstocks have additional nutritive value above the nutritive value provided by the liberated sugars.
  • the present invention provides methods for improving the nutritional quality of food (or animal feed) comprising adding to the food (or the animal feed) at least one protein of the present invention.
  • the present invention provides methods for improving the nutritional quality of the food (or animal feed) comprising pretreating the food (or the animal feed) with at least one isolated protein of the present invention.
  • use of the enzymes xylanases and arabinofuranosidases in bread making has been known to improve the nutritional quality of the dough by degrading the arabinoxylans in the dough.
  • Improving the nutritional quality can mean making the food (or the animal feed) more digestible and/or less allergenic, and encompasses changes in the caloric value, taste and/or texture of the food.
  • the proteins of the present invention may be used as part of nutritional supplements.
  • the proteins of the present invention may be used as part of digestive aids, and may help in providing relief from digestive disorders such as acid reflux and celiac disease.
  • Proteins of the present invention and compositions comprising at least one protein of the present invention are also useful in a variety of other applications involving the hydrolysis of glycosidic linkages in lignocellulosic material, such as stone washing, color brightening, depilling and fabric softening, as well as other applications well known in the art. Proteins of the present invention and compositions comprising at least one protein of the present invention are also readily amenable to use as additives in detergent and other media used for such applications. These and other methods of use will readily suggest themselves to those of skill in the art based on the invention described herein.
  • proteins and compositions of the present invention can be used in stone washing procedures for fabrics or other textiles.
  • the proteins and compositions can be used in stone washing procedures for denim jeans. Examples of these uses can be found in U.S. Patent Application Publication No. 2003/0157595.
  • the cellulase compositions of this invention can be used to reduce or eliminate the harshness associated with a fabric or textile by contacting the fabric or textile with a protein or composition of the present invention.
  • the fabric or textile may be made from cellulose or cotton.
  • a preferred range for reducing or eliminating the harshness associated with a fabric or textile is between about pH 8 to about 12, or between about pH 10 to about 1 1.
  • the proteins or compositions of the subject invention can be used in detergent compositions.
  • the detergent composition may comprise at least one protein or composition of the present invention and one or more surfactants.
  • the detergent compositions may also include any additional detergent ingredient known in the art.
  • the detergent compositions of this invention preferably employ a surface active agent, i.e., surfactant, including anionic, non-ionic, and ampholytic surfactants well known for their use in detergent compositions.
  • the detergent compositions of this invention can additionally contain one or more components. Examples of detergent compositions employing cellulases are exemplified in U.S. Pat. Nos. 4,435,307; 4,443,355; 4,661 ,289; 4,479,881 ; 5, 120,463.
  • a detergent base used in the present invention when in the form of a powder, it may be one which is prepared by any known preparation method including a spray- drying method and/or a granulation method.
  • the granulation method are the most preferred because of the non-dusting nature of granules compared to spray dry products.
  • the detergent base when a liquid, it may be either a homogenous solution or an inhomogeneous solution.
  • compositions of the present invention include, but are not limited to, garment dyeing applications such as enzymatic mercerizing of viscose, bio-polishing applications, enzymatic surface polishing; biowash (washing or washing down treatment of textile materials), enzymatic microfibrillation, enzymatic "cottonization” of linen, ramie and hemp; and treatment of Lyocel® or Newcell® (i.e., "TENCEL®” from Courtauld's), Cupro® and other cellulosic fibers or garments, dye removal from dyed cellulosic substrates such as dyed cotton (Leisola & Linko-(1976) Analytical Biochemistry, v.
  • garment dyeing applications such as enzymatic mercerizing of viscose, bio-polishing applications, enzymatic surface polishing
  • biowash washing or washing down treatment of textile materials
  • enzymatic microfibrillation enzymatic "cottonization” of linen, ramie and hemp
  • the amount of enzyme or enzyme composition contacted with a textile may vary with the particular application. Typically, for biofinishing and denim washing applications, from about 0.02 wt. % to about 5 wt. % of an enzyme or enzyme composition may be contacted with the textile. In some embodiments, from about 0.5 wt. % to about 2 wt. % of an enzyme or enzyme composition may be contacted with the textile. For bioscouring, from about 0. 1 to about 10, or from about 0. 1 to about 1.0 grams of an enzyme or enzyme composition per kilogram of textile may be used, including any amount between about 0. 1 grams and about 10 grams, in increments of 0.1 grams.
  • proteins or compositions of the present invention can be used in the saccharification of lignocellulose biomass from agriculture, forest products, municipal solid waste, and other sources, for biobleaching of wood pulp, and for de- inking of recycled print paper all by methods known to one skilled in the art.
  • the amount of enzyme or enzyme composition used for pulp and paper modification typically varies depending upon the stock that is used, the pH and temperature of the system, and the retention time.
  • the amount of enzyme or enzyme composition contacted with the paper or pulp may be from about 0.01 to about 50 U; from about 0. 1 to about 1 5 U; or from about 0. 1 to about 5 U of enzyme or enzyme composition per dry gram of fiber, including any amount between about 0.01 and about 50 U, in 0.01 U increments.
  • the amount of enzyme or enzyme composition contacted with the paper or pulp may be from about 1 to about 2000 grams or from about 100 to about 500 grams enzyme or enzyme composition per dry ton of pulp, including any amount between about 1 and about 2000 grams, in 1 gram increments.
  • Proteins or compositions of the present invention can added to wastewater to reduce the amount of solids such as sludge or to increase total biochemical oxygen demand (BOD) and chemical oxygen demand (COD) removal.
  • BOD biochemical oxygen demand
  • COD chemical oxygen demand
  • proteins or compositions of the present invention can be used to transform particulate COD to soluble COD in wastewater produced from grain/fruit/cellulose industrial processes or to increase the BOD/COD ratio by increasing waste biodegradability (soluble lower molecular weight polymers in cellulosic hemicellulosic wastes are typically more readily biodegradable than non-soluble material).
  • proteins or compositions of the present invention can also be used to increase waste digestion by aerobic and/or anaerobic bacteria.
  • Chitosanases of the present invention can hydro lyze the P-l ,4-linkages between D- glucosamine residues in acetylated chitosan (i.e., deacetylated chitin) and thus may be used to degrade chitin- or chitosan-containing materials.
  • acetylated chitosan i.e., deacetylated chitin
  • Examples of chitin- containing materials include fungal cell walls, insect exoskeletons, the eggs of parasitic worms, and crustacean shells.
  • Chitosanases may be used to inhibit or reduce fungal growth, including the treatment of fungal infections.
  • Chitosanases or compositions containing chitosanases may be used as a biological control agent such as an insecticide (see, e.g., Kramer et al., Insect Biochem Mol Biol. 27:887 (1997).
  • Chitin-degrading enzymes such as chitinases and chitosanases have been shown to be effective for controlling white-fly larvae in laboratory tests.
  • chitosanases may be applied to crops, plants and the like to control insect infestations.
  • This assay measures the release of p-nitrophenol by the action of a- arabinofuranosidase on p-nitrophenyl a-L-arabinofuranoside (PNPA).
  • PNPA p-nitrophenyl a-L-arabinofuranoside
  • One a- arabinofuranosidase unit of activity is the amount of enzyme that liberates 1 micromole of p-nitrophenol in one minute at 37°C and pH 5.0.
  • Acetate buffer (0.1 M, pH 5.0) is prepared as follows: 8.2 g of anhydrous sodium acetate or 13.6 g of sodium acetate * 3H 2 0 is dissolved in distilled water so that the final volume of the solution is 1000 ml (Solution A). In a separate flask, 6.0 g (5.72 ml) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 ml (Solution B). The final 0.1 M acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • PNPA (Fluka, Switzerland, cat. # 73616) is used as the assay substrate. 8.25 mg of PNPA is dissolved in 5 mL of distilled water and 5 mL 0.1 M acetate buffer using a magnetic stirrer to obtain a 1 mM stock solution. The solution is stable for 2 days with storage at 4°C.
  • the stop reagent (0.25 M sodium carbonate solution) is prepared as follows: 26.5 g of anhydrous sodium carbonate is dissolved in 800 ml of distilled water, and the solution volume is adjusted to 1000 ml. This reagent is used to terminate the enzymatic reaction.
  • ⁇ 405 A s - A S B
  • DF the enzyme dilution factor
  • 21 is the dilution of 10 ⁇ enzyme solution in 210 ⁇ reaction volume
  • 1.33 is the conversion factor of microtiter plates to cuvettes
  • 13.700 is the extinction coefficient 13700 M " 1 cm “1 of p-nitrophenol released corrected for moI/L to ⁇ niL
  • RT is the reaction time in minutes.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH3, GH43, GH51 , GH54, and GH62 enzymes.
  • enzymes such as, but not limited to, GH3, GH43, GH51 , GH54, and GH62 enzymes.
  • CL07803 exhibits activity in this assay.
  • arabinoxylans For the complete degradation of arabinoxylans to arabinose and xylose, several enzyme activities are needed, including endo-xylanases and arabinofuranosidases.
  • the arabinoxylan molecule from wheat is highly substituted with arabinosyl residues. These can be substituted either to the C 2 or the C 3 position of the xylosyl residue (single substitution), or both to the C2 and C3 position of the xylose (double substitution).
  • AXTk An arabinoftiranosidase from Bifidobacterium adolescentis (AXTk ) has previously been isolated which is able to liberate the arabinosyl residue substituted to the C 3 position of a double substituted xylose (Van Laere et al. 1997, Van den Broek et al. 2005). Most of the known arabinofuranosidases are only active towards single arabinosyl substituted xyloses.
  • Single and double substituted oligosaccharides are prepared by incubating wheat arabinoxylan (WAX; 10 mg/mL; Megazyme, Bray, Ireland) in 50 mM acetate buffer pH 5 with 0.3 mg Pentopan Mono (monocomponent endo-l ,4-P-xylanase, an enzyme from Thermomyces lanuginosus produced in Aspergillus oryzae; Sigma, St. Louis, USA) for 16 hours at 30°C. The reaction is stopped by heating the samples at 100°C for 10 minutes. The samples are centrifuged for 5 minutes at 3100 x g. The supernatant is used for further experiments. Degradation of the arabinoxylan is followed by analysis of the formed reducing sugars and High Performance Anion Exchange Chromatography (HPAEC).
  • HPAEC High Performance Anion Exchange Chromatography
  • Double substituted arabinoxylan oligosaccharides are prepared by incubation of 800 ⁇ of the supernatant described above with 0.18 mg of the arabinofuranosidase Abfl (Abfl is arabinofuranosidase from C. lucknowense with activity towards single arabinose substituted xylose residues and is disclosed in U.S. Application No. 1 1/833, 133, filed August 2, 2007, the contents of which are incorporated herein by reference) in 50 mM acetate buffer pH 5 for 20 hours at 30°C. The reaction is stopped by heating the samples at 100°C for 10 minutes. The samples are centrifuged for 5 minutes at 10,000 x g, and the supernatant is used for further experiments. Degradation of the arabinoxylan is followed by analysis of the formed reducing sugars and HPAEC.
  • the enzyme (25 ⁇ g total protein) is incubated with single and double substituted arabinoxylan oligosaccharides ( 100 ⁇ L ⁇ supernatant of Pentopan Mono treated WAX) in 50 mM acetate buffer at 30°C during 20 hours. The reaction is stopped by heating the samples at 100°C , for 10 minutes. The samples are centrifuged for 5 minutes at 10,000 x g. Degradation of the arabinoxylan is followed by analysis of the formed reducing sugars and HPAEC.
  • T e enzyme 25 ⁇ g total protein from B. adolescentis ( ⁇ ⁇ , 0.02 U; Megazyme, Bray, Ireland) is incubated with double substituted arabinoxylan oligosaccharides (125 ⁇ supernatant of Pentopan Mono and Abfl treated WAX) in 50 mM acetate buffer at 35°C during 24 hours. The reaction is stopped by heating the samples at 100°C for 10 minutes. The samples are centrifuged for 5 minutes at 10,000 x g. Degradation of the arabinoxylan is followed by analysis of the formed reducing sugars and HPAEC.
  • he amount of reducing sugars is measured with help of the DNS (3,5- dinitrosalicylic acid) assay.
  • DNS 3,5- dinitrosalicylic acid
  • 0.5 mL of DNS reagent (3,5-dinitrosalicylic acid and sodium potassium tartrate dissolved in dilute sodium hydroxide) is added to the sample (50 ⁇ ), containing 0 - 5 mg/ml reducing sugar.
  • the reaction mixture is heated at 100°C for 5 minutes and rapidly cooled in ice to room temperature. The absorbance at 570 nm is measured. Glucose is used as a standard.
  • the analysis of the samples via HPAEC is performed using a Dionex HPLC system equipped with a Dionex CarboPac PA-1 (2 mm ID x 250 mm) column in combination with a CarboPac PA guard column (1 mm ID x 25 mm) and a Dionex EDetl PAD-detector (Dionex Co., Sunnyvale).
  • a flow rate of 0.3 ml/min is used with the following gradient of sodium acetate in 0.1 M NaOH: 0-5 minutes, 0- 100 mM; 5-45 minutes, 100-400 mM.
  • Single and double substituted arabinoxylan oligosaccharides are prepared by xylanase treatment as described above. After overnight incubation, the amount of reducing sugars increases from 0 to 4.3 mM (glucose is used as the standard). The addition of an extra amount of enzyme did not increase this number, indicating that the reaction is at an end point. Assuming that the average degree of polymerization is 3, 1.8 mg of oligosaccharides are formed, which corresponds to 30% of WAX degradation. Oligosaccharides are identified according to Kormelink et al., 1993 and protagonist et al., 1992.
  • oligosaccharides are also present after xylanase treatment.
  • the single substituted oligosaccharides is removed from the xylanase-treated WAX mixture by the enzyme Abfl as described above.
  • Samples containing only double substituted oligosaccharides are treated with the target enzyme or AXHd 3 from B. adolescentis as a reference enzyme as described above.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH3, GH43, GH51 , GH54, and GH62 enzymes.
  • enzymes such as, but not limited to, GH3, GH43, GH51 , GH54, and GH62 enzymes.
  • CL07803 exhibits activity in this assay.
  • Such activity is demonstrated by using xyloglucan as substrate and a reducing sugars assay (PAHBAH) as detection method.
  • PAHBAH reducing sugars assay
  • the cellulase standard solution is prepared, which contains 2 units of cellulase per ml of 0.2 M HAc/NaOH, pH 5. Subsequently, a standard series of 0 to 2 U/ml is prepared ( 12 samples).
  • Reagent A 10 g of -Hydroxy benzoic acid hydrazide (PAHBAH) is suspended in 60 mL water. 10 mL of concentrated hydrochloric acid is added and the volume is adjusted to 200 ml.
  • Reagent B 24.9 g of trisodium citrate is dissolved in 500 ml of water. To this solution 2.2 g of calcium chloride and 40 g sodium hydroxide are added. The volume is adjusted to 2 L with water. Both reagents are stored at room temperature.
  • Working Reagent 10 ml of Reagent A is added to 90 ml of Reagent B. This solution is prepared freshly every day, and is stored on ice between uses.
  • the assay is conducted in micro titer plate format. Each well contains 50 ⁇ of
  • xyloglucan substrate (0,25%(w/v) tamarind xyloglucan in water), 30 ⁇ of 0,2 M HAc NaOH pH 5, 20 ⁇ xyloglucanase sample or cellulase standard sample. These are incubated at 37 ° C for 2 hours. After incubation 25 ⁇ of each well are mixed with 125 ⁇ working reagent. These solutions are heated at 95°C for 5 minutes. After cooling down, the samples are analyzed by measuring the absorbance at 410 nm ( )io) as As (enzyme sample). The standard curve is determined and from that the enzyme activities are determined.
  • xyloglucan substrate (0,25%(w/v) tamarind xyloglucan in water) is mixed with 50 ⁇ 0.2 M sodium acetate buffer pH 5.0 and incubated at 37 °C for 2 hours.
  • 50 ⁇ . of xyloglucan substrate (0,25%(w/v) tamarind xyloglucan in water) is mixed with 50 ⁇ 0.2 M sodium acetate buffer pH 5.0 and incubated at 37 °C for 2 hours.
  • 125 ⁇ _. of working solution is added.
  • the samples are heated for 5 minutes at 95°C. After cooling down, the samples are ' analyzed by measuring the absorbance at 410 nm (A410) as ASB (substrate blank sample).
  • Activity is calculated as follows: xyloglucanase activity is determined by reference to a standard curve of the cellulase standard solution.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH5, GH12, GH16, GH44, and GH74 enzymes.
  • enzymes CL10564, CL04789, CL08035, CL03904, CL08085, CL07129, CL06253, CL00361 , CL08631 , CL10952, CL04578, CL05406, and CL01088 exhibit activity in this assay.
  • This assay measures the release of p-nitrophenol by the action of ⁇ -xylosidase on p- nitrophenyl ⁇ -D-xylopyranoside (PNPX).
  • PNPX p- nitrophenyl ⁇ -D-xylopyranoside
  • Sodium acetate buffer (0.1 M, pH 5.0) is prepared as follows. 8.2 g of anhydrous sodium acetate or 13.6 g of sodium acetate * 3H 2 O is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 6.0 g (5.72 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.1 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • PNPX from Extrasynthese (France, cat. # 4244) is used as the assay substrate. 16.5 mg of PNPX is dissolved in 5 mL of distilled water and 5 mL 0.1 M sodium acetate buffer using magnetic stirrer to obtain 2 mM stock solution. The solution is stable for 2 days on storage at 4 °C.
  • the stop reagent (0.25 M sodium carbonate solution) is prepared as follows. 26.5 g of anhydrous sodium carbonate is dissolved in 800 mL of distilled water, and the solution volume is adjusted to 1000 mL. This reagent is used to terminate the enzymatic reaction.
  • ⁇ 405 As (enzyme sample) - ASB (substrate blank);
  • DF is the enzyme dilution factor;
  • 21 is the dilution of 10 ⁇ enzyme solution in 210 ⁇ reaction volume;
  • 1.33 is the conversion factor of microtiter plates to cuvettes;
  • 13.700 is the extinction coefficient 13700 M '1 cm "1 of -nitrophenol released corrected for mol/L to ⁇ /mL; and RT is the reaction time in minutes.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH3, GH30, GH39, GH43, GH52, and GH54 enzymes.
  • enzymes such as, but not limited to, GH3, GH30, GH39, GH43, GH52, and GH54 enzymes.
  • CL07803 exhibits activity in this assay.
  • This assay measures the action of ⁇ -galactosidase on 5-Bromo-4-ch!oro-3-indolyl ⁇ - D-galactoside (X-Gal) to yield galactose and 5-bromo-4-chloro-3-hydroxyindole.
  • 5- bromo-4-chloro-3-hydroxyindole is oxidized into 5,5'-dibromo-4,4'-dichloro-indigo, which is an insoluble blue product.
  • Sodium acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1 g of anhydrous sodium acetate or 6.8 g of sodium acetate * 3H 2 0 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 3.0 g (2.86 mL) of glacial acetic acid is mixed with distilled water to total volume of 1000 mL (Solution B). The final 0.05 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is 5.0.
  • X-Gal from Fermentas (St. Leon Rot, Germany) is used as the assay substrate. 1.0 mg of X-Gal is dissolved in 10 mL 0.05 M sodium acetate buffer using magnetic stirrer.
  • the assay is performed as detailed below.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH2 and GH42 enzymes.
  • enzymes such as, but not limited to, GH2 and GH42 enzymes.
  • the enzymes CL00647 and CL01467 exhibit acivity in this assay.
  • This assay measures the release of >-nitrophenol by the action of acetyl esterase on p-nitrophenyl acetate (PNPAc).
  • PNPAc p-nitrophenyl acetate
  • One acetyl esterase unit of activity is the amount of enzyme that liberates 1 micromole ofp-nitrophenol in one minute at 37 °C and pH 5.
  • Sodium acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1 g of anhydrous sodium acetate or 6.8 g of sodium acetate * 3H 2 0 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 3.0 g (2.86 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.05 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is 5.0.
  • PNPAc from Fluka (Switzerland, cat. # 46021) is used as the assay substrate. 3.6 mg of PNPAc is dissolved in 10 mL of 0.05 M sodium acetate buffer using magnetic stirrer to obtain 2 mM stock solution. The solution is stable for 2 days on storage at 4 °C.
  • the stop reagent (0.25 M Tris-HCl, pH 8.8) is prepared as follows. 30.29 g of Tris is dissolved in 900 mL of distilled water (Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by mixing solution A with 37% HC1 until the pH of the resulting solution reached 8.8. The solution volume is adjusted to 1000 mL. This reagent is used to terminate the enzymatic reaction.
  • ⁇ 405 As (enzyme sample) - ASB (substrate blank)
  • DF is the enzyme dilution factor
  • 21 is the dilution of 10 ⁇ enzyme solution in 210 ⁇ reaction volume
  • 1.33 is the conversion factor of microtiter plates to cuvettes
  • 13.700 is the extinction coefficient 13700 M "1 cm '1 of p-nitrophenol released corrected for mol/L to ⁇ 1/ ⁇ _
  • RT is the reaction time in minutes.
  • This assay can be used to test the activity of enzymes such as, but not limited to, CE1 , CE2, CE3, CE4, CE5, CE6, CE7, CE12, CE13 and CE16 enzymes.
  • enzymes CL05183 and CL01692 exhibit activity in this assay.
  • the following example illustrates the assay used to measure arabinofuranosidase/arabinase activity. This assay measures the release of arabinose by the action of the a-arabinofuranosidase on linear and branched arabinan.
  • Sodium acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4. 1 g of anhydrous sodium acetate or 6.8 g of sodium acetate * 33 ⁇ 40 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 3.0 g (2.86 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.05 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • Linear and branched arabinan is purchased from British Sugar.
  • the enzyme sample (40-55 ⁇ g total protein) is incubated with 5 mg/mL of linear or branched arabinan in 50 mM sodium acetate buffer at 40°C during 24 hours. The reaction is stopped by heating the samples at 100°C for 10 minutes. The samples are centrifuged for 5 minutes at 10,000 x g. Degradation of the arabinan is followed by HPAEC analysis.
  • the analysis of the samples is performed using a Dionex HPLC system equipped with a Dionex CarboPac PA- 1 (2 mm ID x 250 mm) column in combination with a CarboPac PA guard column ( 1 mm ID x 25 mm) and a Dionex EDet l PAD-detector (Dionex Co., Sunnyvale).
  • a flow rate of 0.3 mL/min is used with the following gradient of sodium acetate in 0. 1 M NaOH: 0-40 min, 0-400 mM.
  • This assay can be used to test the activity of enzymes such as, but not limited to,
  • This example illustrates the assay used to determine the ability of a protein to bind chitin.
  • the bound proteins, including the matrix are heated for 1 0 minutes at 95°C in sample buffer and separated by SDS-gel electrophoresis. Specific bands from this gel are analyzed by MS/MS.
  • CL10251 , CL05157, and CL01848 are found to bind to chitin.
  • lichenan (which is a P(l ,3)-P(l ,4)-linked glucan).
  • CL 10952, CL04 78, CL05406, and CL01088 is found to bind to lichenan.
  • Sodium acetate buffer (0.2 M, pH 5.0) is prepared as follows. 16.4 g of anhydrous sodium acetate or 27.2 g of sodium acetate * 3H2O is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 12.0 g ( 1 1.44 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.2 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • AZO-wheat arabinoxylan from Megazyme (Bray, Ireland, Cat. # I- AWAXP) is used as the assay substrate. 1 g of AZO-WAX is suspended in 3 mL ethanol and adjusted to 100 mL with 0.2 sodium acetate buffer pH 5.0 using magnetic stirrer.
  • 0.2 mL of 10 mg/ml AZO-WAX stock solution is preheated at 40° C for 10 minutes.
  • This preheated stock solution is mixed with 0.2 mL of the enzyme sample (preheat at 40° C for 1 0 min) and incubated at 40 °C for 10 minutes. After exactly 10 minutes of incubation, 1 .0 mL of 96% ethanol is added and then the absorbance at 590 nm (A590) is measured as As (enzyme sample).
  • 0.2 mL of 10 mg/ml AZO-WAX stock solution is preheated at 40° C for 10 minutes.
  • This preheated stock solution is mixed with 200 ⁇ of 0.2 M sodium acetate buffer pH 5.0 (preheat at 40° C for 10 min) and incubated at 40 °C for 10 minutes. After exactly 10 minutes of incubation, 1.0 mL of 96% ethanol is added and then the absorbance at 590 nm (A 590 ) is measured as ASB (substrate blank).
  • Activity is calculated as follows: endo-xylanase activity is determined by reference to a standard curve, produced from an endo-xylanase with known activity towards AZO-WAX.
  • ⁇ 590 A s (enzyme sample) - A S B (substrate blank)
  • SC the slope of the standard curve
  • DF the enzyme dilution factor
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH5, GH8, Ghl O, and GH1 1.
  • enzymes such as, but not limited to, GH5, GH8, Ghl O, and GH1 1.
  • enzymnes CL10564, CL04789, and CL08035 exhibit activity in this assay.
  • This example illustrates the assay to measure xylanase activity.
  • This assay measures the release of xylose and xylo-oligosaccharides by the action of xylanases on wheat arabinoxylan oligosaccharides (WAX).
  • Sodium acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1 g of anhydrous sodium acetate or 6.8 g of sodium acetate * 3H 2 0 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 3.0 g (2.86 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.05 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is 5.0.
  • Wheat arabinoxylan is purchased from Megazyme (Bray Ireland, Cat. # P-WAXYI).
  • the analysis is performed using a Dionex HPLC system equipped with a Dionex CarboPac PA-1 (2 mm ID x 250 mm) column in combination with a CarboPac PA guard column (1 mm ID x 25 mm) and a Dionex EDetl PAD-detector (Dionex Co., Sunnyvale).
  • a flow rate of 0.3 mL/min is used with the following gradient of sodium acetate in 0.1 M NaOH: 0-50 min, 0-500 mM.
  • Each elution is followed by a washing step of 5 min 1 ,000 mM sodium acetate in 0.1 M NaOH and an equilibration step of 15 min O. l M NaOH.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH5, GH8, Ghl O, and GH1 1.
  • enzymes CL10564, CL04789, and CL08035 exhibit activity in this assay.
  • This example illustrates the assay to determine the ability of a protein to bind xylan.
  • the bound proteins including the matrix, are heated for 10 minutes at 95°C in sample buffer and separated by SDS-gel electrophoresis. Specific bands from this gel are analyzed by MS/MS.
  • the following example illustrates the assay to measure ⁇ -galactosidase activity.
  • This assay measures the release of p-nitrophenol by the action of ⁇ -galactosidase on p- nitrophenyl-P-D-galactopyranoside (PNPGa).
  • PNPGa p- nitrophenyl-P-D-galactopyranoside
  • One ⁇ -galactosidase unit of activity is the amount of enzyme that liberates 1 micromole of p-nitrophenol in one minute.
  • Mcllvain buffer (pH 4.0) is prepared as follows. 21.01 g of citric acid monohydrate ( ⁇ 63 ⁇ 4 ⁇ 7* ⁇ 2 ⁇ ) is dissolved in Millipore water so that the final volume of the solution to be 1000 mL (Solution A). In a separate flask, 53.62 g of Na 2 HP0 4 *7H 2 0 is dissolved in Millipore water to make the total volume of 1000 mL (Solution B). The final Mcllvain buffer, pH 4.0, is prepared by mixing 614.5 mL Solution A with 385.5 mL Solution B. The pH of the resulting solution is equal to 7.0
  • PNPGa from Fluka (Switzerland, cat. # 46021) is used as the assay substrate. 2.7 mg of PNPGa is dissolved in 10 mL of Mcllvain buffer using magnetic stirrer to obtain 1 .5 mM stock solution. The solution is stable for 2 days on storage at 4 °C.
  • the stop reagent (0.25 M sodium carbonate solution) is prepared as follows. 26.5 g of anhydrous sodium carbonate is dissolved in 800 mL of distilled water, and the solution volume is adjusted to 1000 mL. This reagent is used to terminate the enzymatic reaction.
  • ⁇ 410 A s (enzyme sample) - A S B (substrate blank);
  • DF the enzyme dilution factor;
  • 20 is the dilution of 50 ⁇ enzyme solution in 1000 ⁇ reaction volume;
  • 1.33 is the conversion factor of microtiter plates to cuvettes;
  • 13.700 is the extinction coefficient 13700 M "1 cm “1 of p-nitrophenol released corrected for mol/L to ⁇ /mL;
  • RT is the reaction time in minutes.
  • This assay can be used to test the activity of enzymes such as GH2 and GH24.
  • enzymes such as GH2 and GH24.
  • enzymnes CL00647 and CL01467 exhibit activity in this assay.
  • This example illustrates the assay to measure acetyl xylan esterase activity towards arabinoxylan oligosaccharides from Eucalyptus wood. This assay measures the release of acetate by the action of the acetyl xylan esterases on the arabinoxylan oligosaccharides.
  • Phosphate buffer (0.05 M, pH 7.0) is prepared as follows. 13.8 g of NaH 2 P0 4 * H 2 0 is dissolved in 1 L of Millipore water. 26.8 g is dissolved in Millipore water. 195 mL of the NaH 2 PC>4 solution is mixed with 305 mL of the Na 2 HP0 4 solution and adjusted to 1000 mL with Millipore water. The pH of the resulting solution is equal to 7.0.
  • AIS and Eucalyptus globulus xylan polymer are obtained using the method described in Lucas et al. 2002.
  • CE-LIF is performed using ProteomeLab PA800 Protein Characterization System (Beckman Coulter), controlled by 32 Karat Software.
  • the capillary column used is polyvinyl alcohol coated capillary (N-CHO capillary, Beckman Coulter), having 50 ⁇ ID, 50.2 cm length and 40 cm to detector window. 25 mM sodium acetate buffer pH 4.75 containing 0.4% polyethyleneoxide (Carbohydrate separation buffer, Beckman Coulter) is used as running buffer.
  • the sample (ca. 3.5 nL) is injected to the capillary by a pressure of 0.5 psi for 3 seconds. The separation is done for 20 minutes at 30 kV separating voltage, with reversed polarity.
  • the samples are stored at 10°C.
  • the labeled EW-XOS are detected using LIF detector at 488 nm excitation and 520 nm emission wavelengths.
  • This assay can be used to test the activity of enzymes such as, but not limited to, CE 1 , CE2, CE3, CE4, CE5, CE6, CE7, CE 12, and CE 16 enzymes.
  • enzymes CL051 83 and CL01692 exhibit activity in this assay.
  • This example illustrates the assay to measure ⁇ -glucosidase activity.
  • This assay measures the release of p-nitrophenol by the action of ⁇ -glucosidase on p- nitrophenyl ⁇ -D-glucopyranoside (PNPG).
  • PNPG p- nitrophenyl ⁇ -D-glucopyranoside
  • One ⁇ -glucosidase unit of activity is the amount of enzyme that liberates 1 micromole of p-nitrophenol in one minute.
  • Sodium acetate buffer (0.2 M, pH 5.0) is prepared as follows. 16.4 g of anhydrous sodium acetate or 27.2 g of sodium acetate * 3H 2 0 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 12 g ( 1 1 .44 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1 000 mL (Solution B). The final 0.1 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • PNPG from Sigma (St.Louis, USA) is used as the assay substrate. 20 mg of PNPG is dissolved in 5 mL of sodium acetate buffer using magnetic stirrer. The solution is stable for 2 days on storage at 4 °C. [0495] The stop reagent (0.25 M Tris-HCl, pH 8.8) is prepared as follows. 30.29 g of Tris is dissolved in 900 mL of distilled water (Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by mixing solution A with 37% HC1 until the pH of the resulting solution is equal to 8.8. The solution volume is adjusted to 1000 mL. This reagent is used to terminate the enzymatic reaction.
  • PNPG stock solution 0.025 mL of PNPG stock solution is mixed with 1 ⁇ of the enzyme sample, 0.075 mL buffer and 0.099 mL Millipore water and incubated at 37 °C for 4 minutes. Every minute during 4 minutes a 0.04 mL sample is taken and added to 0.06 mL stop reagent. The absorbance at 410 nm (Auo) is measured in microtiter plates as As (enzyme sample).
  • PNPG stock solution 0.025 mL of PNPG stock solution is mixed with 0.075 mL buffer and 0.1 mL Millipore water and incubated at 37 °C for 4 minutes. Every minute during 4 minutes a 0.04 mL sample is taken and added to 0.06 mL stop reagent.
  • the absorbance at 410 nm (Auo) is measured in microtiter plates as A S B (substrate blank sample).
  • dA slope in A/min
  • Va reaction volume in 1 (0.0002 1)
  • d dilution factor of assay mix after adding stop reagent (2.5);
  • extinction coefficient (0.0137 ⁇ "1 cm “1 );
  • Vp volume of protein stock added to assay (0.001 ml)
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH1 , GH3, GH9, and GH30 enzymes.
  • This example illustrates the assay to measure ⁇ -xylosidase activity. This assay measures the release of xylose by the action of ⁇ -xylosidase on xylobiose.
  • Sodium acetate buffer (0.05 M, pH 5.0) is prepared as follows. 4.1 g of anhydrous sodium acetate or 6.8 g of sodium acetate * 3H O is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 3.0 g (2.86 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0.05 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • Xylobiose is purchased from Megazyme (Bray Ireland, Cat. # P-WAXYI). 25 mg is dissolved in 5 mL sodium acetate buffer pH 5.0.
  • a) mL of 5.0 mg/mL substrate solution is mixed with 0.02 mL of the enzyme sample at 50 °C and pH 5.0 for 24 hours. The reaction is stopped by heating the samples for 10 minutes at 100°C. The release of xylose and arabinoxylan oligosaccharides are analyzed by High Performance Anion Exchange Chromatography.
  • a) mL of 5.0 mg mL substrate solution is mixed with 0.02 mL of the buffer at 50 °C and pH 5.0 for 24 hours. The reaction is stopped by heating the samples for 10 minutes at 100°C. The release of xylose and arabinoxylan oligosaccharides are analyzed by High Performance Anion Exchange Chromatography.
  • the analysis is performed using a Dionex HPLC system equipped with a Dionex CarboPac PA-1 (2 mm ID x 250 mm) column in combination with a CarboPac PA guard column (1 mm ID x 25 mm) and a Dionex EDetl PAD-detector (Dionex Co., Sunnyvale).
  • a flow rate of 0.25 mL/min is used with the following gradient of sodium acetate in 0.1 M NaOH: 0- 15 min, 0- 150 mM.
  • Each elution is followed by a washing step of 5 min using 1 M sodium acetate in 0.1 M NaOH and an equilibration step of 15 min using 0.1 M NaOH.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH3, GH30, GH39, GH43, GH52, and GH54 enzymes.
  • Reagent A 10 g of p-Hydroxy benzoic acid hydrazide (PAHBAH) is suspended in 60 mL water. 10 mL of concentrated hydrochloric acid is added and the volume is adjusted to 200 ml.
  • Reagent B 24.9 g of trisodium citrate is dissolved in 500 ml of water. To this solution 2.2 g of calcium chloride and 40 g sodium hydroxide are added. The volume is adjusted to 2 L with water. Both reagents are stored at room temperature.
  • Working Reagent 10 ml of Reagent A is added to 90 ml of Reagent B. This solution is prepared freshly every day, and is stored on ice between uses.
  • the assay is conducted in micro titer plate format.
  • Each well contains 50 ⁇ of ⁇ - glucan substrate ( 1 % (w/v) Barley ⁇ -glucan, laminarin, lichenan or curdlan in water), 30 ⁇ ⁇ of 0,2 HAc/NaOH pH 5, 20 ⁇ P- l ,3-glucanase sample. These are incubated at 37 ° C for 2 hours. After incubation 25 ⁇ of each well are mixed with 125 ⁇ working reagent. These solutions are heated at 95°C for 5 minutes. After cooling down, the samples are analyzed by measuring the absorbance at 41 0 nm ( l io) as As (enzyme sample). The standard curve is determined and from that the enzyme activities are determined.
  • Activity is calculated as follows: P- l ,3-glucanase activity is determined by reference to a standard curve of the cellulase standard solution.
  • SC substrate blank
  • DF enzyme dilution factor
  • This assay can be used to test the activity of enzymes such as the GH5, GH12, GH16, GH17, GH55, GH64 and GH81 enzymes.
  • enzymes CL10564, CL04789, CL08035, CL03904, CL08085, CL07129, CL06253, CL00361 , CL08631 , CL10952, CL04578, CL05406, CL01088, CL07062, CL01490, and CL08065 exhibit activity in this assay.
  • Such activity is demonstrated by using a- l ,6-linked mannobiose as the substrate and a D-mannose detection kit (Megazyme International) as the detection method, using a four enzyme coupled assay, using ATP and NADP+.
  • Reactions are conducted at 37°C in 100 mM MOPS (pH 7.0), containing 0.1 mM ZnS04, 1 mg mL-1 BSA, and 20 ⁇ of a-1 , 6-Mannanase sample.
  • Mannose liberated by a-1 , 6-Mannanase is phosphorylated to mannose-6-phosphate by hexokinase (HK).
  • Mannose-6-phosphate is subsequently converted to fructoses- phosphate by phosphomannose isomerase (PMI) which is then isomerized to glucose-6-phosphate by phosphoglucose isomerase (PGI) .
  • PMI phosphomannose isomerase
  • glucoses- phosphate is oxidized to gluconate-6-phosphate by glucose-6-phosphate dehydrogenase (G6P-DH)
  • G6P-DH glucose-6-phosphate dehydrogenase
  • the concurrent reduction of the NADP+ cofactor to NADPH is monitored at 340 nm using an extinction coefficient of 6223 (M-l cm-l ).
  • the enzymes are individually obtained from Sigma.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH38, GH76, and GH92 enzymes.
  • enzymes CL00928, CL02715, and CL10658 exhibit activity in this assay.
  • Example 19 illustrates the assay to measure Rhamnogalacturonyl hydrolase activity. Such activity is demonstrated by using rhamnogalacturonan as a substrate and and a reducing sugars assay (PAHBAH) as the detection method.
  • PAHBAH reducing sugars assay
  • Reagent A 10 g of p-Hydroxy benzoic acid hydrazide (PAHBAH) is suspended in 60 mL water. 10 mL of concentrated hydrochloric acid is added and the volume is adjusted to 200 ml.
  • Reagent B 24.9 g of trisodium citrate is dissolved in 500 ml of water. To this solution 2.2 g of calcium chloride and 40 g sodium hydroxide are added. The volume is adjusted to 2 L with water. Both reagents are stored at room temperature.
  • Working Reagent 10 ml of Reagent A is added to 90 ml of Reagent B. This solution is prepared freshly every day, and is stored on ice between uses.
  • the assay is conducted in micro titer plate format. Each well contains 50 ⁇ of rhamnogalacturonan substrate ( 1 %(w/v) in water), 30 ⁇ of 0,2 M HAc/ aOH pH 5, 20 ⁇ rhamnogalacturonyl hydrolase sample. These are incubated at 37°C for 2 hours. After incubation 25 ⁇ of each well are mixed with 125 ⁇ working reagent. These solutions are heated at 95°C for 5 minutes. After cooling down, the samples are analyzed by measuring the absorbance at 410 run (A410) as As (enzyme sample). The standard curve is determined and from that the enzyme activities are determined.
  • Activity is calculated as follows: rhamnogalacturonyl hydrolase activity is determined by reference to a standard curve of the cellulase standard solution.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH28 and GH105.
  • enzyme CL08540 exhibits activity in this assay.
  • the following examples illustrates the assay to measure a-Amylase activity. Such activity is demonstrated by using amylose as a substrate and a reducing sugars assay (PAHBAH) as the detection method.
  • PAHBAH reducing sugars assay
  • Reagent A 10 g of p-Hydroxy benzoic acid hydrazide (PAHBAH) is suspended in 60 mL water. 10 mL of concentrated hydrochloric acid is added and the volume is adjusted to 200 ml.
  • Reagent B 24.9 g of trisodium citrate is dissolved in 500 ml of water. To this solution 2.2 g of calcium chloride and 40 g sodium hydroxide are added. The volume is adjusted to 2 L with water. Both reagents are stored at room temperature.
  • Working Reagent 10 ml of Reagent A is added to 90 ml of Reagent B. This solution is prepared freshly every day, and is stored on ice between uses.
  • the assay is conducted in micro titer plate format. Each well contains 50 ⁇ of amylose substrate (0.15 % (w/v) in water), 30 ⁇ of 0,2 M HAc/NaOH pH 5, 20 ⁇ ⁇ -amylase sample. These are incubated at 37 ° C for 15 minutes. After incubation 25 ⁇ of each well are mixed with 125 ⁇ working reagent. These solutions are heated at 95 ° C for 5 minutes. After cooling down, the samples are analyzed by measuring the absorbance at 410 nm (A 10) as As (enzyme sample).
  • a-amylose substrate (0. 15 %(w/v) in water) is mixed with 50 iL 0.2 M sodium acetate buffer pH 5.0 and incubated at 37 °C for 15 minutes. To 25 ⁇ ⁇ of this reaction mixture, 125 ⁇ of working solution is added. The samples are heated for 5 minutes at 95°C. After cooling down, the samples are analyzed by measuring the absorbance at 410 nm (A 10) as A S B (substrate blank sample).
  • Activity is calculated as follows: a-amylase activity is determined by reference to a standard curve of the cellulase standard solution.
  • This assay can be used to test the activity of enzymes such as, but not limited to, GH 13 and GH57 enzymes.
  • enzymes such as, but not limited to, GH 13 and GH57 enzymes.
  • CL07198, CL07207, CL09312, CL07622, CL07554, CL07555, and CLOl 136 exhibit activity in this assay.
  • This example illustrates the assay to measure a-glucosidase activity.
  • This assay measures the release of ?-nitrophenol by the action of a-glucosidase on p- nitrophenyl a-D-glucopyranoside.
  • One ⁇ -glucosidase unit of activity is the amount of enzyme that liberates 1 micromole of -nitrophenol in one minute.
  • Sodium acetate buffer (0.2 M, pH 5.0) is prepared as follows. 1 6.4 g of anhydrous sodium acetate or 27.2 g of sodium acetate * 3H 2 0 is dissolved in distilled water to a final volume of 1000 mL (Solution A). In a separate flask, 12 g (1 1.44 mL) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 mL (Solution B). The final 0. 1 M sodium acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • p-nitrophenyl a-D-glucopyranoside (3 mM) from Sigma (#N1377) is used as the assay substrate. 4.52 mg of -nitrophenyl ⁇ -D-glucopyranoside is dissolved in 5 mL of sodium acetate buffer using magnetic stirrer.
  • the stop reagent (0.25 M Tris-HCl, pH 8.8) is prepared as follows. 30.29 g of Tris is dissolved in 900 mL of distilled water (Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by mixing solution A with 37% HC1 until the pH of the resulting solution is equal to 8.8. The solution volume is adjusted to 1000 mL. This reagent is used to terminate the enzymatic reaction.
  • 0.025 mL of -nitrophenyl ⁇ -D-glucopyranoside stock solution is mixed with 0.075 mL buffer and 0.1 mL Millipore water and incubated at 37 °C for 4 minutes. Every minute during 4 minutes a 0.04 mL sample is taken and added to 0.06 mL stop reagent.
  • the absorbance at 410 nm (A410) is measured in microtiter plates as ASB (substrate blank sample).
  • [protein] protein stock concentration in mg/ml
  • Vp volume of protein stock added to assay in ml.
  • This assay can be used to test the activity of enzymes such as, but not limited to, the GH4, GH13, GH31 and GH63 enzymes.
  • enzymes CL02470, CL00553, CL05538, CL05124, CL07846, and CL04536 exhibit activity in this enzyme.
  • This assay measures the release of /?-nitrophenol by the action of glucoamylase on p- nitrophenyl-P-maltoside (PNPM).
  • PNPM p- nitrophenyl-P-maltoside
  • Acetate buffer (0.1 M, pH 5.0) is prepared as follows: 8.2 g of anhydrous sodium acetate or 13.6 g of sodium acetate * 3H2O is dissolved in distilled water so that the final volume of the solution is 1000 ml (Solution A). In a separate flask, 6.0 g (5.72 ml) of glacial acetic acid is mixed with distilled water to make the total volume of 1000 ml (Solution B). The final 0.1 M acetate buffer, pH 5.0, is prepared by mixing Solution A with Solution B until the pH of the resulting solution is equal to 5.0.
  • PNPM Sigma-Aldrich, cat. # N1884. is used as the assay substrate. 18.54 mg of PNPM is dissolved in 5 mL of distilled water and 5 mL 0.1 M acetate buffer using a magnetic stirrer to obtain a 4 mM stock solution.
  • the stop reagent (0. 1 M sodium tetraborate solution) is prepared as follows: 20.1 g of anhydrous sodium tetraborate is dissolved in 800 ml of distilled water, and the solution volume is adjusted to 1000 ml. This reagent is used to terminate the enzymatic reaction.
  • ⁇ 05 As - ASB, DF is the enzyme dilution factor; 21 is the dilution of 10 ⁇ enzyme solution in 210 ⁇ reaction volume; 1.33 is the conversion factor of microtiter plates to cuvettes; 13.700 is the extinction coefficient 13700 M "1 cm '1 of 7-nitrophenol released corrected for mol/L to ⁇ /mL; and 360 minutes is the reaction time.
  • This assay can be used to test the activity of enzymes such as, but not limited to, the
  • enzyme CL01049 exhibits activity in this assay.
  • gliicanase activity is demonstrated by using a glucan (e.g. dextran, glycogen, pullulan, amylose, amylopectin, cellulose, curdlan, laminarin, chrysolaminarin, lentinan, lichenin, pleuran, zymosan, etc.) as the substrate and a reducing sugars assay (PAHBAH) as the detection method.
  • glucan e.g. dextran, glycogen, pullulan, amylose, amylopectin, cellulose, curdlan, laminarin, chrysolaminarin, lentinan, lichenin, pleuran, zymosan, etc.
  • PAHBAH reducing sugars assay
  • Reagent A 10 g of p-Hydroxy benzoic acid hydrazide (PAHBAH) is suspended in 60 mL water. 10 mL of concentrated hydrochloric acid is added and the volume is adjusted to 200 ml.
  • Reagent B 24.9 g of trisodium citrate is dissolved in 500 ml of water. To this solution 2.2 g of calcium chloride and 40 g sodium hydroxide are added. The volume is adjusted to 2 L with water. Both reagents are stored at room temperature.
  • Working Reagent 10 ml of Reagent A is added to 90 ml of Reagent B. This solution is prepared freshly every day, and is stored on ice between uses.
  • the assay is conducted in micro titer plate format. Each well contains 50 ⁇ of glucan substrate (1 % (w/v) glucan in water), 30 ⁇ of 0,2 M HAc NaOH pH 5, 20 ⁇ endo-glucanase sample. These are incubated at 37°C for 2 hours. After incubation 25 ⁇ of each well are mixed with 125 ⁇ working reagent. These solutions are heated at 95°C for 5 minutes. After cooling down, the samples are analyzed by measuring the absorbance at 410 nm (A410) as A s (enzyme sample). The standard curve is determined and from that the enzyme activities are determined.
  • Activity is calculated as follows: endo-glucanase activity is determined by reference to a standard curve of the cellulase standard solution.
  • substrate blank (substrate blank); SC is the slope of the standard curve; and DF is the enzyme dilution factor.
  • This assay can be used to test the activity of enzymes such as, but not limited to, the GH5, GH6, GH7, GH8, GH9, GH12, GH13, GH14, GH15, GH16, GH17, GH30, GH44, GH48, GH49, GH51 , GH55, GH57, GH64, GH71 , GH74, GH81 enzymes.
  • enzymes such as, but not limited to, the GH5, GH6, GH7, GH8, GH9, GH12, GH13, GH14, GH15, GH16, GH17, GH30, GH44, GH48, GH49, GH51 , GH55, GH57, GH64, GH71 , GH74, GH81 enzymes.
  • the following example illustrates the assay that will be used to measure the ferulic acid esterase enzymatic activity.
  • This assay measures the release of p-nitrophenol by the action of ferulic acid esterase on p-nitrophenylbutyrate (PNBu).
  • PNBu p-nitrophenylbutyrate
  • One ferulic acid esterase unit of activity is the amount of enzyme that liberates 1 micromole of -nitrophenol in one minute at 37°C and pH 7.2.
  • Phosphate buffer (0.01 M, pH 7.2) is prepared as follows: 0.124 g Of NaH2P04 * H20 and 0.178 g Na2HP04 are dissolved in distilled water so that the final volume of the solution is 500 ml and the pH of the resulting solution is equal to 7.2.
  • PNPBu (Sigma, USA, cat. # N9876-5G) is used as the assay substrate. 10 ⁇ of PNPBu is mixed with 25 ml of 0.01 M phosphate buffer using a magnetic stirrer to obtain a 2 mM stock solution. The solution is stable for 2 days with storage at 4°C.
  • the stop reagent (0.25 M Tris-HCl, pH 8.5) is prepared as follows: 30.29 g of Tris is dissolved in 900 ml of distilled water (Solution A). The final 0.25 M Tris-HCl pH 8.5 is prepared by mixing solution A with 37% HC1 until the pH of the resulting solution is equal to 8.5. The solution volume is adjusted to 1000 ml. This reagent is used to terminate the enzymatic reaction. Using the above reagents, the assay is performed as detailed below.
  • ⁇ 405 As - ASB, DF is the enzyme dilution factor; 21 is the dilution of 10 ⁇ enzyme solution in 210 ⁇ reaction volume; 1.33 is the conversion factor of microtiter plates to cuvettes; 13.700 is the extinction coefficient 13700 M "1 cm '1 of p-nitrophenol released corrected for mol/L to ⁇ / ⁇ ; and 10 minutes is the reaction time.
  • This assay can be used to test the activity of enzymes such as,, but not limited to, CL05183 and CL01692 enzymes.
  • the following assay is used to measure the enzymatic activity of a ferulic acid esterase towards wheat bran (WB) oligosaccharides by measuring the release of ferulic acid.
  • Wheat bran oligosaccharides are prepared by degradation of wheat bran (obtained from Nedalco, The Netherlands) by endo-xylanase III from A. niger (enzyme collection Laboratory of Food Chemistry, Wageningen University, The Netherlands). 50 mg of WB is dissolved in 10 ml of 0.05 M acetate buffer pH 5.0 using a magnetic stirrer. 1.0 ml of WB stock solution is mixed with 0.0075 mg of the enzyme and incubated at 35°C for 24 hours. The reaction is stopped by heating the samples for 10 minutes at l OOOC. The residual material is removed by centrifugation (15 minutes at 14000 rpm), and the supernatant is used as the substrate in the assay detailed below.
  • This assay can be used to test the activity of enzymes such as, but not limited to, CL05183 and CL01692 enzymes.
  • the following assay was used to measure the cellulase enhancing activity of expansines. This assay measures the release of glucose by the action of cellulases with and without addition of expansin on pretreated corn stover.
  • a cellulase mixture derived from Myceliophthora thermophila CI was incubated with acid pretreated corn stover (substrate), alone and in combination with Expansin (CL10197) from M. thermophila CI .
  • the release of glucose was analyzed by GOPOD assay (Megazyme).
  • the experiments were performed at pH 5.0 and at 50°C, during 24 and 72 hours.
  • the ratio E/S was 0.4% (4.0 ⁇ g enzyme/mg substrate) for the CI cellulase mixture, the ratio E/S was 0.08% (0.8 ⁇ g enzyme/mg substrate) for the expansin. After incubation the samples were centriiuged and the supernatants were analyzed by the GOPOD assay according to the supplier.
  • the CI cellulase mixture showed a glucose release of 24 g/L after 24 hours of incubation and 32 g L after 72 hours.
  • Addition of expansin to the cellulase mixture increased the glucose release to 26 g/L and 34 g/L, after 24 and 72 hours of incubation, respectively.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Detergent Compositions (AREA)

Abstract

La présente invention concerne de nouvelles enzymes et de nouvelles méthodes pour les produire. Plus précisément, la présente invention concerne diverses enzymes fongiques. Des molécules d'acide nucléique codant ces enzymes, des compositions, des cellules hôtes recombinantes et génétiquement modifiées et des méthodes d'utilisation sont décrites. L'invention concerne également une méthode de conversion d'une biomasse lignocellulosique en sucres fermentables avec des enzymes qui dégradent la matière lignocellulosique et de nouvelles combinaisons d'enzymes, notamment des combinaisons qui permettent une libération synergique de sucres d'une biomasse végétale. L'invention concerne également une méthode permettant de libérer le contenu d'une cellule par la dégradation des parois cellulaires. L'invention concerne en outre des méthodes d'utilisation des nouvelles enzymes et de compositions contenant ces enzymes dans divers autres procédés, comme le lavage des vêtements, les procédés de nettoyage, le bioraffinage, le désencrage et le bioblanchiment du papier et de la pâte à papier, et le traitement des déchets.
PCT/US2011/047734 2010-08-13 2011-08-15 Nouvelles enzymes fongiques Ceased WO2012021883A2 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US37376210P 2010-08-13 2010-08-13
US61/373,762 2010-08-13
US60/373,762 2010-08-13

Publications (2)

Publication Number Publication Date
WO2012021883A2 true WO2012021883A2 (fr) 2012-02-16
WO2012021883A3 WO2012021883A3 (fr) 2012-05-10

Family

ID=45568239

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2011/047734 Ceased WO2012021883A2 (fr) 2010-08-13 2011-08-15 Nouvelles enzymes fongiques

Country Status (1)

Country Link
WO (1) WO2012021883A2 (fr)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2609195A4 (fr) * 2010-08-23 2014-03-05 Codexis Inc Enzymes recombinantes capables de dégrader la lignocellulose utilisables en vue de la production de sucres solubles à partir de biomasse cellulosique
US10240159B2 (en) 2012-01-05 2019-03-26 Glykos Finland Oy Protease deficient filamentous fungal cells and methods of use thereof
CN110241249A (zh) * 2019-07-18 2019-09-17 华中农业大学 一种快速检测覆土中双孢蘑菇疣孢霉病病原菌的引物及方法
US10513724B2 (en) 2014-07-21 2019-12-24 Glykos Finland Oy Production of glycoproteins with mammalian-like N-glycans in filamentous fungi
WO2021055395A1 (fr) 2019-09-16 2021-03-25 Novozymes A/S Polypeptides dotés d'une activité bêta-glucanase et polynucléotides codant pour ces polypeptides
US20210095232A1 (en) * 2015-04-29 2021-04-01 The Procter & Gamble Company Method of treating a fabric
US20220033739A1 (en) * 2018-10-11 2022-02-03 Novozymes A/S Cleaning compositions and uses thereof
WO2023225459A2 (fr) 2022-05-14 2023-11-23 Novozymes A/S Compositions et procédés de prévention, de traitement, de suppression et/ou d'élimination d'infestations et d'infections phytopathogènes

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE602005024918D1 (fr) * 2004-01-16 2011-01-05 Novozymes Inc
CN102943085A (zh) * 2005-03-15 2013-02-27 维莱尼姆公司 纤维素酶、编码它们的核酸及其制备和应用的方法
EA015172B1 (ru) * 2006-02-23 2011-06-30 ДСМ АйПи АССЕТС Б.В. Новые липазы и их применение
CA2655645A1 (fr) * 2006-06-22 2007-12-27 Iogen Energy Corporation Compositions d'enzymes pour l'hydrolyse enzymatique amelioree de la cellulose et procedes d'utilisation de celles-ci
WO2008008793A2 (fr) * 2006-07-10 2008-01-17 Dyadic International Inc. Procédé et compositions pour dégradation de matériaux lignocellulosiques
WO2008095033A2 (fr) * 2007-01-30 2008-08-07 Verenium Corporation Enzymes pour le traitement de matières lignocellulosiques, des acides nucléiques les codant et procédés pour leur fabrication et leur utilisation
CA2736661A1 (fr) * 2007-09-07 2009-03-12 Dyadic International, Inc. Enzymes fongiques inedites
EP2260105B1 (fr) * 2008-02-29 2016-08-17 The Trustees Of The University Of Pennsylvania Production et utilisation de matériaux dégradant les plantes

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2609195A4 (fr) * 2010-08-23 2014-03-05 Codexis Inc Enzymes recombinantes capables de dégrader la lignocellulose utilisables en vue de la production de sucres solubles à partir de biomasse cellulosique
US10240159B2 (en) 2012-01-05 2019-03-26 Glykos Finland Oy Protease deficient filamentous fungal cells and methods of use thereof
US10731168B2 (en) 2012-01-05 2020-08-04 Glykos Finland Oy Protease deficient filamentous fungal cells and methods of use thereof
US11180767B2 (en) 2012-01-05 2021-11-23 Glykos Finland Oy Protease deficient filamentous fungal cells and methods of use thereof
US11827891B2 (en) 2012-01-05 2023-11-28 Vtt Technical Research Centre Of Finland Ltd Protease deficient filamentous fungal cells and methods of use thereof
US10513724B2 (en) 2014-07-21 2019-12-24 Glykos Finland Oy Production of glycoproteins with mammalian-like N-glycans in filamentous fungi
US20210095232A1 (en) * 2015-04-29 2021-04-01 The Procter & Gamble Company Method of treating a fabric
US20220033739A1 (en) * 2018-10-11 2022-02-03 Novozymes A/S Cleaning compositions and uses thereof
CN110241249A (zh) * 2019-07-18 2019-09-17 华中农业大学 一种快速检测覆土中双孢蘑菇疣孢霉病病原菌的引物及方法
WO2021055395A1 (fr) 2019-09-16 2021-03-25 Novozymes A/S Polypeptides dotés d'une activité bêta-glucanase et polynucléotides codant pour ces polypeptides
US12275967B2 (en) 2019-09-16 2025-04-15 Novozymes A/S Processes for producing fermentation products and compositions used therein
WO2023225459A2 (fr) 2022-05-14 2023-11-23 Novozymes A/S Compositions et procédés de prévention, de traitement, de suppression et/ou d'élimination d'infestations et d'infections phytopathogènes

Also Published As

Publication number Publication date
WO2012021883A3 (fr) 2012-05-10

Similar Documents

Publication Publication Date Title
EP2197893B1 (fr) Enzymes fongiques inédites
WO2012027374A2 (fr) Glucides-hydrolases fongiques inédites
CA2657684C (fr) Construction de compositions de cellulase hautement efficaces pour une hydrolyse enzymatique de la cellulose
US7923236B2 (en) Fungal enzymes
US9447400B2 (en) Beta-glucosidase I variants with improved properties
US9133448B2 (en) Polypeptide having cellobiohydrolase activity and uses thereof
EP3318574B1 (fr) Polypeptide doté d'une activité bêta-glucosidase et utilisations associées
US20130280764A1 (en) Method of improving the activity of cellulase enzyme mixtures in the saccharification (ligno)cellulosic material
WO2012018691A2 (fr) Nouvelles enzymes fongiques
US8790894B2 (en) Mutant cellobiohydrolase
CN102597213A (zh) 篮状菌转化体
BR112012033404B1 (pt) célula recombinante, método para a preparação de um polipeptídeo, composição, método para o tratamento de um substrato que compreende material de carboidrato e uso de um polipeptídeo
EA023558B1 (ru) ТЕРМОСТАБИЛЬНЫЙ ПОЛИПЕПТИД С АКТИВНОСТЬЮ β-ЦЕЛЛОБИОГИДРОЛАЗЫ И ЕГО ПРИМЕНЕНИЕ ДЛЯ ПРОИЗВОДСТВА САХАРОВ ИЗ ЛИГНОЦЕЛЛЮЛОЗНОГО МАТЕРИАЛА
CN103391999B (zh) 具有乙酰基木聚糖酯酶活性的多肽及其用途
WO2012021883A2 (fr) Nouvelles enzymes fongiques
MX2012014990A (es) Polipeptido que tiene actividad suolenina y sus usos.
WO2012078741A2 (fr) Nouvelles estérases fongiques
BR122019021575B1 (pt) Célula recombinante, método para a preparação de um polipeptídeo, composição compreendendo um polipleptídeo, método para o tratamento de um substrato que compreende material de carboidrato e uso de um polipeptídeo

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11817166

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11817166

Country of ref document: EP

Kind code of ref document: A2