WO1996001320A2 - Sequence genomique complete du virus autographa californica de la polyhedrose nucleaire - Google Patents
Sequence genomique complete du virus autographa californica de la polyhedrose nucleaire Download PDFInfo
- Publication number
- WO1996001320A2 WO1996001320A2 PCT/IB1995/000578 IB9500578W WO9601320A2 WO 1996001320 A2 WO1996001320 A2 WO 1996001320A2 IB 9500578 W IB9500578 W IB 9500578W WO 9601320 A2 WO9601320 A2 WO 9601320A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- acnpv
- orf
- virus
- gene
- Prior art date
Links
- 241000201370 Autographa californica nucleopolyhedrovirus Species 0.000 title abstract description 76
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 355
- 241000700605 Viruses Species 0.000 claims abstract description 151
- 241000701447 unidentified baculovirus Species 0.000 claims abstract description 49
- 238000013518 transcription Methods 0.000 claims abstract description 35
- 230000035897 transcription Effects 0.000 claims abstract description 35
- 102000004169 proteins and genes Human genes 0.000 claims description 80
- 241000238631 Hexapoda Species 0.000 claims description 76
- 210000004027 cell Anatomy 0.000 claims description 69
- 108091026890 Coding region Proteins 0.000 claims description 54
- 238000000034 method Methods 0.000 claims description 43
- 239000013604 expression vector Substances 0.000 claims description 39
- 108020004705 Codon Proteins 0.000 claims description 35
- 150000001413 amino acids Chemical class 0.000 claims description 31
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 27
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 26
- 229920001184 polypeptide Polymers 0.000 claims description 24
- 230000010076 replication Effects 0.000 claims description 19
- 238000004519 manufacturing process Methods 0.000 claims description 17
- 102000040430 polynucleotide Human genes 0.000 claims description 9
- 108091033319 polynucleotide Proteins 0.000 claims description 9
- 239000002157 polynucleotide Substances 0.000 claims description 9
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 claims description 8
- 101000953580 Pseudomonas phage Pf1 8.6 kDa protein Proteins 0.000 claims description 8
- 101000953577 Pseudomonas phage Pf3 7.9 kDa protein Proteins 0.000 claims description 8
- 108091005804 Peptidases Proteins 0.000 claims description 7
- 101000850960 Pseudomonas phage Pf1 3.2 kDa protein Proteins 0.000 claims description 7
- -1 ORF 32 Proteins 0.000 claims description 6
- 239000004365 Protease Substances 0.000 claims description 6
- 108020003175 receptors Proteins 0.000 claims description 5
- 102000005962 receptors Human genes 0.000 claims description 5
- 241000193388 Bacillus thuringiensis Species 0.000 claims description 4
- 101710091045 Envelope protein Proteins 0.000 claims description 4
- 108700028146 Genetic Enhancer Elements Proteins 0.000 claims description 4
- 241000700721 Hepatitis B virus Species 0.000 claims description 4
- 101710138657 Neurotoxin Proteins 0.000 claims description 4
- 101710188315 Protein X Proteins 0.000 claims description 4
- 241000239226 Scorpiones Species 0.000 claims description 4
- 229940097012 bacillus thuringiensis Drugs 0.000 claims description 4
- 239000002581 neurotoxin Substances 0.000 claims description 4
- 231100000618 neurotoxin Toxicity 0.000 claims description 4
- 239000002243 precursor Substances 0.000 claims description 4
- 230000002194 synthesizing effect Effects 0.000 claims description 4
- 101800000385 Transmembrane protein Proteins 0.000 claims description 3
- 239000005556 hormone Substances 0.000 claims description 3
- 229940088597 hormone Drugs 0.000 claims description 3
- 101710132601 Capsid protein Proteins 0.000 claims description 2
- 101710151559 Crystal protein Proteins 0.000 claims description 2
- 102000003951 Erythropoietin Human genes 0.000 claims description 2
- 108090000394 Erythropoietin Proteins 0.000 claims description 2
- 101710177291 Gag polyprotein Proteins 0.000 claims description 2
- 101001010573 Heliothis virescens Juvenile hormone esterase Proteins 0.000 claims description 2
- 101001111439 Homo sapiens Beta-nerve growth factor Proteins 0.000 claims description 2
- 101100005713 Homo sapiens CD4 gene Proteins 0.000 claims description 2
- 101001033280 Homo sapiens Cytokine receptor common subunit beta Proteins 0.000 claims description 2
- 101000987586 Homo sapiens Eosinophil peroxidase Proteins 0.000 claims description 2
- 101000920686 Homo sapiens Erythropoietin Proteins 0.000 claims description 2
- 101001002657 Homo sapiens Interleukin-2 Proteins 0.000 claims description 2
- 101001076408 Homo sapiens Interleukin-6 Proteins 0.000 claims description 2
- 101000611183 Homo sapiens Tumor necrosis factor Proteins 0.000 claims description 2
- 108090000144 Human Proteins Proteins 0.000 claims description 2
- 102000003839 Human Proteins Human genes 0.000 claims description 2
- 241000713772 Human immunodeficiency virus 1 Species 0.000 claims description 2
- 241000713340 Human immunodeficiency virus 2 Species 0.000 claims description 2
- 102000006992 Interferon-alpha Human genes 0.000 claims description 2
- 108010047761 Interferon-alpha Proteins 0.000 claims description 2
- 102000014150 Interferons Human genes 0.000 claims description 2
- 108010050904 Interferons Proteins 0.000 claims description 2
- 101710125418 Major capsid protein Proteins 0.000 claims description 2
- 241000255908 Manduca sexta Species 0.000 claims description 2
- 101000879976 Manduca sexta Eclosion hormone Proteins 0.000 claims description 2
- 241001481690 Mesobuthus eupeus Species 0.000 claims description 2
- 108091000080 Phosphotransferase Proteins 0.000 claims description 2
- 101710192141 Protein Nef Proteins 0.000 claims description 2
- 101710150344 Protein Rev Proteins 0.000 claims description 2
- 101800001271 Surface protein Proteins 0.000 claims description 2
- 108010015780 Viral Core Proteins Proteins 0.000 claims description 2
- 239000000427 antigen Substances 0.000 claims description 2
- 108091007433 antigens Proteins 0.000 claims description 2
- 102000036639 antigens Human genes 0.000 claims description 2
- 238000013320 baculovirus expression vector system Methods 0.000 claims description 2
- 239000002158 endotoxin Substances 0.000 claims description 2
- 229940105423 erythropoietin Drugs 0.000 claims description 2
- 108010027225 gag-pol Fusion Proteins Proteins 0.000 claims description 2
- 102000055647 human CSF2RB Human genes 0.000 claims description 2
- 102000044890 human EPO Human genes 0.000 claims description 2
- 102000055277 human IL2 Human genes 0.000 claims description 2
- 102000057041 human TNF Human genes 0.000 claims description 2
- 229940116886 human interleukin-6 Drugs 0.000 claims description 2
- 230000010354 integration Effects 0.000 claims description 2
- 229940079322 interferon Drugs 0.000 claims description 2
- 102000020233 phosphotransferase Human genes 0.000 claims description 2
- 108010089520 pol Gene Products Proteins 0.000 claims description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 claims description 2
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 claims 4
- 241001219494 Androctonus australis hector Species 0.000 claims 1
- 101000960969 Homo sapiens Interleukin-5 Proteins 0.000 claims 1
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 claims 1
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 claims 1
- 229960000187 tissue plasminogen activator Drugs 0.000 claims 1
- 108700026244 Open Reading Frames Proteins 0.000 abstract description 213
- 239000002773 nucleotide Substances 0.000 abstract description 45
- 125000003729 nucleotide group Chemical group 0.000 abstract description 45
- 238000004458 analytical method Methods 0.000 abstract description 34
- 230000006870 function Effects 0.000 abstract description 32
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 abstract description 9
- 229930182817 methionine Natural products 0.000 abstract description 9
- 230000004543 DNA replication Effects 0.000 abstract description 7
- 230000000977 initiatory effect Effects 0.000 abstract description 7
- 230000033228 biological regulation Effects 0.000 abstract description 5
- 108020005202 Viral DNA Proteins 0.000 abstract description 4
- 108020004414 DNA Proteins 0.000 description 104
- 235000018102 proteins Nutrition 0.000 description 75
- 239000012634 fragment Substances 0.000 description 53
- 230000014509 gene expression Effects 0.000 description 42
- 208000015181 infectious disease Diseases 0.000 description 40
- 101150066555 lacZ gene Proteins 0.000 description 40
- 108091028043 Nucleic acid sequence Proteins 0.000 description 32
- 230000029812 viral genome replication Effects 0.000 description 30
- 238000004113 cell culture Methods 0.000 description 28
- 230000002458 infectious effect Effects 0.000 description 28
- 101710182846 Polyhedrin Proteins 0.000 description 24
- 241000256251 Spodoptera frugiperda Species 0.000 description 24
- 108091081024 Start codon Proteins 0.000 description 23
- 235000001014 amino acid Nutrition 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 23
- 230000014616 translation Effects 0.000 description 23
- 241001367049 Autographa Species 0.000 description 22
- 229940024606 amino acid Drugs 0.000 description 22
- 108091008146 restriction endonucleases Proteins 0.000 description 21
- 238000013519 translation Methods 0.000 description 21
- 239000013598 vector Substances 0.000 description 21
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 20
- 108091007065 BIRCs Proteins 0.000 description 20
- 230000027455 binding Effects 0.000 description 19
- 230000014621 translational initiation Effects 0.000 description 19
- 230000003612 virological effect Effects 0.000 description 19
- 230000000692 anti-sense effect Effects 0.000 description 17
- 239000002299 complementary DNA Substances 0.000 description 16
- 238000011144 upstream manufacturing Methods 0.000 description 16
- 108020004999 messenger RNA Proteins 0.000 description 15
- 230000004048 modification Effects 0.000 description 15
- 238000012986 modification Methods 0.000 description 15
- 101100144928 Autographa californica nuclear polyhedrosis virus PNK/PNL gene Proteins 0.000 description 14
- 102000055031 Inhibitor of Apoptosis Proteins Human genes 0.000 description 14
- 108010022172 Chitinases Proteins 0.000 description 13
- 108010005774 beta-Galactosidase Proteins 0.000 description 13
- 108700026226 TATA Box Proteins 0.000 description 12
- 230000002068 genetic effect Effects 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 241000894007 species Species 0.000 description 12
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 11
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 11
- 108700039887 Essential Genes Proteins 0.000 description 11
- 239000013600 plasmid vector Substances 0.000 description 11
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 11
- 102000012286 Chitinases Human genes 0.000 description 10
- 108700010070 Codon Usage Proteins 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 10
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 10
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 10
- 230000002939 deleterious effect Effects 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 229920000669 heparin Polymers 0.000 description 10
- 229960002897 heparin Drugs 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 239000011701 zinc Substances 0.000 description 10
- 229910052725 zinc Inorganic materials 0.000 description 10
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 9
- 108060004795 Methyltransferase Proteins 0.000 description 9
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 9
- 239000011159 matrix material Substances 0.000 description 9
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 8
- 102000009572 RNA Polymerase II Human genes 0.000 description 8
- 108010009460 RNA Polymerase II Proteins 0.000 description 8
- 108700009124 Transcription Initiation Site Proteins 0.000 description 8
- 108091023040 Transcription factor Proteins 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 108700003860 Bacterial Genes Proteins 0.000 description 7
- 102000053602 DNA Human genes 0.000 description 7
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 102000005936 beta-Galactosidase Human genes 0.000 description 7
- 238000013461 design Methods 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 230000018109 developmental process Effects 0.000 description 7
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 6
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 6
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 6
- 101150099406 GTA gene Proteins 0.000 description 6
- 101710141347 Major envelope glycoprotein Proteins 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- 238000012300 Sequence Analysis Methods 0.000 description 6
- 102000040945 Transcription factor Human genes 0.000 description 6
- 230000006907 apoptotic process Effects 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 5
- 108060002716 Exonuclease Proteins 0.000 description 5
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 102000013165 exonuclease Human genes 0.000 description 5
- 239000002917 insecticide Substances 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 101000818108 Acholeplasma phage L2 Uncharacterized 81.3 kDa protein Proteins 0.000 description 4
- 101000743047 Autographa californica nuclear polyhedrosis virus Protein AC23 Proteins 0.000 description 4
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 4
- 101710197780 E3 ubiquitin-protein ligase LAP Proteins 0.000 description 4
- 101000912350 Haemophilus phage HP1 (strain HP1c1) DNA N-6-adenine-methyltransferase Proteins 0.000 description 4
- 108700005087 Homeobox Genes Proteins 0.000 description 4
- 101000879661 Homo sapiens Chitotriosidase-1 Proteins 0.000 description 4
- 101000790844 Klebsiella pneumoniae Uncharacterized 24.8 kDa protein in cps region Proteins 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 108091081548 Palindromic sequence Proteins 0.000 description 4
- 108700005077 Viral Genes Proteins 0.000 description 4
- 241000607479 Yersinia pestis Species 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 210000004748 cultured cell Anatomy 0.000 description 4
- 235000018417 cysteine Nutrition 0.000 description 4
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical class C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 108010013770 ecdysteroid UDP-glucosyltransferase Proteins 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 4
- 239000003112 inhibitor Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 239000000575 pesticide Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 101000748781 Anthoceros angustus Uncharacterized 3.0 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- 101100335652 Autographa californica nuclear polyhedrosis virus GP64 gene Proteins 0.000 description 3
- 108010084457 Cathepsins Proteins 0.000 description 3
- 102100037328 Chitotriosidase-1 Human genes 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- 101000792449 Cyanophora paradoxa Uncharacterized 3.4 kDa protein in atpE-petA intergenic region Proteins 0.000 description 3
- 108050006400 Cyclin Proteins 0.000 description 3
- 238000001712 DNA sequencing Methods 0.000 description 3
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 3
- 101001117015 Escherichia coli (strain K12) Poly(A) polymerase I Proteins 0.000 description 3
- 108700024394 Exon Proteins 0.000 description 3
- 102000003886 Glycoproteins Human genes 0.000 description 3
- 108090000288 Glycoproteins Proteins 0.000 description 3
- 101000702559 Homo sapiens Probable global transcription activator SNF2L2 Proteins 0.000 description 3
- 101000702545 Homo sapiens Transcription activator BRG1 Proteins 0.000 description 3
- 101150032161 IAP1 gene Proteins 0.000 description 3
- 102100024319 Intestinal-type alkaline phosphatase Human genes 0.000 description 3
- 101000626970 Marchantia polymorpha Uncharacterized 3.3 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 3
- 101150092861 ORF71 gene Proteins 0.000 description 3
- 101150071814 ORF86 gene Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 101100263767 Orgyia pseudotsugata multicapsid polyhedrosis virus GP16 gene Proteins 0.000 description 3
- 101100317133 Orgyia pseudotsugata multicapsid polyhedrosis virus p91 gene Proteins 0.000 description 3
- 101150027323 PCNP gene Proteins 0.000 description 3
- 108010029182 Pectin lyase Proteins 0.000 description 3
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 3
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 3
- 102100036691 Proliferating cell nuclear antigen Human genes 0.000 description 3
- 102000001253 Protein Kinase Human genes 0.000 description 3
- 101100346651 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MSS18 gene Proteins 0.000 description 3
- 241000700584 Simplexvirus Species 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- 101710183015 Trans-activating transcriptional regulatory protein Proteins 0.000 description 3
- 102000006290 Transcription Factor TFIID Human genes 0.000 description 3
- 108010083268 Transcription Factor TFIID Proteins 0.000 description 3
- 102100031027 Transcription activator BRG1 Human genes 0.000 description 3
- 101100166027 Trichoplusia ni ascovirus 2c MCP-2 gene Proteins 0.000 description 3
- 101000764204 Trieres chinensis Uncharacterized 3.3 kDa protein in rpl11-trnW intergenic region Proteins 0.000 description 3
- 108020000999 Viral RNA Proteins 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000009089 cytolysis Effects 0.000 description 3
- 238000009795 derivation Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000006386 neutralization reaction Methods 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 230000000644 propagated effect Effects 0.000 description 3
- 108060006633 protein kinase Proteins 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 230000005026 transcription initiation Effects 0.000 description 3
- 101800003158 5 kDa peptide Proteins 0.000 description 2
- 101150113556 ALK-EXO gene Proteins 0.000 description 2
- 101000748061 Acholeplasma phage L2 Uncharacterized 16.1 kDa protein Proteins 0.000 description 2
- 101000827329 Acholeplasma phage L2 Uncharacterized 26.1 kDa protein Proteins 0.000 description 2
- 102100021569 Apoptosis regulator Bcl-2 Human genes 0.000 description 2
- 241001203868 Autographa californica Species 0.000 description 2
- 101100495846 Autographa californica nuclear polyhedrosis virus CHIA gene Proteins 0.000 description 2
- 101000781183 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.4 kDa protein in IAP1-SOD intergenic region Proteins 0.000 description 2
- 108091012583 BCL2 Proteins 0.000 description 2
- 241000409811 Bombyx mori nucleopolyhedrovirus Species 0.000 description 2
- 102000005600 Cathepsins Human genes 0.000 description 2
- 101000947615 Clostridium perfringens Uncharacterized 38.4 kDa protein Proteins 0.000 description 2
- 102100033195 DNA ligase 4 Human genes 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 101150093002 EGT gene Proteins 0.000 description 2
- 101000964391 Enterococcus faecalis UPF0145 protein Proteins 0.000 description 2
- 101100066648 Escherichia phage T5 D17 gene Proteins 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 101150021185 FGF gene Proteins 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 101000748063 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 11.1 kDa protein in rep-hol intergenic region Proteins 0.000 description 2
- 101000818057 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 14.9 kDa protein in rep-hol intergenic region Proteins 0.000 description 2
- 101000927810 Homo sapiens DNA ligase 4 Proteins 0.000 description 2
- 101150118344 IAP2 gene Proteins 0.000 description 2
- 101001015100 Klebsiella pneumoniae UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase Proteins 0.000 description 2
- 101000790840 Klebsiella pneumoniae Uncharacterized 49.5 kDa protein in cps region Proteins 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 101150038414 LAP gene Proteins 0.000 description 2
- 102400000401 Latency-associated peptide Human genes 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 101150034674 ME53 gene Proteins 0.000 description 2
- 101000788492 Marchantia polymorpha Uncharacterized mitochondrial protein ymf28 Proteins 0.000 description 2
- 101150083029 ORF147 gene Proteins 0.000 description 2
- 101150077302 ORF88 gene Proteins 0.000 description 2
- 101100281854 Orgyia pseudotsugata multicapsid polyhedrosis virus GP64 gene Proteins 0.000 description 2
- 101100028042 Orgyia pseudotsugata multicapsid polyhedrosis virus OPEP-3 gene Proteins 0.000 description 2
- 101100484850 Orgyia pseudotsugata multicapsid polyhedrosis virus P15 gene Proteins 0.000 description 2
- 101150030083 PE38 gene Proteins 0.000 description 2
- 101150051210 PK2 gene Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- 101710092489 Protein kinase 2 Proteins 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 101710086015 RNA ligase Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 108091058545 Secretory proteins Proteins 0.000 description 2
- 102000040739 Secretory proteins Human genes 0.000 description 2
- 101000992423 Severe acute respiratory syndrome coronavirus 2 Putative ORF9c protein Proteins 0.000 description 2
- 101000953979 Streptomyces lividans Uncharacterized 6.6 kDa protein Proteins 0.000 description 2
- 241000255993 Trichoplusia ni Species 0.000 description 2
- 101710172411 Uncharacterized protein ycf68 Proteins 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 230000009418 agronomic effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- ZWPRYVATYZPCDP-UHFFFAOYSA-M bis(dibutylamino)methylidene-dibutylazanium;fluoride Chemical compound [F-].CCCCN(CCCC)C(N(CCCC)CCCC)=[N+](CCCC)CCCC ZWPRYVATYZPCDP-UHFFFAOYSA-M 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000030833 cell death Effects 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 150000001945 cysteines Chemical class 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000001066 destructive effect Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 229940126864 fibroblast growth factor Drugs 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 239000003228 hemolysin Substances 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 101150100002 iap gene Proteins 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000011835 investigation Methods 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 230000001418 larval effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 150000003212 purines Chemical class 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- HLXHCNWEVQNNKA-UHFFFAOYSA-N 5-methoxy-2,3-dihydro-1h-inden-2-amine Chemical group COC1=CC=C2CC(N)CC2=C1 HLXHCNWEVQNNKA-UHFFFAOYSA-N 0.000 description 1
- 101150044182 8 gene Proteins 0.000 description 1
- 101150023956 ALK gene Proteins 0.000 description 1
- 101710115267 ATP synthase protein MI25 Proteins 0.000 description 1
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 1
- 101000977065 Acidithiobacillus ferridurans Uncharacterized 11.6 kDa protein in mobS 3'region Proteins 0.000 description 1
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 1
- 101800002638 Alpha-amanitin Proteins 0.000 description 1
- 102000013455 Amyloid beta-Peptides Human genes 0.000 description 1
- 108010090849 Amyloid beta-Peptides Proteins 0.000 description 1
- 241000239239 Androctonus Species 0.000 description 1
- 108700031308 Antennapedia Homeodomain Proteins 0.000 description 1
- 101100064323 Arabidopsis thaliana DTX47 gene Proteins 0.000 description 1
- 101100214862 Autographa californica nuclear polyhedrosis virus AC152 gene Proteins 0.000 description 1
- 101100171547 Autographa californica nuclear polyhedrosis virus E27 gene Proteins 0.000 description 1
- 101100394491 Autographa californica nuclear polyhedrosis virus HE65 gene Proteins 0.000 description 1
- 101100070304 Autographa californica nuclear polyhedrosis virus HELI gene Proteins 0.000 description 1
- 101100127793 Autographa californica nuclear polyhedrosis virus LEF-4 gene Proteins 0.000 description 1
- 101100135329 Autographa californica nuclear polyhedrosis virus P6.9 gene Proteins 0.000 description 1
- 101100351191 Autographa californica nuclear polyhedrosis virus PCNA gene Proteins 0.000 description 1
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 1
- 101000666833 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.8 kDa protein in FGF-VUBI intergenic region Proteins 0.000 description 1
- 101000847476 Autographa californica nuclear polyhedrosis virus Uncharacterized 54.7 kDa protein in IAP1-SOD intergenic region Proteins 0.000 description 1
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 1
- 101000977027 Azospirillum brasilense Uncharacterized protein in nodG 5'region Proteins 0.000 description 1
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 1
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 1
- 101000736075 Bacillus subtilis (strain 168) Uncharacterized protein YcbP Proteins 0.000 description 1
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 1
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 1
- 101000962005 Bacillus thuringiensis Uncharacterized 23.6 kDa protein Proteins 0.000 description 1
- 102000051819 Baculoviral IAP Repeat-Containing 3 Human genes 0.000 description 1
- 108700003785 Baculoviral IAP Repeat-Containing 3 Proteins 0.000 description 1
- 241000701412 Baculoviridae Species 0.000 description 1
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 1
- 241000701083 Bovine alphaherpesvirus 1 Species 0.000 description 1
- 101100054773 Caenorhabditis elegans act-2 gene Proteins 0.000 description 1
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 1
- 241001164374 Calyx Species 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- 101000748765 Chlorella vulgaris Uncharacterized 16.5 kDa protein in psaC-atpA intergenic region Proteins 0.000 description 1
- 241000255942 Choristoneura fumiferana Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 1
- 206010010144 Completed suicide Diseases 0.000 description 1
- 101000861180 Cupriavidus necator (strain ATCC 17699 / DSM 428 / KCTC 22496 / NCIMB 10442 / H16 / Stanier 337) Uncharacterized protein H16_B0147 Proteins 0.000 description 1
- 101000764209 Cyanophora paradoxa Uncharacterized 11.2 kDa protein in ycf23-apcF intergenic region Proteins 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 230000003682 DNA packaging effect Effects 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 101100499270 Drosophila melanogaster Diap1 gene Proteins 0.000 description 1
- 101000785191 Drosophila melanogaster Uncharacterized 50 kDa protein in type I retrotransposable element R1DM Proteins 0.000 description 1
- 101150112474 EXO gene Proteins 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 101000791598 Enterobacteria phage 82 Uncharacterized protein in rusA 5'region Proteins 0.000 description 1
- 101000747704 Enterobacteria phage N4 Uncharacterized protein Gp1 Proteins 0.000 description 1
- 241000701832 Enterobacteria phage T3 Species 0.000 description 1
- 101000861206 Enterococcus faecalis (strain ATCC 700802 / V583) Uncharacterized protein EF_A0048 Proteins 0.000 description 1
- 101900264058 Escherichia coli Beta-galactosidase Proteins 0.000 description 1
- 101000769180 Escherichia coli Uncharacterized 11.1 kDa protein Proteins 0.000 description 1
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 1
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 1
- 241001524679 Escherichia virus M13 Species 0.000 description 1
- 101710086766 FP protein Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108010011145 Fushi Tarazu Transcription Factors Proteins 0.000 description 1
- 241000255896 Galleria mellonella Species 0.000 description 1
- 241000951956 Galleria mellonella MNPV Species 0.000 description 1
- 101100272587 Gallus gallus ITA gene Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- 101000626971 Guillardia theta Uncharacterized 8.1 kDa protein Proteins 0.000 description 1
- 101001066788 Haemophilus phage HP1 (strain HP1c1) Probable portal protein Proteins 0.000 description 1
- 101000743335 Haemophilus phage HP1 (strain HP1c1) Probable terminase, endonuclease subunit Proteins 0.000 description 1
- 101000976893 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 14.1 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101000708358 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 23.3 kDa protein in lys 3'region Proteins 0.000 description 1
- 101000786921 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 26.0 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 229920002971 Heparan sulfate Polymers 0.000 description 1
- 101000748192 Herpetosiphon aurantiacus Uncharacterized 15.4 kDa protein in HgiDIIM 5'region Proteins 0.000 description 1
- 101000929495 Homo sapiens Adenosine deaminase Proteins 0.000 description 1
- 101000836540 Homo sapiens Aldo-keto reductase family 1 member B1 Proteins 0.000 description 1
- 101000771674 Homo sapiens Apolipoprotein E Proteins 0.000 description 1
- 101000959437 Homo sapiens Beta-2 adrenergic receptor Proteins 0.000 description 1
- 101000746373 Homo sapiens Granulocyte-macrophage colony-stimulating factor Proteins 0.000 description 1
- 101000820589 Homo sapiens Succinate-hydroxymethylglutarate CoA-transferase Proteins 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 201000001096 IGSF1 deficiency syndrome Diseases 0.000 description 1
- 108091029795 Intergenic region Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 1
- 101000790838 Klebsiella pneumoniae UPF0053 protein in cps region Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 101000976301 Leptospira interrogans Uncharacterized 35 kDa protein in sph 3'region Proteins 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101000788487 Marchantia polymorpha Uncharacterized mitochondrial protein ymf25 Proteins 0.000 description 1
- 101000747938 Marchantia polymorpha Uncharacterized mitochondrial protein ymf31 Proteins 0.000 description 1
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 1
- 101100446506 Mus musculus Fgf3 gene Proteins 0.000 description 1
- 101000658690 Neisseria meningitidis serogroup B Transposase for insertion sequence element IS1106 Proteins 0.000 description 1
- 101100289047 Novosphingobium sp. (strain KA1) ligU gene Proteins 0.000 description 1
- 108020003217 Nuclear RNA Proteins 0.000 description 1
- 102000043141 Nuclear RNA Human genes 0.000 description 1
- 101150089976 ORF144 gene Proteins 0.000 description 1
- 101150075249 ORF40 gene Proteins 0.000 description 1
- 101150050790 ORF49 gene Proteins 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 101150080573 ORF90 gene Proteins 0.000 description 1
- 101150034596 ORF95 gene Proteins 0.000 description 1
- 241001465800 Orgyia Species 0.000 description 1
- 101100181496 Orgyia pseudotsugata multicapsid polyhedrosis virus LEF-5 gene Proteins 0.000 description 1
- 101100181498 Orgyia pseudotsugata multicapsid polyhedrosis virus LEF-7 gene Proteins 0.000 description 1
- 101100372859 Orgyia pseudotsugata multicapsid polyhedrosis virus P25 gene Proteins 0.000 description 1
- 101100428663 Orgyia pseudotsugata multicapsid polyhedrosis virus P39 gene Proteins 0.000 description 1
- 101100463342 Orgyia pseudotsugata multicapsid polyhedrosis virus PE38 gene Proteins 0.000 description 1
- 101000770899 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 24.3 kDa protein Proteins 0.000 description 1
- 101100064055 Ostreid herpesvirus 1 (isolate France) ORF100 gene Proteins 0.000 description 1
- 101100103570 Ostreid herpesvirus 1 (isolate France) ORF123 gene Proteins 0.000 description 1
- 101150110481 PNK/PNL gene Proteins 0.000 description 1
- 101100156835 Paenarthrobacter nicotinovorans xdh gene Proteins 0.000 description 1
- 241000500437 Plutella xylostella Species 0.000 description 1
- 101710093543 Probable non-specific lipid-transfer protein Proteins 0.000 description 1
- 102000002727 Protein Tyrosine Phosphatase Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 102000052575 Proto-Oncogene Human genes 0.000 description 1
- 108700020978 Proto-Oncogene Proteins 0.000 description 1
- 101000748660 Pseudomonas savastanoi Uncharacterized 21 kDa protein in iaaL 5'region Proteins 0.000 description 1
- 241000238706 Pyemotes Species 0.000 description 1
- 241001456341 Rachiplusia ou Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241001068263 Replication competent viruses Species 0.000 description 1
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 1
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 1
- 101000757825 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc01784 Proteins 0.000 description 1
- 101000748499 Rhodobacter capsulatus Uncharacterized 104.1 kDa protein in hypE 3'region Proteins 0.000 description 1
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 1
- 101000584469 Rice tungro bacilliform virus (isolate Philippines) Protein P1 Proteins 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- RXGJTYFDKOHJHK-UHFFFAOYSA-N S-deoxo-amaninamide Natural products CCC(C)C1NC(=O)CNC(=O)C2Cc3c(SCC(NC(=O)CNC1=O)C(=O)NC(CC(=O)N)C(=O)N4CC(O)CC4C(=O)NC(C(C)C(O)CO)C(=O)N2)[nH]c5ccccc35 RXGJTYFDKOHJHK-UHFFFAOYSA-N 0.000 description 1
- 101150112782 SNF2 gene Proteins 0.000 description 1
- 101000953093 Salmonella phage P22 Uncharacterized 9.0 kDa protein in gp15-gp3 intergenic region Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 101000818096 Spirochaeta aurantia Uncharacterized 15.5 kDa protein in trpE 3'region Proteins 0.000 description 1
- 241000931755 Spodoptera exempta Species 0.000 description 1
- 241000985245 Spodoptera litura Species 0.000 description 1
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 1
- 101000766081 Streptomyces ambofaciens Uncharacterized HTH-type transcriptional regulator in unstable DNA locus Proteins 0.000 description 1
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 1
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 1
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 1
- 102100021652 Succinate-hydroxymethylglutarate CoA-transferase Human genes 0.000 description 1
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 1
- 206010042566 Superinfection Diseases 0.000 description 1
- 101000804403 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HIT-like protein Synpcc7942_1390 Proteins 0.000 description 1
- 101000750910 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HTH-type transcriptional regulator Synpcc7942_2319 Proteins 0.000 description 1
- 101000644897 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Uncharacterized protein SYNPCC7002_B0001 Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 1
- 101000768114 Triticum aestivum Uncharacterized protein ycf70 Proteins 0.000 description 1
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 1
- 241001672648 Vieira Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 241000282485 Vulpes vulpes Species 0.000 description 1
- 208000028265 X-linked central congenital hypothyroidism with late-onset testicular enlargement Diseases 0.000 description 1
- 101000916336 Xenopus laevis Transposon TX1 uncharacterized 82 kDa protein Proteins 0.000 description 1
- 101001000760 Zea mays Putative Pol polyprotein from transposon element Bs1 Proteins 0.000 description 1
- 101000678262 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 65 kDa protein Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 239000004007 alpha amanitin Substances 0.000 description 1
- CIORWBWIBBPXCG-SXZCQOKQSA-N alpha-amanitin Chemical compound O=C1N[C@@H](CC(N)=O)C(=O)N2C[C@H](O)C[C@H]2C(=O)N[C@@H]([C@@H](C)[C@@H](O)CO)C(=O)N[C@@H](C2)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@H]1C[S@@](=O)C1=C2C2=CC=C(O)C=C2N1 CIORWBWIBBPXCG-SXZCQOKQSA-N 0.000 description 1
- CIORWBWIBBPXCG-UHFFFAOYSA-N alpha-amanitin Natural products O=C1NC(CC(N)=O)C(=O)N2CC(O)CC2C(=O)NC(C(C)C(O)CO)C(=O)NC(C2)C(=O)NCC(=O)NC(C(C)CC)C(=O)NCC(=O)NC1CS(=O)C1=C2C2=CC=C(O)C=C2N1 CIORWBWIBBPXCG-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000019552 anatomical structure morphogenesis Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000005735 apoptotic response Effects 0.000 description 1
- 230000001873 bacteriocinogenic effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000000853 biopesticidal effect Effects 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000001876 chaperonelike Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000011281 clinical therapy Methods 0.000 description 1
- 238000010954 commercial manufacturing process Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 108050003126 conotoxin Proteins 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- UFJPAQSLHAGEBL-RRKCRQDMSA-N dITP Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(N=CNC2=O)=C2N=C1 UFJPAQSLHAGEBL-RRKCRQDMSA-N 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 102000043395 human ADA Human genes 0.000 description 1
- 102000053020 human ApoE Human genes 0.000 description 1
- 230000005934 immune activation Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 231100000636 lethal dose Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108091064355 mitochondrial RNA Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 238000001807 normal pulse voltammetry Methods 0.000 description 1
- 230000005937 nuclear translocation Effects 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000007505 plaque formation Effects 0.000 description 1
- 101150048568 pnl gene Proteins 0.000 description 1
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003334 potential effect Effects 0.000 description 1
- 239000013615 primer Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108020000494 protein-tyrosine phosphatase Proteins 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 108091035233 repetitive DNA sequence Proteins 0.000 description 1
- 102000053632 repetitive DNA sequence Human genes 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000011451 sequencing strategy Methods 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 241000701451 unidentified granulovirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000007484 viral process Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 229960005502 α-amanitin Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/14011—Baculoviridae
- C12N2710/14111—Nucleopolyhedrovirus, e.g. autographa californica nucleopolyhedrovirus
- C12N2710/14141—Use of virus, viral particle or viral elements as a vector
- C12N2710/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Definitions
- This invention relates to Autographa califomica nuclear polyhedrosis virus DNA sequences and particularly to the DNA sequence of the complete virus genome.
- Autographa califomica nuclear polyhedrosis virus (AcNPV) is a widely studied baculovirus which has been used to form the basis of a polypeptide expression systems (see e.g. US-P-4,745,051 and EP 0 327 626). Modified baculoviruses have also been proposed for use as viral insecticides.
- Baculoviruses are invertebrate-specific viruses with large, circular, covalently closed, double-stranded DNA genomes (Francki et al., 1991).
- early genes and early gene promoters are identified in the AcNPV C6 sequence, genes that have not been reported hitherto and which via substitution of the downstream gene, or by promoter duplication, alone, or in concert with other promoters of AcNPV, or other baculoviruses will allow the expression of foreign genes, or operons, or duplicated AcNPV genes at defined times in the baculovirus infection process.
- the genome data provides information on which specific restriction enzymes do not cut the AcNPV C6 genome.
- the data also identifies restriction enzymes that cut the sequence only once, or twice, or thrice, etc, and the location of all such sites.
- the latter sites can now be removed by deletion (for non-essential, including coding sequences), or by site directed mutage ⁇ esis (for essential, including coding sequences).
- AcNPV derivatives can be constructed that only cut the genome at defined locations (new sites) by these specific enzymes. This will allow the linearisation of the virus DNA at defined locations in order to facilitate the introduction of foreign genes.
- the new sites may be located within a reporter gene sequence for the efficient identification of recombinant expression vectors by the loss of the reporter gene function.
- Additional sequences representing these restriction sites may also be placed in flanking sequences of essential genes to improve the efficient recovery of recombinants using transfer vectors that provide both the foreign gene and the unmodified essential flanking sequences. Further, the use of a number of such enzyme sites strategically located in the virus genome, will allow the preparation of genetically stable, multiple gene expression vectors.
- the genome sequence allows for the identification of essential and non- essential genes in relation to the infection course of the virus in different types of cultured cells and host insects.
- genes that will be proven to be essential to the infection course of the virus in cultured cells and insect hosts and other genes that are non-essential to one or other or both substrates can now be specifically removed from the AcNPV genome without affecting the expression of essential, including flanking, genes, or the replication of the virus in certain cultured cells. Removal of such genes and corresponding reduction in the AcNPV genome size and hence cost to the overall transcription, translation and other processes induced by the virus, or certain other processes and structures naturally operative in the host cell, will provide a preferred expression vector system and improved virus replication.
- the modifications will allow the time when foreign gene products are made to be regulated and improvements to the amounts and quality of such products.
- removal of such genes will be to the benefit of commercial manufacturing processes and environmental safety.
- the removal of natural AcNPV genes that facilitate the persistence of AcNPV in the environment and, or that provide for the productive infection of insect larvae and, or that facilitate the transmission of infectious virus in the environment by affecting characters such as determinants of host range, cell death and larval degradation will be suitable candidate genes to remove.
- the loss of any or all such functions and the derivation of disabled virus expression vectors will prohibit the occurrence of any adverse consequences of virus escape from laboratories or manufacturing establishments, by eliminating any potential effect on natural insect populations in the environment, or the likelihood of re-acquisition of such genes and functions from natural sources.
- both ⁇ uclease and protease genes deleterious to the transcription, expression and product accumulation of foreign genes expressed by baculovirus vectors have been identified. Removal of such genes will also provide for improved expression vectors.
- sequence information allows new sites to be identified for the insertion of single or multiple gene expression cassettes composed of viral promoters, foreign gene(s) of choice, including new polyadenylation sites and transcription terminators.
- cassettes can now be positioned so that they do not affect resident genes, their promoters, terminators, polyadenylation sites, or give mRNA species that act as antisense sequences to required viral genes.
- the sites may be contiguous. Additionally, or alternatively, the sites may be non-contiguous thereby facilitating expression of foreign genes without incurring deleterious positional effects on mRNA transcription.
- the genome sequence allows genetically engineered virus insecticides to be produced by exploiting the advantages described above with regard to tailored genome size, genetic stability, multiple foreign gene expression, and by the exploitation of gene dose.
- the ability to introduce genes into proscribed sites in the AcNPV genome and derivatives without affecting resident genes thereof includes the ability to transfer from other baculoviruses and other origins individual genes, cassettes of genes, and other DNA sequences that will affect the virus host range, its transmission and stability in the environment.
- the benefits will include effects on the LD50 (lethal doses required to kill 50% of target species) and LT50 (lethal time in 50% of members of an infected host species) and other biological properties of the natural virus.
- Such sequences will include, for example, genes representing baculoviruses with alternative host ranges, including genes from viruses that have proved impossible to grow, or to clone in cultured cells.
- the AcNPV genome contains genes and sequences that alone or in concert with host factors regulate the expression of viral genes generally in a temporally controlled fashion.
- the genome sequence allows the identification of all such regulatory viral genes and sequences.
- sequence information contained in SEQ LD NO. 1 may be used in the manufacture of a range of novel polynucleotides which may be used industrially.
- the invention according to one aspect thereof provides the use of sequence information derivable from the complete genomic sequence of AcNPV in the manufacture of a polynucleotide for use in an industrially applicable process.
- the invention further provides the use of sequence information derivable from the complete genomic sequence of AcNPV in the manufacture of a polynucleotide capable of acting as a control sequence in the expression of a foreign gene in an insect or insect cell.
- sequence information is derivable solely and/or primarily from said complete genomic sequence.
- the information may be derived from sequence data present in said complete genomic sequence, but essentially absent from or present in incomplete form in previously available sequence data.
- sequence analysis of the complete genomic sequence contained in SEQ ID NO. 1 has revealed the presence of 154 open reading frames of which 91 have not hitherto been described.
- These novel open reading frames are identified in Table 1 as ORF 13, 22-26, 28-30, 32, 38, 41-46, 50-60, 62-63, 66, 68-79, 81-87, 91-92, 96-98, 101-103, 106-126, 129-130, 140-146, 148-150, 152 and 154.
- the present invention thus includes isolated polynucleotides containing a nucleotide sequence which corresponds to one of the aforementioned ORFs.
- corresponding to as used herein is meant a nucleotide sequence which is identical to the disclosed sequence or which has sufficient homology to hybridize to the aforementioned sequence under hybridization conditions corresponding to TM -19 to TM -25.
- the corresponding sequences may be at least 80%, preferably at least 90% and most preferably at least 95% homologous to the stated sequence. Desirably the degree of homology is not less than 98%),
- the invention also includes polypeptides obtainable by expressing polynucleotides corresponding to the aforementioned ORFs. Such expression may be achieved by incorporating an insert having a sequence corresponding to one of the aforementioned polynucleotides into a suitable expression vector in association with and under the control of appropriate expression control sequences.
- Information derived from the SEQ ID NO. 1 may be used to optimize polypeptide expression in expression systems based upon baculoviruses by selecting appropriate control sequences.
- the present invention further provides a method of synthesizing a polypeptide by expressing the polypeptide in an insect or cultured insect cell which has been transformed by an expression vector derived from AcNPV, the expression vector containing a coding sequence coding for the polypeptide and control sequences responsible for control of replication of the expression vector and/or transcription of the coding sequence, characterized in that the control sequences are selected on the basis of sequence information derived from SEQ ID NO. 1.
- the information derived from SEQ ID NO. 1 additionally enables the efficiency of polypeptide expression to be increased by modifying the nucleotide sequence being expressed so as to take advantage of the preferred codon usage which is characteristic of the ORFs which have been identified in SEQ ID NO. 1.
- the invention provides a method of synthesizing a polypeptide by expressing the polypeptide in an insect or cultured insect cell which has been transformed by an expression vector derived from AcNPV, the expression vector containing a coding sequence coding for the polypeptide and control sequences responsible for control of replication of the expression vector and transcription of the coding sequence, characterized in that the coding sequence is adapted by selecting codons in accordance with the preferred codon usage of AcNPV.
- Preferred codon usage differs between species and expression of foreign polypeptides can often .be hampered if codons contained in the coding sequence to be expressed correspond to less preferred codons in the expression host.
- Knowledge of the preferred condo ⁇ usage for AcNPV allows the DNA sequence of the insert being expressed to be modified so as to increase the proportion of codons which are preferred for AcNPV.
- the coding sequence should be modified (if necessary) so as to ensure that one or more (and preferably at least ten, most preferably at least 15) of the amino acids indicated below are encoded by the indicated codons:
- Val GTG A person of ordinary skill in this art could therefore employ the preferred codons for the different amino acids as described herein in order to the optimize expression of a variety of different heterologous proteins using the claimed expression vector and the claimed methods.
- the genes encoding a desired heterologous protein could be modified to include the more preferred codons (see list above) and to exclude the less preferred codons (see list below for codons to avoid).
- DNA sequences encoding different enzymes, hormones, toxins, antibodies and receptors may be modified as described herein to enhance production.
- proteins useful in agriculture proteins are modified to alter insect behavior in a desirable way), clinical therapy, and or diagnosing disease could be modified.
- these different proteins include, but are not limited to the following: hepatitis B virus core antigen, hepatitis B virus surface antigen, bovine Herpesvirus-1 glycoprotein glV, Human immunodeficiency virus type 1 (HIV-l) envelope protein gp 120, HTV-l envelope protein gp 160, HTV-l Gag protein, HTV-l Gag-pol fusion protein, HTV-l Integration protein, HTV-l Major core p24, HTV-l Nef protein, HTV-l Pol protein, HTV-l protease, HTV-l Rev protein, Human immunodeficiency virus type 2 Gag precursor protein, Human T-cell lymphotxophic virus type 1 (HTLV-1) p20E protein, HTLV-1 gp46 protein, HTLV-1 040* protein, Bacillus thuringiensis subspecies kurstaki HD-73 delta endotoxin, Bacillus thuringiensis subspecies aizawai 7.21 crystal
- the coding sequence should be modified so that the following codons are avoided (these being less preferred codons for the indicated amino acids): Amino Acid Codon(s) to be avoided
- Chaperon sequences shall be defined as a sequence encoding a protein which contains a nucleotriphosphate and which is capable of leading, escorting or "chaperoning" a different protein into the nucleus from the cytoplasm.
- open reading frame shall refer to a specific length of DNA with a methioni ⁇ e start codon and terminated by a translation stop codon.
- Predicted sequences describes a sequence of putative protein as derived from the DNA sequence in the open reading frame. Using the genetic code one of ordinary skill in this art could readily define a protein sequence corresponding to each of the 154 open reading frames presented in Table 1. Putative is defined as "assumed to exist” e.g. “encodes a putative alkaline exonuclease” (infra under the Heading "Gene functions", last para.).
- data is used to define nucleotide sequences based on computer predictions; particularly when assuming the function of putative gene product.
- a consensus sequence is defined as a sequence specific for a biological function or characteristic as determined by computer sequence analysis. Consensus sequences may also be used to define a sequence (and corresponding characteristic or function for this sequence) which is shared or found to be homologous among different species.
- DNA wobble is a term used to explain how the third nucleotide of a codon can vary or "wobble" and still encode the same amino acid.
- TTT and TTC both encode the amino acid phenylalanine and that ATT, ATC, and ATA all encode for the amino acid isoleucine.
- Protease sequence defines those amino acid sequences found on certain proteins which are known or presumed (because of a consensus sequence) to play a role in the enzymatic digestion of other proteins.
- Ligase sequences refers to an amino acid sequence that is capable of joining or ligating the ends of RNA molecules or joining the ends of DNA molecules.
- T4 DNA ligase is used to join or ligate compatible "sticky” or “blunt” ends of DNA derived after restriction enzyme digestion.
- "Sticky” and “blunt” are terms in the art to define how the ends of DNA molecules appear after restriction enzyme digestion.
- Helicase sequences are protein sequences in enzymes associated with the unfolding of DNA molecules.
- polymerase sequences refer to either RNA or DNA polymerases. These enzymes are responsible for synthesizing RNA or DNA from the appropriate template.
- Deleterious sequences refer to a sequence that can have a deleterious effect on the production or efficiency of certain proteins being produced in the host cells.
- a protease sequence might be deleterious if this portease specifically breaks down the foreign recombinant protein synthesized in the insect cell via a baculovirus expression vector.
- Enhancer sequences are DNA sequences which increase the transcription of a virus gene. For example, dot matrix analysis of the AcNPV sequence against itself and its complement revealed eight regions of direct and inverted repetitive DNA sequences (hr ⁇ -hr5). The hrs are involved in enhancing early mRNA transcription and act as origins of DNA replication (infra, first two sentences under the heading "AcNPV genomic organization and repetitive DNA”.
- disrupted, interrupted, mutated, and deleted are sometimes used interchangeably in reference to specific ORFs. It is intended that these terms refer to a condition where the encoded protein is no longer functional due to a disruption, interruption, mutation, or some other interference that prevents, shuts down, nullifies or inhibits the otherwise named function.
- SEQ ID NO. 1 was derived from the C6 clone of AcNPV
- sequence information provided according to the invention may be used to optimize expression in other baculovirus expression systems.
- published partial sequence data, restriction enzyme and hybridization analysis can be used to identify other clones and baculovirus isolates from insects which may be strains, variants or varieties of AcNPV.
- isolates include viruses obtained from Autograph califomica, Autographa gamm, Galleria mellonella, Plutella xylostella, Rachiplusia ou, Spodoptera exempta, Spodoptera litura and Trichoplusio ni.
- Such viruses are likely to possess DNA sequences, genes, origins and replication, transcriptional promoters, terminators and regulatory factors in common with those of AcNPV C6 and such entities are likely to be involved in directing the course of infection, multiplication and morphogenesis of these viruses as well as their interactions with hosts, host cells and components thereof. Accordingly, the information provided according to the invention of SEQ ID No. 1 may be used in the development of expression systems utilizing these alternative viruses and virus strains.
- the complete nucleotide sequence of the genome of clone 6 of the baculovirus Autographa califomica nuclear polyhedrosis virus (AcNPV) has been determined.
- the molecule comprises 133,894 base-pairs and has an overall A + T content of 59%.
- Our analysis suggests that the virus enclodes some 154 methionine-initiated, and potentially expressed, open reading frames (ORFs) of 150 nucleotides or greater. These ORFs are distributed evenly throughout the virus genome on either strand.
- the ORFs are arranged as adjacent, non-overlapping reading frames, separated by short intergenic regions.
- Figure 1 A physical map and summary of coding strategy of the
- Figure 2. A dot matrix analysis of AcNPV genomic DNA.
- Figure 3. A circular map of the AcNPV genome.
- Figures 4 - 14. A construct for modififying the following respective genes to identify which genes are dispensable (non-essential) and which genes are indispensable (essential) for viral replication in cell culture or insect larvae.
- Figure 15 Single restriction enzyme site within the AcNPV EGT gene.
- AcNPV genomic DNA was prepared as described by Possee (1986).
- the DNA was digested with an appropriate restriction endonuclease (Bam ⁇ I, Bg ⁇ i, EcoBl, HindTH, Pstl, Sstl, Sst ⁇ ).
- the derived DNA fragments were inserted into pUC18/19, pUC118/119 or pT7T318/19 vectors using standard protocols (Sambrook et al., 1989).
- plasmids containing larger regions of virus DNA were digested with a restriction enzyme to release the insert, the virus DNA purified using agarose gel electrophoresis and then digested with another restriction enzyme. These smaller DNA fragments were inserted into plasmid vectors to provide materials more convenient for DNA sequencing.
- Reaction mixtures contained the dGTP analogue, 7-deaza dGTP, in lieu of dGTP in order to reduce sequence compressions.
- dITP was substituted for dGTP in the sequencing reactions.
- the M13 primer (5' GTAAAACGACGGCCAGT) was used to sequence the ends of each virus DNA fragment.
- Oligonucleotide primers prepared using an Applied Biosystems Instruments synthesizer (ABI, Model 380B, Warrington, UK), were employed to obtain the internal sequences of the viral fragments. Where appropriate, double-stranded DNA templates were used to complete regions of the AcNPV sequence not analysed as single-stranded DNA.
- An ABI automated sequencer (model 370A) was also used on occasion. Using the established nomenclature for describing (by rank of size) the AcNPV restriction endonuclease fragments (e.g., A, B, C, etc.,), the following cloned virus DNA fragments were completely sequenced: BamHl-D, -E and -G; BgHl-G', HinaH-C to -K, -O to -S, -U, -W and -X; PsrI-J to -M, Sstl-F to -H.
- the AcNPV restriction endonuclease fragments e.g., A, B, C, etc.
- Partially sequenced fragments included: BamHI-E; BgU -E and -H; HindHl-L; Pstl-B and -C; Sstl-O and Ssf ⁇ -I. All the DNA sequences between adjoining virus DNA fragments were determined using appropriate subclones spanning the respective junctions.
- the DNA sequences of the AcNPV C6 homologous region (hr) 1, .EcoBI-I and -R fragments have been reported (Possee et al., 1991).
- the remaining sequence of this AcNPV clone was determined from a data set comprising approximately 106 nucleotides.
- the complete AcNPV genomic sequence has been determined to consist of 133,894 base-pairs (bp) and has an A+T content of 59%.
- the distributions of purines and A+T nucleotides for the plus strand (+ strand; see convention established by Vlak and Smith, 1982) throughout a linearized representation of the circular AcNPV genome is shown in Fig. 1, using a moving window of 250 nucleotides.
- FIG. 1 A physical map of the genome was derived from the sequence data and is also illustrated in Fig. 1. This shows the arrangement of some of the common restriction enzyme sites frequently used to map the virus DNA (Ec ⁇ Rl, HindUL, Pstl, Sstl, BgiU, Xhol). Although circular, the map is presented with the first JEcoRI site of ⁇ rl as the left end of the genome.
- the virus DNA fragments shown in Fig. 1 are labelled alphabetically, in decreasing order of size (Vlak and Smith, 1982).
- a small fragment of 38 nucleotides is present between the HindUI-L and -M fragments and a 12 nucleotide fragment between the Hind ⁇ I-C and -W fragments (see Lu and Carstens, 1991 for the data on the clone HR3).
- the only exceptions to labelling fragments uniquely according to their size are the HindUl-Al (15,293 bp) and -A2 (7,576 bp) fragments. These are designated Al and A2 in Fig. 1 solely for convenience of comparison with previously published data.
- the Sstl map is modified to interchange the SsrI-A and -B fragments and the BgUL map is modified to interchange the B ⁇ ZII-G and -H fragments.
- the Ar4c represents an imperfect copy of the typical AcNPV 30 bp palindrome since there is a base change that mutates to AAATTC the characteristic JBCORI site (GAATTC) found in the centre of all other AcNPV hr palindromes (Table 2).
- Fig. 1 shows the positions (black boxes) of 337 open reading frames (ORFs) that are initiated with a methionine codon (vertical bars) and which could encode polypeptides of at least 50 amino acids.
- ORFs open reading frames
- This strategy of analysis does not identify gene products that may be smaller than 50 amino acids, or products that are generated by removal of introns from primary mRNA transcripts representing larger regions of the genome.
- ORFs open reading frames
- ORF 1 encodes a virus protein tyrosine/serine phosphatase (FTP) previously identified by Kim and Weaver (1993).
- FTP virus protein tyrosine/serine phosphatase
- Table 1 provides a more detailed summary of the information concerning the selected ORFs.
- the left end of each ORF identified in Table 1 (column Left) represents the site of either the translation initiation or termination codon, as determined by the orientation of the ORF.
- the right end of each ORF (Table 1, column Right) indicates the respective translation termination or initiation codon.
- the .direction of transcription (Table 1, column D), relative to that of the polyhedrin gene, is indicated by an arrow.
- the predicted number of amino acids (Table 1, column aa) per methionine initiated polypeptide derived from the ORF, and the M r of that polypeptide are also given.
- ORF128, Fig. 1 the large ORF encoded entirely within the region of gp67 (ORF128, Fig. 1), but on the opposite strand, was excluded from our final dataset.
- ORF100 which encodes the basic DNA binding protein, p6.9, of AcNPV (Wilson, et al., 1987), was included in our final dataset. As a consequence the two similar sized ORFs that overlap ORFIOO were not. Further analyses of the selected and non-selected ORFs will determine whether these assumptions are correct.
- ORF6 (lef-2) starts within the 3' region of ORF5.
- ORF14 (lef-1) overlaps the start of ORF13.
- ORF25 in Table 1 was recorded as 2 smaller ORFs by the same authors. In the vicinity of residue 7,497 there are 4 extra nucleotides compared to the previous published AcNPV C6 sequence data (Possee et al., 1991). This causes a frameshift in the coding region and results in an extension of a predicted protein, PKl (ORF10), from 196 to 272 amino acids.
- PKl predicted protein
- hrl Dot matrix analysis of the AcNPV sequence against itself and its complement revealed 8 regions of direct and inverted repetitive DNA (Fig. 2, identified as hrl, Aria, hrl, hrZ, Ar4a, hr4b, ⁇ r4c, hr ⁇ ).
- the hr regions are involved in enhancing early mRNA transcription and as origins of DNA replication (Pearson et ⁇ l., 1992; .Leisy.and Rohrmann, 1993; Kool et ⁇ l., 1993a,b).
- Other regions of DNA sequence were identified that have direct or inverted repetitive DNA that meet the minimal 21/24 bp matching criteria. The significance of these sequences is unknown.
- Table 2 is listed a number of the larger, non- ⁇ r inverted repeats that could in single-stranded forms produce hairpin structures. These may be relevant to the secondary structure of mRNA species and affect the transcriptional or translational efficiencies of a particular ORF. In this regard, it is noted that most of these sequences occur within ORFs, rather than in intergenic sequences (Table 2). Their presence may be solely a consequence of the encoded amino acid sequence and the codons used. However, of particular note is the palindromic sequence found within the 25K gene (FP-protein; ORF61) and its similarity to the hr palindromic sequences (see Table 2).
- RNA polymerase II RNA polymerase II
- AGT first potential translation start codon
- TATA boxes shown in Table 3 represents a sampling of several of the core DNA elements that are recognised to bind transcription factors (TFIID and TFUD-like proteins) (Ghosh, 1992).
- TFIID transcription factors
- TFIID transcription factors
- One general, loosely-defined consensus for the TFIID binding site is TATA(A/T)A(A/T) (Nikolov et al., 1992).
- the patterns that were employed were selected to limit the number of matches obtained when only TATA was used as the search motif. In the TATA motif search it was observed that the two patterns that favoured the A residue at position 6 were preferred over the third pattern (TA AAT, see Table 3).
- the TATAAA motif occurs in 46% of the cases, the TATATA motif in 34%, and the TATAAT motif in 19%.
- the CAGT motif is not always found at the start site of AcNPV early mRNA species. It should also be noted that in identifying possible RNA pol II promoter sites, we only considered the relative positions of the TATA box and CAGT motif (i.e., a TATA box 5' to a CAGT motif within the 5' leader sequence that was analysed, see above). Generally, however, in eukaryotes the TATA box motif is within 20 to 40 nucleotides of the mRNA cap site (Roeder, 1991; Zawel and Reinberg, 1992).
- AcNPV late genes are transcribed from a consensus late promoter transcription start signal (TAAG; Blissard and Rohrmann, 1990).
- the TAAG motif shows a dramatic difference in occurrence within the leader sequences of the selected ORFs (71 ORFs, 46%, Tables 1, 3) compared to the non- selected ORFs (11 ORFs, 6%; Tables 1, 3).
- A- T rich regions flank AcNPV ORFs (Kuzio et al., 1984). While the nucleotide composition of the genome is 59% A+T, A+T rich regions are not uniformly ( randomly) distributed.
- Fig. 1 shows several regions of A+T composition th approaches 85% when measured with a 250 nucleotide moving window.
- Althoug A+T rich regions often flank AcNPV genes this characteristic is not absolute
- the region 5' to the viral DNA polymerase (ORF65) is not especiall A+T rich.
- the TAAG motif occurs less frequently than would b expected for a random sequence.
- GAAT a sequence of similar composition, GAAT, occurs 574 times on the strand and 595 times on the — strand.
- th expected frequency of a sequence conforming to the composition (A2TG) in 133,894 bp genome of the base composition of AcNPV and involving randoml distributed bases is 705 occurrences per strand.
- a frequency distribution profile of the nucleotides surrounding the start codon of the 154 selected ORFs is shown in Table 4.
- the dominance of an A residue at the -3 and perhaps -2 positions relative to the A of the ATG translation start sites in the corresponding DNA is the only significant characteristic of the selected ORFs. G at -3 is not favoured in the selected ORFs.
- ORFs in AcNPV initiate translation at an ATG downstream of an in-frame ATG in the transcribed mRNA (Table 1, column K, identified as "2"). These are gp67 (ORF128) and PCNA (ORF49) (O'Reilly et al., 1989; Whitford et al, 1989).
- gp67 ORF1278
- PCNA ORF49
- the amino acids and predicted M r of the selected ORFs are based on the calculations for the largest potential ORF initiated with a methionine. This assumption over-estimates the size of the primary translation products for gp67 and PCNA, and for any other product for which translation is initiated at a downstream in-frame ATG.
- mini-cistrons There are 15 short ORFs (mini-cistrons) that are located immediately upstream (within 80 nucleotides) of the translation start site of the selected ORFs. All these mini-cistrons have ATG flanking sequences that conform to Kozak's rules. These are identified as "! in Table 1, column K. For mini-cistrons that are out-of-frame with respect to the larger ORF, a termination codon occurs either upstream of the selected ORF, or within a short distance into its coding region. Mini-cistrons have been reported in the 5' leaders of other baculovirus genes (Tomalski et a/., 1988; Blissard and Rohrmann, 1989) and may have regulatory roles in the translation of mRNA species.
- codons that are used (Table 5), for example AGG and CGG (arginine), GGG (glycine), CTA, CTC, CTT (leucine) which are each used at less than half of the frequencies that may be expected if all the possible codons were utilized equally. While some codons appear to be discriminated against in the selected ORFs, others appear to be favoured (Table 5), for example CAA (glutamine), GAA (glutamic), GGC (glycine), ATT (isoleucine), TTG (leucine), and AAA (lysine). To what extent codon bias affects the expression level of AcNPV genes, or foreign genes expressed from AcNPV- derived expression vectors, remains to be determined.
- the predominant translation termination codon utilized by the selected ORFs is TAA. It terminates 117 of the 154 ORFs (76%, Table 5).
- CpGV LAP Cydi ⁇ pomonell ⁇ granulosis virus
- AcNPV encodes a gene with identity to the acidic and basic fibroblast growth factors (FGFs), also known as heparin binding growth factors (HBGF, reviewed by Burgess and Maciag, 1989; Klagsbrun and D'Armore, 1991).
- FGFs acidic and basic fibroblast growth factors
- HBGF heparin binding growth factors
- the AcNPV FGF- like gene product shows c ⁇ . 35% identity (75% similarity) with known members of the FGF superfamily.
- GTAs Global transactivators
- D. mel ⁇ nog ⁇ ster brahma gene is encoded by a 1638 codon ORF (Tamkun et al., 1992) while the yeast SNF2 gene contains an ORF of 1703 codons (Laurent era/., 1991).
- PNK/PNL encodes a protein that may have multiple functions.
- the amino terminal portion is strongly related to T4 RNA ligase (31% identity, 72% similarity) while the carboxy terminal half of this protein is related to T4 polynucleotide kinase (26% identity, 66% similarity).
- AcNPV encodes a chitinase (ORF126) that resembles those of other organisms, most notably Serr ⁇ ti ⁇ m ⁇ rcescens (57% identity; 88% similarity; Jones et ⁇ l., 1986). Analyses of the function of the viral chitinase indicates that it has a role in the liquefaction of infected larvae (R. Hawtin and R.D. Possee, manuscript in preparation).
- AcNPV also encodes a putative alkaline exonuclease (ORF133).
- ORF133 has 53% identity with its Orgyi ⁇ pseudotsug ⁇ t ⁇ NPV (OpNPV) homologue (Gombart etal., 1989).
- RNA binding motifs As part of our search for potential virus-encoded RNA polymerase subunits, we searched for DNA binding motifs. A sample of the motifs used for the searches are shown in Table 3. They include zinc fingers (Table 1, Dom column, “Z”), leucine zippers (Table 1, Dom column, “L”), nucleoside triphosphate binding domains (Table 1, Dom column, “NTP”) and nuclear translocation signals (Table 1, Dom column, “NTS”).
- Zinc fingers were found in two potential apoptosis inhibitory proteins IAPl (ORF27) and IAP2 (ORF71) (Table 1). Zinc fingers were also found in the early genes IE-1 (ORF147), ME53 (ORF139) and PE38 (ORF153). The zinc finger suggested to be in cg30 was not identified by our analysis. However, the leucine zipper in the cg30 protein (ORF88) was identified. Leucine zippers were found in 7 other potential polypeptides, including the calyx protein, pp34 (Table 1).
- NTP binding motif was identified in 4 ORFs, 3 of which are known as late enhancing factors (lefs, Table 1).
- the fourth protein was PNK/PNL (ORF86).
- searches with a simplified motif for the ATP-binding site in protein kinases would not have found matches in either PKl (ORF10), or PK2 (ORF123), both of which have extensive overall identity with known protein kinases.
- PKl lacks a consensus ATP-binding motif, having IxGxxG at the ATP-binding site, while PK2 completely lacks this N- terminal domain.
- NTS motifs were found in 12 of the selected ORFs.
- Known nuclear localising proteins that have an NTS include 39K, DNA polymerase, and p6.9.
- No NTS was found for the plO protein, which is the component of fibrous bodies present in the nuclei of AcNPV infected cells. It is possible that this and other viral proteins enter the nucleus using an alternative pathway, or are chaperoned by a protein containing an NTS. None of the AcNPV proteins that are known to be solely cytoplasmic had a predicted NTS.
- the cDNA sequence information for A. califomica can be used to design a vector which is capable of optimally expressing a desired protein product (called a "designer vector").
- a design vector An investigator of ordinary skill in this art would analyze a variety of different factors prior to deciding on which genetic elements should be included in a specific designer vector. For example, an investigator might study the following factors before designing a vector; the protein to be synthetically produced, the host cells to be used, desired temporal timing for protein production, available insertion sites for the non-natural promoters, any known deleterious sequences or proteases that could reduce the amount of protein being produced, etc.
- the designer vector can include a single promoter, multiple promoters, tandem promoters, combinations of synthetically constructed promoters, natural promoters and derivatives thereof.
- the choice of promoters depends on several factors and is usually performed on a vector to vector basis (case by case basis). Additionally, many different genetic elements can be included in the designer vector and deciding which to include or exclude depends on the desired protein to be recombina ⁇ tly produced in the baculovirus expression vector system.
- a vector can be designed and constructed to optimize the isolation and recovery of the desired protein.
- the vector can be designed to include specifically identified secretion sequences determined from the cDNA sequence data.
- califomica cDNA sequence information locations of transcription and translation signal sequences can be determined. Additionally, specific flanking sequences near the ATG sequence of the open reading frame can be identified and then used in order to optimally transcribe the ORF.
- the A. califomica cDNA sequence information can be used to identify new genes. Once these new genes are identified, their promoters (early, late, immediate early or immediate late) may then be obtained and used in vectors. The new promoters from these new late genes may then be used to drive the expression of desired genes more efficiently and effectively when compared to the polyhedrin.
- essential and non-essential gene regions can be identified.
- essential and non-essential genes refer to the virus replication in cell culture (e.g. Spodoptera frugiperda cells).
- ORFs 126 chitinase
- 127 cathespin
- ORFs 126 and 127 have been shown to be non-essential genes.
- these two gene could be eliminated from the A. califo ica sequence and not affect the nature of the sequence. Elimination of these two non-essential genes could be performed by standard protocols known to those skilled in the art.
- the rationale for identifying non-essential genes is to reduce the genome size to smaller and more functional pieces in order to create a more effective, and environmentally acceptable pesticide or in order to create a more effective vector.
- regions that are essential for enabling a virus to live in an insect cell can be identified. Once these essential regions are identified, the essential sequence can be used to produce a virus that will not propagate in live insects. One use of such an environmentally safe virus would be used as a selective pesticide.
- the A califomica cDNA sequences claimed in this invention can be used to design a plasmid vector capable of optimizing expression of the desired protein.
- One way in which this plasmid vector can be tailored to more effectively and efficiently produce the desired protein of choice is to optimize it for the particular host.
- SF9 cells are the optimal cells of choice for production of desired proteins in the baculoviral expression vector system.
- a designer vector as in Example 11 above can be constructed for optimal expression of the desired protein in the SF9 cells by deleting selected deleterious sequences and/or providing enhancer sequences.
- the A. califomica cDNA sequences claimed in this invention can be used to design a complete virus which is specifically constructed to contain specific and unique elements which will enhance the infectivity of this virus in a particular insect cell.
- a viral particle can -be designed to infect and kill the insect at an early stage.
- the claimed sequence can also be used to produce a virus capable of infecting larvae and not adult insects.
- An additional embodiment of this invention is to use the claimed A. califomica cDNA sequence to tailor or design a virus which is capable of infecting only specific insects, thereby constructing a very host specific virus.
- a self destructive mechanism may be included in the viral particle. This mechanism can be designed such that once the viral particle has killed the host specific insect, the virus destroys itself via a time, chemical, or enzymatic attack. This self destructive mechanism will effectively eliminate any residual virus and therefore produce a more environmentally acceptable pesticide.
- a sequence known to trigger lysis may be inserted adjacent to a late or early promoter.
- the availability of the complete AcNPV sequence and subsequent experimental data will allow the identification of those virus genes with roles in determining those insect species which can be infected with the virus.
- the virus could be modified to limit infection to the target pest species, while leaving other species unaffected.
- baculoviruses may be engineered to expand their host range to include several pest species.
- AcNPV has a wide host range in comparison to other baculoviruses and therefore may be a source of "host range genes" which can be added to these other baculoviruses.
- Certain proteins are naturally expressed in A. califomica (for example, heparin binding factor).
- the cDNA sequence information of the claimed invention can be used to enhance or increase production of the proteins that are naturally expressed in califomica, for example, by inserting additional promoter sequences and/or by deleting certain sequences deleterious to the production of the desired protein.
- the deletion of the annihilator gene (ORF 135) from the virus results in a phenotype in which virus-infected cells die through a process of apoptosis or early cell death. In effect, the cell commits suicide to prevent replication of the virus.
- ORF 135 The deletion of the annihilator gene (ORF 135) from the virus results in a phenotype in which virus-infected cells die through a process of apoptosis or early cell death. In effect, the cell commits suicide to prevent replication of the virus.
- ORF 135 The deletion of the annihilator gene (ORF 135) from the virus results in a phenotype in which virus-infected cells die through a process of apoptosis or early cell death. In effect, the cell commits suicide to prevent replication of the virus.
- other genes could be identified with a similar function to the annihilator gene, i.e. preventing the cell from undergoing an apoptotic response. These genes would
- Fig. 1 a linear representation of the map is shown. Since the virus genome is circular, a more conventional map for the AcNPV genome is given in Fig. 3. In this map the identified genes (hatched arrows), and unassigned selected ORFs (open arrows) are shown as well as their orientations. Also indicated in Fig. 3 are the sites of Ar sequences and insertion (IS) and retroposon sequences (RP). This circular map includes the revised Eco ⁇ l (outer ring) and HindUL (inner ring) fragment lengths of AcNPV C6.
- ORFs were identified within the virus genome that could potentially encode proteins of greater than 50 amino acids. This selection allowed inclusion of the 55 amino acid, arginine-rich p6.9 protein (basic protein, Wilson et al., 1987). It disregards smaller ORFs, some of which may encode proteins or peptides that are made during the virus infection process.
- the 154 ORFs were selected on the basis of their possession of a methionine codon and the absence of a larger, overlapping ORF. Again these assumptions may prove to be incorrect in some cases (e.g., where a spliced mRNA is involved).
- the number of gene products encoded by the AcNPV genome may be larger or smaller than 154, depending on the extent that the assumptions made in these analyses prove to be correct.
- other strains of the virus may include additional sequences (insertions, or ORFs), or lack sequences by comparison to those in the C6 virus. Since it is valuable to have a reference point for comparison purposes, it is suggested that the AcNPV C6 ORF numbering nomenclature is adopted pro temporis and until virus gene functions are described for the particular ORFs.
- the complete AcNPV sequence was analysed using a neutral-net ORF identification programme, GRATL (Uberbacher and Mural, 1991), in order to predict potential protein coding regions.
- GRAIL was originally designed as a programme for identifying coding exons in human and other DNA sequences.
- the GRATL coding recognition module incorporates seven sensor algorithms. Each component of the module provides an indication of the coding potential of the DNA sequence. The various sensor outputs are integrated using a neutral network which also predicts the locations of the coding regions.
- the system has been demonstrated to be effective in the identification of 90% of exons over 100 bases long in human DNA (Uberbacher and Mural, 1991). In part this success rate depends on the G+C content of the DNA.
- Coding regions are recognised less easily in DNA sequences with a lower G+C than A+T content.
- the G+C content of the AcNPV genome is only 41%, so the coding regions predicted from the GRATL an lysis must be treated with some caution.
- the candidate ORFs that were identified by GRATL were rated as excellent, good, marginal or null (Table 1).
- Most of the AcNPV genes which have been assigned functions gave excellent or good ratings using this method. The most notable exceptions were the protein tyrosine phosphatase (Kim and Weaver, 1993), p6.9 (Wilson et al., 1987) and conotoxin (Eldridge et al., 1992).
- GRAIL provided a complementary analysis of the likely coding potential of the AcNPV genome. The value is confirmed by the fact that GRAIL predicted 84% of the 154 selected ORFs (Table 1), whereas only 4% of the 183 non-selected ORFs were identified by GRATL as having potential protein coding capacity.
- TATA boxes TFITD binding sites
- CAGT possible mRNA transcription start sites
- the CAGT motif is associated with many baculovirus early gene promoters and is probably a good indicator of whether or not a virus gene is transcribed in the early phase of the replication cycle.
- the TATA boxes are more problematic, in part due to the high A+T content of the AcNPV genome and its intergenic regions. More than one TATA box was present upstream of many of the ORFs.
- Table 3 we located TATA boxes upstream of 40% of the selected ORFs. Of the 3 TATA box patterns utilised to identify possible TFIID-type binding motifs (Table 3),. the TATAAA motif, which is the preferred TFIID binding site, was the most frequent in the selected ORFs identified to be early genes.
- RNA pol II promoter within 160 nucleotides of the ATG codon of the respective ORF.
- the known AcNPV early genes identified by this procedure include: ME53 (ORF139), IE-1 (ORF147), IE-N (ORF151), and PE38 (ORF153).
- the presence of a TATA motif does not prove that it is used in early transcription by RNA pol EL This can only be determined by experimentation.
- lef-3 has consensus TATA and CAGT motifs in its 5' leader, but no evidence has been reported that these are utilised in early mRNA synthesis (Li et al., 1993).
- the polyhedrin gene has an RNA pol II motif within its promoter region.
- CGTGC motif Alternative transcription start sites, initiating from a CGTGC motif, have been identified in some AcNPV early gene promoters.
- the CGTGC motif is utilised as early start sites for pl43 (Lu and Carstens, 1991), DNA polymerase (Tomalski et al., 1988) and p47 (Carstens et al., 1993).
- pl43 Long and Carstens, 1991
- DNA polymerase Tomalski et al., 1988
- p47 Carstens et al., 1993.
- this motif is involved in the expression of the AcNPV delayed-early genes and may be a site of recognition by virus-encoded, trans-activating proteins.
- the CGTGC motif is broadly similar to sequences found in AcNPV Ar regions, i.e., TYC(A/T)(A/T)A(AT)CGXGTRA (where Y is a pyrimidine, R a purine and X any nucleotide).
- the CGTGC motif is evenly distributed between the selected ORFs and the non-selected ORFs, suggesting that the definition of this motif is not refined enough toie of predictive value. If it is important, its placement may not be confined to the immediate 5' leader sequence of a neighbouring gene.
- the late and very late transcription start sites involve a TAAG motif (Blissard and Rohrmann, 1990).
- TAAG rather than the canonical ATAAG or RTAAG sequences to search for ORFs that might be transcribed late in infection in an endeavour to maximise the chance of finding matches.
- the 46% of the selected ORFs that are identified as probable late/very late genes may under ⁇ estimate such genes.
- cg30 ORF88
- initiates from the sequence ATTAG Wang and Miller 1989.
- the late gene p74 (ORF138) initiates transcription at the sequence TATTG (Kuzio et al, 1989) and p47 (ORF40) has a late transcription start site GTAAAAC (Carstens et al., 1993).
- a search for similar matches to the start site used in p47 revealed a good match at nucleotide 66,740 in the coding region for gp41 (ORF80).
- an ATAAG motif is present 145 nucleotides upstream of this site.
- codon usage table for the 154 selected ORFs presented in this study (Table 5). There appears to be some codon bias.
- the codon usage bias shown by the AcNPV ORFs may reflect some state of the tRNAs available to the virus during the infection process. However although the sample base is low, so far we have not been able to detect a differential codon bias between early and late expressed genes.
- C ⁇ s-acting elements (hrs) involved in the origins of AcNPV DNA replication have been shown to be A+T rich.
- OpNPV appears to have at least one origin that is slightly G+C rich, but with a neutral purine composition, i.e., different from the hrs of AcNPV, or the transcription enhancer regions found within OpNPV (Pearson et al, 1993).
- the region in AcNPV homologous to the OpNPV origin of replication lies within ORF13 and ORF14 (lef-1).
- Choristoneura fumiferana NPV CfNPV
- Bombyx mori NPV BmNPV
- the Ar sites of baculoviruses may be active in inter- or intra-molecular recombination. If recombination was involved in the one or other inversion, how this occurred is not certain since there is no obvious relationship between the left and right arms of the second inverted region in
- OpNPV the corresponding regions of AcNPV are A+T rich. This suggests that an intramolecular inversion may have taken place in OpNPV. However, a detailed analysis of this region in that virus has yet to be undertaken.
- the Ar regions of AcNPV have been implicated in replication of the AcNPV genomic DNA and may act as origins of replication (Pearson et al., 1992; Leisy and Rohrmann, 1993; Kool et al., 1993a,b). Furthermore, recent studies have identified regions of the AcNPV genome that encode products that act on the origins of replication (Kool et al., 1994).
- the AcNPV + strand G-rich sequence at position 78,300 of AcNPV shows no overall bias in A+T content but has a pronounced spike with respect to total purine composition (ca. 78%, Fig. 1).
- Purine-rich tracts can potentially form an intrastrand triple-helix and tetrads.
- Triple helical DNA has been implicated as an origin of replication for some plasmids, as well as having other potential regulatory functions (Caddie et al., 1990).
- the only other region of elevated purine composition in AcNPV occurs within the coding region of ORF66 (ca. 68% purines).
- the purine rich region within ORF66 is also A+T rich, thus A residues- contribute highly to the purine composition of the + strand.
- the CpGV IAP gene provides the 3 ⁇ k gene function in AcNPV 35k-negative mutants, thereby preventing the annihilator phenotype of the mutant (Crook et al., 1993).
- the CpGV IAP provides the 3 ⁇ k gene function in AcNPV 35k-negative mutants, thereby preventing the annihilator phenotype of the mutant (Crook et al., 1993).
- there is no sequence identity between AcNPV 35k and CpGV IAP In view of the structural homologies between the CpGV IAP and the AcNPV IAPl and IAP2 genes, the roles and functions of these AcNPV genes warrant further investigation. It has been shown that the 35k-negative AcNPV mutant, while unable to replicate efficiently in S. frugiperda cells in culture, or in whole larvae, can be propagated in T. ni cells, or insects.
- the AcNPV IAPl and IAP2 genes may prevent apoptosis in AcNPV infections of other cell types or larval species. Further, it has been shown that over-expression of a human inhibitor of apoptosis (BCL2) in S. frugiperda cells (Alnemri et at., 1992), using an AcNPV expression vector, results in the protection of the cells against apoptosis. These recombinant virus infected cells have an extended survival time and do no . show the degradation of host cell DNA that is evident in cells infected with wild- type AcNPV. It is not known if over-expression of the AcNPV IAP genes results in extended survival of virus-infected cells. The AcNPV LAP genes do not share any structural similarity with BCL2, or any other known IAP gene. However, the viral LAP genes are similar to certain DNA binding proteins by the possession of 3 copies of a zinc finger motif.
- AcNPV encodes a gene with identity to the FGFs and HBGF family of growth factors.
- Two conserved cysteines have been identified in all the human FGFs sequenced to-date. These are Cys31 and Cys98 (relative to human acidic FGF). These cysteines have been implicated in intramolecular disulfide bond formation (Burgess and Maciag, 1989). The N-terminal cysteine is lacking from the putative AcNPV FGF.
- site-directed mutagenesis of cDNA clones has implicated Lysl33 in heparin binding (Burgess and Maciag, 1989).
- the AcNPV FGF has an arginine at this position.
- Hbg3 This substitution of one basic residue for another also occurs in the int-2 proto-oncogene precursor Hbg3.
- Free heparin is known to inhibit the growth of herpes simplex viruses (Nahmias and Kilbrick, 1964). More recently, it has been shown that heparin binds to HSV-1 virions via the glycoprotein gC (WuDunn and Spear, 1989; Herold et al., 1991) and prevents their adsorption to heparin sulphate moieties resident on cell surface proteogl cans. Heparin similarly inhibits plaque formation of pseudorabies virus by binding to glycoprotein gELI (Mettenleiter et al., 1990).
- heparin binding factor by the baculovirus could be a method to complex free heparin (or heparin-related compounds) thereby facilitating virus spread within the host.
- the virus FGF has a signal peptide sequence at its amino terminal sequence which may facilitate secretion from virus-infected cells.
- the GTAs are non-DNA binding proteins thought to have a role in the regulation of homeotic genes (Tamkun et al., 1992).
- Homeotic genes are involved in the expression of a large group of other genes that have been implicated in directed development and growth of an organism (McGinnis et al., 1984a,b; Scott and Weiner, 1984; Levine and Hoey, 1988; Hayashi and Scott, 1990).
- the AcNPV ORF42 has homology with the GTAs of D. mel ⁇ nog ⁇ ster (Tamkun etal., 1992) and yeast (Laurent et al., 1991).
- the AcNPV GTA-like protein does not have either the early CAGT, or late TAAG transcription initiation sites, so it is difficult to predict when it may be expressed in virus-infected cells. Transcriptional analysis is required to determine if and when it is synthesized.
- a viral GTA might be involved in regulating a number of genes involved in viral processes, such as late gene transactivation. It is also conceivable that the AcNPV GTA-like gene acts as a repressor to inhibit host gene expression.
- RNA polymerase has at least 8 subunits with apparent sizes of 95, 76, 50, 47.5, 40, . 33.5, 27.5, and 26 kDa (Yang et ⁇ l., 1991). These subunits are believed to be distinct from host encoded RNA polymerase subunits.
- the level of processing of viral RNA polymerase subunits i.e., cleavage of primary products phosphorylation
- ORF144 encodes a 33.5 kDa peptide that has similarity to the yeast MSS18 protein (Seraphin et al., 1988). MSS18 is known to be involved with yeast mitochondrial RNA splicing. Also, ORF124 encodes a 28.5 kDa peptide with similarity to a plasmid copy number protein from Clostridium perfringens (Gamier and Cole, 1988).
- lefs late enhancing factors
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 27: 22600 > 23458 and named "Inhibitor of Apoptosis-Like Gene 1" (IAPl).
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- the cassette was chosen to derive an in-frame fusion between the IAPl and beta-galactosidase coding regions.
- the plasmid was designated pUC118.IAPl.lacZ. This was used to cotransfect Spodoprera frugiperda cells with infectious AcMNPV C6 DNA to produce recombinant virus with a copy of the beta- galactosidase gene in frame with the IAPl, this disrupting IAPl function.
- the results from this Example demonstrated that the recombinant virus (AcIAPl.lacZ) replicated normally in S. frugiperda cells and Trichoplusia ni insect larvae.
- ORF 30 24315 ⁇ 25704: Haemolysin Secretory Protein (HSP)
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 30: 24315 ⁇ 25704 and named "Haemolysin Secretory Protein” (HSP).
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- ORF 32 27041 ⁇ 27584: Fibroblast Growth Factor (FGF)
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 32: 27041 ⁇ 27584 and named "Fibroblast Growth Factor: (FGF).
- FGF Fibroblast Growth Factor
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- ORF 71 61016 > 61763: Inhibitor of Apoptosis-Like Gene 2 (IAP2)
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 71: 61016 > 61763 and named "Inhibitor of Apoptosis-Like Gene 2" (IAP2).
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- an AcMNPV DNA fragment coordinates 60448 (Sad Site) to 63194 (Sad site) was subcloned into pUC118 digested with SacI and treated with CEP to derive pUC118.IAP2 (See Figure 7, Panels a-e).
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 86: 72131 ⁇ 74213 and named "Polynucleotide Kinase/Polynucleotide Ligase" (PNK/PNL).
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- an AcMNPV DNA fragment coordinates 71417 (Hindm site) to 83121 (Hin ⁇ T ⁇ site) was subcloned into pAT153 digested with HindEQ and treated with CEP to derive PAT153.PNK/PNL (See Figure 8, Panels a-e).
- the plasmid was designated pUCll ⁇ .PNK/PNL.lacZ. This was used to cotransfect S. frugiperda cells with infectious ACMNPV C6 DNA to produce recombinant virus with a copy of the beta-galactosidase gene in frame with the PNK/PNL, thus disrupting PNK/PNL function. The results showed that the recombinant virus (AcPNK/PNL.lacZ) replicated normally in S. frugiperda cells. EXAMPLE 26
- ORF 123 102964 ⁇ 103609: Protein Kinase 2 (PK2)
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 123: 102964 ⁇ 103609 and named "Protein Kinase 2" (PK2).
- PK2 Protein Kinase 2
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- an AcMNPV DNA fragment coordinates 102148 (Pstl site) to 105164 (Pstl site), was subcloned into pUCll ⁇ digested with Pstl and treated with CEP to derive pUC118.PK2 (See Figure 9, Panels a-e).
- Apal-BglH adaptor AGATCTGGCC
- coli lacZ coding region to provide an in-frame fusion between the virus and bacterial genes.
- the plasmid was designated pUC118.PK2.lacZ. This was used to cotransfect S. frugiperda cells with infectious AcMNPV C6 DNA to produce recombinant virus with a copy of the beta-galactosidase gene in frame with the PK2, thus disrupting PK2 function. The results demonstrated that the recombinant virus (AcPK2.1acZ) replicated normally in S. frugiperda cells. 56 EXAMPLE 27
- ORF 126 105282 ⁇ 106935: Chitinase (CHID
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 126: 105282 ⁇ 106935 and named "Chitinase” (CHIT.
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- an AcMNPV DNA fragment coordinates 105164 (Pstl site) to 107943 (Pstl site), was subcloned into pUC118 (lacking a Hind HI site) digested with Pstl and treated with CIP to derive pUC118.CHIT (See Figure 10, Panels a-e). This was digested with Hind (106337), treated with CIP and ligated with a Hindm-BamHl adaptor (AGCTGGATCC) to insert a BamHl site within the CHIT gene to derive pUC118.CHIT-BamHl. This was digested with BamHl, treated with CEP and ligated with a DNA cassette containing the E.
- the plasmid was designated pUC118.CHIT.lacZ. This was used to cotransfect S. frugiperda cells with infectious ACMNPV C6 DNA to produce recombinant virus with a copy of the beta- galactosidase gene in frame with the chitinase, thus disrupting chitinase function.
- the recombinant virus (AcCHIT.lacZ) replicated normally in S. frugiperda cells. In T. ni insect larvae, the virus replicated but failed to induce liquefaction of the host.
- This Example identifies a new AcMNPV gene which is dispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 127: 106983 > 107952 and is named "Cathepsin” (CATH).
- This AcMNPV gene can be deleted from the baculovirus genome to: (a) provide additional sites for inserting single or multiple copies of foreign genes and (b) to reduce the size of the virus genome.
- Each gene is ascribed a number corresponding with the order of open reading frames (ORFs) within the AcMNPV genome. This is followed by the precise coordinates of the left and right ends of the coding region. Genes which are on the same strand as the polyhedrin gene are indicated by " > " between the left and right coordinates. Genes which are antisense to the polyhedrin are indicated by " ⁇ " between the left and right coordinates.
- the translation initiation codon (ATG) for polyhedrin-sense genes is located at the left coordinate while the translation initiation codon for antisense genes is located at the right coordinate. All coordinates in this description are relative to the AcMNPV genomic sequence, even after subcloning of virus DNA fragments into plasmid vectors.
- This mutated plasmid was designated pUC119.M.CHTT- /CATH-. It was used to cotransfect S. frugiperda cells with infectious virus DNA, purified from the AcCHTT.lacZ, which had been digested with Bsu361 to enhance the recovery of recombinant viruses.
- the recombinant virus, AcCH_T-/CATH- replicated normally in S. frugiperda cells. In T. ni insect larvae, the virus replicated but failed to induce liquefaction of the host.
- ORF 42 34010 > 33924: Global Transactivator (GTA)
- This Example identifies a new AcMNPV gene which is indispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 42: 34010 > 33924 and is named "Global Transactivator" (GTA).
- GTA Global Transactivator
- This AcMNPV gene was modified in a similar manner as Examples 21-28 (described above), but the modification did not result in the production of an infectious virus stock. This information is a strong indication that this virus gene is indispensable for replication and cannot be removed from the virus genome.
- an AcMNPV DNA fragment coordinates 33403 (EcoRI site) to 37088 (Asp718 site) was inserted into pUC118 digested with EcoRI and Asp718 and treated with CEP, to derive pUCll ⁇ .GTA (See Figure 12, Panels a-e).
- This plasmid was designated pUC118.GTA.lacZ. This was used to cotransfect S. frugiperda cells with infectious AcMNPV C6 DNA to produce recombinant virus. Although some blue plaques were derived, these could not be titrated to genetic homogeneity and it was concluded that the GTA gene is essential for virus replication in cell culture.
- This Example identifies a new AcMNPV gene which is indispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 124: 103793 > 104534 and named "Plasmid Copy Number Protein” (PCNP).
- This AcMNPV gene was modified in a similar manner as Examples 21-28 (described above), but the modification did not result in the production of an infectious virus stock. This information is a strong indication that this virus gene is indispensable for replication and cannot be removed from the virus genome.
- an AcMNPV DNA fragment coordinates 102148 (Pstl site) to 105164 (Pstl site) was inserted into pUC118, digested with Pstl and treated with CEP, to derive pUC118.PCNP (See Figure 13, Panels a-e).
- This plasmid was designated pUC118.PCNP.lacZ. This was used to cotransfect Spodoptera frugiperda cells with infectious AcMNPV C6 DNA to produce recombinant virus. Although some blue plaques were derived, these could not be titrated to genetic homogeneity and it was concluded that the PCNP gene is essential for virus replication in cell culture.
- ORF 132 112560 > 113817: Alkaline Exonuclease (ALK-EXO)
- This Example identifies a new AcMNPV gene which is indispensable for virus replication in cell culture or insect larvae. This gene is located at ORF 132: 112560 > 113817 and named "Alkaline Exonuclease" (ALK-EXO).
- This AcMNPV gene was modified in a similar manner as Examples 21-28 (described above), but the modification did not result in the production of an infectious virus stock. This information is a strong indication that this virus gene is indispensable for replication and cannot be removed from the virus genome.
- an AcMNPV DNA fragment coordinates 112044 (Smal site) to 113913 (HindELT site) was subcloned into pUC118 digested with HindHT and Smal and treated with CEP (See Figure 14, Panels a-e).
- This combination of enzymes served to remove an intervening BamHl site within the polylinker of the plasmid.
- the plasmid was designated pUCl 18.
- ALK-EXO This was digested with BamHl (113033), treated with CEP and Ugated with a DNA cassette containing the E. coli lacZ coding region to provide an in-frame fusion between the virus and bacterial genes.
- the plasmid was designated pUC118.ALK-EXO.lacZ. This was used to cotransfect S. frugiperda cells with infectious AcMNPV C6 DNA to produce recombinant virus. Although some blue plaques were derived, these could not be titrated to genetic homogeneity and it was concluded that the ALK-EXO gene is essential for virus replication in cell culture.
- This Example identifies three restriction enzymes which do not have recognition sites within the AcNPV genome. These three enzymes are:
- Bsu36I sites were inserted within the ORF 9 (immediately downstream of the polyhedrin gene in AcNPV) and ORF 7 (immediately upstream of the polyhedrin gene).
- the polyhedrin gene was replaced with the beta-galactosidase coding region which also contains a Bsu36I site.
- Srfl and Sse8387I could be utilized in a similar manner in other regions of the virus genome. For example, they could be used to alter the AcNPV genome to incorporate these sites and facilitate genomic DNA linearization.
- This Example identifies two restriction sites which only digest AcNPV DNA once: Avril (See Figure 15) and Fsel (See Figure 16).
- AvriT digests within the non-essential EGT (ecdysteroid UDP-glucosyltransferase) gene and Fsel digests within the essential GTA (global transactivator) gene.
- EGT ecdysteroid UDP-glucosyltransferase
- GTA global transactivator
- Information derived from the entire AcMNPV genomic sequence could afford development of novel baculovirus transfer vectors that encode baculoviruses with favorable agronomic properties. Identification of genes encoding proteins that modify viral host range would lead to generation of recombinant NPVs wherein said recombinant viruses would be capable of infecting and therefore neutralizing a wider spectrum of important agronomic pests. Alternatively, genetic manipulation could lead to changes in viral properties that render the virus capable of infecting only a very narrow spectrum of insect pests, thus affording precise control of targeted insect species while sparing beneficial insect populations.
- genes involved in viral replication could be identified. Manipulation of these genes could afford recombinant baculoviruses that multiply more rapidly within infected insect cells, thus leading to more rapid neutralization of the infected insect.
- the Global Transactivator Gene ORF 42; see Example 29
- ORF 42 the Global Transactivator Gene
- Other genes influencing viral infectivity could also be identified and modified in order to raise the efficiency of the infectious process. This would also afford more rapid neutralization of targeted populations, and thus approach the rapidity of insect neutralization commonly associated with appUcation of traditional chemical insect control agents.
- genes could be identified that qu ⁇ ditatively control viral repUcation outside of a permissive propagation system.
- viral mutants deficient in a protein or proteins required for in vivo infectivity could be propagated in an insect cell culture system that is permissive to viral replication. While efficient viral repUcation in cell culture takes place, as well as initial infection of target insects, further viral replication in vivo is curtailed, and environmental impact of appUcation of recombinant baculoviruses is minimized.
- ALTSCHUL S.F., GISH, W., MILLER, W prisms, E.W. and LTPMAN, D.J.
- COCHRAN M.A.
- CARSTENS E.B.
- EATON B.T.
- FAULKNER FAULKNER
- Viral transcription during Autographa califo ica nuclear polyhedrosis virus infection a novel
- baculovirus polyhedral envelope-associated protein genetic location nucleotide sequence, and immunocytochemical characterization.
- Glycoprotein C of herpes simplex virus type 1 plays a principal role in the adsorption of virus to cells and in infectivity. J. Virol.65, 1090-1098. HODGMAN, T.C (1988a). A new superfamily of replicative proteins. Nature 333,
- Baculovirus gene ME53 which contains a putative zinc finger motif, is one of the major early-transcribed genes. J. Virol. 67, 753-758. KOGAN, P.H. and BLISSARD, G.W. (1994). A baculovirus gp64 early promoter is activated by host transcription factor binding to CACGTG and GATA elements. J. Virol. 68, 813-822. KOOL, M. and VLAK, J.M. (1993). The structural and functional organization of the Autographa califomica nuclear polyhedrosis virus genome. Arch. Virol.
- Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes.
- RAWLINGS N.D., PEARL, L.H. and BUTTLE, D.J. (1992).
- the baculovirus Autographa califomica nuclear polyhedrosis virus genome includes a papain- like sequence. Biol. Chem. Hoppe-Seyler 373, 1211-1215. ROEDER, R.G. (1991).
- the complexities of eukaryotic transcription initiation regulation of preinitiation complex assembly. TIBS 16, 402-408.
- Brahma a regulator of Drosophila homeotic genes structurally related to the yeast transcriptional activator SNF2/SW12. Cell 68, 561-572. THIEM, S.M. and MELLER, L.K. (1989a). Identification, sequence, and transcriptional mapping of the major capsid protein gene of the baculovirus
- FIG. 1 Physical map and summary of coding strategy of the AcNPV genome.
- the upper part of each panel represents a map of the sites in the virus genome for the commonly used restriction endonucleases (see text). Also shown are the hrs within the EcoRI map.
- the middle part of each panel summarizes the coding potential of all six reading frames of the virus DNA (1,2,3, ,2',3').
- ORFs are identified as black boxes starting at methionine codons (vertical Unes).
- the selected ORFs (see text) are numbered 1-154, with appropriate designations for the genes which have been characterized previously (see Table 1).
- Non-selected ORFs represent potential genes which overlap with other coding regions (see text).
- the lower section in each panel summarizes the percent purine or A+T composition for the + strand of the virus genome, using a sliding 250 nucleotide window. Units at the bottom are in base-pairs.
- FIG. 2 Dot matrix analysis of AcNPV genomic DNA. The genomic sequence of AcNPV
- + strand was compared to itself (left panel), or to its complementary - strand (right panel), using a 24 nucleotide moving window.
- the direction of sequence (strandedness) relative to the standard map in each comparison is indicated by the arrows on the x and y axes. Dots represent sites where there is 21 out of 24 or greater nucleotide sequence match (88% identity).
- matches in the left panel indicate sites of positional identity (diagonal Une mnning lower left to upper right), or direct DNA repeats (dots off the diagonal Une).
- Matches in the right panel indicate regions of inverted repetitive DNA.
- Dots close to the position where a diagonal Une should be in the right panel represent potential stem-and-loop (hairpin) structures.
- the columns and rows of dots marking the positions of the repetitive DNA associated with hrs are labelled across the top and on the right-side y-axis. Scales on the x and y axes are in kilobase-pairs.
- FIG. 3 Circular map of the AcNPV genome. The sites for the EcoRI (outer ring) and
- HindUL (inner ring) restriction enzymes are presented. The positions of the 154 ORFs described PCMB95/00578
- Fig. 1 Fig. 1
- arrows representing the direction of transcription for these putative genes. Shaded arrows indicate that the gene is known to be expressed, or has a weU characterized homologue in the protein sequence databases.
- Insertion sites and names of well characterized insertion sequences (IS) and retroposons (RP) are indicated, as are the positions of the hr sequences.
- the scale on the inner circle is in 100 map units.
- Fig. 10 Modification of the AcNPV CHITINASE gene.
- Panel (a) Pstl restriction maps for AcNPV (linearized form).
- Panel (b) Exploded view of genome coordinates 105164-107943 within pUC118.CHTE.
- Panel (c) pUC118.CHrr-Bgi ⁇ .
- Panel (d) pUC118.CHTr.LacZ.
- Fig. 12. Modification of the AcNPV GTA gene.
- Fig. 13 Modification of the AcNPV PCNP gene.
- Fig. 14 Modification of the AcNPV ALK-EXO gene.
- Panel (a) Hindm/Smal restriction map for AcNPV (linearized form).
- Fig. 15 Single restriction enzyme site (AvrEQ within the AcNPV EGT gene. Panel (a):
- Avrll restriction enzyme map for AcNPV (linearized form).
- Fig. 16 Single restriction enzyme site (Fsel) within the AcNPV GTA gene.
- Fsel restriction enzyme map for AcNPV (linearized form).
- the selected ORF's are numbered sequentially, in their order of appearance in the + strand of the genome (see text and Fig. 1).
- the left (column Left) and right (column Right) columns define the ends of the ORF irrespective of its encoding strand.
- the direction of the transcripts (column D) that could express the ORF is indicated by arrows.
- the number of amino acids encoded by the ORF (column aa) and the predicted molecular mass of the primary translation product (column M r ) from the first ATG are Usted (see text).
- the transcription column (Trans) indicates if at least one early (e/E), or TATA-like (t/T), or cap (c/Q motif is present in the 160 nucleotides upstream of an ORF (see text and Table 3). where a TATA-box is positioned 5' to a CAGT in a poUL- like promoter orientation, this is indicated by "TC".
- TC late promoter motif
- TAAG L
- ORFs that have an initiation methionine that conforms to Kozak-rules (column K) for higher eukaryotes are indicated (k).
- ORFs representing potential mini-cistrons initiating upstream of one of the selected ORFs and with an ATG condon that conforms to Kozak-rules are indicated (*, see text).
- ORFs that initiate at an ATG codon downstream of the first ATG or an ORF and producing a translation product that is smaller than the computer predicted product are marked (*, see text).
- Representative motifs in putative translation products (Table 3) are indicated in the domains column (Dom).
- the motifs included signal peptide (S), zinc finger (Z), leucine zipper (L), nuclear translation signal (N), and NTP binding domain (P).
- S signal peptide
- Z zinc finger
- L leucine zipper
- N nuclear translation signal
- P NTP binding domain
- the comments column includes differences in genomic organization pubUshed for other strains of AcNPV, functional properties of predicted peptide products, or other relevant features. References are Usted as a guide to the Uterature regarding previously pubUshed sequences or studies defining AcNPV gene functions.
- TATA box TATAAA; TATATA; TATAAT " 61/154 40/183
- Late promoter TAAG 71/154 1 1/183 oza consensus AxxATG(A/G); GxxATGG 91/154 52/183
- Zinc finger C/H X 2 -5 C/H X1 L13 C/H X2/5 C/H 31/154
- the motifs, their patterns and the number of the selected and non-selected ORFs with at least one copy of the indicated motifs are presented.
- the searches for motifs representing putative early transcription sites involved analyses of DNA sequences 160 nucleotides upstream of the first ATG codon (i.e., CGTGC, TATA box, Cap site and Pol II promoter motifs).
- the •search involved 80 nucleotides upstream of the ATG codon.
- Only the selected ORFs were analysed for motifs in the putative gene products (see text).
- Val GTA 508 18 lie ATA 822 30 Val GTC 4S2 18 lie ATC 590 22 Val GTG 1083 39 lie ATT 1286 48 Val GTT 678 25 SEQUENCE LISTING
- MOLECULE TYPE DNA (genomic)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Virology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
La présente invention concerne la séquence nucléotidique complète du génome du clone 6 du baculovirus Autographa californica du virus de la polyhédrose nucléaire (AcNPV). La molécule comporte 133 894 paires de bases et une teneur globale en A+T de 59 %. Des analyses effectuées, il semble découler que le virus est capable de coder environ 154 cadres de lecture ouverts (ORF) d'au moins 150 nucléotides à médiation méthioninique, et potentiellement exprimés. Ces ORF sont régulièrement répartis sur chacun des brins du génome de tout le virus. Ces ORF sont disposés en cadres de lecture adjacents et ne se chevauchant pas, séparés grâce à de courtes régions intergéniques. D'après la séquence nucléotidique primaire, il semble possible de déterminer les fonctions de certains gènes, les sites initiaux de réplication de l'ADN viral, les modalités de régulation des transcriptions géniques précoce et tardive, et des facteurs susceptibles d'affecter l'efficacité translationnelle du gène de l'AcNPV. Les données concernant la séquence génomique confirment, à quelques différences mineures près, les informations obtenues dans le cas des autres clones d'AcNPV. Le clone C6 semble donc proposable comme archétype de l'AcNPV pour les modalités comparatives.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU28972/95A AU2897295A (en) | 1994-07-04 | 1995-06-30 | Autographa californica complete genome sequence |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB9413420A GB9413420D0 (en) | 1994-07-04 | 1994-07-04 | Autographa californica nuclear polyhedrosis virus dna sequences |
| GB9413420.2 | 1994-07-04 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO1996001320A2 true WO1996001320A2 (fr) | 1996-01-18 |
| WO1996001320A3 WO1996001320A3 (fr) | 1996-07-25 |
Family
ID=10757768
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/IB1995/000578 WO1996001320A2 (fr) | 1994-07-04 | 1995-06-30 | Sequence genomique complete du virus autographa californica de la polyhedrose nucleaire |
Country Status (3)
| Country | Link |
|---|---|
| AU (1) | AU2897295A (fr) |
| GB (1) | GB9413420D0 (fr) |
| WO (1) | WO1996001320A2 (fr) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000005391A1 (fr) * | 1998-07-21 | 2000-02-03 | Dow Agrosciences Llc | Regulation negative de proteines vegetales a mediation par anticorps |
| US6635748B2 (en) * | 1997-12-31 | 2003-10-21 | Chiron Corporation | Metastatic breast and colon cancer regulated genes |
| CN114058598A (zh) * | 2021-11-04 | 2022-02-18 | 中国科学院精密测量科学与技术创新研究院 | 新的重组杆状病毒基因组插入位点及其应用 |
| CN114317608A (zh) * | 2020-12-28 | 2022-04-12 | 陕西杆粒生物科技有限公司 | 一种基因敲除型杆状病毒表达载体 |
| CN118086400A (zh) * | 2024-04-17 | 2024-05-28 | 和元生物技术(上海)股份有限公司 | 核酸分子、包含其的重组杆状病毒及其应用 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AUPN570295A0 (en) * | 1995-09-29 | 1995-10-26 | Commonwealth Scientific And Industrial Research Organisation | Biologically active proteins of viral origin |
-
1994
- 1994-07-04 GB GB9413420A patent/GB9413420D0/en active Pending
-
1995
- 1995-06-30 WO PCT/IB1995/000578 patent/WO1996001320A2/fr active Application Filing
- 1995-06-30 AU AU28972/95A patent/AU2897295A/en not_active Abandoned
Non-Patent Citations (4)
| Title |
|---|
| ARCH. VIROL., vol.130, pages 1 - 16 M. KOOL AND J.M. VLAK; 'The structural and functional organization of the autographa californica nuclear polyhedrosis virus genome' cited in the application * |
| VIROLOGY, vol.185, 19 October 0 pages 229 - 241 R.D. POSSEE ET AL.; 'Nucleotide sequence of the Autographa californica nuclear polyhedrosis 9.4 kbp Eco RI-I and -R (polyhedrin gene) region' cited in the application * |
| VIROLOGY, vol.191, pages 1003 - 1008 S.C. BRAUNAGEL ET AL.; 'Sequence, genomic organization of the EcoRI-A fragment of Autographica californica nuclear polyhedrosis virus, and identification of a viral-encoded protein resembling the outer capsid protein VP8 of Rotavirus' cited in the application * |
| VIROLOGY, vol.202, pages 586 - 605 M.D. AYRES ET AL.; 'The complete DNA sequence of Autographa californica nuclear polyhedrosis virus' * |
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6635748B2 (en) * | 1997-12-31 | 2003-10-21 | Chiron Corporation | Metastatic breast and colon cancer regulated genes |
| US7279307B2 (en) | 1997-12-31 | 2007-10-09 | Chiron Corporation | Metastatic breast and colon cancer regulated genes |
| US7795407B2 (en) | 1997-12-31 | 2010-09-14 | Novartis Vaccines And Diagnostics, Inc. | Metastatic breast and colon cancer regulated genes |
| WO2000005391A1 (fr) * | 1998-07-21 | 2000-02-03 | Dow Agrosciences Llc | Regulation negative de proteines vegetales a mediation par anticorps |
| CN114317608A (zh) * | 2020-12-28 | 2022-04-12 | 陕西杆粒生物科技有限公司 | 一种基因敲除型杆状病毒表达载体 |
| CN114317608B (zh) * | 2020-12-28 | 2023-08-22 | 陕西杆粒生物科技有限公司 | 一种基因敲除型杆状病毒表达载体 |
| CN114058598A (zh) * | 2021-11-04 | 2022-02-18 | 中国科学院精密测量科学与技术创新研究院 | 新的重组杆状病毒基因组插入位点及其应用 |
| CN114058598B (zh) * | 2021-11-04 | 2023-04-28 | 中国科学院精密测量科学与技术创新研究院 | 新的重组杆状病毒基因组插入位点及其应用 |
| CN118086400A (zh) * | 2024-04-17 | 2024-05-28 | 和元生物技术(上海)股份有限公司 | 核酸分子、包含其的重组杆状病毒及其应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2897295A (en) | 1996-01-25 |
| WO1996001320A3 (fr) | 1996-07-25 |
| GB9413420D0 (en) | 1994-08-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DK2467489T3 (en) | Baculovirus-based production of biopharmaceuticals free of contaminating baculoviral virions | |
| Cummings et al. | The complete DNA sequence of the mitochondrial genome of Podospora anserina | |
| KR102147007B1 (ko) | Fad3 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
| AU2013312198B2 (en) | Fluorescence activated cell sorting (FACS) enrichment to generate plants | |
| CN111163803B (zh) | 表达car t细胞靶物的溶瘤病毒及其用途 | |
| AU2017353868C1 (en) | Synthetic chimeric poxviruses | |
| CN112543806B (zh) | 合成嵌合痘苗病毒 | |
| KR102080055B1 (ko) | 식물 조절 요소 및 그의 용도 | |
| CN113215109B (zh) | 非洲猪瘟多基因联合缺失减毒株的构建及作为疫苗的应用 | |
| KR20220165731A (ko) | Sars-cov-2 바이러스에 대한 재조합 폭스바이러스 기반 백신 | |
| KR20230113832A (ko) | 키메라 폭스바이러스 조성물 및 이의 용도 | |
| Van Oers et al. | The baculovirus 10-kDa protein | |
| CN112899290B (zh) | 一种天然免疫抑制基因缺失的减毒非洲猪瘟病毒株及应用 | |
| CN113025629A (zh) | 一种基因缺失的减毒非洲猪瘟病毒株及应用 | |
| KR20220148823A (ko) | 천연 또는 합성 dna에 의해 생산된 폭스바이러스-기반 벡터 및 그의 용도 | |
| CN116670153A (zh) | 允许在稳定细胞系中高效生长的非洲猪瘟疫苗的基因组缺失 | |
| WO1996001320A2 (fr) | Sequence genomique complete du virus autographa californica de la polyhedrose nucleaire | |
| CN112261951A (zh) | 包含合成嵌合痘苗病毒的干细胞及其使用方法 | |
| US20040185565A1 (en) | High throughput system for producing recombinant viruses using site-specific recombination | |
| EP0975787A1 (fr) | Vecteur d'administration de gene a base d'entomopoxvirus pour vertebres | |
| US6180098B1 (en) | Recombinant helicoverpa baculoviruses expressing heterologous DNA | |
| HK40078138A (en) | Poxvirus-based vectors produced by natural or synthetic dna and uses thereof | |
| KR100270928B1 (ko) | 누에 핵다각체병 바이러스 p10 유전자를 이용한 전이 벡터 및 제조방법 | |
| KR100325394B1 (ko) | 신규한 진핵세포용 유전자 클로닝 전달운반체와 재조합 바이러스 운반체 및 이들을 이용한 재조합 단백질 제조방법 | |
| HK40042130A (en) | Stem cells comprising synthetic chimeric vaccinia virus and methods of using them |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AM AT AU BB BG BR BY CA CH CN CZ DE DK EE ES FI GB GE HU IS JP KE KG KP KR KZ LK LR LT LU LV MD MG MN MW MX NO NZ PL PT RO RU SD SE SG SI SK TJ TM TT UA UG US UZ VN |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): KE MW SD SZ UG AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase in: |
Ref country code: CA |