US20160145604A1 - An Integrated System for Library Construction, Affinity Binder Screening and Expression Thereof - Google Patents
An Integrated System for Library Construction, Affinity Binder Screening and Expression Thereof Download PDFInfo
- Publication number
- US20160145604A1 US20160145604A1 US14/774,722 US201314774722A US2016145604A1 US 20160145604 A1 US20160145604 A1 US 20160145604A1 US 201314774722 A US201314774722 A US 201314774722A US 2016145604 A1 US2016145604 A1 US 2016145604A1
- Authority
- US
- United States
- Prior art keywords
- display
- acid sequence
- nucleic acid
- tag
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000011230 binding agent Substances 0.000 title claims description 39
- 238000012216 screening Methods 0.000 title claims description 16
- 238000010276 construction Methods 0.000 title description 4
- 239000013598 vector Substances 0.000 claims abstract description 112
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 76
- 238000000034 method Methods 0.000 claims abstract description 74
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 64
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 64
- 239000002157 polynucleotide Substances 0.000 claims abstract description 64
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 58
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 54
- 239000013604 expression vector Substances 0.000 claims abstract description 35
- 108090000623 proteins and genes Proteins 0.000 claims description 73
- 210000004027 cell Anatomy 0.000 claims description 53
- 230000027455 binding Effects 0.000 claims description 51
- 102000004169 proteins and genes Human genes 0.000 claims description 42
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 31
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 31
- 230000004927 fusion Effects 0.000 claims description 29
- 238000002823 phage display Methods 0.000 claims description 24
- 108091008146 restriction endonucleases Proteins 0.000 claims description 24
- 150000001413 amino acids Chemical group 0.000 claims description 23
- 238000006243 chemical reaction Methods 0.000 claims description 22
- 108020001507 fusion proteins Proteins 0.000 claims description 14
- 102000037865 fusion proteins Human genes 0.000 claims description 14
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 12
- 101710132601 Capsid protein Proteins 0.000 claims description 11
- 101710094648 Coat protein Proteins 0.000 claims description 11
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 claims description 11
- 101710125418 Major capsid protein Proteins 0.000 claims description 11
- 101710141454 Nucleoprotein Proteins 0.000 claims description 11
- 101710083689 Probable capsid protein Proteins 0.000 claims description 11
- 238000012512 characterization method Methods 0.000 claims description 9
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 claims description 8
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 8
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 8
- 239000005090 green fluorescent protein Substances 0.000 claims description 7
- 238000002818 protein evolution Methods 0.000 claims description 6
- 238000002819 bacterial display Methods 0.000 claims description 5
- 102000002260 Alkaline Phosphatase Human genes 0.000 claims description 4
- 108020004774 Alkaline Phosphatase Proteins 0.000 claims description 4
- 108010071023 Bacterial Outer Membrane Proteins Proteins 0.000 claims description 4
- 241000724791 Filamentous phage Species 0.000 claims description 4
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 claims description 4
- 102100038895 Myc proto-oncogene protein Human genes 0.000 claims description 4
- 101710135898 Myc proto-oncogene protein Proteins 0.000 claims description 4
- 101710150448 Transcriptional regulator Myc Proteins 0.000 claims description 4
- 108010005400 cutinase Proteins 0.000 claims description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 4
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000007865 diluting Methods 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 11
- 235000018102 proteins Nutrition 0.000 description 37
- 102000004196 processed proteins & peptides Human genes 0.000 description 33
- 229920001184 polypeptide Polymers 0.000 description 25
- 239000000427 antigen Substances 0.000 description 24
- 108091007433 antigens Proteins 0.000 description 24
- 102000036639 antigens Human genes 0.000 description 24
- 108020004705 Codon Proteins 0.000 description 21
- 235000001014 amino acid Nutrition 0.000 description 19
- 239000012634 fragment Substances 0.000 description 18
- 102000039446 nucleic acids Human genes 0.000 description 17
- 108020004707 nucleic acids Proteins 0.000 description 17
- 230000008569 process Effects 0.000 description 17
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 14
- 238000013459 approach Methods 0.000 description 14
- 102000023732 binding proteins Human genes 0.000 description 14
- 108091008324 binding proteins Proteins 0.000 description 14
- 108060003951 Immunoglobulin Proteins 0.000 description 13
- 102000018358 immunoglobulin Human genes 0.000 description 13
- 241000282414 Homo sapiens Species 0.000 description 12
- 125000003729 nucleotide group Chemical group 0.000 description 12
- 108020004414 DNA Proteins 0.000 description 10
- 102000018697 Membrane Proteins Human genes 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 239000002773 nucleotide Substances 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 8
- 108010052285 Membrane Proteins Proteins 0.000 description 8
- 102100024952 Protein CBFA2T1 Human genes 0.000 description 8
- 210000004962 mammalian cell Anatomy 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- 108020005038 Terminator Codon Proteins 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 241000894007 species Species 0.000 description 7
- 230000009870 specific binding Effects 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 6
- 102000012410 DNA Ligases Human genes 0.000 description 6
- 108010061982 DNA Ligases Proteins 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 239000002245 particle Substances 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 229920000642 polymer Polymers 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 230000007017 scission Effects 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 241000700605 Viruses Species 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 4
- 102000025171 antigen binding proteins Human genes 0.000 description 4
- 108091000831 antigen binding proteins Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 239000003446 ligand Substances 0.000 description 4
- 238000004091 panning Methods 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical group C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 239000012707 chemical precursor Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 238000004806 packaging method and process Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical compound COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 2
- 108010077805 Bacterial Proteins Proteins 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 241000283073 Equus caballus Species 0.000 description 2
- 241001524679 Escherichia virus M13 Species 0.000 description 2
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 2
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 108010067902 Peptide Library Proteins 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- -1 auxotropic markers Proteins 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- 230000004540 complement-dependent cytotoxicity Effects 0.000 description 2
- 230000009260 cross reactivity Effects 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000013537 high throughput screening Methods 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 101150066555 lacZ gene Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000001322 periplasm Anatomy 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108020004418 ribosomal RNA Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 102000009109 Fc receptors Human genes 0.000 description 1
- 108010087819 Fc receptors Proteins 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102000013463 Immunoglobulin Light Chains Human genes 0.000 description 1
- 108010065825 Immunoglobulin Light Chains Proteins 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- 241000254158 Lampyridae Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 101100467813 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RBS1 gene Proteins 0.000 description 1
- 241000242583 Scyphozoa Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 238000003314 affinity selection Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000033115 angiogenesis Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000003782 beta lactam antibiotic agent Substances 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000006258 combinatorial reaction Methods 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000030944 contact inhibition Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000012866 crystallographic experiment Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000006240 deamidation Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000011577 humanized mouse model Methods 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000006882 induction of apoptosis Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000009878 intermolecular interaction Effects 0.000 description 1
- 230000008863 intramolecular interaction Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 231100001231 less toxic Toxicity 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000002824 mRNA display Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000009343 monoculture Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000006654 negative regulation of apoptotic process Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108700010839 phage proteins Proteins 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000013391 scatchard analysis Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000009394 selective breeding Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001296 transplacental effect Effects 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1037—Screening libraries presented on the surface of microorganisms, e.g. phage display, E. coli display
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1058—Directional evolution of libraries, e.g. evolution of libraries is achieved by mutagenesis and screening or selection of mixed population of organisms
Definitions
- This application pertains to the construction and screening of display libraries, particularly the design and use of vectors therefor.
- affinity binders such as antibodies, that bind specifically to a target, such as antigens
- surface display technologies including phage display, ribosomal/mRNA display, yeast display and mammalian display.
- Various display vectors have been developed such that upon expression, libraries of peptides or proteins can be displayed on the surface of phage, bacteria, yeast or mammalian cells.
- Affinity binders can then be identified through library screening and selection processes such as phage panning or fluorescence-activated cell sorting (FACS). Thereafter, clones corresponding to the affinity binders must be further characterized to ascertain the identity of the binders, as well as for downstream applications.
- inserts from binding clones are transferred or subcloned into expression vectors, either individually or en masse, such that the inserts can be expressed, purified and/or further characterized.
- this subcloning process for transferring inserts from display vectors to expression vectors is time consuming, laborious, and low throughput.
- the efficiency of subcloning is low and unpredictable, varying from about 50% to 80%.
- aspects of the invention relate to the design and use of vectors for high-throughput conversion, e.g., from display to expression.
- the design requires judicious engineering of restriction enzyme sites at specific sequence locations, and employment of combinations of restriction enzymes and DNA ligases to facilitate the conversion.
- Various libraries can be constructed using vectors of the present invention to enable high-throughput library screening, binder selection/recovery, and binder characterization.
- a high conversion rate e.g., over 90%, over 95%, or close to 100%
- a recombinant polynucleotide suitable for use to construct various vectors comprises from 5′ to 3′: a first nucleic acid sequence encoding an amino acid sequence to be displayed on a surface; a first restriction site selected from the group consisting of XbaI, NcoI, SalI and XhoI sites; a second nucleic acid sequence encoding a surface peptide capable of being displayed on the surface; and a second restriction site selected from the group consisting of XbaI, NcoI, SalI and XhoI sites.
- the first nucleic acid sequence is engineered in-frame with the second nucleic acid sequence. The first and second restriction sites, when cleaved by corresponding restriction endonuclease thereto, produce compatible sticky ends.
- the first nucleic acid sequence encodes an antibody fragment such as Fab.
- the second nucleic acid sequence may encode a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, a cell surface tether domain, or an adapter, or a truncation or derivative thereof.
- the second nucleic acid sequence can be gene III of filamentous phage M13, or a truncation or derivative thereof.
- the second nucleic acid sequence can also encode an adapter capable of binding to a binding partner, wherein said binding partner is expressed as a fusion and directly displayed on the surface.
- the surface peptide can be for phage display, yeast display, bacterial display or mammalian display, or shuttling display between different hosts (e.g., via adapters).
- the surface peptide and the amino acid sequence encoded by the first nucleic acid sequence are displayed as a fusion protein on the surface.
- the first and second restriction site each encode amino acids that do not interfere with binding affinity of the amino acid sequence, and/or display of the surface peptide or fusion protein.
- the first and/or second restriction site is XbaI site.
- a display vector for high-throughput conversion into expression vector comprises the recombinant polynucleotide described herein and a fusion tag sequence 5′ to the first restriction site or 3′ to the second restriction site.
- the fusion tag sequence when the fusion tag sequence is 3′ to the second restriction site, it is engineered such that upon (a) removal of the second nucleic acid sequence by cleaving the first and second restriction site, and (b) religation of the compatible sticky ends produced therefrom, the first nucleic acid sequence is in-frame with the fusion tag sequence.
- the fusion tag sequence can be selected from one or more of: an alkaline phosphatase tag, an AviTag, a cutinase tag, a halotag, a flag tag, a c-myc tag, a histidine tag, a GST tag, a green fluorescent protein tag, an HA tag, an E-tag, a Strep tag, a Strep tag II and a YoI 1/34 tag.
- Such display vector can be provided in a library of display vectors in which each display vector has a unique first nucleic acid sequence, thereby forming a library for selection.
- a method of converting a display vector to an expression vector includes: providing the display vector described herein; cleaving the first and second restriction site with corresponding restriction endonuclease thereto, thereby producing said compatible sticky ends; and religating the compatible sticky ends to produce an expression vector in which the first nucleic acid sequence is in-frame with the fusion tag sequence.
- the method can further include cleaving within the second nucleic acid sequence to increase religation efficiency in the religating step.
- a product from the cleaving step can be diluted to increase intramolecular ligation.
- the expression vector produced therefrom can be introduced into a host, to further characterize the first nucleic acid sequence.
- Exemplary further characterization includes sequencing the first nucleic acid sequence and/or expressing the first nucleic acid sequence.
- the method is a high-efficiency method.
- the providing, cleaving, religating and introducing steps can be completed in less than 12 hours, in less than 8 hours, or in less than 4 hours.
- the method can be performed in a high-throughput manner where a plurality of display vectors can be converted to a plurality of expression vectors in parallel. Thereafter, the plurality of expression vectors can be introduced into a population of hosts for further characterization.
- Such high-throughput conversion can have a conversion rate (from the plurality of display vectors to the plurality of expression vectors) that is higher than 90%, higher than 95%, or higher than 98%.
- a method of identifying an affinity binder to a target includes: screening a population of first hosts each containing the display vector described herein, to obtain a subpopulation of first hosts having binding affinity to a target, wherein each first host displays, on a surface of said first host, the amino acid sequence encoded by a unique first nucleic acid sequence in the display vector, and wherein said subpopulation of first hosts each display an affinity binder to said target; converting display vectors isolated from said subpopulation of first hosts to expression vectors by cleaving the first and second restriction site to remove the second nucleic acid sequence and religating the compatible sticky ends produced therefrom; and introducing said expression vectors into a population of second hosts, to further characterize the affinity binder.
- the method can be performed in a high-throughput fashion, wherein the converting step can have a conversion rate that is higher than 90%, higher than 95%, or higher than 98%.
- libraries and kits constructed using the vectors described herein and for carrying out the methods described herein. Further provided are sublibraries generated during the screening process and affinity binders obtained from methods described herein.
- FIG. 1 provides a schematic restriction map showing an exemplary vector of the present invention.
- FIG. 2 provides a gel electrophoresis image showing the conversion rate of an exemplary vector at different concentrations.
- One restriction enzyme, XbaI was used.
- FIG. 3 provides a gel electrophoresis image showing the conversion rate of an exemplary vector at different concentrations. Two restriction enzymes, XbaI and XmnI were used.
- This invention relates to a high-throughput process which integrates affinity binder discovery with downstream characterization thereof.
- Such high-throughput process has been surprisingly shown to save tremendous amount of time and effort.
- the entire process from library screening, binder selection, to binder characterization can be streamlined, while dramatically accelerating the pace of affinity binder discovery and characterization.
- the high-throughput process of the present invention is commercially desirable in affinity reagents discovery, and can be applied to various display platforms, including phage display, bacterial display, yeast display, mammalian display, as well as other display systems.
- an element means one element or more than one element.
- a population of hosts is meant a group of hosts into which a library of polynucleotides can be introduced and displayed.
- the host can be phages, yeasts, bacteria or mammalian cells.
- a population of cells from a monoculture i.e., wherein each cell in the population is of the same cell type can be used.
- mixed cultures of cells can also be used.
- Cells may be adherent, i.e., cells which grow attached to a solid substrate, or, alternatively, the cells may be in suspension.
- Mammalian cells may be cells derived from primary tumors, cells derived from metastatic tumors, primary cells, cells which have lost contact inhibition, transformed primary cells, immortalized primary cells, cells which may undergo apoptosis, and cell lines derived there from.
- the term “about” means within 20%, more preferably within 10% and most preferably within 5%.
- affinity binder refers to a peptidic chain having a specific or general affinity with another protein or molecule. Proteins are brought into contact and form a complex when binding is possible.
- the affinity binder of the invention expressed on the surface of a host can preferably be an antibody, a fragment or derivative of an antibody, a protein or a peptide.
- An antibody or a peptide “affinity binds,” “specifically binds” or “preferentially binds” to an antigen or a target if it binds with greater affinity, avidity, more readily, and/or with greater duration than it binds to other substances.
- an antibody or a peptide that specifically or preferentially binds to a first target may or may not specifically or preferentially bind to a second target.
- affinity binding does not necessarily require (although it can include) exclusive binding.
- specific binding means that the protein exhibits appreciable affinity for a particular antigen or epitope and, generally, does not exhibit significant cross-reactivity.
- An antigen binding to protein that “does not exhibit significant cross-reactivity” is one that will not appreciably bind to an entity other than its target (e.g., a different epitope or a different molecule).
- an antigen specific protein specific for a particular epitope will, for example, not significantly cross-react with remote epitopes on the same protein or peptide.
- Specific binding can be determined according to any art-recognized means for determining such binding. Preferably, specific binding is determined according to Scatchard analysis and/or competitive binding assays. “Affinity” binding includes binding with an affinity of at least 10 6 , 10 7 , 10 8 , 10 9 M ⁇ 1 , or 10 10 M ⁇ 1 . Antigen binding proteins with affinities greater than 10 7 M ⁇ 1 or 10 8 M ⁇ 1 typically bind with correspondingly greater specificity.
- amino acid sequence refers to a sequence of contiguous amino acid residues of any length.
- polypeptide peptide
- oligopeptide oligopeptide
- protein may be used interchangeably herein with the term “amino acid sequence.”
- an “antibody” is an immunoglobulin molecule capable of specific binding to a target, such as a carbohydrate, polynucleotide, lipid, polypeptide, etc., through at least one antigen recognition site, located in the variable region of the immunoglobulin molecule.
- the term encompasses not only intact polyclonal or monoclonal antibodies, but also fragments thereof (such as Fab, Fab′, F(ab′) 2 , Fv), single chain (ScFv), mutants thereof, naturally occurring variants, fusion proteins comprising an antibody portion with an antigen recognition site of the required specificity, humanized antibodies, chimeric antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
- fragments thereof such as Fab, Fab′, F(ab′) 2 , Fv), single chain (ScFv), mutants thereof, naturally occurring variants, fusion proteins comprising an antibody portion with an antigen recognition site of the required specificity, humanized antibodies, chimeric antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
- Antibody fragments comprise only a portion of an intact antibody, generally including an antigen binding site of the intact antibody and thus retaining the ability to bind antigen.
- Examples of antibody fragments encompassed by the present definition include: (i) the Fab fragment, having VL, CL, VH and CH1 domains; (ii) the Fab′ fragment, which is a Fab fragment having one or more cysteine residues at the C-terminus of the CH1 domain; (iii) the Fd fragment having VH and CH1 domains; (iv) the Fd′ fragment having VH and CH1 domains and one or more cysteine residues at the C-terminus of the CH1 domain; (v) the Fv fragment having the VL and VH domains of a single antibody; (vi) the dAb fragment which consists of a VH domain; (vii) isolated CDR regions; (viii) F(ab′) 2 fragments, a bivalent fragment including two Fab′ fragments linked by a disulfide bridge at the
- antibody variant refers to an antibody with single or multiple mutations in the heavy chains and/or light chains.
- the mutations exist in the variable region. In some embodiments, the mutations exist in the constant region.
- binding partner is used interchangeably with “target” to refer to a molecule that is recognized and binds to a peptide or protein.
- the binding partner can be an antigen or epitope thereof when binding to an antibody or an antibody fragment. Binding partners can be used in a screen to identify affinity binders thereto.
- Biopanning is an affinity selection technique which selects for peptides that bind to a given target. Biopanning involves three major steps: capturing affinity binders with a target (“panning”) using a previously constructed library, washing, and elution. Details are provided hereunder.
- cell surface tether domains include, for example, transmembrane domains or glycosidylphosphatidylinositol signal sequences.
- Chimeric antibodies refers to those antibodies wherein one portion of each of the amino acid sequences of heavy and light chains is homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular class, while the remaining segment of the chains is homologous to corresponding sequences in another.
- the variable region of both light and heavy chains mimics the variable regions of antibodies derived from one species of mammals, while the constant portions are homologous to the sequences in antibodies derived from another.
- the variable regions can conveniently be derived from presently known sources using readily available hybridomas or B cells from non human host organisms in combination with constant regions derived from, for example, human cell preparations.
- variable region has the advantage of ease of preparation, and the specificity is not affected by its source, the constant region being human, is less likely to elicit an immune response from a human subject when the antibodies are injected than would the constant region from a non-human source.
- the definition is not limited to this particular example.
- a “codon” is a series of three nucleotides (triplets) that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation (stop codons). There are 64 different codons (61 codons encoding for amino acids plus 3 stop codons) but only 20 different translated amino acids. The overabundance in the number of codons allows many amino acids to be encoded by more than one codon. Different organisms (and organelles) often show particular preferences or biases for one of the several codons that encode the same amino acid.
- a “constant region” of an antibody refers to the constant region of the antibody light chain or the constant region of the antibody heavy chain, either alone or in combination.
- the constant regions of the light chain (CL) and the heavy chain (CH1, CH2 or CH3, or CH4 in the case of IgM and IgE) confer important biological properties such as secretion, transplacental mobility, Fc receptor binding, complement binding, and the like.
- CL constant regions of the light chain
- CH1, CH2 or CH3, or CH4 in the case of IgM and IgE confer important biological properties such as secretion, transplacental mobility, Fc receptor binding, complement binding, and the like.
- the numbering of the constant region domains increases as they become more distal from the antigen binding site or amino-terminus of the antibody.
- degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues.
- the degeneracy can also be selective or constrained where only a selected subset of positions is subject to substitutions, and/or one or more or all positions are substituted with constraints by optimizing the composition and ratio of the mixed-base and/or deoxyinosine residues.
- Display refers to presentation of different recombinant polypeptides on the surface of a host such as phages, yeasts, bacteria and mammalian cells.
- display vector refers to a plasmid or phage DNA or other DNA sequence which is able to replicate autonomously in a host, and capable of expressing and displaying an insert in the vector as part of a fusion protein on the surface of the host.
- “Diversity” of a library refers to the number of different recombinant polypeptides encoded by the polynucleotides in the library.
- expression vector refers to a vector capable of expressing of a gene or any open reading frame that has been cloned into it. Such expression can occur after transformation into a host cell, or in in vitro systems.
- the cloned DNA or insert is usually operably linked to one or more regulatory sequences, such as promoters, activator/repressor binding sites, terminators, enhancers and the like.
- fuse refers to the covalent linkage between two polypeptides in a single protein.
- the polypeptides are typically joined via a peptide bond, either directly to each other or via an amino acid linker.
- the peptides can be joined via non-peptide covalent linkages known to those of skill in the art.
- fusion tag is a peptide or protein located either on the C- or N-terminal of the target protein, which improves one or more of: solubility, detection, purification, expression of the target protein.
- the fusion tag is generally engineered in-frame with the target protein.
- Commonly used fusion tags include but are not limited to: alkaline phosphatase tag, AviTag, cutinase tag, halotag, flag tag, c-myc tag, histidine (His) tag, GST tag, green fluorescent protein (GFP) tag, HA tag, E-tag, Strep tag, Strep tag II and YoI 1/34 tag.
- heavy chain refers to the larger immunoglobulin subunit which associates, through its amino terminal region, with the immunoglobulin light chain.
- the heavy chain comprises a variable region (VH) and a constant region (CH).
- the constant region further comprises the CH1, hinge, CH2, and CH3 domains.
- the heavy chain comprises a CH4 domain but does not have a hinge domain.
- immunoglobulin subclasses e.g., IgG1, IgG2, IgG3, IgG4, IgA1, etc. are well characterized and are known to confer functional specialization.
- a “host” is intended to include any individual virus or cell or culture thereof that can be or has been a recipient for vectors or for the incorporation of exogenous nucleic acid molecules, polynucleotides, and/or proteins. It also is intended to include progeny of a single virus or cell. The progeny may not necessarily be completely identical (in morphology or in genomic or total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation.
- the virus can be phage.
- the cells may be prokaryotic or eukaryotic, and include but are not limited to bacterial cells, yeast cells, insect cells, animal cells, and mammalian cells, e.g., murine, rat, simian, or human cells.
- Humanized antibodies refer to a molecule having an antigen-binding site that is substantially derived from an immunoglobulin from a non-human species and the remaining immunoglobulin structure of the molecule based upon the structure and/or sequence of a human immunoglobulin.
- the antigen-binding site may comprise either complete variable domains fused onto constant domains or only the complementarity determining regions (CDRs) grafted onto appropriate framework regions in the variable domains.
- Antigen binding sites may be wild type or modified by one or more amino acid substitutions, e.g., modified to resemble human immunoglobulin more closely.
- Some forms of humanized antibodies preserve all CDR sequences (for example, a humanized mouse antibody which contains all six CDRs from the mouse antibodies).
- Other forms of humanized antibodies have one or more CDRs (one, two, three, four, five, six) which are altered with respect to the original antibody, which are also termed one or more CDRs “derived from” one or more CDRs.
- a gene or reading frame is “in-frame” with another when the two genes or reading frames are cloned together to generate a contiguous reading frame, without disrupting the codons therein, such that when expressed, a fusion protein expressing both genes or reading frames are produced.
- the upstream gene or reading frame is an open reading frame.
- a reading frame is a way of dividing the sequence of nucleotides in a nucleic acid into a set of consecutive, non-overlapping codons.
- An open reading frame (ORF) is a reading frame that contains a start codon, and a subsequent region which usually has a length which is a multiple of 3 nucleotides, but does not contain a stop codon in a given reading frame.
- an “insert” as used herein, is a heterologous nucleic acid sequence that is ligated into a compatible site into a vector.
- An insert may comprise one or more nucleic acid sequences that encode a polypeptide or polypeptides.
- An insert may comprise regulatory regions or other nucleic acid elements.
- an “isolated” or “purified” polypeptide or polynucleotide e.g., an “isolated polypeptide,” or an “isolated polynucleotide” is purified to a state beyond that in which it exists in nature.
- the “isolated” or “purified” polypeptide or polynucleotide can be substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the protein or polynucleotide is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized.
- antigen binding protein having less than about 50% of non-antigen binding protein also referred to herein as a “contaminating protein”
- contaminating protein also referred to herein as a “contaminating protein”
- chemical precursors 40%, 30%, 20%, 10% and more preferably 5% (by dry weight), of non-antigen binding protein, or of chemical precursors is considered to be substantially free.
- Library used herein refers to a diverse collection or mixture of polynucleotides comprising polynucleotides encoding different recombinant polypeptides.
- a library of polynucleotides may comprise at least 10 6 , 10 7 , 10 8 , 10 9 , 10 10 , 10 11 , 10 12 , 10 13 , 10 14 , 10 15 , or more or less different polynucleotides within a given collection of polynucleotides.
- the different polynucleotides in the library are related through, for example, their origin from a single animal species (for example, human, mouse, rabbit, goat, horse), tissue type, organ, or cell type.
- a “library” may comprise polynucleotides of a common genus.
- the genus can be polynucleotides encoding an immunoglobulin subunit polypeptide of a certain type and class, e.g., a library might encode an antibody ⁇ , ⁇ 1, ⁇ 2, ⁇ 3, ⁇ 4, ⁇ 1, ⁇ 2, ⁇ , or ⁇ heavy chain, or an antibody ⁇ or ⁇ light chain.
- each member of any one library described herein may encode the same heavy or light chain constant region
- the library may collectively comprise at least 10 6 , 10 7 , 10 8 , 10 9 , 10 10 , 10 11 , 10 12 , 10 13 , 10 14 , 10 15 or more or less different variable regions, i.e., a “plurality” of variable regions associated with the common constant region.
- the different polynucleotides in the library can also be related through, for example, their origin from a single animal species (for example, human, mouse, rabbit, goat, horse, etc.), tissue type, organ, or cell type.
- light chain refers to the smaller immunoglobulin subunit which associates with the amino terminal region of a heavy chain.
- a light chain comprises a variable region (VL) and a constant region (CL).
- Light chains are classified as either kappa or lambda ( ⁇ , ⁇ ). A pair of these can associate with a pair of any of the various heavy chains to form an immunoglobulin molecule.
- Also encompassed in the meaning of light chain are light chains with a lambda variable region (V-lambda) linked to a kappa constant region (C-kappa) or a kappa variable region (V-kappa) linked to a lambda constant region (C-lambda).
- reporter refers to a gene or protein that can be attached to a regulatory sequence of another gene or protein of interest, so that upon expression in a host cell or organism, the reporter can confer certain characteristics that can be relatively easily selected, identified and/or measured. Reporter genes are often used as an indication of whether a certain gene has been introduced into or expressed in the host cell or organism.
- reporter examples include: antibiotic resistance genes, auxotropic markers, ⁇ -galactosidase (encoded by the bacterial gene lacZ), luciferase (from lightning bugs), chloramphenicol acetyltransferase (CAT; from bacteria), GUS ( ⁇ -glucuronidase; commonly used in plants) and green fluorescent protein (GFP; from jelly fish).
- Reporters or markers can be selectable or screenable.
- a selectable marker e.g., antibiotic resistance gene, auxotropic marker
- a selectable marker is a gene confers a trait suitable for artificial selection; typically host cells expressing the selectable marker is protected from a selective agent that is toxic or inhibitory to cell growth.
- a screenable marker e.g., gfp, lacZ generally allows researchers to distinguish between wanted cells (expressing the marker) and unwanted cells (not expressing the marker or expressing at insufficient level).
- Nucleic acid means at least two nucleotides, either deoxyribonucleotides or ribonucleotides, or analogs thereof, covalently linked together.
- Polynucleotides are polymers of any length, including, e.g., 20, 50, 100, 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc.
- a polynucleotide described herein generally contains phosphodiester bonds, although in some cases, nucleic acid analogs are included that may have at least one different linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages, and peptide nucleic acid backbones and linkages.
- linkage e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages, and peptide nucleic acid backbones and linkages.
- polynucleotides a gene or gene fragment, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, cRNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers.
- a polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer.
- sequence of nucleotides may be interrupted by non-nucleotide components.
- a polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
- the term also includes both double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of this invention that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form.
- a polynucleotide is composed of a specific sequence of four nucleotide bases: adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U) for thymine when the polynucleotide is RNA.
- polynucleotide sequence is the alphabetical representation of a polynucleotide molecule.
- a particular polynucleotide sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences as well as the sequence explicitly indicated.
- degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues.
- operably linked refers to a juxtaposition of two or more components, wherein the components so described are in a relationship permitting them to function in their intended manner.
- a promoter and/or enhancer is operably linked to a coding sequence if it acts in cis to control or modulate the transcription of the linked sequence.
- the DNA sequences that are “operably linked” are contiguous and, where necessary to join two protein coding regions or in the case of a secretory leader, contiguous and in frame.
- an operably linked promoter is generally located upstream of the coding sequence, it is not necessarily contiguous with it.
- a polyadenylation site is operably linked to a coding sequence if it is located at the downstream end of the coding sequence such that transcription proceeds through the coding sequence into the polyadenylation sequence.
- Linking is accomplished by recombinant methods known in the art, e.g., using PCR methodology, by annealing, or by ligation at convenient restriction sites. If convenient restriction sites do not exist, then synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice.
- peptide refers to polymers of amino acid residues. These terms also apply to amino acid polymers in which one or more amino acid residues is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those containing modified residues, and non-naturally occurring amino acid polymers.
- polypeptide encompasses an antibody or a fragment thereof.
- recombinant polynucleotide or nucleic acid herein is meant a polynucleotide or nucleic acid not normally found in its natural environment, e.g., any nucleic acid comprising at least two sequences that are not present together in nature.
- a recombinant nucleic acid may be generated in vitro, for example by using the methods of molecular biology, or in vivo, for example by insertion of a nucleic acid at a novel chromosomal location by homologous or non-homologous recombination.
- Recovering is used herein to mean a crude separation of a desired species from the rest of the pool which are not desired.
- “Restriction endonucleases” or “restriction enzymes” are enzymes that bind to a double-stranded nucleic acid (e.g., DNA) at one site, referred to as the recognition site, and make a single double stranded cut outside of the recognition site.
- the double stranded cut referred to as the restriction site or cleavage site, is generally situated within or at short distances away from the recognition site. Some restriction sites occur frequently in DNA (e.g., every several hundred base pairs, others much less frequently (rare-cutter; e.g., every 10,000 base pairs).
- the recognition site is generally about 4-8 bp long.
- restriction sites are “compatible” if, once cleaved by appropriate restriction enzymes, can be ligated by a DNA ligase.
- the compatible restriction sites include those double-stranded sequences that, once cleaved by appropriate restriction enzymes, generate “sticky ends” with complementary overhang sequences that can be joined by a DNA ligase.
- Screening used herein refers to the method in which a pool comprising the desired species is subject to an assay in which the desired species can be detected, and subsequently an aliquot of the pool in which the desired species is detected and optionally enriched is recovered or obtained.
- “Surface protein” or “surface peptide” refers to an amino acid sequence that confers the ability of a polypeptide to be associated with and/or presented on the surface of a host, such as phages, yeasts, bacteria and mammalian cells.
- the surface protein can be a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, a cell surface tether domain, or an adapter, or a truncation or derivative thereof.
- Surface proteins can be used for phage display, yeast display, bacterial display or mammalian display, or shuttling display between different hosts.
- transcription refers to the synthesis of RNA from a DNA template; the term “translation” refers to the synthesis of a polypeptide from an mRNA template. Transcription and translation collectively are known as “expression.”
- transfected or “transformed” or “transduced” as used herein refers to a process by which exogenous nucleic acid is transferred or introduced into the host cell.
- a transformed cell includes the primary subject cell and its progeny.
- the host cell can be bacteria, yeasts, mammalian cells, and plant cells.
- variable region of an antibody refers to the variable region of the antibody light chain or the variable region of the antibody heavy chain, either alone or in combination.
- the variable regions of both the light (VL) and heavy (VH) chain portions determine antigen recognition and specificity.
- VL and VH each consist of four framework regions (FR) connected by three complementarity determining regions (CDRs) also known as hypervariable regions.
- the CDRs complement an antigen's shape and determine the antibody's affinity and specificity for the antigen.
- CDRs there are at least two techniques for determining CDRs: (1) an approach based on cross-species sequence variability (the Kabat numbering scheme; see Kabat et al., Sequences of Proteins of Immunological Interest (5th ed., 1991, National Institutes of Health, Bethesda Md.)); and (2) an approach based on crystallographic studies of antigen-antibody complexes (the Chothia numbering scheme which corrects the sites of insertions and deletions (indels) in CDR-L1 and CDR-H1 suggested by Kabat; see Al-lazikani et al. (1997) J. Molec. Biol. 273:927-948)). Other numbering approach or scheme can also be used.
- a CDR may refer to CDRs defined by either approach or by a combination of both approaches or by other desirable approaches.
- a new definition of highly conserved core, boundary and hyper-variable regions can be used.
- vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- a vector includes any genetic element, such as a plasmid, phage vector, phagemid, transposon, cosmid, chromosome, artificial chromosome, episome, virus, virion, etc., capable of replication when associated with the proper control elements and which can transfer gene sequences into or between hosts.
- One type of vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication.
- Another type of vector is an integrative vector that is designed to recombine with the genetic material of a host cell.
- Vectors may be both autonomously replicating and integrative, and the properties of a vector may differ depending on the cellular context (i.e., a vector may be autonomously replicating in one host cell type and purely integrative in another host cell type).
- Vectors generally contain one or a small number of restriction endonuclease recognition sites and/or sites for site-specific recombination. A foreign DNA fragment may be cleaved and ligated into the vector at these sites.
- the vector may contain a marker suitable for use in the identification of transformed or transfected cells. For example, markers may provide antibiotic resistant, fluorescent, enzymatic, as well as other traits. As a second example, markers may complement auxotrophic deficiencies or supply critical nutrients not in the culture media.
- Display technology is used to present different recombinant polypeptides on the surface of a host such as phages, yeasts, bacteria and mammalian cells.
- Various display vectors are used to express and display an insert in the vector as part of a fusion protein on the surface of the host. By exposing a plurality of such fusion proteins to a target, various in vitro selection processes generally known in the art can then be used to identify affinity binders to the target.
- the first step of biopanning is to construct display libraries.
- the diversity or size of the library can be 10 6 , 10 7 , 10 8 , 10 9 , 10 10 , 10 11 , 10 12 , 10 13 , 10 14 , 10 15 , or more or less different members (e.g., polynucleotides).
- Library construction involves inserting desired genes or reading frames into display vectors described herein.
- the inserts can encode an antibody fragment.
- the inserts can be a plurality of random mutations or degenerate oligonucleotides designed to introduce variations/mutations into an antibody fragment (for example, each CDR region).
- the inserts can also be flanked by two restriction sites to facilitate cloning. The restriction sites can be selected to achieve high efficiency cloning (e.g., above about 80%, above about 85%, above about 90%, or above about 9%).
- the display library can be constructed using a vector containing a recombinant polynucleotide of the present invention.
- the recombinant polynucleotide can include from 5′ to 3′: a first nucleic acid sequence (or insert) encoding an amino acid or polypeptide sequence to be displayed on a surface; a first pre-selected restriction site; a second nucleic acid sequence encoding a surface protein for display; and a second pre-selected restriction site. These components can be operably linked together and to the vector.
- the first and second restriction sites produce compatible sticky ends.
- the first and second restriction site can be selected to each encode amino acids that do not interfere with binding affinity of the amino acid or polypeptide sequence to a target and/or display of the surface protein.
- the first and second restriction sites can be selected according to one or more of the following criteria:
- the first and second restriction site can be selected to meet all of the criteria above.
- sites include XbaI, NcoI, SalI and XhoI sites.
- the first restriction site is XbaI site.
- the second restriction site is XbaI site.
- the first nucleic acid sequence can be an open reading frame and can be engineered in-frame with the second nucleic acid sequence such that a fusion protein expressing both sequences can be produced upon expression. Such fusion protein can then be presented on a surface of a host.
- the second nucleic acid sequence can be selected to encode a surface protein such as a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, or a cell surface tether domain, or a truncation or derivative thereof.
- a surface protein such as a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, or a cell surface tether domain, or a truncation or derivative thereof.
- gene III of filamentous phage M13, or a truncation or derivative thereof can be used. Accordingly, the surface protein enables phage display, yeast display, bacterial display or mammalian display, or shuttling display therebetween.
- a display vector for high-throughput conversion into expression vector can contain the recombinant polynucleotide described above, as well as a fusion tag sequence 3′ to the second restriction site.
- the fusion tag sequence can be engineered such that upon (a) removal of the second nucleic acid sequence by cleaving the first and second restriction site, and (b) religation of the compatible sticky ends produced therefrom, the first nucleic acid sequence is in-frame with the fusion tag sequence.
- Exemplary fusion tag sequence include one or more of: an alkaline phosphatase tag, an AviTag, a cutinase tag, a halotag, a flag tag, a c-myc tag, a histidine tag, a GST tag, a green fluorescent protein tag, an HA tag, an E-tag, a Strep tag, a Strep tag II, a YoI 1/34 tag, and other tags known by one of ordinary skill in the art. Fusion tag sequences can be operably linked to other elements in the vector. In some embodiments, specific restriction sites can be selected to flank the fusion tag sequences, to facilitate exchange and/or cloning. For example, as shown in FIG.
- a FlagHis6 tag is used in the Fad22 phagemid.
- the FlaHis6 tag is flanked by a SalI site (e.g., upstream) and an XbaI site (e.g., downstream).
- the sites can be engineered through silent mutagenesis.
- the SalI site is placed immediately downstream of and is the closest restriction site to the C-terminus of the CH1 domain. These sites can facilitate the convenient exchange of tags downstream of CH1.
- Many other restriction sites, in addition to SalI or in replacement of SalI can also be introduced downstream and outside of the CH1 ORF to facilitate introduction and removal of various vector elements.
- each display vector can contain a unique first nucleic acid sequence that encode a unique peptide or antibody fragment.
- the next step of biopanning is the capturing or panning step where the displayed peptide binds to a desired target.
- Panning utilizes the binding interactions between an affinity binder presented by the host with its target, so that only specific peptides that have a binding affinity are bound to the target.
- the target is usually attached to or fixed in a solid phase, either directly or indirectly.
- antibodies presented by bacteriophage can be selected with coated antigen in microtiter plates.
- a mild washing step is generally used to wash away the unbound hosts that do not present affinity binders on their surfaces; only the bound hosts with desired affinity remain.
- the final step involves an elution or recovering step where the bound hosts are eluted through protease cleavage, changing of pH or other environmental conditions.
- the end result is that specific peptides displayed on the bound hosts are collected and analyzed.
- the cycle can occur many times (e.g., with increasingly stringent wash conditions) to screen for strong affinity binders to the target.
- markers or reporters can facilitate selection.
- One commonly used reporter is the GFP protein, where affinity binders can be selected through fluorescence-activated cell sorting (FACS).
- the corresponding polynucleotide needs to be expressed, sequenced, and/or further characterized.
- display vectors can be converted to expression vectors in a simple cleavage-religation step, eliminating the need to subclone the polynucleotide of interest.
- the present invention provides an attractive alternative for more efficient and faster downstream evaluation of affinity binders.
- phagemid vectors e.g., M13 phage
- coat proteins e.g., GP3, GP8 or GP7
- yeast outer wall proteins e.g., bacterial outer membrane proteins
- cell surface tether domains or adapters e.g., cell surface tether domains or adapters.
- antibody fragments including, for example, Fab, Fab′ Fd, Fd′, Fv, dAb, isolated CDR regions, F(ab′) 2 , and scFv.
- antibody fragments identified using the vectors and methods of the present invention can be used to construct various forms of antibodies, such as humanized antibodies, chimeric antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
- polypeptide such as an antibody
- desired properties include, for example, specific binding to a partner, higher binding affinity to a binding partner, antibody-dependent cellular cytotoxicity (ADCC), complement-dependent cytotoxicity (CDC), agonist or antagonist functions, induction or inhibition of apoptosis, angiogenesis, proliferation, activation of inhibition of signaling pathway.
- ADCC antibody-dependent cellular cytotoxicity
- CDC complement-dependent cytotoxicity
- agonist or antagonist functions induction or inhibition of apoptosis, angiogenesis, proliferation, activation of inhibition of signaling pathway.
- Multiple properties may be screened simultaneously or individually. Assay methods for these desired properties are known in the art.
- the phage display method is a technology used for studying interactions of proteins. This technology is based on expressing on the surface of a phage (display) the binding protein of interest and selecting said binding protein on its capacity to form a complex with a binding partner.
- a phage display
- a sequence encoding a binding protein of interest is inserted into said phage genome.
- the sequence insertion is localized next to a gene encoding a protein forming the coat protein complex of the phage.
- Said coat is composed of different proteins, such as GP3 and GP8 proteins which are the most commonly used.
- the insertion of a sequence of interest next to the gene encoding these coat proteins enables the fusion of the binding protein of interest to the coat protein of the phage.
- the recombinant phage i.e., a phage vector
- infects bacteria, and its genome is replicated therein.
- the expression of the recombinant phage genome leads to the production of phages expressing on their surface the heterologous binding protein to be screened, in which all copies of the coat protein display the heterologous protein (i.e., the heterologous protein is displayed in the multiple on a given phage, or multivalently).
- different proteins or molecules referred to as targets or binding partners, are brought into contact with said protein of interest.
- targets or binding partners are brought into contact with said protein of interest.
- the complex is purified and the nucleotidic sequence encoding the binding protein of interest can then be determined from the recombinant phage genome.
- Exemplary phage vectors include M13IX30, M13IX11, M13IX34, M13IX13 or M13IX60 described in International Patent Publication No. WO92/06204, and any derived phage vectors such as the vector 668-4 used in a method for generating multivalent display library described in the U.S. Pat. No. 6,057,098. Although most phage display methods have used filamentous phage, lambdoid phage display systems (WO 95/34683; U.S. Pat. No.
- T4 phage display systems (Ren et al., Gene, 215: 439 (1998); Zhu et al., Cancer Research, 58(15): 3209-3214 (1998); Jiang et al., Infection & Immunity, 65(11): 4770-4777 (1997); Ren et al., Gene, 195(2):303-311 (1997); Ren, Protein Sci., 5: 1833 (1996); Efimov et al., Virus Genes, 10: 173 (1995)) and T7 phage display systems (Smith and Scott, Methods in Enzymology, 217: 228-257 (1993); U.S. Pat. No. 5,766,905) are also known.
- the second phage display method combines the use of phagemid (display) vectors and helper phages for the generation of libraries of phages expressing on their surface a binding protein of interest.
- Phagemid vectors comprise the sequence encoding the binding protein of interest and phage sequences, especially the sequence encoding the coat protein to be fused with the binding protein of interest (i.e., the displayed protein fusion gene), which is cloned into a small plasmid.
- Phagemid vectors are also constituted of different functional sequences for the replication of the phage genome (e.g., phage replication origin such as Ff or f1 origin) and the maintenance of the vector in the host cell (e.g., plasmid replication origin such as pBR322 or pMB1).
- the phagemid vectors do not contain the whole phage genome, which is why this method is combined with the use of a helper phage for the production of phages expressing binding proteins as a fusion on their surface.
- the helper phage upon infection of the host cell (e.g., E. coli ), enables the replication and packaging of the phage genome by complementing proteins from the complete phage genome that is absent in the phagemid.
- helper phage includes M13K07 from New England Biolabs and R408 and VCSM13 from Stratagene, which typically have mutations that reduce packaging efficiency, to ensure phagemid genomes are preferentially packaged.
- engineered helper phage with complement adapter or adapters to prevent contamination from unrelated phage or phagmid particles As a result of the use of helper phages, all wide-type phage proteins from the helper phage genome, as well as a small amount of the fusion protein encoded by the phagemid are expressed, so that phage particles extruded by the cells contain both proteins, usually with the wild-type in considerable excess.
- coli harboring a GP3 phagemid display vector and infected with helper phage will exhibit a Poisson distribution of fusion protein expression: 10% or less of the particles will display one copy of the fusion protein; a very small percentage will display two copies; and the remaining majority of the particles will display only wild-type GP3 and no fusion protein.
- the main displaying species is monovalent, while most particles do not display at all.
- the valency of display is important primarily due to its impact on the ability to discriminate binders of differing affinities.
- multivalent display prevented the highest-affinity clones in a selection form being identified, because multivalency conferred a high apparent affinity (avidity) on weakObinding clones.
- Monovalent display allows selection based on pure affinity, and is therefore generally preferred for the many studies where the aim is to identify the tightest binding variants from a library.
- the initial selectants are of law affinity for example, the de novo selection of peptides that bind a given target multivalency increases the changes of isolating rare and weakly binding clones.
- a frequently used strategy is to start with multivalent display, and then move to monovalent display as the affinity of the displayed peptides matures.
- helper phage can be eliminated by using the bacterial packaging cell line technology, which is described in detail in U.S. Pat. No. 8,227,242, the entire disclosure incorporated herein by reference.
- sorting phage libraries of mutants also requires a strategy for constructing and propagating a large number of variants, a procedure for affinity purification using the target receptor, and a means of evaluating the results of binding enrichments. See U.S. Pat. Nos. 5,223,409; 5,403,484; 5,571,689; and 5,663,143.
- phage display libraries have been used to analyze and control bimolecular interactions (WO 98/20169; WO 98/20159) and properties of constrained helical peptides (WO 98/20036).
- WO 97/35196 describes a method of isolating an affinity ligand in which a phage display library is contacted with one solution in which the ligand will bind to a target molecule and a second solution in which the affinity ligand will not bind to the target molecule, to selectively isolate binding ligands.
- WO 97/46251 describes a method of biopanning a random phage display library with an affinity purified antibody and then isolating binding phage, followed by a micropanning process using microplate wells to isolate high affinity binding phage.
- WO 97/47314 describes the use of substrate subtraction libraries to distinguish enzyme specificities using a combinatorial library which may be a phage display library.
- a method for selecting enzymes suitable for use in detergents using phage display is described in WO 97/09446. Additional methods of selecting specific binding proteins are described in U.S. Pat. Nos. 5,498,538 and 5,432,018, and WO 98/15833.
- adaptor directed display and cross-species display including but not limited to those described in Wang et al, Adapter-Directed Display: A Modular Design for Shuttling Display on Phage Surfaces, J. Mol. Biol (2010) 395, 1088-1101 and Wang et al., Yeast surface display of antibodies via the heterodimeric interaction of two coiled-coil adapters, Journal of Immunological Methods (2010) 354, 11-19, can also be used in the present invention.
- phage display is achieved through fusions of an antibody fragment to one of the coat proteins on M13 phage, such as GP3, GP8, or GP7.
- a phagemid named Fad22 is constructed, where the GP3 C-terminal (“CT”) fragment is used instead of full-length GP3 protein.
- CT C-terminal
- restriction sites can be selected to flank the GP3 CT fragment.
- sites include XbaI, NcoI, SalI and XhoI sites.
- the first restriction site is XbaI site.
- the second restriction site is XbaI site.
- a desired affinity binder e.g., an antibody fragment
- a target can be identified through screening and recovering steps such as the biopanning process described above.
- a host e.g., E. coli
- the first approach is a genetic approach, in which an amber stop codon (TAG) is inserted between the gene encoding the antibody fragment and that encoding the GP3 fragment.
- TAG amber stop codon
- a genetic allele SupE suppresses the amber stop codon and insert an amino acid (usually a glutamine) instead, thus allowing surface display of the antibody fragment through the GP3 fragment.
- the bacterial translation machinery ribosome stops at the amber stop codon between the antibody fragment and the GP3 fragment, which results in the synthesis and export to bacterial periplasm of the antibody fragment, but not its fusion with GP3 fragment.
- the advantage of this approach is that no further manipulation of display vector is required. Instead, the antibody fragment can be expressed in E. coli by simply moving the same display vector from an amber-suppressing strain to a non-suppressing strain.
- the second “cleavage-religation” approach differs from the above-mentioned approach in that, after the display vector is extracted from E. coli , the vector is digested with a specific restriction enzyme to release the GP3 fragment, and then the vector is self-ligated using T4 DNA ligase, followed by transformation into specific competent E. coli strains for expression.
- Pershad et al. (Anal. Biochem. 412 (2011): 210-216) used MfeI, an infrequently used and relatively expensive restriction enzyme to excise GP3 coding sequence from phage display vector.
- MfeI enzyme is also known to display star activity (NEB), and its High Fidelity version MfeI-HF reduces but does not eliminate the star activity.
- MfeI restriction site, CAATTG encodes Gln and Leu, and it is known that Gln is susceptible to posttranslational modifications such as deamidation, which may adversely affect the structure and function of the affinity binder.
- the vector of the present invention utilizes restriction site having one or more of the following advantages:
- display vectors can be converted to expression vectors in a high-throughput, high-efficiency fashion.
- a method of converting a display vector to an expression vector includes: providing the display vector described herein; cleaving the first and second restriction site with corresponding restriction endonuclease thereto, thereby producing compatible sticky ends; and religating the compatible sticky ends to produce an expression vector in which the first nucleic acid sequence is in-frame with the fusion tag sequence.
- the method can further include additionally cleaving within the second nucleic acid sequence to increase religation efficiency in the religating step. Such cleaving may produce blunt ends that are less likely to ligate than sticky ends.
- the product from the cleaving step can be diluted before the religating step, to increase religation efficiency.
- the dilution can be with water, e.g., 1:10 or 1:20 or 1:100 with water, to facilitate intramolecular interaction (i.e., vector religation) over intermolecular interactions (i.e., ligation between the vector and the GP3 fragment).
- the expression vector can be introduced into a host, to further characterize the first nucleic acid sequence, including expressing a corresponding peptide in the host or subjecting the first nucleic acid sequence to sequencing.
- the providing, cleaving, religating and introducing steps can be completed in less than 12 hours, less than 8 hours, or less than 4 hours.
- a plurality of (e.g., hundreds or thousands) display vectors can be converted to a plurality of expression vectors in parallel, and the plurality of expression vectors can be introduced into a population of hosts.
- methods of the present invention can be performed in a high-throughput fashion.
- the conversion rate from the plurality of display vectors to the plurality of expression vectors can be higher than 80%, higher than 85%, higher than 90%, higher than 92%, higher than 95% or higher than 98%. Because of the near 100% conversion rate, parallel processing of hundreds or thousands or millions of individual colonies can be achieved (e.g., in individual 96-well plates), eliminating the need to further characterize each single colony and enhancing throughput dramatically.
- a method of identifying an affinity binder to a target is additionally provided by the present invention.
- the method can include screening a population of first hosts each containing the display vector described herein, to obtain a subpopulation of first hosts having binding affinity to a target. Each first host displays, on a surface thereof, an amino acid sequence encoded by a unique first nucleic acid sequence in the display vector.
- the subpopulation of first hosts each display an affinity binder to the target.
- the method also includes converting display vectors isolated from the subpopulation of first hosts to expression vectors by cleaving the first and second restriction site to remove the second nucleic acid sequence and religating the compatible sticky ends produced therefrom.
- the method additionally includes introducing the expression vectors into a population of second hosts, to further characterize the affinity binder.
- the method is performed in a high-throughput fashion.
- the conversion rate of the converting step can be higher than 80%, higher than 85%, higher than 90%, higher than 92%, higher than 95% or higher than 98%
- XbaI restriction sites were engineered to flank the GP3 C-terminal fragment (labeled “GP3CT”).
- One XbaI site (XbaI (4116)) is placed upstream of the GGT codon 253 (encoding Gly) of GP3, while the other XbaI site (XbaI (3645)) is positioned downstream of two stop codons (TGA TAA) placed after the TCT codon 406 (encoding Ser) of GP3.
- the XbaI restriction site is chosen because it fulfills the following criteria:
- the Fad22 phagemid contains two origins, one or phage replication (f1 ori) and the other for plasmid replication (pMB1 ori).
- Fad 22 also contains a Bla marker gene which encodes b-lactamasc and confers resistance against b-lactam antibiotics (penicillin, ampicillin, etc.).
- a lac promoter (Plac) is placed upstream of the fusion gene (of an antibody fragment encoding gene and the GP3CT gene).
- the antibody fragment (e.g., Fab) encoding gene can comprise various domains of interest, such as V ⁇ 1, CL, VH3 and CH1 domains as shown in FIG. 1 . These domains can be arranged in any suitable order for expression of the antibody fragment.
- Each domain has one or more mutations introduced therein, producing a library of mutants.
- Two ribosome binding sites (RBS1 and RBS2) and two secretion signal sequences (SS1 and SS2) are also included in Fad22, upstream of V ⁇ 1-CL and VH3-CH1, respectively, to facilitate translation and secretion thereof.
- DNA fragments after 20-fold overdigestion of Fad22 with XbaI, more than 95% of the DNA fragments can be ligated and recut.
- Additional enzymes that can be used include NcoI, SalI and XhoI.
- Conversion efficiency is further improved by using a combination of two restriction enzymes such as XbaI and XmnI.
- the XmnI (GAATAATTTC) site is present in the gene encoding GP3 C-terminal fragment, and digestion by XmnI restriction enzyme yields a blunt end that is more difficult to ligate than sticky ends (as produced by XbaI).
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Physics & Mathematics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Ecology (AREA)
- Immunology (AREA)
- Gastroenterology & Hepatology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
- This application pertains to the construction and screening of display libraries, particularly the design and use of vectors therefor.
- High throughput screening for affinity binders, such as antibodies, that bind specifically to a target, such as antigens, was made possible by surface display technologies, including phage display, ribosomal/mRNA display, yeast display and mammalian display. Various display vectors have been developed such that upon expression, libraries of peptides or proteins can be displayed on the surface of phage, bacteria, yeast or mammalian cells. Affinity binders can then be identified through library screening and selection processes such as phage panning or fluorescence-activated cell sorting (FACS). Thereafter, clones corresponding to the affinity binders must be further characterized to ascertain the identity of the binders, as well as for downstream applications.
- Conventionally, inserts from binding clones are transferred or subcloned into expression vectors, either individually or en masse, such that the inserts can be expressed, purified and/or further characterized. However, this subcloning process for transferring inserts from display vectors to expression vectors is time consuming, laborious, and low throughput. In addition, the efficiency of subcloning is low and unpredictable, varying from about 50% to 80%. Thus, a need exists for a fast, inexpensive and high-throughput method to accelerate the affinity binder identification and characterization processes.
- Aspects of the invention relate to the design and use of vectors for high-throughput conversion, e.g., from display to expression. In some embodiments, the design requires judicious engineering of restriction enzyme sites at specific sequence locations, and employment of combinations of restriction enzymes and DNA ligases to facilitate the conversion. Various libraries can be constructed using vectors of the present invention to enable high-throughput library screening, binder selection/recovery, and binder characterization. Upon implementation of such high-throughput processes, it has been surprisingly discovered that a high conversion rate (e.g., over 90%, over 95%, or close to 100%) can be achieved, thereby greatly accelerating the binder discovery and characterization processes.
- In one aspect, a recombinant polynucleotide suitable for use to construct various vectors is provided. The recombinant polynucleotide comprises from 5′ to 3′: a first nucleic acid sequence encoding an amino acid sequence to be displayed on a surface; a first restriction site selected from the group consisting of XbaI, NcoI, SalI and XhoI sites; a second nucleic acid sequence encoding a surface peptide capable of being displayed on the surface; and a second restriction site selected from the group consisting of XbaI, NcoI, SalI and XhoI sites. In various embodiments, the first nucleic acid sequence is engineered in-frame with the second nucleic acid sequence. The first and second restriction sites, when cleaved by corresponding restriction endonuclease thereto, produce compatible sticky ends.
- In some embodiments, the first nucleic acid sequence encodes an antibody fragment such as Fab. The second nucleic acid sequence may encode a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, a cell surface tether domain, or an adapter, or a truncation or derivative thereof. For example, the second nucleic acid sequence can be gene III of filamentous phage M13, or a truncation or derivative thereof. The second nucleic acid sequence can also encode an adapter capable of binding to a binding partner, wherein said binding partner is expressed as a fusion and directly displayed on the surface. Correspondingly, the surface peptide can be for phage display, yeast display, bacterial display or mammalian display, or shuttling display between different hosts (e.g., via adapters). In various embodiments, when expressed, the surface peptide and the amino acid sequence encoded by the first nucleic acid sequence are displayed as a fusion protein on the surface.
- In certain embodiments, the first and second restriction site each encode amino acids that do not interfere with binding affinity of the amino acid sequence, and/or display of the surface peptide or fusion protein. In some embodiments, the first and/or second restriction site is XbaI site.
- In another aspect, a display vector for high-throughput conversion into expression vector is provided. The display vector comprises the recombinant polynucleotide described herein and a fusion tag sequence 5′ to the first restriction site or 3′ to the second restriction site. In some embodiments, when the fusion tag sequence is 3′ to the second restriction site, it is engineered such that upon (a) removal of the second nucleic acid sequence by cleaving the first and second restriction site, and (b) religation of the compatible sticky ends produced therefrom, the first nucleic acid sequence is in-frame with the fusion tag sequence. The fusion tag sequence can be selected from one or more of: an alkaline phosphatase tag, an AviTag, a cutinase tag, a halotag, a flag tag, a c-myc tag, a histidine tag, a GST tag, a green fluorescent protein tag, an HA tag, an E-tag, a Strep tag, a Strep tag II and a YoI 1/34 tag. Such display vector can be provided in a library of display vectors in which each display vector has a unique first nucleic acid sequence, thereby forming a library for selection.
- In a further aspect, a method of converting a display vector to an expression vector is provided. The method includes: providing the display vector described herein; cleaving the first and second restriction site with corresponding restriction endonuclease thereto, thereby producing said compatible sticky ends; and religating the compatible sticky ends to produce an expression vector in which the first nucleic acid sequence is in-frame with the fusion tag sequence.
- In some embodiments, the method can further include cleaving within the second nucleic acid sequence to increase religation efficiency in the religating step. In certain embodiments, before the religating step, a product from the cleaving step can be diluted to increase intramolecular ligation.
- In various embodiments, after the religating step, the expression vector produced therefrom can be introduced into a host, to further characterize the first nucleic acid sequence. Exemplary further characterization includes sequencing the first nucleic acid sequence and/or expressing the first nucleic acid sequence.
- In various embodiments, the method is a high-efficiency method. For example, the providing, cleaving, religating and introducing steps can be completed in less than 12 hours, in less than 8 hours, or in less than 4 hours.
- The method can be performed in a high-throughput manner where a plurality of display vectors can be converted to a plurality of expression vectors in parallel. Thereafter, the plurality of expression vectors can be introduced into a population of hosts for further characterization. Such high-throughput conversion can have a conversion rate (from the plurality of display vectors to the plurality of expression vectors) that is higher than 90%, higher than 95%, or higher than 98%.
- In yet another aspect, a method of identifying an affinity binder to a target is provided. The method includes: screening a population of first hosts each containing the display vector described herein, to obtain a subpopulation of first hosts having binding affinity to a target, wherein each first host displays, on a surface of said first host, the amino acid sequence encoded by a unique first nucleic acid sequence in the display vector, and wherein said subpopulation of first hosts each display an affinity binder to said target; converting display vectors isolated from said subpopulation of first hosts to expression vectors by cleaving the first and second restriction site to remove the second nucleic acid sequence and religating the compatible sticky ends produced therefrom; and introducing said expression vectors into a population of second hosts, to further characterize the affinity binder. In various embodiments, the method can be performed in a high-throughput fashion, wherein the converting step can have a conversion rate that is higher than 90%, higher than 95%, or higher than 98%.
- Also provided are libraries and kits constructed using the vectors described herein and for carrying out the methods described herein. Further provided are sublibraries generated during the screening process and affinity binders obtained from methods described herein.
-
FIG. 1 provides a schematic restriction map showing an exemplary vector of the present invention. -
FIG. 2 provides a gel electrophoresis image showing the conversion rate of an exemplary vector at different concentrations. One restriction enzyme, XbaI was used. -
FIG. 3 provides a gel electrophoresis image showing the conversion rate of an exemplary vector at different concentrations. Two restriction enzymes, XbaI and XmnI were used. - This invention relates to a high-throughput process which integrates affinity binder discovery with downstream characterization thereof. Such high-throughput process has been surprisingly shown to save tremendous amount of time and effort. As a result, the entire process from library screening, binder selection, to binder characterization can be streamlined, while dramatically accelerating the pace of affinity binder discovery and characterization. In practice, the high-throughput process of the present invention is commercially desirable in affinity reagents discovery, and can be applied to various display platforms, including phage display, bacterial display, yeast display, mammalian display, as well as other display systems.
- It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the compositions and methods described herein. In this application, the use of the singular includes the plural unless specifically stated otherwise. Also, the use of “or” means “and/or” unless state otherwise. Similarly, “comprise,” “comprises,” “comprising,” “include,” “includes” and “including” are not intended to be limiting. It is understood that aspects and embodiments of the invention described herein include “consisting” and/or “consisting essentially of” aspects and embodiments.
- For convenience, certain terms employed in the specification, examples, and appended claims are collected here. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
- As used herein, the following terms and phrases are intended to have the following meanings:
- The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.
- By “a population of hosts” is meant a group of hosts into which a library of polynucleotides can be introduced and displayed. The host can be phages, yeasts, bacteria or mammalian cells. In some embodiments, a population of cells from a monoculture, i.e., wherein each cell in the population is of the same cell type can be used. Alternatively, mixed cultures of cells can also be used. Cells may be adherent, i.e., cells which grow attached to a solid substrate, or, alternatively, the cells may be in suspension. Mammalian cells may be cells derived from primary tumors, cells derived from metastatic tumors, primary cells, cells which have lost contact inhibition, transformed primary cells, immortalized primary cells, cells which may undergo apoptosis, and cell lines derived there from.
- As used herein, the term “about” means within 20%, more preferably within 10% and most preferably within 5%.
- The terms “affinity binder,” “binder,” and “binding protein” are used interchangeably to refer to a peptidic chain having a specific or general affinity with another protein or molecule. Proteins are brought into contact and form a complex when binding is possible. The affinity binder of the invention expressed on the surface of a host can preferably be an antibody, a fragment or derivative of an antibody, a protein or a peptide. An antibody or a peptide “affinity binds,” “specifically binds” or “preferentially binds” to an antigen or a target if it binds with greater affinity, avidity, more readily, and/or with greater duration than it binds to other substances. It is understood by reading this definition that, for example, an antibody or a peptide that specifically or preferentially binds to a first target may or may not specifically or preferentially bind to a second target. As such, “affinity binding,” “specific binding” or “preferential binding” does not necessarily require (although it can include) exclusive binding. In some embodiments, “specific binding” means that the protein exhibits appreciable affinity for a particular antigen or epitope and, generally, does not exhibit significant cross-reactivity. An antigen binding to protein that “does not exhibit significant cross-reactivity” is one that will not appreciably bind to an entity other than its target (e.g., a different epitope or a different molecule). An antigen specific protein specific for a particular epitope will, for example, not significantly cross-react with remote epitopes on the same protein or peptide. Specific binding can be determined according to any art-recognized means for determining such binding. Preferably, specific binding is determined according to Scatchard analysis and/or competitive binding assays. “Affinity” binding includes binding with an affinity of at least 106, 107, 108, 109 M−1, or 1010 M−1. Antigen binding proteins with affinities greater than 107 M−1 or 108 M−1 typically bind with correspondingly greater specificity.
- As used herein, the term “amino acid sequence” refers to a sequence of contiguous amino acid residues of any length. The terms “polypeptide,” “peptide,” “oligopeptide,” or “protein” may be used interchangeably herein with the term “amino acid sequence.”
- An “antibody” is an immunoglobulin molecule capable of specific binding to a target, such as a carbohydrate, polynucleotide, lipid, polypeptide, etc., through at least one antigen recognition site, located in the variable region of the immunoglobulin molecule. As used herein, the term encompasses not only intact polyclonal or monoclonal antibodies, but also fragments thereof (such as Fab, Fab′, F(ab′)2, Fv), single chain (ScFv), mutants thereof, naturally occurring variants, fusion proteins comprising an antibody portion with an antigen recognition site of the required specificity, humanized antibodies, chimeric antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
- “Antibody fragments” comprise only a portion of an intact antibody, generally including an antigen binding site of the intact antibody and thus retaining the ability to bind antigen. Examples of antibody fragments encompassed by the present definition include: (i) the Fab fragment, having VL, CL, VH and CH1 domains; (ii) the Fab′ fragment, which is a Fab fragment having one or more cysteine residues at the C-terminus of the CH1 domain; (iii) the Fd fragment having VH and CH1 domains; (iv) the Fd′ fragment having VH and CH1 domains and one or more cysteine residues at the C-terminus of the CH1 domain; (v) the Fv fragment having the VL and VH domains of a single antibody; (vi) the dAb fragment which consists of a VH domain; (vii) isolated CDR regions; (viii) F(ab′)2 fragments, a bivalent fragment including two Fab′ fragments linked by a disulfide bridge at the hinge region; (ix) single chain antibody molecules (e.g. single chain Fv; scFv); (x) “diabodies” with two antigen binding sites, comprising a heavy chain variable domain (VH) connected to a light chain variable domain (VL) in the same polypeptide chain; (xi) “linear antibodies” comprising a pair of tandem Fd segments (VH-CH1-VH-CH1) which, together with complementary light chain polypeptides, form a pair of antigen binding regions.
- The term “antibody variant” as used herein refers to an antibody with single or multiple mutations in the heavy chains and/or light chains. In some embodiments, the mutations exist in the variable region. In some embodiments, the mutations exist in the constant region.
- The term “binding partner” is used interchangeably with “target” to refer to a molecule that is recognized and binds to a peptide or protein. The binding partner can be an antigen or epitope thereof when binding to an antibody or an antibody fragment. Binding partners can be used in a screen to identify affinity binders thereto.
- “Biopanning” is an affinity selection technique which selects for peptides that bind to a given target. Biopanning involves three major steps: capturing affinity binders with a target (“panning”) using a previously constructed library, washing, and elution. Details are provided hereunder.
- A “cell surface tether domain” as used herein, refers to an amino acid sequence that confers the ability of a polypeptide to be associated with a host cell outer membrane, and which is sometimes but not always naturally present in the protein of interest. As described herein, cell surface tether domains include, for example, transmembrane domains or glycosidylphosphatidylinositol signal sequences.
- “Chimeric antibodies” refers to those antibodies wherein one portion of each of the amino acid sequences of heavy and light chains is homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular class, while the remaining segment of the chains is homologous to corresponding sequences in another. Typically, in these chimeric antibodies, the variable region of both light and heavy chains mimics the variable regions of antibodies derived from one species of mammals, while the constant portions are homologous to the sequences in antibodies derived from another. One clear advantage to such chimeric forms is that, for example, the variable regions can conveniently be derived from presently known sources using readily available hybridomas or B cells from non human host organisms in combination with constant regions derived from, for example, human cell preparations. While the variable region has the advantage of ease of preparation, and the specificity is not affected by its source, the constant region being human, is less likely to elicit an immune response from a human subject when the antibodies are injected than would the constant region from a non-human source. However, the definition is not limited to this particular example.
- As generally understood, a “codon” is a series of three nucleotides (triplets) that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation (stop codons). There are 64 different codons (61 codons encoding for amino acids plus 3 stop codons) but only 20 different translated amino acids. The overabundance in the number of codons allows many amino acids to be encoded by more than one codon. Different organisms (and organelles) often show particular preferences or biases for one of the several codons that encode the same amino acid.
- A “constant region” of an antibody refers to the constant region of the antibody light chain or the constant region of the antibody heavy chain, either alone or in combination. The constant regions of the light chain (CL) and the heavy chain (CH1, CH2 or CH3, or CH4 in the case of IgM and IgE) confer important biological properties such as secretion, transplacental mobility, Fc receptor binding, complement binding, and the like. By convention the numbering of the constant region domains increases as they become more distal from the antigen binding site or amino-terminus of the antibody.
- “Degenerate sequences” are nucleic acid sequences having a length of N1 nucleosides and comprises up to 4N1 different sequences. In general, a sequence is called “degenerate” if some or all of its positions have several possible bases or substitutions. Assuming Σ={T, C, A, G} is the DNA alphabet, a sequence (e.g. a primer) can be shown as S=x1x2 . . . xl, where xi ⊂Σ, xi≠Ø and l is the length of S. The degeneracy of a sequence is the number of unique sequence combinations it contains, which can be calculated as d(S)=Πl i=1|xi|. In some embodiments, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues. The degeneracy can also be selective or constrained where only a selected subset of positions is subject to substitutions, and/or one or more or all positions are substituted with constraints by optimizing the composition and ratio of the mixed-base and/or deoxyinosine residues.
- “Display” refers to presentation of different recombinant polypeptides on the surface of a host such as phages, yeasts, bacteria and mammalian cells.
- As used herein, the term “display vector” refers to a plasmid or phage DNA or other DNA sequence which is able to replicate autonomously in a host, and capable of expressing and displaying an insert in the vector as part of a fusion protein on the surface of the host.
- “Diversity” of a library refers to the number of different recombinant polypeptides encoded by the polynucleotides in the library.
- The term “expression vector” refers to a vector capable of expressing of a gene or any open reading frame that has been cloned into it. Such expression can occur after transformation into a host cell, or in in vitro systems. The cloned DNA or insert is usually operably linked to one or more regulatory sequences, such as promoters, activator/repressor binding sites, terminators, enhancers and the like.
- The term “fuse,” “fusion” or “link” refers to the covalent linkage between two polypeptides in a single protein. The polypeptides are typically joined via a peptide bond, either directly to each other or via an amino acid linker. Optionally, the peptides can be joined via non-peptide covalent linkages known to those of skill in the art.
- As used herein, the term “fusion tag” is a peptide or protein located either on the C- or N-terminal of the target protein, which improves one or more of: solubility, detection, purification, expression of the target protein. The fusion tag is generally engineered in-frame with the target protein. Commonly used fusion tags include but are not limited to: alkaline phosphatase tag, AviTag, cutinase tag, halotag, flag tag, c-myc tag, histidine (His) tag, GST tag, green fluorescent protein (GFP) tag, HA tag, E-tag, Strep tag, Strep tag II and YoI 1/34 tag.
- The term “heavy chain” as used herein refers to the larger immunoglobulin subunit which associates, through its amino terminal region, with the immunoglobulin light chain. The heavy chain comprises a variable region (VH) and a constant region (CH). The constant region further comprises the CH1, hinge, CH2, and CH3 domains. In the case of IgE, IgM, and IgY, the heavy chain comprises a CH4 domain but does not have a hinge domain. Those skilled in the art will appreciate that heavy chains are classified as gamma, mu, alpha, delta, or epsilon (γ, μ, α, δ, ε), with some subclasses among them (e.g., γ1-γ4). It is the nature of this chain that determines the “class” of the antibody as IgG, IgM, IgA IgG, or IgE, respectively. The immunoglobulin subclasses (isotypes), e.g., IgG1, IgG2, IgG3, IgG4, IgA1, etc. are well characterized and are known to confer functional specialization.
- A “host” is intended to include any individual virus or cell or culture thereof that can be or has been a recipient for vectors or for the incorporation of exogenous nucleic acid molecules, polynucleotides, and/or proteins. It also is intended to include progeny of a single virus or cell. The progeny may not necessarily be completely identical (in morphology or in genomic or total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation. The virus can be phage. The cells may be prokaryotic or eukaryotic, and include but are not limited to bacterial cells, yeast cells, insect cells, animal cells, and mammalian cells, e.g., murine, rat, simian, or human cells.
- “Humanized” antibodies refer to a molecule having an antigen-binding site that is substantially derived from an immunoglobulin from a non-human species and the remaining immunoglobulin structure of the molecule based upon the structure and/or sequence of a human immunoglobulin. The antigen-binding site may comprise either complete variable domains fused onto constant domains or only the complementarity determining regions (CDRs) grafted onto appropriate framework regions in the variable domains. Antigen binding sites may be wild type or modified by one or more amino acid substitutions, e.g., modified to resemble human immunoglobulin more closely. Some forms of humanized antibodies preserve all CDR sequences (for example, a humanized mouse antibody which contains all six CDRs from the mouse antibodies). Other forms of humanized antibodies have one or more CDRs (one, two, three, four, five, six) which are altered with respect to the original antibody, which are also termed one or more CDRs “derived from” one or more CDRs.
- A gene or reading frame is “in-frame” with another when the two genes or reading frames are cloned together to generate a contiguous reading frame, without disrupting the codons therein, such that when expressed, a fusion protein expressing both genes or reading frames are produced. Generally the upstream gene or reading frame is an open reading frame. A reading frame is a way of dividing the sequence of nucleotides in a nucleic acid into a set of consecutive, non-overlapping codons. An open reading frame (ORF) is a reading frame that contains a start codon, and a subsequent region which usually has a length which is a multiple of 3 nucleotides, but does not contain a stop codon in a given reading frame.
- An “insert” as used herein, is a heterologous nucleic acid sequence that is ligated into a compatible site into a vector. An insert may comprise one or more nucleic acid sequences that encode a polypeptide or polypeptides. An insert may comprise regulatory regions or other nucleic acid elements.
- An “isolated” or “purified” polypeptide or polynucleotide, e.g., an “isolated polypeptide,” or an “isolated polynucleotide” is purified to a state beyond that in which it exists in nature. For example, the “isolated” or “purified” polypeptide or polynucleotide, can be substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the protein or polynucleotide is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized. The preparation of antigen binding protein having less than about 50% of non-antigen binding protein (also referred to herein as a “contaminating protein”), or of chemical precursors, is considered to be “substantially free.” 40%, 30%, 20%, 10% and more preferably 5% (by dry weight), of non-antigen binding protein, or of chemical precursors is considered to be substantially free.
- “Library” used herein refers to a diverse collection or mixture of polynucleotides comprising polynucleotides encoding different recombinant polypeptides. In certain embodiments, a library of polynucleotides may comprise at least 106, 107, 108, 109, 1010, 1011, 1012, 1013, 1014, 1015, or more or less different polynucleotides within a given collection of polynucleotides. Typically, the different polynucleotides in the library are related through, for example, their origin from a single animal species (for example, human, mouse, rabbit, goat, horse), tissue type, organ, or cell type. A “library” may comprise polynucleotides of a common genus. For example, the genus can be polynucleotides encoding an immunoglobulin subunit polypeptide of a certain type and class, e.g., a library might encode an antibody μ, γ1, γ2, γ3, γ4, α1, α2, δ, or ε heavy chain, or an antibody κ or λ light chain. Although each member of any one library described herein may encode the same heavy or light chain constant region, the library may collectively comprise at least 106, 107, 108, 109, 1010, 1011, 1012, 1013, 1014, 1015 or more or less different variable regions, i.e., a “plurality” of variable regions associated with the common constant region. The different polynucleotides in the library can also be related through, for example, their origin from a single animal species (for example, human, mouse, rabbit, goat, horse, etc.), tissue type, organ, or cell type.
- The term “light chain” as used herein refers to the smaller immunoglobulin subunit which associates with the amino terminal region of a heavy chain. As with a heavy chain, a light chain comprises a variable region (VL) and a constant region (CL). Light chains are classified as either kappa or lambda (κ, λ). A pair of these can associate with a pair of any of the various heavy chains to form an immunoglobulin molecule. Also encompassed in the meaning of light chain are light chains with a lambda variable region (V-lambda) linked to a kappa constant region (C-kappa) or a kappa variable region (V-kappa) linked to a lambda constant region (C-lambda).
- The terms “marker” or “reporter” refer to a gene or protein that can be attached to a regulatory sequence of another gene or protein of interest, so that upon expression in a host cell or organism, the reporter can confer certain characteristics that can be relatively easily selected, identified and/or measured. Reporter genes are often used as an indication of whether a certain gene has been introduced into or expressed in the host cell or organism. Examples of commonly used reporters include: antibiotic resistance genes, auxotropic markers, β-galactosidase (encoded by the bacterial gene lacZ), luciferase (from lightning bugs), chloramphenicol acetyltransferase (CAT; from bacteria), GUS (β-glucuronidase; commonly used in plants) and green fluorescent protein (GFP; from jelly fish). Reporters or markers can be selectable or screenable. A selectable marker (e.g., antibiotic resistance gene, auxotropic marker) is a gene confers a trait suitable for artificial selection; typically host cells expressing the selectable marker is protected from a selective agent that is toxic or inhibitory to cell growth. A screenable marker (e.g., gfp, lacZ) generally allows researchers to distinguish between wanted cells (expressing the marker) and unwanted cells (not expressing the marker or expressing at insufficient level).
- “Nucleic acid,” “nucleic acid sequence,” “oligonucleotide,” “polynucleotide” or other grammatical equivalents as used herein means at least two nucleotides, either deoxyribonucleotides or ribonucleotides, or analogs thereof, covalently linked together. Polynucleotides are polymers of any length, including, e.g., 20, 50, 100, 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A polynucleotide described herein generally contains phosphodiester bonds, although in some cases, nucleic acid analogs are included that may have at least one different linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages, and peptide nucleic acid backbones and linkages. Mixtures of naturally occurring polynucleotides and analogs can be made; alternatively, mixtures of different polynucleotide analogs, and mixtures of naturally occurring polynucleotides and analogs may be made. The following are non-limiting examples of polynucleotides: a gene or gene fragment, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, cRNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component. The term also includes both double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of this invention that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form. A polynucleotide is composed of a specific sequence of four nucleotide bases: adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U) for thymine when the polynucleotide is RNA. Thus, the term “polynucleotide sequence” is the alphabetical representation of a polynucleotide molecule. Unless otherwise indicated, a particular polynucleotide sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues.
- “Operably linked” refers to a juxtaposition of two or more components, wherein the components so described are in a relationship permitting them to function in their intended manner. For example, a promoter and/or enhancer is operably linked to a coding sequence if it acts in cis to control or modulate the transcription of the linked sequence. Generally, but not necessarily, the DNA sequences that are “operably linked” are contiguous and, where necessary to join two protein coding regions or in the case of a secretory leader, contiguous and in frame. However, although an operably linked promoter is generally located upstream of the coding sequence, it is not necessarily contiguous with it. A polyadenylation site is operably linked to a coding sequence if it is located at the downstream end of the coding sequence such that transcription proceeds through the coding sequence into the polyadenylation sequence. Linking is accomplished by recombinant methods known in the art, e.g., using PCR methodology, by annealing, or by ligation at convenient restriction sites. If convenient restriction sites do not exist, then synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice.
- The terms “peptide,” “polypeptide” and “protein” used herein refer to polymers of amino acid residues. These terms also apply to amino acid polymers in which one or more amino acid residues is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those containing modified residues, and non-naturally occurring amino acid polymers. In the present case, the term “polypeptide” encompasses an antibody or a fragment thereof.
- By the term “recombinant” polynucleotide or nucleic acid herein is meant a polynucleotide or nucleic acid not normally found in its natural environment, e.g., any nucleic acid comprising at least two sequences that are not present together in nature. A recombinant nucleic acid may be generated in vitro, for example by using the methods of molecular biology, or in vivo, for example by insertion of a nucleic acid at a novel chromosomal location by homologous or non-homologous recombination. It is understood that once a recombinant polynucleotide is made and reintroduced into a host cell or organism, it will replicate non-recombinantly, e.g., using the in vivo cellular machinery of the host cell rather than in vitro manipulations; however, such nucleic acids, once produced recombinantly, although subsequently replicated non-recombinantly, are still considered recombinant for the purposes of the invention.
- “Recovering” is used herein to mean a crude separation of a desired species from the rest of the pool which are not desired.
- “Restriction endonucleases” or “restriction enzymes” are enzymes that bind to a double-stranded nucleic acid (e.g., DNA) at one site, referred to as the recognition site, and make a single double stranded cut outside of the recognition site. The double stranded cut, referred to as the restriction site or cleavage site, is generally situated within or at short distances away from the recognition site. Some restriction sites occur frequently in DNA (e.g., every several hundred base pairs, others much less frequently (rare-cutter; e.g., every 10,000 base pairs). The recognition site is generally about 4-8 bp long. Cleavage generally produces 1-6 nucleotide single-stranded overhangs, with 5′ or 3′ termini, although some enzymes produce blunt ends. Such enzymes and information regarding their recognition and cleavage sites are available from commercial suppliers such as New England Biolabs. As used herein, restriction sites are “compatible” if, once cleaved by appropriate restriction enzymes, can be ligated by a DNA ligase. In some embodiments, the compatible restriction sites include those double-stranded sequences that, once cleaved by appropriate restriction enzymes, generate “sticky ends” with complementary overhang sequences that can be joined by a DNA ligase.
- “Screening” used herein refers to the method in which a pool comprising the desired species is subject to an assay in which the desired species can be detected, and subsequently an aliquot of the pool in which the desired species is detected and optionally enriched is recovered or obtained.
- “Surface protein” or “surface peptide” refers to an amino acid sequence that confers the ability of a polypeptide to be associated with and/or presented on the surface of a host, such as phages, yeasts, bacteria and mammalian cells. The surface protein can be a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, a cell surface tether domain, or an adapter, or a truncation or derivative thereof. Surface proteins can be used for phage display, yeast display, bacterial display or mammalian display, or shuttling display between different hosts.
- As used herein, unless otherwise stated, the term “transcription” refers to the synthesis of RNA from a DNA template; the term “translation” refers to the synthesis of a polypeptide from an mRNA template. Transcription and translation collectively are known as “expression.”
- The term “transfected” or “transformed” or “transduced” as used herein refers to a process by which exogenous nucleic acid is transferred or introduced into the host cell. A transformed cell includes the primary subject cell and its progeny. The host cell can be bacteria, yeasts, mammalian cells, and plant cells.
- A “variable region” of an antibody refers to the variable region of the antibody light chain or the variable region of the antibody heavy chain, either alone or in combination. The variable regions of both the light (VL) and heavy (VH) chain portions determine antigen recognition and specificity. VL and VH each consist of four framework regions (FR) connected by three complementarity determining regions (CDRs) also known as hypervariable regions. The CDRs complement an antigen's shape and determine the antibody's affinity and specificity for the antigen. There are six CDRs in both VL and VII, The CDRs in each chain are held together in close proximity by the FRs and, with the CDRs from the other chain, contribute to the formation of the antigen-binding site of antibodies. There are at least two techniques for determining CDRs: (1) an approach based on cross-species sequence variability (the Kabat numbering scheme; see Kabat et al., Sequences of Proteins of Immunological Interest (5th ed., 1991, National Institutes of Health, Bethesda Md.)); and (2) an approach based on crystallographic studies of antigen-antibody complexes (the Chothia numbering scheme which corrects the sites of insertions and deletions (indels) in CDR-L1 and CDR-H1 suggested by Kabat; see Al-lazikani et al. (1997) J. Molec. Biol. 273:927-948)). Other numbering approach or scheme can also be used. As used herein, a CDR may refer to CDRs defined by either approach or by a combination of both approaches or by other desirable approaches. In addition, a new definition of highly conserved core, boundary and hyper-variable regions can be used.
- As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. A vector includes any genetic element, such as a plasmid, phage vector, phagemid, transposon, cosmid, chromosome, artificial chromosome, episome, virus, virion, etc., capable of replication when associated with the proper control elements and which can transfer gene sequences into or between hosts. One type of vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Another type of vector is an integrative vector that is designed to recombine with the genetic material of a host cell. Vectors may be both autonomously replicating and integrative, and the properties of a vector may differ depending on the cellular context (i.e., a vector may be autonomously replicating in one host cell type and purely integrative in another host cell type). Vectors generally contain one or a small number of restriction endonuclease recognition sites and/or sites for site-specific recombination. A foreign DNA fragment may be cleaved and ligated into the vector at these sites. The vector may contain a marker suitable for use in the identification of transformed or transfected cells. For example, markers may provide antibiotic resistant, fluorescent, enzymatic, as well as other traits. As a second example, markers may complement auxotrophic deficiencies or supply critical nutrients not in the culture media.
- Other terms used in the fields of recombinant nucleic acid technology, microbiology, immunology, antibody engineering, and molecular and cell biology as used herein will be generally understood by one of ordinary skill in the applicable arts.
- Display technology is used to present different recombinant polypeptides on the surface of a host such as phages, yeasts, bacteria and mammalian cells. Various display vectors are used to express and display an insert in the vector as part of a fusion protein on the surface of the host. By exposing a plurality of such fusion proteins to a target, various in vitro selection processes generally known in the art can then be used to identify affinity binders to the target.
- One commonly selection process is known as biopanning. The first step of biopanning is to construct display libraries. The diversity or size of the library can be 106, 107, 108, 109, 1010, 1011, 1012, 1013, 1014, 1015, or more or less different members (e.g., polynucleotides). Library construction involves inserting desired genes or reading frames into display vectors described herein. The inserts can encode an antibody fragment. For example, the inserts can be a plurality of random mutations or degenerate oligonucleotides designed to introduce variations/mutations into an antibody fragment (for example, each CDR region). The inserts can also be flanked by two restriction sites to facilitate cloning. The restriction sites can be selected to achieve high efficiency cloning (e.g., above about 80%, above about 85%, above about 90%, or above about 9%).
- In some embodiments, the display library can be constructed using a vector containing a recombinant polynucleotide of the present invention. The recombinant polynucleotide can include from 5′ to 3′: a first nucleic acid sequence (or insert) encoding an amino acid or polypeptide sequence to be displayed on a surface; a first pre-selected restriction site; a second nucleic acid sequence encoding a surface protein for display; and a second pre-selected restriction site. These components can be operably linked together and to the vector. In some embodiments, when cleaved by corresponding restriction endonuclease thereto, the first and second restriction sites produce compatible sticky ends.
- In certain embodiments, the first and second restriction site can be selected to each encode amino acids that do not interfere with binding affinity of the amino acid or polypeptide sequence to a target and/or display of the surface protein. Where the first nucleic acid sequence encodes an antibody fragment, the first and second restriction sites can be selected according to one or more of the following criteria:
-
- (1) The restriction site is rarely present in human germline Vκ, Vλ or VH;
- (2) The restriction site is not present in the library (e.g., the six CDR regions);
- (3) After cleavage, a sticky end (e.g., 4-base sticky end) is exposed. In some instances, a 4-base sticky end is expected to increase self-ligation efficiency than other sticky ends (e.g., 2-base sticky end) or blunt ends;
- (4) The restriction site does not encode any in-frame stop codons, and it encodes amino acids that are compatible with the structure of antibody fragment (for example, no Cys is allowed);
- (5) The sites are unique in the vector;
- (6) The corresponding restriction enzyme can be inactivated (e.g., at higher temperature such as 65° C.); and/or
- (7) The corresponding the restriction enzyme is inexpensive yet robust, and lacks star activity.
- For example, the first and second restriction site can be selected to meet all of the criteria above. Such sites, in the case of phagemid Fad22 exemplified herein, include XbaI, NcoI, SalI and XhoI sites. In one embodiment, the first restriction site is XbaI site. In some embodiments, the second restriction site is XbaI site. For other vectors, one of ordinary skill in the art would be able to follow the criteria above to select appropriate restriction sites.
- In certain embodiments, the first nucleic acid sequence can be an open reading frame and can be engineered in-frame with the second nucleic acid sequence such that a fusion protein expressing both sequences can be produced upon expression. Such fusion protein can then be presented on a surface of a host. To facilitate surface presentation, the second nucleic acid sequence can be selected to encode a surface protein such as a phage coat protein, a yeast outer wall protein, a bacterial outer membrane protein, or a cell surface tether domain, or a truncation or derivative thereof. For example, gene III of filamentous phage M13, or a truncation or derivative thereof, can be used. Accordingly, the surface protein enables phage display, yeast display, bacterial display or mammalian display, or shuttling display therebetween.
- In various embodiments, a display vector for high-throughput conversion into expression vector is provided. The display vector can contain the recombinant polynucleotide described above, as well as a fusion tag sequence 3′ to the second restriction site. In some embodiments, the fusion tag sequence can be engineered such that upon (a) removal of the second nucleic acid sequence by cleaving the first and second restriction site, and (b) religation of the compatible sticky ends produced therefrom, the first nucleic acid sequence is in-frame with the fusion tag sequence. Exemplary fusion tag sequence include one or more of: an alkaline phosphatase tag, an AviTag, a cutinase tag, a halotag, a flag tag, a c-myc tag, a histidine tag, a GST tag, a green fluorescent protein tag, an HA tag, an E-tag, a Strep tag, a Strep tag II, a YoI 1/34 tag, and other tags known by one of ordinary skill in the art. Fusion tag sequences can be operably linked to other elements in the vector. In some embodiments, specific restriction sites can be selected to flank the fusion tag sequences, to facilitate exchange and/or cloning. For example, as shown in
FIG. 1 , a FlagHis6 tag is used in the Fad22 phagemid. The FlaHis6 tag is flanked by a SalI site (e.g., upstream) and an XbaI site (e.g., downstream). The sites can be engineered through silent mutagenesis. In addition, the SalI site is placed immediately downstream of and is the closest restriction site to the C-terminus of the CH1 domain. These sites can facilitate the convenient exchange of tags downstream of CH1. Many other restriction sites, in addition to SalI or in replacement of SalI, can also be introduced downstream and outside of the CH1 ORF to facilitate introduction and removal of various vector elements. - During display library construction, a library of display vectors can be used, in which each display vector can contain a unique first nucleic acid sequence that encode a unique peptide or antibody fragment.
- Once a library is constructed, the next step of biopanning is the capturing or panning step where the displayed peptide binds to a desired target. Panning utilizes the binding interactions between an affinity binder presented by the host with its target, so that only specific peptides that have a binding affinity are bound to the target. The target is usually attached to or fixed in a solid phase, either directly or indirectly. For example, antibodies presented by bacteriophage can be selected with coated antigen in microtiter plates. After the capturing step, a mild washing step is generally used to wash away the unbound hosts that do not present affinity binders on their surfaces; only the bound hosts with desired affinity remain. The final step involves an elution or recovering step where the bound hosts are eluted through protease cleavage, changing of pH or other environmental conditions. The end result is that specific peptides displayed on the bound hosts are collected and analyzed. The cycle can occur many times (e.g., with increasingly stringent wash conditions) to screen for strong affinity binders to the target.
- Other in vitro selection processes can also be used. For example, various markers or reporters can facilitate selection. One commonly used reporter is the GFP protein, where affinity binders can be selected through fluorescence-activated cell sorting (FACS).
- In some embodiments, after an affinity binder is recovered, the corresponding polynucleotide needs to be expressed, sequenced, and/or further characterized. Using the vectors of the present invention, display vectors can be converted to expression vectors in a simple cleavage-religation step, eliminating the need to subclone the polynucleotide of interest. Thus, the present invention provides an attractive alternative for more efficient and faster downstream evaluation of affinity binders.
- In the sections below, a detailed description is provided with regard to phagemid vectors. It should be understood by a person or ordinary skill in the art, however, that the description is equally applicable to other types of vectors using standard molecular cloning techniques. For example, discussions about the phage (e.g., M13 phage) coat proteins (e.g., GP3, GP8 or GP7) are equally applicable to the other surface proteins such as yeast outer wall proteins, bacterial outer membrane proteins, cell surface tether domains or adapters. It should also be understood that description below are applicable to various antibodies and antibody fragments (including, for example, Fab, Fab′ Fd, Fd′, Fv, dAb, isolated CDR regions, F(ab′)2, and scFv). Furthermore, the antibody fragments identified using the vectors and methods of the present invention can be used to construct various forms of antibodies, such as humanized antibodies, chimeric antibodies, and any other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
- Similarly, although the description provided below focuses on isolation of polynucleotide encoding an antibody or antibody fragment specifically recognizing an antigen, the methods are equally applicable to isolation of polynucleotides encoding a polypeptide (such as an antibody) with other desired properties, which include, for example, specific binding to a partner, higher binding affinity to a binding partner, antibody-dependent cellular cytotoxicity (ADCC), complement-dependent cytotoxicity (CDC), agonist or antagonist functions, induction or inhibition of apoptosis, angiogenesis, proliferation, activation of inhibition of signaling pathway. Multiple properties may be screened simultaneously or individually. Assay methods for these desired properties are known in the art.
- The phage display method is a technology used for studying interactions of proteins. This technology is based on expressing on the surface of a phage (display) the binding protein of interest and selecting said binding protein on its capacity to form a complex with a binding partner. In general, there are two types of vectors for displaying the binding proteins, phage vector and phagemid vector, resulting in multivalent phage or monovalent phage, respectively.
- In the first method involving phage vector, the principle relies on the genetic recombination of the phage genome: a sequence encoding a binding protein of interest is inserted into said phage genome. The sequence insertion is localized next to a gene encoding a protein forming the coat protein complex of the phage. Said coat is composed of different proteins, such as GP3 and GP8 proteins which are the most commonly used. The insertion of a sequence of interest next to the gene encoding these coat proteins enables the fusion of the binding protein of interest to the coat protein of the phage. The recombinant phage (i.e., a phage vector) then infects bacteria, and its genome is replicated therein. The expression of the recombinant phage genome leads to the production of phages expressing on their surface the heterologous binding protein to be screened, in which all copies of the coat protein display the heterologous protein (i.e., the heterologous protein is displayed in the multiple on a given phage, or multivalently). During the steps of screening, different proteins or molecules, referred to as targets or binding partners, are brought into contact with said protein of interest. When a complex is formed between the binding protein on the surface of the phage and a binding partner, the complex is purified and the nucleotidic sequence encoding the binding protein of interest can then be determined from the recombinant phage genome.
- Exemplary phage vectors include M13IX30, M13IX11, M13IX34, M13IX13 or M13IX60 described in International Patent Publication No. WO92/06204, and any derived phage vectors such as the vector 668-4 used in a method for generating multivalent display library described in the U.S. Pat. No. 6,057,098. Although most phage display methods have used filamentous phage, lambdoid phage display systems (WO 95/34683; U.S. Pat. No. 5,627,024), T4 phage display systems (Ren et al., Gene, 215: 439 (1998); Zhu et al., Cancer Research, 58(15): 3209-3214 (1998); Jiang et al., Infection & Immunity, 65(11): 4770-4777 (1997); Ren et al., Gene, 195(2):303-311 (1997); Ren, Protein Sci., 5: 1833 (1996); Efimov et al., Virus Genes, 10: 173 (1995)) and T7 phage display systems (Smith and Scott, Methods in Enzymology, 217: 228-257 (1993); U.S. Pat. No. 5,766,905) are also known.
- The second phage display method combines the use of phagemid (display) vectors and helper phages for the generation of libraries of phages expressing on their surface a binding protein of interest. Phagemid vectors comprise the sequence encoding the binding protein of interest and phage sequences, especially the sequence encoding the coat protein to be fused with the binding protein of interest (i.e., the displayed protein fusion gene), which is cloned into a small plasmid. Phagemid vectors are also constituted of different functional sequences for the replication of the phage genome (e.g., phage replication origin such as Ff or f1 origin) and the maintenance of the vector in the host cell (e.g., plasmid replication origin such as pBR322 or pMB1). The phagemid vectors do not contain the whole phage genome, which is why this method is combined with the use of a helper phage for the production of phages expressing binding proteins as a fusion on their surface. The helper phage, upon infection of the host cell (e.g., E. coli), enables the replication and packaging of the phage genome by complementing proteins from the complete phage genome that is absent in the phagemid. Commercially available helper phage includes M13K07 from New England Biolabs and R408 and VCSM13 from Stratagene, which typically have mutations that reduce packaging efficiency, to ensure phagemid genomes are preferentially packaged. In addition, engineered helper phage with complement adapter or adapters to prevent contamination from unrelated phage or phagmid particles. As a result of the use of helper phages, all wide-type phage proteins from the helper phage genome, as well as a small amount of the fusion protein encoded by the phagemid are expressed, so that phage particles extruded by the cells contain both proteins, usually with the wild-type in considerable excess. A typical preparation of phage particles from E. coli harboring a GP3 phagemid display vector and infected with helper phage will exhibit a Poisson distribution of fusion protein expression: 10% or less of the particles will display one copy of the fusion protein; a very small percentage will display two copies; and the remaining majority of the particles will display only wild-type GP3 and no fusion protein. Thus, the main displaying species is monovalent, while most particles do not display at all.
- The valency of display is important primarily due to its impact on the ability to discriminate binders of differing affinities. Early work showed that multivalent display prevented the highest-affinity clones in a selection form being identified, because multivalency conferred a high apparent affinity (avidity) on weakObinding clones. Monovalent display allows selection based on pure affinity, and is therefore generally preferred for the many studies where the aim is to identify the tightest binding variants from a library. Conversely, in applications where the initial selectants are of law affinity for example, the de novo selection of peptides that bind a given target multivalency increases the changes of isolating rare and weakly binding clones. A frequently used strategy is to start with multivalent display, and then move to monovalent display as the affinity of the displayed peptides matures.
- The use of a helper phage can be eliminated by using the bacterial packaging cell line technology, which is described in detail in U.S. Pat. No. 8,227,242, the entire disclosure incorporated herein by reference.
- In addition to selecting suitable vectors for constructing a library of mutants, sorting phage libraries of mutants also requires a strategy for constructing and propagating a large number of variants, a procedure for affinity purification using the target receptor, and a means of evaluating the results of binding enrichments. See U.S. Pat. Nos. 5,223,409; 5,403,484; 5,571,689; and 5,663,143.
- Many other improvements and variations of the basic phage display concept have now been developed. These improvements enhance the ability of display systems to screen peptide libraries for binding to selected target molecules and to display functional proteins with the potential of screening these proteins for desired properties. Combinatorial reaction devices for phage display reactions have been developed (WO 98/14277) and phage display libraries have been used to analyze and control bimolecular interactions (WO 98/20169; WO 98/20159) and properties of constrained helical peptides (WO 98/20036). WO 97/35196 describes a method of isolating an affinity ligand in which a phage display library is contacted with one solution in which the ligand will bind to a target molecule and a second solution in which the affinity ligand will not bind to the target molecule, to selectively isolate binding ligands. WO 97/46251 describes a method of biopanning a random phage display library with an affinity purified antibody and then isolating binding phage, followed by a micropanning process using microplate wells to isolate high affinity binding phage. WO 97/47314 describes the use of substrate subtraction libraries to distinguish enzyme specificities using a combinatorial library which may be a phage display library. A method for selecting enzymes suitable for use in detergents using phage display is described in WO 97/09446. Additional methods of selecting specific binding proteins are described in U.S. Pat. Nos. 5,498,538 and 5,432,018, and WO 98/15833. In addition, adaptor directed display and cross-species display, including but not limited to those described in Wang et al, Adapter-Directed Display: A Modular Design for Shuttling Display on Phage Surfaces, J. Mol. Biol (2010) 395, 1088-1101 and Wang et al., Yeast surface display of antibodies via the heterodimeric interaction of two coiled-coil adapters, Journal of Immunological Methods (2010) 354, 11-19, can also be used in the present invention.
- Methods of generating peptide libraries and screening these libraries are also disclosed in U.S. Pat. Nos. 5,723,286; 5,432,018; 5,580,717; 5,427,908; 5,498,530; 5,770,434; 5,734,018; 5,698,426; 5,763,192; and 5,723,323.
- Typically, phage display is achieved through fusions of an antibody fragment to one of the coat proteins on M13 phage, such as GP3, GP8, or GP7. In one embodiment of the present invention, a phagemid named Fad22 is constructed, where the GP3 C-terminal (“CT”) fragment is used instead of full-length GP3 protein. The GP3 CT fragment has been shown to be less toxic than full-length GP3, while equally efficient in displaying antibody fragment on phage surface.
- Using recombinant polynucleotides and vectors described herein, phage display libraries can be constructed. For example, restriction sites can be selected to flank the GP3 CT fragment. Such sites, in the case of phagemid Fad22 exemplified herein, include XbaI, NcoI, SalI and XhoI sites. In one embodiment, the first restriction site is XbaI site. In some embodiments, the second restriction site is XbaI site.
- A desired affinity binder (e.g., an antibody fragment) to a target can be identified through screening and recovering steps such as the biopanning process described above. Next, it is often necessary to express the antibody fragment without the GP3 CT fragment in a host (e.g., E. coli) in order to further purify and characterize the antibody fragment. There are generally two approaches to achieve this goal.
- The first approach is a genetic approach, in which an amber stop codon (TAG) is inserted between the gene encoding the antibody fragment and that encoding the GP3 fragment. In a bacterial suppressor strain (such as TG1 or ER2738) used for phage screening, a genetic allele SupE suppresses the amber stop codon and insert an amino acid (usually a glutamine) instead, thus allowing surface display of the antibody fragment through the GP3 fragment. In a non-suppressing bacterial strain (such as HB2151) commonly used to for expression, however, the bacterial translation machinery ribosome stops at the amber stop codon between the antibody fragment and the GP3 fragment, which results in the synthesis and export to bacterial periplasm of the antibody fragment, but not its fusion with GP3 fragment. The advantage of this approach is that no further manipulation of display vector is required. Instead, the antibody fragment can be expressed in E. coli by simply moving the same display vector from an amber-suppressing strain to a non-suppressing strain. However, this approach also has several disadvantages: (a) The suppression efficiency is strain dependent, and usually is less than 30%, which reduces phage display efficiency; (b) the amber stop codon is not a strong termination codon and thus, there is leaky read-through even in a non-suppressing strain, resulting in lower yield of soluble antibody fragments in bacterial periplasm; and (c) because antibody fragments containing internal amber stop codons are also displayed and selected in suppressing strains, these internal amber stop codons must be replaced with other codons through a laborious and expensive site-directed mutagenesis process, in order for these antibody fragments to be expressed in non-suppressing strains.
- The second “cleavage-religation” approach differs from the above-mentioned approach in that, after the display vector is extracted from E. coli, the vector is digested with a specific restriction enzyme to release the GP3 fragment, and then the vector is self-ligated using T4 DNA ligase, followed by transformation into specific competent E. coli strains for expression. Pershad et al. (Anal. Biochem. 412 (2011): 210-216) used MfeI, an infrequently used and relatively expensive restriction enzyme to excise GP3 coding sequence from phage display vector. MfeI enzyme is also known to display star activity (NEB), and its High Fidelity version MfeI-HF reduces but does not eliminate the star activity. In addition, MfeI restriction site, CAATTG encodes Gln and Leu, and it is known that Gln is susceptible to posttranslational modifications such as deamidation, which may adversely affect the structure and function of the affinity binder.
- In contrast to MfeI, the vector of the present invention utilizes restriction site having one or more of the following advantages:
-
- (1) The restriction site is rarely present in human germline Vκ, Vλ or VH;
- (2) The restriction site is not present in the library (e.g., the six CDR regions);
- (3) The restriction site encodes amino acids that are compatible with the structure of affinity binder such as antibody fragment;
- (4) The sites are unique in the vector; and/or
- (5) The corresponding the restriction enzyme is inexpensive yet robust, and lacks star activity.
- Using recombinant polynucleotides and vectors of the present invention, display vectors can be converted to expression vectors in a high-throughput, high-efficiency fashion. In some embodiments, a method of converting a display vector to an expression vector. Such method includes: providing the display vector described herein; cleaving the first and second restriction site with corresponding restriction endonuclease thereto, thereby producing compatible sticky ends; and religating the compatible sticky ends to produce an expression vector in which the first nucleic acid sequence is in-frame with the fusion tag sequence.
- The method can further include additionally cleaving within the second nucleic acid sequence to increase religation efficiency in the religating step. Such cleaving may produce blunt ends that are less likely to ligate than sticky ends.
- In some embodiments, the product from the cleaving step can be diluted before the religating step, to increase religation efficiency. The dilution can be with water, e.g., 1:10 or 1:20 or 1:100 with water, to facilitate intramolecular interaction (i.e., vector religation) over intermolecular interactions (i.e., ligation between the vector and the GP3 fragment).
- After the religating step, the expression vector can be introduced into a host, to further characterize the first nucleic acid sequence, including expressing a corresponding peptide in the host or subjecting the first nucleic acid sequence to sequencing. In some embodiments, the providing, cleaving, religating and introducing steps can be completed in less than 12 hours, less than 8 hours, or less than 4 hours.
- In the context of library screening and selection, a plurality of (e.g., hundreds or thousands) display vectors can be converted to a plurality of expression vectors in parallel, and the plurality of expression vectors can be introduced into a population of hosts. Thus, methods of the present invention can be performed in a high-throughput fashion. In some embodiments, the conversion rate from the plurality of display vectors to the plurality of expression vectors can be higher than 80%, higher than 85%, higher than 90%, higher than 92%, higher than 95% or higher than 98%. Because of the near 100% conversion rate, parallel processing of hundreds or thousands or millions of individual colonies can be achieved (e.g., in individual 96-well plates), eliminating the need to further characterize each single colony and enhancing throughput dramatically.
- A method of identifying an affinity binder to a target is additionally provided by the present invention. The method can include screening a population of first hosts each containing the display vector described herein, to obtain a subpopulation of first hosts having binding affinity to a target. Each first host displays, on a surface thereof, an amino acid sequence encoded by a unique first nucleic acid sequence in the display vector. The subpopulation of first hosts each display an affinity binder to the target. The method also includes converting display vectors isolated from the subpopulation of first hosts to expression vectors by cleaving the first and second restriction site to remove the second nucleic acid sequence and religating the compatible sticky ends produced therefrom. The method additionally includes introducing the expression vectors into a population of second hosts, to further characterize the affinity binder. In various embodiments, the method is performed in a high-throughput fashion. For example, the conversion rate of the converting step can be higher than 80%, higher than 85%, higher than 90%, higher than 92%, higher than 95% or higher than 98%.
- Various aspects of the present invention may be used alone, in combination, or in a variety of arrangements not specifically discussed in the embodiments described in the foregoing and is therefore not limited in its application to the details and arrangement of components set forth in the foregoing description or illustrated in the drawings. For example, aspects described in one embodiment may be combined in any manner with aspects described in other embodiments.
- In an exemplary design of Fad22 (
FIG. 1 ), two XbaI restriction sites were engineered to flank the GP3 C-terminal fragment (labeled “GP3CT”). One XbaI site (XbaI (4116)) is placed upstream of the GGT codon 253 (encoding Gly) of GP3, while the other XbaI site (XbaI (3645)) is positioned downstream of two stop codons (TGA TAA) placed after the TCT codon 406 (encoding Ser) of GP3. The XbaI restriction site is chosen because it fulfills the following criteria: -
- (1) It is rarely present in human germline Vκ, Vλ or VH;
- (2) It is not present in the library (e.g., the six CDR regions);
- (3) After cleavage, a 4-base sticky end, instead of a 2-base sticky end or a blunt end, is exposed. A 4-base sticky end in some embodiments can increase self-ligation efficiency;
- (4) The XbaI site (TCTAGA) encodes amino acid Serine and Arginine that are compatible with the structure of antibody fragment;
- (5) The two XbaI sites are the only two XbaI sites present in Fad22;
- (6) The restriction enzyme XbaI can be inactivated at 65° C.;
- (7) The restriction enzyme XbaI is inexpensive yet robust, and lacks star activity.
- Still referring to
FIG. 1 , the Fad22 phagemid contains two origins, one or phage replication (f1 ori) and the other for plasmid replication (pMB1 ori). Fad 22 also contains a Bla marker gene which encodes b-lactamasc and confers resistance against b-lactam antibiotics (penicillin, ampicillin, etc.). A lac promoter (Plac) is placed upstream of the fusion gene (of an antibody fragment encoding gene and the GP3CT gene). The antibody fragment (e.g., Fab) encoding gene can comprise various domains of interest, such as Vκ1, CL, VH3 and CH1 domains as shown inFIG. 1 . These domains can be arranged in any suitable order for expression of the antibody fragment. Each domain has one or more mutations introduced therein, producing a library of mutants. Two ribosome binding sites (RBS1 and RBS2) and two secretion signal sequences (SS1 and SS2) are also included in Fad22, upstream of Vκ1-CL and VH3-CH1, respectively, to facilitate translation and secretion thereof. - In one example, after 20-fold overdigestion of Fad22 with XbaI, more than 95% of the DNA fragments can be ligated and recut. Additional enzymes that can be used include NcoI, SalI and XhoI.
- It was demonstrated that with XbaI digestion, followed by self-ligation with T4 DNA ligase, almost 100% conversion of display vector into expression vector can be achieved when the starting phagemid concentration is equal to or lower than 2 ng/μl (
FIG. 2 ). However, when the starting phagemid concentration is at or higher than 10 ng/μl, the conversion rate is lower than 50% (FIG. 2 ). This result may be explained by the fact that higher concentration favors intermolecular ligation, instead of intramolecular self-ligation. - Conversion efficiency is further improved by using a combination of two restriction enzymes such as XbaI and XmnI. The XmnI (GAATAATTTC) site is present in the gene encoding GP3 C-terminal fragment, and digestion by XmnI restriction enzyme yields a blunt end that is more difficult to ligate than sticky ends (as produced by XbaI). Experiments demonstrated that with the combination of XbaI and XmnI, followed by self-ligation with T4 DNA ligase, almost 100% conversion of display vector into expression vector were achieved even when the starting phagemid concentration is about 10 ng/μl (
FIG. 3 ), which is significantly higher than the results at 10 ng/μl when only XbaI is used (FIG. 2 ). Therefore, the combination of XbaI and XmnI dramatically extends the range of starting phagemid concentration, and improves the robustness of this cleavage-religation approach, especially in high-throughput settings where hundreds of, or even thousands of phagemid samples are processed in parallel, and where the starting phagemid concentrations routinely vary by several fold. - The present invention provides among other things novel methods and vectors for high-throughput screening and selection. While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification. The full scope of the invention should be determined by reference to the claims, along with their full scope of equivalents, and the specification, along with such variations.
- All publications, patents and sequence database entries mentioned herein are hereby incorporated by reference in their entirety as if each individual publication or patent was specifically and individually indicated to be incorporated by reference.
Claims (25)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2013/072631 WO2014139130A1 (en) | 2013-03-14 | 2013-03-14 | An integrated system for library construction, affinity binder screening and expression thereof |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2013/072631 A-371-Of-International WO2014139130A1 (en) | 2013-03-14 | 2013-03-14 | An integrated system for library construction, affinity binder screening and expression thereof |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/540,159 Division US20220090053A1 (en) | 2013-03-14 | 2021-12-01 | Integrated system for library construction, affinity binder screening and expression thereof |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20160145604A1 true US20160145604A1 (en) | 2016-05-26 |
Family
ID=51535816
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/774,722 Abandoned US20160145604A1 (en) | 2013-03-14 | 2013-03-14 | An Integrated System for Library Construction, Affinity Binder Screening and Expression Thereof |
| US17/540,159 Pending US20220090053A1 (en) | 2013-03-14 | 2021-12-01 | Integrated system for library construction, affinity binder screening and expression thereof |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/540,159 Pending US20220090053A1 (en) | 2013-03-14 | 2021-12-01 | Integrated system for library construction, affinity binder screening and expression thereof |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US20160145604A1 (en) |
| EP (1) | EP2970952B1 (en) |
| CN (1) | CN105247050B (en) |
| CA (1) | CA2905529C (en) |
| WO (1) | WO2014139130A1 (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019036842A1 (en) | 2017-08-21 | 2019-02-28 | Adagene Inc. | Dynamic human heavy chain antibody libraries |
| US11242395B2 (en) | 2017-08-21 | 2022-02-08 | Adagene Inc. | Anti-CD137 molecules and use thereof |
| US11359016B2 (en) | 2018-02-02 | 2022-06-14 | Adagene Inc. | Anti-CTLA4 antibodies and methods of making and using the same |
| US11585014B2 (en) | 2017-08-21 | 2023-02-21 | Adagene Inc. | Dynamic human antibody light chain libraries |
| US11952681B2 (en) | 2018-02-02 | 2024-04-09 | Adagene Inc. | Masked activatable CD137 antibodies |
| WO2024263761A1 (en) | 2023-06-22 | 2024-12-26 | Genentech, Inc. | Antibodies and uses thereof |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11485977B2 (en) | 2014-12-19 | 2022-11-01 | Adagene, Inc. | Methods and systems for autoinduction of protein expression |
| WO2018078167A1 (en) | 2016-10-31 | 2018-05-03 | Universität Zürich | Protein screening and detection method |
| CN106749530B (en) * | 2017-02-14 | 2020-04-24 | 东华理工大学 | Uranyl ion binding heptapeptides |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090111126A1 (en) * | 2006-11-01 | 2009-04-30 | Pdl Biopharma, Inc. | Mammalian cell-based immunoglobulin display libraries |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002088315A2 (en) * | 2001-04-27 | 2002-11-07 | Alexion Pharmaceuticals, Inc. | Phagemid vectors |
| CN1217000C (en) * | 2003-07-30 | 2005-08-31 | 中国药科大学 | Exhibiting method of preparing antibody by antigen utilizing bacteriophage exhibiting technique |
| CA2614304A1 (en) * | 2005-07-07 | 2007-01-18 | Ribovax Biotechnologies S.A. | Novel phage display technologies |
| JP2007124915A (en) * | 2005-11-01 | 2007-05-24 | Tokai Univ | Recombinering construct and gene targeting construct production vector |
| JP5061346B2 (en) * | 2006-12-26 | 2012-10-31 | 国立大学法人山口大学 | Method for producing DNA fragment |
| WO2010036860A2 (en) * | 2008-09-26 | 2010-04-01 | Wyeth Llc | Compatible display vector systems |
-
2013
- 2013-03-14 CN CN201380074656.1A patent/CN105247050B/en active Active
- 2013-03-14 EP EP13877452.6A patent/EP2970952B1/en active Active
- 2013-03-14 CA CA2905529A patent/CA2905529C/en active Active
- 2013-03-14 US US14/774,722 patent/US20160145604A1/en not_active Abandoned
- 2013-03-14 WO PCT/CN2013/072631 patent/WO2014139130A1/en not_active Ceased
-
2021
- 2021-12-01 US US17/540,159 patent/US20220090053A1/en active Pending
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090111126A1 (en) * | 2006-11-01 | 2009-04-30 | Pdl Biopharma, Inc. | Mammalian cell-based immunoglobulin display libraries |
Non-Patent Citations (3)
| Title |
|---|
| Hoet et al Genbank NC_003287 .2 * |
| Hoet et al., Nature Biotechnology, 2005, vol 23 (3) pages 344-8 * |
| Kim et al., AEM, 2000, vol 66, pages 788-793 * |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019036842A1 (en) | 2017-08-21 | 2019-02-28 | Adagene Inc. | Dynamic human heavy chain antibody libraries |
| US11242395B2 (en) | 2017-08-21 | 2022-02-08 | Adagene Inc. | Anti-CD137 molecules and use thereof |
| US11578426B2 (en) | 2017-08-21 | 2023-02-14 | Adagene Inc. | Dynamic human heavy chain antibody libraries |
| US11585014B2 (en) | 2017-08-21 | 2023-02-21 | Adagene Inc. | Dynamic human antibody light chain libraries |
| US11859003B2 (en) | 2017-08-21 | 2024-01-02 | Adagene Inc. | Method for treating cancer using anti-CD137 antibody |
| US12378319B2 (en) | 2017-08-21 | 2025-08-05 | Adagene Inc. | Anti-CD137 molecules and use thereof |
| US12385162B2 (en) | 2017-08-21 | 2025-08-12 | Adagene Inc. | Dynamic human heavy chain antibody libraries |
| US11359016B2 (en) | 2018-02-02 | 2022-06-14 | Adagene Inc. | Anti-CTLA4 antibodies and methods of making and using the same |
| US11692036B2 (en) | 2018-02-02 | 2023-07-04 | Adagene Inc. | Anti-CTLA4 antibodies and methods of making and using the same |
| US11952681B2 (en) | 2018-02-02 | 2024-04-09 | Adagene Inc. | Masked activatable CD137 antibodies |
| WO2024263761A1 (en) | 2023-06-22 | 2024-12-26 | Genentech, Inc. | Antibodies and uses thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| CN105247050A (en) | 2016-01-13 |
| CN105247050B (en) | 2020-06-16 |
| EP2970952A1 (en) | 2016-01-20 |
| EP2970952B1 (en) | 2018-07-11 |
| CA2905529C (en) | 2023-03-14 |
| CA2905529A1 (en) | 2014-09-18 |
| EP2970952A4 (en) | 2016-10-05 |
| WO2014139130A1 (en) | 2014-09-18 |
| US20220090053A1 (en) | 2022-03-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20220090053A1 (en) | Integrated system for library construction, affinity binder screening and expression thereof | |
| JP4629787B2 (en) | Protein / (poly) peptide library | |
| JP3344584B2 (en) | Recombinant library screening method | |
| JP6073038B2 (en) | Stabilized immunoglobulin variable domain selection method and application of selected domains | |
| JP2015531597A5 (en) | ||
| JP2019526618A (en) | Antibody phage display library | |
| JP2023081910A (en) | Polypeptide expression system | |
| WO2020014143A1 (en) | Antibody libraries with maximized antibody developability characteristics | |
| EP2619578B1 (en) | Method of analyzing binding interactions | |
| US20140038842A1 (en) | Cell surface display using pdz domains | |
| Siegel | Antibody affinity optimization using yeast cell surface display | |
| US8946128B2 (en) | Signal sequence-independent pIX phage display | |
| KR102216032B1 (en) | Synthetic antibody library generation method, the library and its application(s) | |
| CN111201239A (en) | Methods and compositions for developing antibodies specific for epitope post-translational modification states | |
| EP3717682A1 (en) | An antibody fragment library, and uses thereof | |
| US20230295271A1 (en) | Methods and compositions for in vitro affinity maturation of monoclonal antibodies | |
| Leow et al. | Monoclonal IgY antibodies | |
| JP6293829B2 (en) | Stabilized immunoglobulin variable domain selection method and application of selected domains |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: ADAGENE, INC., CAYMAN ISLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DU, FANGYONG;LUO, PETER PEIZHI;REEL/FRAME:039626/0778 Effective date: 20151113 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |