WO2025133253A2 - Protein-based conjugation carriers for intranuclear delivery - Google Patents
Protein-based conjugation carriers for intranuclear delivery Download PDFInfo
- Publication number
- WO2025133253A2 WO2025133253A2 PCT/EP2024/088112 EP2024088112W WO2025133253A2 WO 2025133253 A2 WO2025133253 A2 WO 2025133253A2 EP 2024088112 W EP2024088112 W EP 2024088112W WO 2025133253 A2 WO2025133253 A2 WO 2025133253A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- amino acid
- cysteine
- side chain
- reactive group
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/395—Antibodies; Immunoglobulins; Immune serum, e.g. antilymphocytic serum
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/395—Antibodies; Immunoglobulins; Immune serum, e.g. antilymphocytic serum
- A61K39/42—Antibodies; Immunoglobulins; Immune serum, e.g. antilymphocytic serum viral
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/32—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against translation products of oncogenes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
- C12N15/625—DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/30—Immunoglobulins specific features characterized by aspects of specificity or valency
- C07K2317/31—Immunoglobulins specific features characterized by aspects of specificity or valency multispecific
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/50—Immunoglobulins specific features characterized by immunoglobulin fragments
- C07K2317/56—Immunoglobulins specific features characterized by immunoglobulin fragments variable (Fv) region, i.e. VH and/or VL
- C07K2317/569—Single domain, e.g. dAb, sdAb, VHH, VNAR or nanobody®
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/60—Immunoglobulins specific features characterized by non-natural combinations of immunoglobulin fragments
- C07K2317/62—Immunoglobulins specific features characterized by non-natural combinations of immunoglobulin fragments comprising only variable region components
- C07K2317/622—Single chain antibody (scFv)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
Definitions
- the present technology provides molecules comprising or consisting of at least one proteinbased carrier building block, wherein the protein-based carrier building block comprises at least one, preferably at least two attachment point(s) or conjugation site(s), and at least one nuclear localization sequences (NLS), covalently linked to at least one conjugation site or attachment point comprised in the protein-based carrier building block, directly or by means of a linker.
- the protein-based carrier building block comprises at least one, preferably at least two attachment point(s) or conjugation site(s), and at least one nuclear localization sequences (NLS), covalently linked to at least one conjugation site or attachment point comprised in the protein-based carrier building block, directly or by means of a linker.
- NLS nuclear localization sequences
- the present technology further relates to nucleic acids encoding such molecules or part of such molecules; to host cells comprising such nucleic acids and/or expressing or capable of expressing such molecules or part of such molecules; to compositions, and in particular to pharmaceutical compositions that comprise such molecules, nucleic acids and/or host cells; and to uses of such molecules, nucleic acids, host cells and/or compositions, in particular for labelling, prophylactic, therapeutic and/or diagnostic purposes.
- Protein-based therapeutics are creating new therapeutic strategies which are hard to achieve via the classic, small molecule-based therapeutics.
- One fast-growing field comprises conjugation-based targeted therapeutics, such as antibody-drug conjugates (ADCs).
- ADCs antibody-drug conjugates
- Targeted drug delivery is critical for improving the therapeutic benefits of the drugs, such as anticancer drugs.
- the active delivery strategy increases the bioavailability of drugs not only to the diseased tissue and subsequent individual target cells but also to the active sites inside the target organelles where certain drugs carry out their desired pharmacological activities.
- the nucleus has a double membrane called nuclear envelope.
- NPC nuclear pore complex
- the NPC is a large, multimeric structure that generally acts as a permeability barrier between the cytoplasm and nucleoplasm.
- the main structural components of the NPC include the central channel, the cytoplasmic ring moiety and cytoplasmic filaments, and the nuclear ring moiety and nuclear basket.
- the NPC has eightfold rotational symmetry.
- Each NPC is connected to the inner and outer nuclear membranes by symmetrical 8 molecular spoke proteins, and the 8 molecular spoke proteins surround each other into a central channel with an outer diameter of 122 nm and an inner diameter of 70 nm.
- Diverse proteins, such as transcription factors, histones, and cell cycle regulators need to be transported into the nucleus through the NPC after their synthesis, which necessitates the presence of a nuclear localization signal or nuclear localization sequence (NLS) on these proteins.
- NLS nuclear localization sequence
- NLS is recognized by the corresponding nuclear transporters, which can interact with nucleoporins to help NLS- containing proteins reach the nucleus through NPCs (Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60). NLS have been used to deliver molecules such as nucleic acids, proteins, and nanoparticles intranuclearly.
- the current technology aims at simplifying the generation of conjugation-based therapeutics, in particular of targeted conjugation-based therapeutics, and/or to create a plug-and play strategy that can create alternative formats which allow for versatile conjugation of different cargos.
- the present technology employs a non-targeting protein-based carrier building block which solely serves as a site-specific conjugation vehicle (protein-based carrier building block).
- This protein-based carrier building block can be contained within a genetic construct, thus be the product of one manufacturing campaign or, alternatively, can be produced separately (e.g., recombinantly or by alternative means such as solid-phase peptide synthesis, SPPS) and later connected to the active and/or targeting moiety (i.e., the cargo), as it will be explained in detail below.
- SPPS solid-phase peptide synthesis
- the molecules of the present technology comprise at least one NLS, which allows the molecule to reach the intranuclear space, for targeted delivery of drugs into the cell nucleus.
- the present technology provides molecules comprising or consisting of at least one protein-based carrier building block, wherein the protein-based carrier building block comprises at least one attachment point or conjugation site, preferably at least two attachment points or conjugation sites.
- the protein-based building block comprises at least one nuclear localization sequence (NLS), covalently linked to at least one conjugation site or attachment point comprised in it.
- NLS nuclear localization sequence
- the conjugation sites or attachment points are suitable for conjugation or attachment of NLS and optionally further cargos to the protein-based carrier building block, as described herein.
- a “cargo” is any molecule which is/may be attached or conjugated to the protein-based carrier building block through the attachment point(s) or conjugation site(s) present therein.
- cargos which may be attached or conjugated to the protein-based carrier building block comprised in the molecule of the present technology are proteins such as targeting proteins, peptides such as NLSs and cell-penetrating peptides (CPPs), polyethylene glycol (PEG), small molecules, glycans, lipids, chelators, fluorophores, radio isotopes, vitamins such as folic acid or biotin, nucleic acids such as Antisense Oligonucleotides (ASOs) etc.
- proteins such as targeting proteins, peptides such as NLSs and cell-penetrating peptides (CPPs), polyethylene glycol (PEG), small molecules, glycans, lipids, chelators, fluorophores, radio isotopes, vitamins such as folic acid or biotin, nucleic acids such as Antisense Oligonucleotides (ASOs) etc.
- proteins such as targeting proteins, peptides such as NLSs and
- the present technology provides the protein-based carrier building block attached to at least one NLS, as described above, and further attached to another cargo, as described herein, i.e., the molecule of the present technology comprises (or, alternatively, consists of) at least one protein-based building block comprising at least one NLS, as described herein, and at least one further cargo attached or conjugated to it through at least one conjugation site or attachment point.
- the further cargo may preferably be a cell-penetrating peptide or a targeting molecule, such as a cel I -targeting moiety.
- the protein-based carrier building block comprised in the molecule of the present technology comprises (and, preferably, consists of) at least part of a protein, preferably a whole protein.
- the protein-based carrier building block is a polypeptide.
- the proteinbased carrier building block has a globular 3D structure and is soluble.
- the proteinbased carrier building block comprised in the molecule of the present technology has a size (molecular mass or molecular weight, MW) of about 2.5 to about 70 kDa, preferably of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa, such as about 6 kDa, or about 7 kDa, or about 16 kDa.
- MW molecular weight
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein, although it may show non-specific binding to one or more human proteins, as explained in detail herein.
- the proteinbased carrier building block may bind to human proteins with low specificity and/or low selectivity, as defined herein.
- the protein-based carrier building block does also not specifically bind to any non-protein (preferably human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans.
- any non-protein (preferably human) molecule such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans.
- PS phosphatidylserine
- the protein-based carrier building block may derive from a target-binding protein (such as an immunoglobulin single variable domain (ISVD), a DARPin, an affibody or an affitin), as described below. It may also derive from other proteins which show specific binding towards, e.g., human proteins, such as small globular human proteins. This is the so-called "protein-based carrier building block precursor". In these cases, preferably, the protein-based building block does also not specifically bind to any molecule (including non-human proteins) to which the protein-based carrier building block precursor specifically binds (if any).
- a target-binding protein such as an immunoglobulin single variable domain (ISVD), a DARPin, an affibody or an affitin
- protein-based carrier building block precursor This is the so-called "protein-based carrier building block precursor”.
- the protein-based building block does also not specifically bind to any molecule (including non-human proteins) to which the protein-based carrier building block precursor specifically
- the protein-based carrier building block if the precursor of the protein-based carrier building block is an anti-RSV (respiratory syncytial virus) ISVD, the protein-based carrier building block preferably does not specifically bind RSV. Hence, preferably, the protein-based carrier building block does also not specifically bind to the precursor's target, should the precursor have a target and should this be a non-human molecule, such as a non-human protein, or a human non-protein molecule, such as human DNA, RNA, glycans, lipids, etc.
- RSV respiratory syncytial virus
- the protein-based carrier building block does not specifically bind any human protein, non-human protein and/or non-protein molecule when a cargo is conjugated to the at least one, preferably at least two, attachment points or conjugation sites on the protein-based carrier building block.
- the at least one protein-based carrier building block comprised in the molecule of the present technology a) Has at least one attachment point (also referred to as conjugation site in the present description), preferably at least two attachment points or conjugation sites, wherein an attachment point or conjugation site is a reactive group in the side chain of a non-natural or natural amino acid (e.g., Cys, Lys, Tyr, Orn, etc.) preferably located at a solvent-accessible position in the protein-based carrier building block, and/or the /V-terminal primary amine, and/or the C-terminal carboxylic group of the protein-based carrier building block, if these are available.
- a non-natural or natural amino acid e.g., Cys, Lys, Tyr, Orn, etc.
- the at least one, preferably at least two, attachment point(s) or conjugation site(s) is(are) thus preferably located at solvent-accessible positions in the protein-based carrier building block; b) Has a size (molecular mass) of from about 2.5 to about 70 kDa, preferably from about 2.5 to about 50 kDa, such as from about 2.5 to less than 50 kDa, more preferably from about 2.5 to about 30 kDa, even more preferably from about 2.5 to about 16 kDa; c) Has a solubility of about 10 mg/mL or more, measured in an aqueous solution at room temperature (RT), preferably measured in a buffer or water at RT, more preferably measured in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (
- the present technology relates to a molecule comprising at least one proteinbased carrier building block, wherein the at least one protein-based carrier building block: a) comprises at least one conjugation site or attachment point, preferably at least two attachment points or conjugation sites; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as from about 2.5 kDa to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular 3D structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at RT, preferably measured in a buffer or water at RT, more preferably in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT
- PBS cit
- the at least one NLS attached to at least one attachment point or conjugation site comprised in the protein-based carrier building block is not particularly limited.
- the skilled person is aware of NLSs which may be comprised in the molecule of the present technology. Suitable examples comprise NLSs as described, e.g., in Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60, e.g., on Table 1, page 3 or page 4 of this document.
- the at least one NLS comprises or consists of the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221), see, e.g., Dang CV and Lee WM., "Identification of the human c-myc protein nuclear translocation signal", Mol Cell Biol. 1988, 8(10):4048-5.
- the at least one NLS comprises or consists of the SV40mono NLS (SEQ ID NO.: 256, PKKKRKV).
- the at least one NLS comprises or consists of the SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV).
- the at least one NLS comprises or consists of the NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD) (Ray et al. 2015, Bioconj. Chem. 26(6): 1004-1007, Quantitative tracking of protein trafficking to the nucleus using cytosolic protein delivery by nanoparticle-stabilized nanocapsules).
- the at least one protein-based carrier building block does not specifically bind to any non-protein molecule, e.g., to any human nonprotein molecule, such as human DNA, human RNA, human lipids or human glycans.
- the molecule of the present technology may comprise more than one protein-based carrier building block, such as, e.g., two, three, four, five, six or more protein-based carrier building blocks. These protein-based carrier building blocks may be directly linked to each other, or linked to each other through a linker, as described herein.
- the at least one protein-based carrier building block comprised in the molecule of the present technology comprises more than one conjugation site or attachment point, preferably at least two conjugation sites or attachment points, such as two conjugation sites or attachment points, or at least three conjugation sites or attachment points, such as three, four, five, six, seven, eight or nine conjugation sites or attachment points, which are preferably reactive groups present in the side chain of a natural or non-natural amino acids comprised in the protein-based carrier building block, or which may be (additionally or alternatively) the /V- terminal primary amine and/or the C-terminal carboxylic acid group of the protein-based building block.
- the at least one NLS may be attached or conjugated to any of the conjugation sites present in the protein-based building block, directly or by means of a linker, as described herein.
- the protein-based building block comprises one NLS covalently linked, by means of a peptide linker, to the C-terminal carboxylic acid group of the protein-based building block.
- the protein-based building block comprises one NLS covalently linked, by means of a peptide linker, to another attachment point or conjugation sites, such as reactive groups present in the side chain of a natural or non-natural amino acids comprised in the protein-based carrier building block.
- the protein-based building block comprises more than one NLS covalently linked, by means of a peptide linker, to more than one attachment points or conjugation sites present in the protein-based building block.
- one of the NLS may be covalently linked, directly or by means of a peptide linker, to the C-terminal carboxylic acid group of the protein-based building block
- one or more NLS may be covalently linked, directly or by means of a peptide linker, to other attachment points or conjugation sites, such as reactive groups present in the side chain of a natural or non-natural amino acids comprised in the protein-based carrier building block and/or to the /V-terminal primary amine of the protein-based building block.
- the at least one conjugation site or attachment point comprised in the at least one protein-based carrier building block is a free or capped thiol group, a free or capped hydroxyl group and/or a free or capped primary amine.
- the at least one conjugation site or attachment point comprised in the at least one protein-based carrier building block is a reactive group present in the side chain of a cysteine and/or in the side chain of a tyrosine, and/or in the side chain of a lysine, and/or in the side chain of an ornithine.
- the at least one protein-based building block comprises a /V- and/or a C-terminal Cys and/or a /V- and/or a C-terminal Tyr, preceded or followed by a (GG) or (G4Si)i-3GG sequence, such as CGG-, -GGC, YGG-, -GGY, -(G4SI)I-3GGY, Y(G4SI)I-3GG-, YGG(SIG 4 )I- 3 -, or YGG(G 4 SI)I-3-.
- GG or (G4Si)i-3GG sequence
- the at least one protein-based carrier building block present in the molecule of the present technology is (i) a building block based on small globular non-human proteins, such as an ISVD-based building block, a DARPin-based building block, an affibody-based building block or an affitin-based building block or (ii) a building block based on small globular human proteins, such as cyclin-dependent kinase subunit 1 (CDK-1).
- CDK-1 cyclin-dependent kinase subunit 1
- the least one protein-based building block is derived from a heavy chain ISVD, preferably from a VH, VHH, including a camelized VH or humanized VHH.
- the at least one protein-based building block is derived from an ISVD belonging to the "VH3 class", preferably wherein the resulting building block comprises at least one (preferably engineered) cysteine, at least one (preferably engineered) lysine, at least one non- natural amino acid and/or at least one (preferably engineered) tyrosine at one or more solvent-accessible positions of the protein-based building block.
- the least one protein-based building block is derived from RSV001A04, SEQ ID NO.: 179.
- the at least one protein-based building block is an ISVD-based building block which comprises a Leu or a Gin, preferably a Leu at position 108, according to Kabat numbering, preferably wherein the ISVD-based building block comprises a Vai or a Leu, preferably a Vai at position 11 and/or a Vai, a Thr or a Leu, preferably a Leu at position 89, according to Kabat numbering.
- the at least one protein-based building block comprises or, alternatively, consists of SEQ ID NO.: 186:
- Xi position 1 according to Kabat numbering
- Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X3 (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine
- X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine
- X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai, or any other amino acid with a reactive group in its side chain, such as cysteine;
- Xs can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
- X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X10 (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X15 (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xie (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X17 (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xis (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xig (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X20 (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X21 (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X23 (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X24 (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X25 (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X26 (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X27 (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X28 (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X29 (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X30 (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X31 (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
- X32 (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X33 (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X34 (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X35 (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- X36 (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X37 (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- X38 (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X39 (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X40 (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X41 (position 64 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X42 (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X43 (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X44 (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X45 (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X46 (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X47 (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X48 (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X49: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X50 (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- Xsi (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X52 (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X53 (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X54 (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xse (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X57 (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- Xss (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xeo (position 89 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai, or any other amino acid with a reactive group in its side chain, such as cysteine;
- Xei (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X62 (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X64 (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- Xee (position 100a according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xeg (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X70 (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
- X71 (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X72 (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X73 (position 103 according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
- X74 (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X75 (position 106 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X76 (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu or any other amino acid with a reactive group in its side chain, such as cysteine;
- X77 (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X78 (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine.
- X79 (position 113 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xso is absent or Gly
- Xsi is absent or Gly
- X82 is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 186, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 186, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as about2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described herein.
- the at least one protein-based building block comprises or consists of SEQ ID NO.: 225: EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCS
- the at least one protein-based building block is a DARPin-based building block, preferably derived from the DARPin K27 as defined in SEQ ID NO.: 187.
- the protein-based building block is a DARPin-based building block which comprises, or alternatively, consists of, SEQ ID NO.: 188:
- X12 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine
- X13 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine
- X14 can be He or any amino acid with a reactive group in its side chain, such as cysteine
- X15 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- Xi6 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X17 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xis can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X19 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X20 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
- X21 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X23 can be Phe or any amino acid with a reactive group in its side chain, such
- X26 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be His or any amino acid with a reactive group in its side chain, such as cysteine X29 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine X30 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X31 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X32 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
- X33 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine
- X34 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine
- X35 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X36 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X37 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X38 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X40 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X41 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X44 can be Leu or any amino acid with a reactive group in its side
- X54 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X55 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X57 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X58 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X59 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
- Xeo can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- Xei can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X62 can be Glu or any amino acid with a reactive group in its side chain,
- X65 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- Xee can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X67 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X69 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
- X70 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X71 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X72 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X73 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
- X74 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X75 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X76 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
- X77 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X78 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
- X79 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- Xso can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- Xsi can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs3 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs6 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs7 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs8 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
- Xs9 is absent or Leu
- X90 is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described herein, in particular does not specifically bind to human KRAS protein (GTPase KRas, EC:3.6.5.2, primary accession number P01116, see also
- the at least one protein-based building block is a small globular human protein-based building bock, preferably derived from the polypeptide as defined in SEQ ID NO.: 190.
- the protein-based building block is a small globular human protein-based building bock which comprises, or alternatively, consists of, SEQ ID NO.: 191:
- Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X2 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
- X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X4 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X5 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine
- Xe can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine
- X2ocan be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X2i can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
- X23 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X24can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X25 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X23bcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X24bcan be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X25bcan can be His or any amino acid with a reactive group in its side
- X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X4ocan be Met or any amino acid with a reactive group in its side chain, such as cysteine;
- X4i can be He or any amino acid with a reactive group in its side chain, such as cysteine;
- X42 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
- X43 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X46 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X47 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
- X48 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X49 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- Xsocan Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- Xsi can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X52 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X53 can be Lys or any amino acid with a reactive group in its side chain, such as cyste
- Xs4 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
- X55 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine
- X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
- X57 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as from about 2.5 to about 50 kDa, such as from about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such
- the at least one protein-based building block may be selected from SEQ ID NO.: 80-105, 175, 199, 208, 222-225.
- the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein and at least one NLS, as described herein, covalently linked to a conjugation site or attachment point comprised in the at least one protein-based building block.
- the molecule of the present technology comprises at least one protein-based carrier building block as defined herein and at least two NLS, as described herein, covalently linked to at least two conjugation sites or attachment points comprised in the at least one protein-based building block. More preferably, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein and at least three NLS, as described herein, covalently linked to at least three conjugation sites or attachment points comprised in the at least one protein-based building block.
- the at least one NLS comprises or consists of the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221), see, e.g., Dang CV and Lee WM., "Identification of the human c-myc protein nuclear translocation signal", Mol Cell Biol. 1988, 8(10):4048-5.
- the at least one NLS comprises or consists of the SV40mono NLS (SEQ ID NO.: 256, PKKKRKV).
- the at least one NLS comprises or consists of the SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV).
- the at least one NLS comprises or consists of the NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
- the molecule of the present technology comprises at least one protein-based carrier building block as defined herein, one or more NLS, as described herein, and at least one cell-targeting moiety, as defined herein, attached to at least one attachment point or conjugation site.
- the molecule of the present technology comprises at least one protein-based carrier building block as defined herein, one or more NLS, as described herein, and at least one cell-penetrating peptide (CPP), preferably more than one CPP, such as two, three, four, five or more CPPs, as defined herein, attached to at least one, preferably to more than one, attachment points or conjugation sites.
- CPP cell-penetrating peptide
- the at least one protein-based carrier building block and the at least one cell-targeting moiety and/or the at least one CPP are directly linked to each other.
- they are linked to each other through a peptide linker, preferably wherein the peptide linker is selected from the linkers depicted in Table A-l, such as SEQ ID NO.: 158-169 or 193-196, or 298, or GGG.
- Other linkers may be used, such as APN-maleimide linkers, as defined below and exemplified in the examples.
- the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein, (i) one or more NLS, as described herein, (ii) one or more cel I -targeting moieties, as described herein and, optionally, (iii) one or more CPPs, as described herein.
- the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein, one or more NLS, as described herein, preferably one or more cel I -targeting moieties, as described herein, preferably one or more CPPs, as described herein and at least one further moiety or cargo attached to the attachment point or conjugation site, wherein the at least one further moiety or cargo is selected from: a) a half-life extending (HLE) moiety, such as PEG, and/or an albumin binding ISVD; b) a further targeting moiety, such as an EGFR-targeting moiety, e.g., GE11 peptide or an anti-EGFR ISVD (such as an anti-EGFR VHH); and/or other cell specific binding moieties; c) a therapeutic moiety or precursor therefrom, preferably a therapeutic moiety which target is in the cell nucleus, e.g., such as a CDK inhibitor; d) an imaging moiety, such as
- the at least one further cargo is at least one therapeutic moiety which target is in the cell nucleus.
- the at least one further cargo is a half-life extending moiety.
- the at least one half-life extending moiety is an albumin-binding ISVD, wherein the albumin-binding ISVD is preferably selected from SEQ ID NOs: 50-64 and 106, more preferably SEQ ID NO.: 63 or SEQ ID NO.: 106, or a sequence with at least 70%, preferably at least 80%, more preferably at least 90% and even more preferably at least 95% identity with SEQ ID NOs: 50-64 and/or 106.
- the at least one half-life extending moiety is a linear or branched polyethylene glycol moiety with a molecular weight of about 1- 60 kDa, preferably with a weight of about 1-15 kDa, such as about 14 or 15 kDa, or of about 1-10 kDa, such as 5 or 10 kDa.
- the at least one protein-based carrier building block and the at least one further moiety or cargo are directly linked to each other.
- they are linked to each other through a peptide linker, preferably wherein the peptide linker is selected from the linkers depicted in Table A-l, such as SEQ ID NO.: 158-169 or 193-196, or 298, more preferably SEQ ID NO.: 163.
- Other linkers may be used, such as APN-maleimide linkers, as defined below and exemplified in the examples.
- the molecule of the present technology may comprise any one of SEQ ID NOs.: 107-127, SEQ ID NOs. : 170-174, 176, 200 or 306.
- the molecule of the present technology may comprise SEQ ID NO.: 215.
- the molecule of the present technology may comprise:
- A0315024B02(Ll 1 V,A14P, DI 6G,S23A,M43K, Q64K,K83R, G85E, G89D-15GS- follows:
- the present technology also provides a nucleic acid encoding the molecule of the present technology (or part of the molecule of the present technology).
- the present technology provides a vector comprising the nucleic acid of the present technology, and a composition comprising the molecule of the present technology, such as a pharmaceutical composition.
- the present technology relates to the molecule or composition of the present technology for use in medicine, in particular for use in the (prophylactic or therapeutic) treatment of diseases and or disorders, such as autoimmune/inflammatory diseases, cancer and/or infectious diseases.
- diseases and or disorders such as autoimmune/inflammatory diseases, cancer and/or infectious diseases.
- FIG. 4 Conjugation using an APN-maleimide 'bifunctional' linker.
- the carrier protein-based building block comprised in the molecule
- the APN-maleimide 'bifunctional' linker can be first attached to the conjugation site present in the carrier.
- the cargo represented as "DR5-SH” in the figure
- the cargo can be attached to the other side of the APN- maleimide 'bifunctional' linker.
- the cargo has been attached or conjugated to the carrier through an APN-maleimide 'bifunctional' linker.
- Figure 6 Non-reducing PAGE analysis of a partial CMA1 uploaded CKS-based carrier.
- sequence as used herein (for example in terms like “immunoglobulin sequence”, “antibody sequence”, “variable domain sequence”, “VHH sequence” or “protein sequence”), should generally be understood to include both the relevant amino acid sequence as well as nucleic acids or nucleotide sequences encoding the same, unless the context requires a more limited interpretation.
- Amino acid sequences are interpreted to mean a single amino acid or an unbranched sequence of two or more amino acids, depending on the context.
- Nucleotide sequences are interpreted to mean an unbranched sequence of 3 or more nucleotides.
- any reference to the amino acid sequences is meant to encompass post- translational modifications of these sequences occurring in mammalian cells such as CHO cells, including, but not limited to, /V-glycosylation, O-glycosylation, deamidation, Asp isomerization/fragmentation, pyro-glutamate formation, removal of C-terminal lysine, and Met/Trp oxidation.
- nucleotide sequence or amino acid sequence is said to "comprise” another nucleotide sequence or amino acid sequence, respectively, or to “essentially consist of” another nucleotide sequence or amino acid sequence, this may mean that the latter nucleotide sequence or amino acid sequence has been incorporated into the first mentioned nucleotide sequence or amino acid sequence, respectively, but more usually this generally means that the first mentioned nucleotide sequence or amino acid sequence comprises within its sequence a stretch of nucleotides or amino acid residues, respectively, that has the same nucleotide sequence or amino acid sequence, respectively, as the latter sequence, irrespective of how the first mentioned sequence has actually been generated or obtained (which may for example be by any suitable method described herein).
- Amino acids are organic compounds that contain amino[a] (-NH + s) and carboxylate (-CO ⁇ 2) functional groups, along with a side chain (R group) specific to each amino acid.
- amino acids include those L-amino acids commonly found in naturally occurring proteins.
- Amino acids in the context of the present technology, also include D-amino acids and nonnatural, unusual or unnatural amino acids, as described below.
- Amino acid residues will be indicated according to the standard three-letter or one-letter amino acid code. Reference is made to Table A-2 on page 48 of WO 08/020079. Examples of amino acids commonly found in proteins and represented in the genetic code are listed in Table 1 below. Other common amino acids (excluding those listed in Table 1 below) are described on the table on p. 624 of Pure & Appl. Chem., Vol. 56, No. 5, pp. 595—624, 1984, reproduced below as Table 2 for convenience.
- Table 1 Common amino acids (IUPAC)
- D-amino acids are also encompassed by the definition of "amino acid".
- D-amino acid refers to amino acids where the stereogenic carbon alpha to the amino group has the D-configuration.
- unusual, unnatural or non-natural amino acids are also encompassed by the definition of "amino acid".
- the term "unnatural amino acid” or “non-canonical amino acid” or “non-natural amino acid” or “novel amino acid” (or the like) refers to an amino acid that is not one of the twenty amino acids commonly found in peptides synthesized in nature, and known by the one letter abbreviations A, R, N, C, D, Q, E, G, H, I, L, K, M, F, P, S, T, W, Y and V.
- Exemplary unnatural amino acids are described in Young et al., "Beyond the canonical 20 amino acids: expanding the genetic lexicon," J. of Biological Chemistry, 285(15): 11039- 11044 (2010), the disclosure of which is incorporated herein by reference.
- Non-limiting examples of unnatural amino acids include: p-acetyl-phenylalanine, O-4-allyl-L- tyrosine, 4-propyl-L-tyrosine, L-Dopa, p-azido-phenylalanine, N6-(propargyloxy)-carbonyl-L- lysine (PrK), azido-lysine (N6-azidoethoxy-carbonyl-L-lysine, AzK).
- the unnatural amino acid comprises a selective reactive group, or a reactive group for site- selective labeling or conjugation of a moiety or cargo.
- the chemistry is a biorthogonal reaction (e.g., biocompatible and selective reactions).
- the chemistry is a Cu(l)-catalyzed or "copper-free" alkyne-azide triazole-forming reaction, the Staudinger ligation, inverse-electron-demand Diels-Alder (IEDDA) reaction, "photo-click” chemistry, or a metal-mediated process such as olefin metathesis and Suzuki-Miyaura or Sonogashira cross-coupling.
- IEDDA inverse-electron-demand Diels-Alder
- metal-mediated process such as olefin metathesis and Suzuki-Miyaura or Sonogashira cross-coupling.
- protein protein
- peptide protein/peptide
- polypeptide polypeptide
- the terms “protein”, “peptide”, “protein/peptide”, and “polypeptide” are used interchangeably throughout the present disclosure, and each has the same meaning for purposes of this disclosure.
- Each term refers to an organic compound made of a linear chain of two or more amino acids.
- the compound may have ten or more amino acids; twenty-five or more amino acids; fifty or more amino acids; one hundred or more amino acids, two hundred or more amino acids, and even three hundred or more amino acids.
- polypeptides generally comprise fewer amino acids than proteins, although there is no art-recognized cut-off point of the number of amino acids that distinguish a polypeptide and a protein; that polypeptides may be made by chemical synthesis or recombinant methods; and that proteins are generally made in vitro or in vivo by recombinant methods as known in the art.
- the amide bond in the primary structure of polypeptides is in the order that the amino acids are written, in which the amine end (/V-terminus) of a polypeptide is always on the left, while the acid end (C-terminus) is on the right.
- Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated using the symbols shown in Table 1 with the modified positions; e.g., hydroxylations or glycosylations, but these modifications shall not be shown explicitly in the amino acid sequence.
- binding specifically refers to the number of different target molecules, such as antigens, to which a particular binding unit can bind with sufficiently high affinity (see below).
- Specificity refers to the number of different target molecules, such as antigens, to which a particular binding unit can bind with sufficiently high affinity (see below).
- Specificity refers to the number of different target molecules, such as antigens, to which a particular binding unit can bind with sufficiently high affinity (see below).
- binding specifically are used interchangeably herein with “selectivity”, “binding selectively” or “selective binding”.
- binding units such as binding ISVDs, specifically bind to their designated targets.
- the specificity /selectivity of a binding unit can be determined based on affinity.
- the affinity denotes the strength or stability of a molecular interaction.
- the affinity is commonly given by the KD, or dissociation constant, which has units of mol/litre (or M).
- the affinity can also be expressed as an association constant, K , which equals 1/KD and has units of (mol/ litre) 1 (or M 1 ).
- the affinity is a measure for the binding strength between a moiety and a binding site on a target molecule: the lower the value of the KD, the stronger the binding strength between a target molecule and a targeting moiety.
- the off-rate k O ff has units s 1 (where s is the SI unit notation of second).
- the on-rate k on has units M -1 s -1 .
- the on-rate may vary between 10 2 M -1 s 1 to about 10 7 M -1 s -1 , approaching the diffusion-limited association rate constant for bimolecular interactions.
- the measured KD may correspond to the apparent KD if the measuring process somehow influences the intrinsic binding affinity of the implied molecules for example by artefacts related to the coating on the biosensor of one molecule. Also, an apparent KD may be measured if one molecule contains more than one recognition sites for the other molecule or molecules. In such situation the measured affinity may be affected by the avidity of the interaction by the two molecules.
- the dissociation constant (KD) may be the actual or apparent dissociation constant, as will be clear to the skilled person. Methods for determining the KD will be clear to the skilled person, and for example include the techniques mentioned below. In this respect, it will also be clear that it may not be possible to measure dissociation constants of more than IO -4 moles/litre or IO -3 moles/litre (e.g., of IO -2 moles/litre).
- a temperature specified in °C with no decimal place shall have an error margin of ⁇ 1°C (e.g., a temperature value of about 50°C means 50°C ⁇ 1°C); a time indicated in hours shall have an error margin of 0.1 hours irrespective of the decimal places (e.g., a time value of about 1.0 hours means 1.0 hours ⁇ 0.1 hours; a time value of about 0.5 hours means 0.5 hours ⁇ 0.1 hours).
- any parameter indicated with the term “about” is also contemplated as being disclosed without the term “about”.
- embodiments referring to a parameter value using the term “about” shall also describe an embodiment directed to the numerical value of said parameter as such.
- an embodiment specifying a pH of "about pH 2.7” shall also disclose an embodiment specifying a pH of "pH 2.7” as such; an embodiment specifying a pH range of "between about pH 2.7 and about pH 2.1” shall also describe an embodiment specifying a pH range of "between pH 2.7 and pH 2.1", etc.
- the percentage of "sequence identity" between a first nucleotide sequence and a second nucleotide sequence may be calculated by dividing [the number of nucleotides in the first nucleotide sequence that are identical to the nucleotides at the corresponding positions in the second nucleotide sequence] by [the total number of nucleotides in the first nucleotide sequence] and multiplying by [100%], in which each deletion, insertion, substitution or addition of a nucleotide in the second nucleotide sequence - compared to the first nucleotide sequence - is considered as a difference at a single nucleotide (position).
- the degree of sequence identity between two or more nucleotide sequences may be calculated using a known computer algorithm for sequence alignment such as NCBI Blast v2.0, using standard settings.
- a known computer algorithm for sequence alignment such as NCBI Blast v2.0
- Some other techniques, computer algorithms and settings for determining the degree of sequence identity are for example described in WO 04/037999, EP 0967284, EP 1085089, WO 00/55318, WO 00/78972, WO 98/49185 and GB 2357768.
- nucleotide sequence with the greatest number of nucleotides will be taken as the "first" nucleotide sequence, and the other nucleotide sequence will be taken as the "second" nucleotide sequence.
- the percentage of "sequence identity" between a first amino acid sequence and a second amino acid sequence may be calculated by dividing [the number of amino acid residues in the first amino acid sequence that are identical to the amino acid residues at the corresponding positions in the second amino acid sequence] by [the total number of amino acid residues in the first amino acid sequence] and multiplying by [100%], in which each deletion, insertion, substitution or addition of an amino acid residue in the second amino acid sequence - compared to the first amino acid sequence - is considered as a difference at a single amino acid residue (position), i.e., as an "amino acid difference" as defined herein.
- the degree of sequence identity between two amino acid sequences may be calculated using a known computer algorithm, such as those mentioned above for determining the degree of sequence identity for nucleotide sequences, again using standard settings.
- a known computer algorithm such as those mentioned above for determining the degree of sequence identity for nucleotide sequences, again using standard settings.
- the amino acid sequence with the greatest number of amino acid residues will be taken as the "first" amino acid sequence, and the other amino acid sequence will be taken as the "second" amino acid sequence.
- amino acid substitutions can generally be described as amino acid substitutions in which an amino acid residue is replaced with another amino acid residue of similar chemical structure and which has little or essentially no influence on the 3D structure, function, activity, or other biological properties of the polypeptide.
- Such conservative amino acid substitutions are well known in the art, for example from WO 04/037999, GB 335768, WO 98/49185, WO 00/46383, and WO 01/09300; and (preferred) types and/or combinations of such substitutions may be selected on the basis of the pertinent teachings from WO 04/037999 as well as WO 98/49185 and from the further references cited therein.
- Such conservative substitutions preferably are substitutions in which one amino acid within the following groups (a) - (e) is substituted by another amino acid residue within the same group: (a) small aliphatic, nonpolar or slightly polar residues: Ala, Ser, Thr, Pro and Gly; (b) polar, negatively charged residues and their (uncharged) amides: Asp, Asn, Glu and Gin; (c) polar, positively charged residues: His, Arg and Lys; (d) large aliphatic, nonpolar residues: Met, Leu, He, Vai and Cys; and (e) aromatic residues: Phe, Tyr and Trp.
- Particularly preferred conservative substitutions are as follows: Ala into Gly or into Ser; Arg into Lys; Asn into Gin or into His; Asp into Glu; Cys into Ser; Gin into Asn; Glu into Asp; Gly into Ala or into Pro; His into Asn or into Gin; He into Leu or into Vai; Leu into lie or into Vai; Lys into Arg, into Gin or into Glu; Met into Leu, into Tyr or into He; Phe into Met, into Leu or into Tyr; Ser into Thr; Thr into Ser; Trp into Tyr; Tyr into Trp; and/or Phe into Vai, into He or into Leu.
- amino acid sequences and nucleic acid sequences are said to be “exactly the same” if they have 100% sequence identity (as defined herein) over their entire length.
- amino acid difference refers to an insertion, deletion or substitution of a single amino acid residue on a position of the first sequence, compared to the second sequence; it being understood that two amino acid sequences may contain one, two or more such amino acid differences.
- protein solubility is a thermodynamic parameter defined as the concentration of protein in a saturated solution that is in equilibrium with a solid phase, either crystalline or amorphous, under a given set of conditions (see, e.g., Kramer RM. et al., "Toward a molecular understanding of protein solubility: increased negative surface charge correlates with increased solubility", Biophys J., 2012, 102(8):1907-15).
- the molecule of the present technology comprises at least one protein-based carrier building block (also referred herein as “carrier building block”, “protein-based building block”, or simply “building block” or “carrier”), as defined herein.
- the molecule of the present technology may comprise a single protein-based carrier building block.
- the molecule comprises more than one protein-based building blocks, such as two, three, four, five, six or more carrier building blocks.
- the protein-based carrier building block comprises (and, preferably, consists of) at least part of a protein or a whole structured protein, i.e., the protein-based carrier building block is preferably a polypeptide.
- the protein-based carrier building block is designed as a "carrier” or “delivery” moiety, with at least one attachment point or conjugation site, preferably with at least two attachment points or conjugation sites, for conjugation or attachment of cargos, as defined in detail below.
- Suitable cargos include proteins, peptides, toxic payloads, nucleic acids, oligonucleotides, fluorophores, glycans, chelators for/and radio-isotopes, polyethylene glycol (PEG) molecules, vitamins (such as biotin or folate), etc. Specific non-limiting examples of suitable cargos are depicted below in the present description.
- An attachment point or conjugation site in the context of the present technology, refers to any group comprised in the protein-based building block which is suitable for attaching or conjugating a cargo to it.
- the attachment point or conjugation site is preferably present at a solvent-accessible position in the protein-based building block, as explained in detail below.
- An attachment point or conjugation site may be a reactive group present in the side chain of any amino acid in the protein-based carrier building block, preferably an amino acid present at a solvent-accessible position in the protein-based carrier building block, or may be the /V- terminal primary amine, and/or the C-terminal carboxylic group of the protein-based building block.
- the attachment point/conjugation site allows the formation of a covalent bond with a group present in the cargo to be conjugated and/or attached to the protein-based carrier building block.
- the attachment point or conjugation site is a reactive group present in the side chain of an amino acid in the protein-based carrier building block, preferably present at a solvent-accessible position in the protein-based carrier building block, which allows the formation of a covalent bond with a group present in the cargo to be conjugated and/or attached to the protein-based carrier building block.
- two of the conjugation sites or attachment points of the protein-based building block are reactive groups present in the side chain of two amino acids present in the proteinbased carrier building block, preferably two amino acid presents at solvent-accessible positions in the protein-based carrier building block.
- all of the conjugation sites or attachment points of the protein-based building block are reactive groups present in the side chain of amino acids present in the protein-based carrier building block, preferably amino acid presents at solvent-accessible positions in the protein-based carrier building block.
- the protein-based carrier building block comprised in the molecule of the present technology has a globular three-dimensional (3D) structure, i.e., it is or comprises a structured protein with a globular 3D structure.
- Globular proteins have approximately spherical shape. Nearly all globular proteins contain substantial numbers of a-helices and/or p-sheets folded into a compact structure that is stabilized by both polar and nonpolar interactions.
- the globular 3D structure forms naturally and often involves interactions mediated by the side chains of the amino acids. Most often, the hydrophobic amino acid side chains are buried, closely packed, in the interior of a globular protein, out of contact with water. Hydrophilic amino acid side chains lie on the surface of the globular proteins exposed to the water.
- tertiary structure can be defined as the level of protein structure at which an entire polypeptide chain has folded into a 3D structure.
- tertiary structure applies to the individual chains. See Smith, A.D., et al., eds. 1997, Oxford Dictionary of Biochemistry and Molecular Biology, New York: Oxford University Press.
- CD can be used to estimate the structure of unknown proteins and monitor conformational changes due to temperature, mutations, heat, denaturants or binding interactions.
- a-helical proteins have negative bands at 222 nm and 208 nm and a positive band at 193 nm.
- Proteins with well- defined antiparallel -pleated sheets (P-helices) have negative bands at 218 nm and positive bands at 195 nm, while disordered proteins have very low ellipticity above 210 nm and negative bands near 195 nm. See Greenfield NJ., "Using circular dichroism spectra to estimate protein secondary structure", Nat Protoc., 2006, l(6):2876-90 for further details.
- the protein-based carrier building block comprised in the molecule of the present technology comprises at least one a-helix and/or at least one -sheet as part of its secondary structure, preferably more than one a-helix and/or more than one -sheet as part of its secondary structure, leading to a globular 3D tertiary structure.
- This allows the engineering of site- and stereospecific-conjugation sites or attachment points, as described in detail in this specification.
- the presence of at least one a-helix and/or at least one p-sheet in a certain polypeptide or protein can be determined by known techniques, as explained above, such as, e.g., CD.
- the protein-based carrier building block comprised in the molecule of the present technology is soluble.
- a soluble building block means that the building block has a solubility of 10 mg/mL or more, preferably of 20 mg/mL, preferably of 50 mg/mL or more, and even more preferably of 100 mg/mL or more, measured in water or a suitable buffer or solvent (e.g., an aqueous solution, or a physiological buffer, such as a buffer which is amenable for parenteral administration) at room temperature (RT).
- a suitable buffer or solvent e.g., an aqueous solution, or a physiological buffer, such as a buffer which is amenable for parenteral administration
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- DPBS Dulbecco's phosphate buffered saline
- histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)
- phosphate buffer pH 7.0, at RT comprising Na
- solubility measurements can be performed as follows.
- the protein solution e.g., in citrate buffer 5 mM, pH 7.0, or in PBS pH 7.4, or in water, or in any of the suitable buffers described above
- ultrafiltration e.g., via tangential flow filtration (TFF)
- the solution is spined at high speed or 0.22 pm filtered to remove any non-soluble material, and the OD2so of the supernatant is measured.
- the protein concentration of the supernatant (and, thus, the concentration of the protein in a saturated solution that is in equilibrium with a solid phase, i.e., the protein solubility) is obtained.
- the protein-based carrier building block comprised in the molecule of the present technology is soluble in reduced state, i.e., it is soluble when the -SH groups (e.g., in the side chain of one or more Cys) present at solvent accessible positions in its amino acid sequence, if any, is(are) in a reduced form (as "-SH"), and not oxidized.
- a protein-based carrier building block may be reduced when subjected to reducing conditions for enough time.
- reducing conditions may mean using beta-mercaptoethanol (2-ME), dithiothreitol (DTT) orTCEP (Tris (2-carboxyethyl) phosphine).
- the building block comprised in the molecule of the present technology has a size of about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, or such as about 2.5, 3, 5, 6.5, 7, 10, 11, 12, 13, 14, 15 or 16 kDa.
- the protein-based building block may have a size (molecular mass) of about 6 kDa, or of about 7 kDa, or of about 15 kDa, or of about 16 kDa.
- the protein-based carrier building block has a size of about 15 kDa.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein. If the building block shows any interaction with one or more human proteins, such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins or Fc-sensors.
- the protein-based carrier building block does not specifically bind C-type lectin receptors (CLRs).
- All antibodies possess two functional domains — one that confers antigen specificity, known as the antigen-binding fragment (Fab), and another that drives antibody function, known as the crystallizable fragment (Fc).
- the specific effector functions that are triggered by antibodies are determined by the receptors to which the antibody Fc domain binds and the specific innate immune cells on which these FcRs are expressed.
- These sensors include both classical FcRs and non-classical C-type lectin receptors (CLRs), see Lu, L. et al., "Beyond binding: antibody effector functions in infectious diseases", Nat Rev Immunol, 2018, 18, 46-61.
- Table 1 of Lu, L. et al provides non-limiting examples of Fc domain sensors (e.g., Fey or FcRn) to which the protein-based carrier building block comprised in the molecule of the present technology do not specifically bind. Consequently, the protein-based carrier building block comprised in the molecule of the present technology does not show effector functions of conventional antibodies mediated by the Fc domain.
- Fc domain sensors e.g., Fey or FcRn
- the protein-based carrier building block and/or the molecule does not specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins or Fc- sensors.
- the protein-based carrier building block and/or the molecule does not specifically bind C-type lectin receptors (CLRs).
- CLRs C-type lectin receptors
- none of the components comprised in the molecule of the present technology e.g., at least one proteinbased carrier building block and at least one NLS (cargo) attached or conjugated to it
- specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins, Fc-sensors and/or CLRs specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins, Fc-sensors and/or CLRs.
- the protein-based building block and/or the molecule of the present technology does not show effector functions of conventional antibodies mediated by the Fc domain, i.e., none of the components comprised in the molecule of the present technology show effector functions of conventional antibodies mediated by the Fc domain.
- the molecule of the present technology does not include conventional VH-VL pairing/interaction and/or does not include CL-CH1 pairing such as CL-CH1 binding disulphide bridges.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind the variable domain of the light chain (VL) and/or the variable domain of the heavy chain (VH) of an antibody, such as the VL and/or the VH of a monoclonal antibody (mAb).
- the protein-based carrier building block does not specifically bind the first constant domain of the heavy chain (CHI) of an antibody, such as the CHI of a mAb.
- the protein-based carrier building block does not specifically bind the constant domain of the light chain (CL) of an antibody, such as the CL of a mAb.
- the protein-based carrier building block does not specifically bind the third constant domain of the heavy chain (CH3) of an antibody, such as the CH3 of a mAb. In another embodiment, the protein-based carrier building block does not specifically bind the second constant domain of the heavy chain (CH2) of an antibody, such as the CH2 of a mAb.
- the building block and/or the molecule of the present technology is not a Fab fragment from an antibody, such as from a mAb. In one embodiment, the building block and/or the molecule of the present technology is not a CH, preferably is not a CHI fragment from an antibody, such as from a mAb. The building block and/or the molecule of the present technology is not an antibody, such as a mAb, is not a Fc fragment, or a Fv fragment.
- the protein-based carrier building block comprised in the molecule of the present technology may derive from a target-binding protein (such as an ISVD, a DARPin, an affibody or an affitin) (the "protein-based carrier building block precursor").
- a target-binding protein such as an ISVD, a DARPin, an affibody or an affitin
- a “protein-based carrier building block precursor” or “building block precursor” is a protein-based moiety which may be modified to generate the protein-based carrier building block comprised in the molecule of the present technology.
- the "protein-based carrier building block precursor” is a protein which is modified (e.g., by point mutations and/or by addition/deletion of amino acids to its sequence) to generate the protein-based carrier building block comprised in the molecule of the present technology.
- the "protein-based carrier building block precursor” is modified so that it no longer specifically binds any human protein, preferably so that it also does not specifically bind any (non-human) molecule (including non-human biomolecule) and/or any non-protein (human) molecule (including biomolecule), in particular any molecule (including biomolecule) to which the precursor specifically binds.
- the "protein-based carrier building block precursor” is modified so that it incorporates one or more attachment points or conjugation sites as described herein.
- the "protein-based carrier building block precursor” has a sequence identity of at least 60%, such as at least 70%, or at least 75%, preferably of at least 80% with the protein-based carrier building block derived from it.
- the "protein-based carrier building block precursor” has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with the protein-based carrier building block derived from it.
- the "protein-based carrier building block precursor” may share the whole amino acid sequence with the protein-based carrier building block derived from it with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, twenty or more amino acids.
- the protein-based carrier building block derived from a protein-based carrier building block precursor has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, does not specifically bind any human protein and preferably does not specifically bind any protein or non-protein molecule to which the precursor specifically binds.
- the carrier building block comprised in the molecule of the present technology does also not specifically bind to any non-protein molecule (including non-protein biomolecules, such as nucleic acids, e.g., DNA and/or RNA, lipids (e.g., phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecule), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., such as phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids.
- non-protein biomolecules such as nucleic acids, e.g., DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids.
- the carrier building block does also not specifically bind to any non-protein molecule (including biomolecules) (such as nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecules), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non- protein molecule (including biomolecules) or a non-human protein).
- any non-protein molecule including biomolecules
- any non-protein molecule including biomolecules
- the carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein or part thereof present on the surface of human cells.
- the carrier building block comprised in the molecule of the present technology does not specifically bind to any human molecule (such as protein, lipid, sugar, etc.) or part thereof present in the surface of human cells.
- the protein-based building block comprised in the molecule of the present technology does also not specifically bind to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a non-protein molecule (including biomolecules)), or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a nonprotein molecule) with a KD value greater than 5xl0 -4 mol/litre, as described herein.
- the protein-based carrier building block is an anti-RSV (respiratory syncytial virus) ISVD (i.e., it specifically binds one or more proteins of RSV, such as protein F of RSV)
- the protein-based carrier building block derived from it preferably does not specifically bind those RSV proteins (or binds those proteins, such as protein F of RSV preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the precursor of the protein-based carrier building block specifically binds to virus (e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like) and/or to viral molecules (e.g., it specifically binds one or more viral biomolecules, e.g., viral proteins, viral nucleic acids, viral lipids or viral glycans), the proteinbased carrier building block derived from it preferably does not specifically bind those virus and/or viral molecules (or binds those virus and/or viral molecules preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- virus e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like
- viral molecules e.g., it specifically binds one or more viral bio
- the protein-based carrier building block specifically binds to virus (e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like) and/or to viral molecules (e.g., it specifically binds one or more viral biomolecules, e.g., viral proteins, viral nucleic acids, viral lipids or viral glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- virus e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like
- viral molecules e.g., it specifically binds one or more viral biomolecules, e.g., viral proteins, viral nucleic acids, viral lipids or viral glycans
- viruses which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: RSV, influenza virus, rabies virus, potyvirus, bacteriophage, rotavirus, HIV protein, Hepatitis B virus, Hepatitis C virus, norovirus, Shiga toxins from lambdoid prophages, Herpes simplex virus, Grapevine fanleaf virus (GFLV), Ebola, Middle East respiratory syndrome (MERS) virus, acute respiratory syndrome (SARS) virus, SARS-COV2, Vibrio or a White Spot Syndrome virus, cytomegalovirus, parvovirus, ZIKA virus, Chikungunya Virus (CHIKV).
- RSV virus
- influenza virus rabies virus
- potyvirus bacteriophage
- rotavirus HIV protein
- Hepatitis B virus Hepatitis C virus
- norovirus Shiga toxins from lambdoid prophages
- the proteinbased building block precursor e.g., an ISVD
- the proteinbased building block precursor may specifically bind to one or more of these viruses (or molecules, including biomolecules, comprised therein).
- the resulting protein- based building block may not specifically bind to the virus (or molecules, including biomolecules, comprised therein) to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these viruses (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the precursor of the protein-based carrier building block specifically binds to protozoa (a microorganism, unicellular eukaryote) (e.g., it is an anti-protozoa ISVD, an antiprotozoa DARPin, an anti-protozoa affitin, an anti-protozoa affibody, or the like) and/or to protozoa molecules (e.g., it specifically binds one or more protozoa biomolecules, e.g., protozoa proteins, protozoa nucleic acids, protozoa lipids or protozoa glycans), the proteinbased carrier building block derived from it preferably does not specifically bind those protozoa and/or protozoa molecules (or binds those protozoa and/or protozoa molecules preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- protozoa a microorganism, uni
- the protein-based carrier building block specifically binds to protozoa (e.g., it is an anti-protozoa ISVD, an anti-protozoa DARPin, an anti-protozoa affitin, an anti-protozoa affibody, or the like) and/or to protozoa molecules (e.g., it specifically binds one or more protozoa biomolecules, e.g., protozoa proteins, protozoa nucleic acids, protozoa lipids or protozoa glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- protozoa e.g., it is an anti-protozoa ISVD, an anti-protozoa DARPin, an anti-protozoa affitin, an anti-protozoa affibody, or the like
- protozoa molecules e.g., it specifically binds
- protozoa and protozoa molecules to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: Trypanosoma evansi, Eimeria sitesdae, Variant surface glycoprotein (VSG).
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to one or more of these protozoa (or molecules, including biomolecules, comprised therein).
- the resulting protein-based building block may not specifically bind to the protozoa (or molecules, including biomolecules, comprised therein) to which the precursor binds.
- the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these protozoa (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to mammalian proteins (e.g., it is an anti-mammalian protein ISVD, an anti-mammalian protein DARPin, an anti-mammalian protein affitin, an anti-mammalian protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those mammalian proteins (or binds those mammalian proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to mammalian proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- a mammalian protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is bovine serum albumin.
- the protein-based building block precursor e.g., an ISVD
- the resulting protein-based building block may not specifically bind to the mammalian protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards this mammalian protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to avian proteins (e.g., it is an anti-avian protein ISVD, an anti-avian protein DARPin, an anti-avian protein affitin, an anti-avian protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those avian proteins (or binds those avian proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to avian proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- Example of an avian protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is Ovalbumin (chicken).
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to this avian protein.
- the resulting protein-based building block may not specifically bind to the avian protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards this avian protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the precursor of the protein-based carrier building block specifically binds to yeast and/or moulds proteins (e.g., it is an anti-yeast and/or anti-moulds protein ISVD, an antiyeast and/or moulds protein DARPin, an anti-yeast and/or moulds protein affitin, an anti-yeast and/or moulds protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those yeast and/or moulds proteins (or binds those yeast and/or moulds proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to yeast and/or moulds proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- yeast and moulds proteins to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are yeast extract, inactivated yeast, Candida.
- the protein-based building block precursor e.g., an ISVD
- the resulting protein-based building block may not specifically bind to at least one of these yeast and/or moulds proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards these yeast and/or moulds proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to plant proteins (e.g., it is an anti-plant protein ISVD, an anti-plant protein DARPin, an anti-plant protein affitin, an anti-plant protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those plant proteins (or binds those plant proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to plant proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to one or more of these plant proteins.
- the resulting protein-based building block may not specifically bind to at least one of these plant proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards these plant proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to fungi proteins (e.g., it is an anti-fungi protein ISVD, an anti-fungi protein DARPin, an anti-fungi protein affitin, an anti-fungi protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those fungi proteins (or binds those fungi proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to fungi proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- fungi proteins to which the proteinbased building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are Cutinase, chitin, fungus sphingolipids.
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to at least one of these fungi proteins.
- the resulting proteinbased building block may not specifically bind to the at least one of these fungi protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards at least one of these fungi proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the precursor of the protein-based carrier building block specifically binds to bacteria (e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like) and/or to bacterial molecules (e.g., it specifically binds one or more bacterial biomolecules, e.g., bacterial proteins, bacterial nucleic acids, bacterial lipids or bacterial glycans), the protein-based carrier building block derived from it preferably does not specifically bind those bacteria and/or bacterial molecules (or binds those bacteria and/or bacterial molecules preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- bacteria e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like
- bacterial molecules e.g., it specifically binds one or more
- the protein-based carrier building block specifically binds to bacteria (e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like) and/or to bacterial molecules (e.g., it specifically binds one or more bacterial biomolecules, e.g., bacterial proteins, bacterial nucleic acids, bacterial lipids or bacterial glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- bacteria e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like
- bacterial molecules e.g., it specifically binds one or more bacterial biomolecules, e.g., bacterial proteins, bacterial nucleic acids, bacterial lipids or bacterial glycans
- bacteria and bacterial molecules to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: Beta-lactamase, tetanus toxin, Lactate Oxidase, Salmonella typhimurium, Helicobacter pylori, Mycobacterium tuberculosis, Clostridium difficile (toxin A and B), Pseudomonas aeruginosa, Bacillus anthracis, Botulinum Neurotoxin, Treponema pallidum, Chlamydia trachomatis, Escherichia coli, Campylobacter jejuni ('flagella), Salmonella enterica, Bordetella pertussis (toxin), Shigella spp, Streptomyces venezuelae, chloramphenicol.
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to one or more of these bacteria (or their molecules, including biomolecules).
- the resulting protein-based building block may not specifically bind to the bacteria (or their molecules, including biomolecules) to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these bacteria (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to non-human animal proteins, such as snake proteins (e.g., it is an anti-snake protein ISVD, an anti-snake protein DARPin, an anti-snake protein affitin, an anti-snake protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those snake proteins (or binds those snake proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to snake proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- Example of a snake protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is Cobra toxin.
- the protein-based building block precursor e.g., an ISVD
- the protein-based building block precursor may specifically bind to this snake protein.
- the resulting proteinbased building block may not specifically bind to the snake protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards this snake protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the precursor of the protein-based carrier building block specifically binds to green fluorescent protein (GFP), which is a protein from Jellyfish (sea jellies) and corals, sea anemones, zoanithids, copepods and lancelets (e.g., it is an anti-GFP ISVD, an anti-GFP DARPin, an anti-GFP affitin, an anti-GFP affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind GFP (or binds GFP preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the proteinbased carrier building block specifically binds to GFP, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- the protein-based building block precursor e.g., an ISVD
- the resulting protein-based building block may not specifically bind to GFP to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards GFP, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to insect proteins (e.g., it is an anti-insect protein ISVD, an anti-insect protein DARPin, an antiinsect protein affitin, an anti-insect protein affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind those insect proteins (or binds those insect proteins preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the protein-based carrier building block specifically binds to insect proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- insect proteins examples include Androctonus autralis hecor toxins, chitin, chitin binding domain (CBD), V-ATPase subunit C, trehalase, cytochrome p450 monooxygenase, chitin deacetylase, chitin synthase and NPC1 sterol transporter.
- the protein-based building block precursor e.g., an ISVD
- the resulting protein-based building block may not specifically bind to the at least one of these insect proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards at least one of these insect proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block specifically binds to chitin, which is a crustaceans protein (e.g., it is an anti-chitin ISVD, an anti-chitin DARPin, an anti-chitin affitin, an anti-chitin affibody, or the like)
- the protein-based carrier building block derived from it preferably does not specifically bind chitin (or binds chitin preferably with a KD value greater than 5xl0 -4 mol/litre, as described herein).
- the proteinbased carrier building block specifically binds to chitin, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block.
- the protein-based building block precursor e.g., an ISVD
- the resulting protein-based building block may not specifically bind to chitin to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards chitin, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
- the protein-based carrier building block does not specifically bind to the precursor's target, should the protein-based carrier building block precursor have a target and should this be a non-human molecule (including biomolecules), such as a non-human protein.
- the at least one protein-based building block comprised in the molecule of the present technology does not specifically bind any RSV protein, such as protein F of RSV, or binds any RSV protein, such as protein F of RSV, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- Tables A-l and A-2 provide examples of F-protein binding sequences.
- the at least one protein-based building block comprised in the molecule of the present technology does not comprise/does not consist of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656.
- the at least one protein-based building block comprised in the molecule of the present technology does not comprise/does not consist of the amino acid sequence as defined in SEQ ID NO.: 214.
- the protein-based building block comprised in the molecule of the present technology when it has at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) attached to it (via at least one conjugation sites or attachment points comprised therein) does not specifically bind to any molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with at least a cargo attached to it, preferably does not specifically bind to the precursor's target, e.g., a non-human protein or a non-protein molecule (including biomolecules)), or binds to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with a cargo attached to it, preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a
- the protein based carrier building block comprised in the molecule of the present technology shows any specific binding towards a specific target, such as towards a molecule (including biomolecules, e.g., human, non-human animal, plant, microbial, viral, etc.), or towards a cell (e.g., animal, human, plant cell), microorganisms, virus, etc., that specific binding is eliminated when at least a cargo is attached to at least one attachment point or conjugation site comprised in the protein-based building block.
- the cargo attached to the protein-based building block may of course show specific binding towards a target (including biomolecules, as described herein), but the protein-based building block does no longer specifically binds its target.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human or non-human (e.g., non-human animal, plant, yeast, etc.) cell and/or cell type (such as the ones exemplified in Example 6). If the building block shows any interaction with one or more human or non- human cells and/or cell types, such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the carrier building block does also not specifically bind to any human or non-human cell and/or cell type to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-protein molecule or a protein present on the surface of a human cell).
- the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-protein molecule or a protein present on the surface of a human cell.
- the lack of binding to any human or non-human cell and/or cell type can for example be assessed with the "cell binding assay” as described below (see also, e.g., Hunter SA and Cochran JR, "Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations", Methods Enzymol., 2016, 580:21-44).
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any microorganisms such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules). If the building block shows any interaction with one or more microorganisms and/or virus, or with any microbial or viral molecule (including biomolecules), such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the carrier building block does also not specifically bind to any microorganism and/or virus (or to any microbial or viral molecule (including biomolecules)) to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a virus, a microorganism, a non-protein molecule (including biomolecules) or a protein present on the surface of a microorganism and/or virus).
- the lack of binding to any microorganism, or virus, or microbial molecule, or viral molecule can for example be assessed with the "cell binding assay" and/or SPR as described herein.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any microorganisms such as bacteria, fungi, protists, yeast and/or virus (and/or to any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.) when it has at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) attached or conjugated to it (via at least one attachment point or conjugation sites comprised therein).
- a "model cargo" e.g. a maleimide-modified alanine
- the building block comprising the cargo attached to it shows any interaction with one or more microorganisms and/or virus (or with any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.), such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the carrier building block does also not specifically bind to any microorganism and/or virus (or to any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.) to which the protein-based carrier building block precursor specifically binds when the protein-based carrier building block has at least one cargo attached or conjugated to it (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target , e.g., a non-protein molecule or biomolecule or a protein present on the surface of a microorganism and/or virus, or present in the microorganism or virus, when it has at least one cargo attached or conjugated to it).
- the precursor's target e.g., a non-protein molecule or biomolecule or a protein present on the surface of a microorganism and/or virus, or present in the microorganism or virus, when it has at least one cargo attached or conjugated to it.
- the proteinbased building block comprised in the molecule of the present technology does not specifically bind to any viruses and/or viral proteins, such as RSV and/or one or more proteins of RSV, such as protein F of RSV, when the building block has at least a cargo attached or conjugated to it.
- the protein-based carrier building block e.g., a DARPin-based carrier building block
- the protein-based carrier building block may show specific binding towards a microorganism and/or a virus, such as RSV and/or RSV proteins, such as protein F of RSV, but the specific binding (as defined herein), if any, is lost when at least one cargo is attached or conjugated to the protein-based building block.
- the lack of specific binding to any microorganism can for example be assessed with the "cell binding assay” as described herein.
- the lack of specific binding to viruses, microbial and/or viral molecules or biomolecules can, for example, be assessed by surface plasmon resonance, as described herein.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any molecule, including biomolecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any molecule, including bio molecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans) with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- KD
- the protein-based carrier building block does not specifically bind to any human and/or non-human animal biomolecule (e.g., human and/or non-human animal proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any human and/or non-human animal biomolecule with a KD (KD value) greaterthan 5xl0 -4 mol/litre, as described herein.
- KD KD value
- the protein-based carrier building block does not specifically bind to any bacterial molecule (including bacterial biomolecules, e.g., bacterial proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any bacterial molecule, as defined above, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- bacterial biomolecules e.g., bacterial proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- glycans binds to any bacterial molecule, as defined above, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any viral molecule (including biomolecules, e.g., viral proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any viral molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- any viral molecule including biomolecules, e.g., viral proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- the proteinbased carrier building block does not specifically bind to any fungi molecule (including biomolecules, e.g., fungi proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any fungi molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- fungi molecule including biomolecules, e.g., fungi proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- the proteinbased carrier building block does not specifically bind to any yeast molecule (including biomolecules, e.g., yeast proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any yeast molecule, as described herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- yeast proteins including biomolecules, e.g., yeast proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- glycans binds to any yeast molecule, as described herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- the proteinbased carrier building block does not specifically bind to any plant molecule (including biomolecules, e.g., plant proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any plant molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- plant molecule including biomolecules, e.g., plant proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- glycans binds to any plant molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- the proteinbased carrier building block does not specifically bind to any mammalian molecule (including mammalian biomolecules, e.g., mammalian proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any mammalian molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- mammalian biomolecules e.g., mammalian proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans
- PS phosphatidylserine
- glycans binds to any mammalian molecule, as defined herein, with a KD (KD value) greater than 5xl0 -4 mol/litre
- biomolecule refers to molecules present in organisms, including animals, plants, microorganisms that play a role in one or more biological processes, such as cell division, morphogenesis, or development.
- Biomolecules are the building blocks of life and perform important functions in living organisms. Biomolecules include the primary metabolites which are large macromolecules such as proteins, carbohydrates (glycans), lipids (e.g., such as PS), and nucleic acids (such as DNA, RNA), as well as small molecules such as vitamins and hormones.
- the four major types of biomolecules are carbohydrates (glycans), lipids, nucleic acids, and proteins.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind any non-human protein and/or any non-protein molecule (including biomolecule) when at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) is conjugated to the at least one attachment point or conjugation site on the protein-based carrier building block, preferably it does not specifically bind any non-human protein and/or any non-protein molecule (including biomolecule) to which the protein-based carrier building block precursor specifically binds, or binds to them with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- a KD KD value
- the present technology provides a molecule comprising at least one protein-based carrier building block as described in the present technology, wherein the protein-based carrier building block has at least a cargo (such as a "model cargo", e.g. a maleimide-modified alanine or a NLS) attached or conjugated to it (via at least one attachment point or conjugation site comprised in the protein-based carrier building block), and wherein the protein-based carrier building block does not specifically bind to any molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.).
- the protein-based building block when at least a cargo is conjugated to it, loses its target binding specificity.
- the protein-based building block comprising a cargo attached to it, does not specifically bind to any molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.) which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block, with at least a cargo attached to it, preferably does not specifically bind to the precursor's target, or binds to any (non-human) molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with a cargo attached to it, preferably does also not specifically bind to the precursor's target) with a KD value greater than 5xl0 -4 mol/litre, as described herein.
- the skilled person is aware of means for reducing and/or eliminating specific binding of a protein-based carrier building block precursor to proteins and/or non-protein molecules (including biomolecules). For instance, mutations may be performed in the amino acid sequence of the precursor building block so that it no longer specifically binds to human proteins, or to any non-human protein, or to non-protein molecules (including biomolecules), or binds to them with a KD (KD value) greater than 5xl0 -4 mol/litre, as described herein.
- KD KD value
- the affinity of a molecular interaction between two molecules can be measured via different techniques known perse, such as the well-known surface plasmon resonance (SPR) biosensor technique (see for example Ober et al. 2001, Intern. Immunology 13: 1551-1559, in particular section “Surface plasmon resonance (SPR) experiments” starting on p. 1552, which describes conditions for measuring the affinity of a molecular interaction between two molecules, or the explanations provided herein in this description).
- SPR surface plasmon resonance
- surface plasmon resonance refers to an optical phenomenon that allows for the analysis of real-time biospecific interactions by detection of alterations in protein concentrations within a biosensor matrix, where one molecule is immobilized on the biosensor chip and the other molecule is passed over the immobilized molecule under flow conditions yielding k on , k O ff measurements and hence KD (or K ) values.
- This can for example be performed using the well-known BIAcore® system (BIAcore International AB, a Cytiva lifesciences company, Uppsala, Sweden and Piscataway, NJ).
- BIAcore® system BIAcore International AB, a Cytiva lifesciences company, Uppsala, Sweden and Piscataway, NJ.
- the affinity (KD) of a molecular interaction between two molecules can be determined via SPR on a ProteOn XPR36 instrument (Bio-Rad Laboratories). The experiment can be performed at 25°C, and as assay buffer PBS pH7.4 containing 0.005% Tween 20 (Bio-Rad Laboratories) can be used.
- Targets such as human proteins or non-protein molecules (biomolecules), or non-human biomolecules, as described herein, such as nucleic acids (e.g., DNA, RNA), lipids (e.g., such as phosphatidylserine (PS)) or glycans can be immobilized onto different ligand lanes from a GLC sensorchip (Bio-Rad Laboratories), e.g., with the ProteOn Amine Coupling Kit (Bio-Rad Laboratories) according to the manufacturer's instructions.
- the protein-based building block can be captured on the target immobilized ligand lanes.
- One ligand lane can serve as a reference surface and no protein-based building block is captured on the surface.
- Different concentrations (e.g., ranging from 300 nM to 1.2 nM) diluted in running buffer can be flowed over the respective protein-based building blocks and reference surface in multi-cycle kinetics for 2 minutes, followed by a constant flow of the assay buffer for 15 minutes. Between the different injections, the surfaces can be regenerated with 3 M MgCl2 (Cytiva), or with 10 mM Glycine pH 1.5 (Cytiva). Several buffer blanks can be injected for double referencing. Data can be analyzed, e.g., with the ProteOn Manager 3.1.0 software (Bio-Rad Laboratories).
- the kinetic rate constants (ka and kd) can be calculated by fitting the sensorgrams via the Langmuir 1:1 interaction ligand binding model.
- the equilibrium dissociation constant KD can be calculated as the kd/ka ratio. See also, e.g., https://nicoyalife.com/wp- content/uploads/2023/02/characterization-of-lnfl uenza-using-Alto.pdf.
- bio-layer interferometry refers to a label-free optical technique that analyzes the interference pattern of light reflected from two surfaces: an internal reference layer (reference beam) and a layer of immobilized protein on the biosensor tip (signal beam).
- reference beam an internal reference layer
- signal beam a layer of immobilized protein on the biosensor tip
- BLI can for example be performed using the well-known Octet® Systems (ForteBio, a division of Pall Life Sciences, Menlo Park, USA).
- affinities can be measured in Kinetic Exclusion Assay (KinExA) (see for example Drake et al., "Characterizing high-affinity antigen/antibody complexes by kinetic- and equilibrium-based methods", Anal. Biochem., 2004, 328: 35-43), using the KinExA® platform (Sapidyne Instruments Inc, Boise, USA).
- KinExA Kinetic Exclusion Assay
- Equilibrated solutions of a binding unit/target complex such as an antibody/antigen complex, are passed over a column with beads precoated with antigen (or antibody), allowing the free antibody (or antigen) to bind to the coated molecule. Detection of the antibody (or antigen) thus captured is accomplished with a fluorescently labeled protein binding the antibody (or antigen).
- the GYROLAB® immunoassay system provides a platform for automated bioanalysis and rapid sample turnaround (Fraley et al., "The GyrolabTM immunoassay system: a platform for automated bioanalysis and rapid sample turnaround", Bioanalysis 2013, 5: 1765-74).
- the affinity of a molecular interaction between two molecules can be measured using flow cytometry to analyze ligand binding to antigens such as proteins, lipids (e.g., such as phosphatidylserine (PS)), sugars, etc. presented on the surface of a cell ("cell binding assay").
- PS phosphatidylserine
- cell binding assay The skilled person is familiar with cell-binding assays to determine the affinity of a certain soluble molecule (such as the molecule of the present technology) and a binding partner present on the surface of a cell, such as a human cell. For instance, Hunter S. A. and Cochran J. R.
- Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations present a practical guide for measuring binding events between soluble ligands and binding partners expressed on the surface of, inter alia, mammalian cells.
- the cell binding assay can be carried out as follows: a.
- a fixed number of (human or non-human) cells to a 96-well V-bottom plate (e.g., 50 pL human cell suspension (e.g., 5E+04/96-well), or to tubes, such as Eppendorf tubes, in cold fluorescence-activated cell sorting (FACS) buffer (e.g., consisting of D-PBS, 2% heat inactivated fetal bovine serum (HI FBS) and 0.05% Sodium Azide); b.
- FACS cold fluorescence-activated cell sorting
- the cell-binding assay can be performed by adding a number of cells (human or non- human, such as non-human animal, plant, microorganisms, etc.) to a recipient (e.g., 96-well V-bottom plate or tubes, as described above), preferably in a physiological buffer, adding the molecule which binding is to be assessed (e.g., the molecule of the present technology, or the protein-based building block), which is preferably marked, and incubate it with the cells for an amount of time, e.g., when the reaction has come to equilibrium, generally a number of hours, e.g., for about 3h, preferably at low temperature, typically 4°C, preferably while shaking, and finally evaluating the binding of the soluble molecule to the human cells by flow cytometry, e.g., by FACS.
- a number of cells human or non- human, such as non-human animal, plant, microorganisms, etc.
- a recipient e.g., 96-well V
- the soluble molecule in step c. above may be added to each well/tube at varying concentrations, spanning two orders of magnitude above and below the anticipated KD and/or EC50.
- Binding values can be determined from the average signal value (e.g., average fluorescence value) of each sample, plotting the fraction bound vs. ligand concentration (log scale) and fitting a sigmoidal curve using nonlinear regression analysis.
- the ligand concentration at half the fraction bound also referred to as EC50, will be a first approximation of the equilibrium dissociation constant (KD).
- KD equilibrium dissociation constant
- the skilled person is able to determine whether a molecule is able to specifically bind human proteins, as defined in the context of the present technology. For instance, the skilled person may make use of commercially available protein arrays to determine the binding affinity of a certain molecule (protein) towards human proteins. For instance, the skilled person may make use of the commercially available Proteome ProfilerTM Antibody Arrays, which allows for the semiquantitative measurement of more than 100 proteins in a single sample.
- HuProtTM assay such as the version v4.0, which consists of >21,000 unique human proteins, isoform variants, and protein fragments - covering 16,794 unique genes. This includes 15,889 of the 19,613 canonical human proteins described in the Human Protein Atlas, with broad coverage across protein subclasses.
- the skilled person can also use commercially available cell arrays, such as human, non-human animal, plant, bacteria, yeast, etc. arrays to determine the binding affinity (e.g., KD and/or EC50) of a certain molecule (protein) towards human cells. See also, e.g., Example 6.
- the skilled person is able to determine whether a molecule is able to specifically bind a non-human protein, such as a bacterial or viral protein.
- a non-human protein such as a bacterial or viral protein.
- the skilled person may make use of protein-binding assays to determine the binding affinity of a certain molecule (e.g., a protein) towards non-human (such as bacterial or viral) proteins.
- the binding affinity of a molecular interaction between two molecules can be measured by SPR.
- SPR allows forthe determination of the KD of a potential interaction between two molecules, as described in detail above.
- the at least one carrier building block comprised in the molecule of the present technology may show non-specific binding with one or more human proteins (and/orwith one or more non-human proteins, and/orwith one or more non- protein molecules, such as human non-protein molecules, and/or with one or more human cell types as described above). This is because there may be molecular forces between the at least one carrier building block and one or more human proteins (and/or one or more nonhuman proteins, and/or one or more non-protein molecules, such as human non-protein molecules, and/or one or more human cells, as described above), e.g., in the form of hydrophobic interactions, hydrogen bonding, Van der Waals interactions, and other nonspecific interactions.
- the at least one carrier building block comprised in the molecule of the present technology may non-specifically bind to one or more human proteins (and/or to one or more non-human proteins, and/or to one or more non- protein molecules, such as human non-protein molecules, and/or to one or more human cells, if this is the case, as described above).
- any KD value greater than 5xl0 -4 mol/litre is generally considered to represent "non-specific binding".
- the building block may bind to any human protein (or non-human protein, and/or to any human cell, if this is the case, as explained above) with a KD (KD value) greater than 5xl0 -4 mol/litre (or with a K value lower than 2xl0 3 litres/mol), such as with a KD (KD value) greater than 5.5xl0 -4 mol/litre (or with a KA value lower than 1.8xl0 3 litres/mol), or with a KD (KD value) greater than 6xl0 -4 mol/litre (or with a KA value lower than 1.7xl0 3 litres/mol).
- KD value KD value
- the carrier building block may bind to any non-protein molecule, such as to any human non-protein molecule (e.g., DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)), glycans) with a KD (KD value) greater than 5xl0 -4 mol/litre (or with a KA value lower than 2xl0 3 litres/mol), such as with a KD (KD value) greater than 5.5x10” 4 mol/litre (or with a KA value lower than 1.8xl0 3 litres/mol), or with a KD (KD value) greater than 6xl0 -4 mol/litre (or with a KA value lower than 1.7xl0 3 litres/mol).
- any human non-protein molecule e.g., DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)), glycans) with a K
- the protein-based carrier building block comprised in the molecule of the present technology is not derived from the crystallizable fragment of an antibody (Fc, which contains two CH2 and two CH3 domains) such as the Fc fragment of a monoclonal antibody (mAb).
- the protein-based carrier building block comprised in the molecule of the present technology is not derived from the CH2 and/or the CH3 domains of the Fc fragment.
- the protein-based carrier building block comprised in the molecule of the present technology is not derived from a CHI and/or the CL domains comprised in the antigen-binding fragment (Fab) of an antibody, such as the CHI and/or the CL domains comprised in the Fab of a mAb.
- Fab antigen-binding fragment
- the molecule of the present technology is not (or is not derived from) a crystallizable fragment (Fc) of an antibody, such as a mAb. In another embodiment, the molecule of the present technology is not (or is not derived from) the Fab of an antibody, such as a mAb.
- the molecule of the present technology does not comprise VH-VL pairs or, e.g., it does not comprise at least one VH and at least one VL which interact (are bound to) with each other, such as in an antibody.
- the molecule of the present technology does not comprise CL-CH1 conjugates, e.g., it does not comprise at least one CL and at least one CHI which are linked to each other, e.g., through a disulphide bridge.
- the binding properties of the at least one protein-based carrier building block comprised in the molecule of the present technology are not affected or altered (or essentially not affected or altered) when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block.
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein. If the building block shows any interaction with one or more human proteins, such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the protein-based carrier building block when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any human protein, and/or, if the building block shows any interaction with one or more human proteins, such interaction is still characterized by low specificity and/or low affinity, as defined herein.
- the protein-based carrier building block when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the proteinbased carrier building block, the protein-based carrier building block still does not specifically bind to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to.
- the protein-based carrier building block does not specifically bind to the precursor's target, should the protein-based carrier building block precursor have a target and should this be a non-human molecule (including biomolecules), such as a non-human protein.
- the protein-based carrier building block when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to the precursor's target.
- the protein-based carrier building block does not specifically bind to any microorganism such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules). If the building block shows any interaction with one or more microorganisms and/or virus, or with any microbial or viral molecule (including biomolecules), such interaction is characterized by low specificity and/or low affinity, as defined herein.
- the protein-based carrier building block when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any microorganism such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules), or binds to it with low specificity and/or low affinity, as described herein.
- any microorganism such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules)
- the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any molecule, including biomolecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any molecule, including bio molecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans) with a KD (KD value) greater than 5xl0 -4 mol/litre,
- KD KD value
- the protein-based carrier building block does not specifically bind to any human and/or non-human animal biomolecule (e.g., human and/or non-human animal proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any human and/or non-human animal biomolecule with a KD (KD value) greaterthan 5xl0 -4 mol/litre, as described herein.
- KD KD value
- the carrier building block present in the molecule of the present technology has at least one attachment point (also referred to as conjugation site in the present disclosure), preferably at a solvent-accessible position, as defined further below.
- the at least one protein-based carrier building block comprises more than one attachment points or conjugation sites, preferably at solvent-accessible positions.
- the protein-based carrier building block comprises at least two attachment points or conjugation sites, preferably at solvent-accessible positions.
- the protein-based building block comprises three conjugation sites or more, such as six or nine conjugation sites, preferably at solvent-accessible positions.
- the protein-based building block comprises four conjugation sites, preferably located at solvent-accessible positions in the protein-based carrier building block.
- the protein-based building block comprises five conjugation sites, preferably located at solvent-accessible positions in the protein-based carrier building block.
- the protein-based carrier building block may have two, three, four, five, six, seven, eight, nine, ten conjugation sites or more, preferably at solvent-accessible positions.
- the conjugation sites present in the carrier building block are different from each other.
- the carrier building block comprises two conjugation sites
- these conjugation sites may be functionally/chemically different from each other, i.e., each conjugation site or attachment point is chemically different from each other (e.g., if there are two conjugation sites, one conjugation site may be a -SH group present in the side chain of a cysteine located in a solvent-accessible position, and the other conjugation site may be a -NH2 group present in the side chain of a lysine located in a solvent-accessible position, or the /V-terminal NH2 group).
- the building block has more than two conjugation sites (e.g., at least three conjugation sites, such as three, four, five, six, seven, eight, nine, ten, etc.), there may be at least two types of conjugation sites among the at least three conjugation sites present in the building block.
- each conjugation site is functionally different from each other.
- two conjugation sites are the same and one conjugation site is functionally different from the other two conjugation sites.
- all conjugation sites present in the building block are functionally different from each other.
- all conjugation sites present in the carrier building block are the same.
- the protein-based building block may comprise one, two, three, four, five, six, seven, eight, nine, ten or more conjugation sites which are all the same, e.g., which are all -SH groups present in the side chain of cysteines located at solvent- accessible positions in the protein-based building block.
- the conjugation sites are spatially distant from each other (spatially separated from each other).
- the minimal distance between conjugation sites will be dictated by the nature of the cargos (and linkers, if used) which are to be attached or conjugated to the attachment points or conjugation sites in the protein-based carrier building block.
- the minimal distance can still be kept small when used in combination with long linkers, which add the needed flexibility and the envisaged target binding.
- a short distance between conjugation sites, combined with short linkers, if any, will likely limit the target binding of larger cargos, and result in restricted engagement (e.g., increased cell specificity).
- the solubility of the molecule may be decreased (i.e., the molecule may be more prone to aggregation).
- the cargos to be attached are rather small (e.g., radioactive isotopes)
- the minimal distance can be kept small even in the absence of linkers.
- the skilled person will be able to select the location of the specific conjugation sites and the length and flexibility of the linkers, if any, depending on the nature of the cargos which are to be attached or conjugated to the protein-based carrier building block.
- a “conjugation site” or “attachment point” may be a reactive group in the side chain of a natural or a non-natural (also referred to as “noncanonical”, “unnatural” or “unusual”, as described above) amino acid preferably located at a solvent-accessible position in the proteinbased carrier building block. It may also be the C-terminal and/or /V-terminal reactive group (-COOH and -NH2 groups, respectively) of the protein-based carrier building block.
- a “reactive group in the side chain of an amino acid” refers to any chemical group present in the side chain of an amino acid which is capable of forming a covalent bond.
- the reactive group present on its side chain is a primary amine.
- the reactive group present on its side chain is a thiol group.
- the amino acid is aspartic or glutamic acid
- the reactive group present in their side chain is a carboxylic group.
- the amino acid is tyrosine
- the reactive group present on its side chain is a phenolic hydroxyl group.
- the amino acid is arginine
- the reactive group present on its side chain is a guanidino group.
- the amino acid is methionine
- the reactive group present on its side chain is a thioether group.
- C-terminal or /V-terminal reactive group of the protein-based carrier building block refers to the -COOH and -NH2 reactive groups present in the C- and /V-terminal amino acid of the protein-based carrier building block.
- the carrier building block does not have a free C- and/or /V-terminal end (e.g., because the carrier building block is C- and/or /V-terminal linked to another protein-based building block, or to another peptide or protein, or because the /V-terminal is amidated, or because the C-terminal is acetylated, etc.), then the N- and C-terminal ends of the carrier building block are not suitable as attachment points or conjugation sites as defined herein. In some embodiments the "conjugation site" or "attachment point" is not the C-terminal or /V-terminal reactive group of the protein-based carrier building block.
- the protein-based building block precursor may be modified to introduce one or more attachment points or conjugation sites, as described in detail below.
- a non-limiting example of an engineered attachment point or conjugation site is a reactive group present in the side chain of an amino acid in the protein-based carrier building block which amino acid was not present at the same or equivalent position in the building block precursor.
- the building block precursor has a serine at a certain position X (which is preferably a solvent-accessible position) in the building block precursor, and that serine is mutated to a cysteine in the carrier building block, the -SH group of that cysteine would be an engineered attachment point or conjugation site.
- an amino acid e.g., a Cys, or a Tyr
- the reactive group present in the side chain of that newly added amino acid in the carrier building block would be an engineered attachment point or conjugation site.
- the at least one conjugation site present in the carrier building block comprised in the molecule of the present technology may be free (i.e., ready for reaction) or capped/protected.
- the a-amino group, the carboxylic acid terminus, or the reactive groups present in the side chain of one or more amino acids of the carrier building block e.g., amines, carboxylic acids, alcohols, thiols
- a protecting group e.g., to prevent polymerization of the amino acids, to minimize undesirable side reactions during the synthesis of the building block or to selectively attach different cargos, for example.
- the at least one conjugation site is capped or protected, it has to be de-capped or deprotected before attaching or conjugating a cargo to it, as described in detail below.
- the at least one conjugation site present in the protein-based carrier building block comprised in the molecule of the present technology may be a primary amine present in the side chain of a lysine (or ornithine (Orn), or Diaminopropionic acid (Dap), or Diaminobutyric acid (Dab)) in the protein-based building block, preferably located at a solvent-accessible position.
- the conjugation site is a thiol group present in the side chain of a cysteine in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block.
- the conjugation site is a carboxylic group present in the side chain of an aspartic or glutamic acid in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block.
- the conjugation site is a guanidino group present the side chain of an arginine in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block.
- the conjugation site is a thioether group present the side chain of a methionine in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block.
- the conjugation sites may be free or protected.
- the conjugation site is a thiol group (e.g., from a cysteine in the proteinbased building block, preferably located at a solvent-accessible position in the protein-based building block)
- the thiol group may be free (-SH) or protected/capped.
- the protein-based carrier building block comprised in the molecule of the present technology comprises at least two attachment points or conjugation sites which are two reactive groups present in the side chain of two amino acids (which may be natural or a non-natural) in the protein-based building block, preferably located at solvent-accessible positions in the protein-based carrier building block.
- the protein-based carrier building block comprises at least two attachment points or conjugation sites which are two reactive groups present in the side chain of two natural amino acids (e.g., two Cys) in the protein-based building block, preferably located at solvent-accessible positions in the protein-based carrier building block.
- the protein-based carrier building block comprises four conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block.
- the protein-based carrier building block comprises five conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block.
- the fifth conjugation site is the /V-terminal amine of the protein-based carrier building block.
- the cargo may be a cell-penetrating peptide (CPP), as described in detail below.
- CPP cell-penetrating peptide
- the molecule of the present technology may comprise more than one CPPs, such as two, three, four, five or more CPPs. These can each be covalently linked to the attachment points or conjugation sites comprised in the protein-based building block (directly or by means of a linker, as described herein). They can also be covalently linked to one attachment point or conjugation site in tandem, i.e., two or more CPPs are covalently linked to each other (e.g., via their /V- and C- terminal parts) and then, all of them, covalently linked to the protein-based carrier building block through one attachment point or conjugation site comprised therein.
- the cargo may be a cell-penetrating peptide (CPP), as described in detail below, and a celltargeting moiety, as described in detail below.
- CPP cell-penetrating peptide
- a celltargeting moiety as described in detail below.
- CPP cell-penetrating peptide
- These can each be covalently linked to the attachment points or conjugation sites comprised in the protein-based building block (directly or by means of a linker, as described herein). They can also be covalently linked to one attachment point or conjugation site in tandem, i.e., two or more cell-targeting moieties or CPPs are covalently linked to each other (e.g., via their /V- and C-terminal parts) and then, all of them, covalently linked to the protein-based carrier building block through one attachment point or conjugation site comprised therein.
- the at least one cargo may be a half-life extending (HLE) molecule, a targeting molecule, a therapeutic molecule or precursor thereof (including mRNA), an imaging molecule, a toxic molecule, an agonist, a T-cell engagement molecule, a sweeping/degrader molecule, a cell-penetrating molecule, a nuclear localization molecule, a blood brain barrier (BBB) shuttle, a radiotherapeutic molecule or an imaging probe.
- HLE half-life extending
- a targeting molecule e.g., a targeting molecule, a therapeutic molecule or precursor thereof (including mRNA), an imaging molecule, a toxic molecule, an agonist, a T-cell engagement molecule, a sweeping/degrader molecule, a cell-penetrating molecule, a nuclear localization molecule, a blood brain barrier (BBB) shuttle, a radiotherapeutic molecule or an imaging probe.
- BBB blood brain barrier
- the molecule of the present technology comprises at least one further cargo, wherein the further cargo is also attached or conjugated to the at least one protein-based carrier building block through at least one attachment point or conjugation site, and wherein the further cargo is a HLE molecule, such as an albumin-binding ISVD (as described herein, e.g., as defined in Table 8, such as SEQ ID NO.: 63 or 106) or a PEG molecule, or ELNN polypeptides, as described herein.
- HLE molecule such as an albumin-binding ISVD (as described herein, e.g., as defined in Table 8, such as SEQ ID NO.: 63 or 106) or a PEG molecule, or ELNN polypeptides, as described herein.
- the molecule of the present technology comprises at least one further cargo, wherein the further cargo is also attached or conjugated to the at least one protein-based carrier building block through at least one attachment point or conjugation site, and wherein the further cargo is a targeting moiety and/or a therapeutic moiety as described herein.
- the molecule of the present technology comprises at least two further cargos, wherein the further cargos are attached or conjugated to the at least one protein-based carrier building block through at least two attachment points or conjugation sites, wherein the at least two further cargos are one HLE molecule, as described herein, and one therapeutic and/or targeting moiety, as described herein.
- the at least one protein-based carrier building block comprised in the molecule of the present technology comprises at least two cysteines, preferably located at solvent accessible positions, such as three cysteines, or four cysteines, or six cysteines, or nine cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the at least two, such as three, or four, or six, or nine, conjugation sites as defined herein.
- the at least one protein-based carrier building block comprised in the molecule of the present technology comprises three cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the three conjugation sites as defined herein.
- the protein-based carrier building block does not comprise any other cysteine at solvent accessible positions besides the three cysteines at solvent-accessible positions which bear the three conjugation sites (free or capped thiol groups) (but may comprise one or more cysteines at positions which are not solvent- accessible).
- the at least one protein-based carrier building block comprised in the molecule of the present technology comprises four cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the four conjugation sites as defined herein.
- the protein-based carrier building block does not comprise any other cysteine at solvent accessible positions besides the four cysteines at solvent-accessible positions which bear the four conjugation sites (free or capped thiol groups) (but may comprise one or more cysteines at positions which are not solvent- accessible).
- the at least one protein-based carrier building block comprised in the molecule of the present technology comprises four, five, six, seven, eight, nine, ten or more cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the four, five, six, seven, eight, nine, ten or more conjugation sites as defined herein.
- the protein-based carrier building block does not comprise any other cysteine at solvent accessible positions besides the four, five, six, seven, eight, nine, ten or more cysteines which bear the four, five, six, seven, eight, nine, ten or more conjugation sites (free or capped thiol groups) and which are located at solvent-accessible positions in the building block.
- the at least one protein-based building block comprised in the molecule of the present technology comprises at least one amino acid, such as one, two, three, four, five, six, seven, eight, nine, ten or more, which may be natural or non-natural, preferably located at solvent accessible positions, which comprises a reactive group on its side chain which is the conjugation site as defined herein.
- the at least one protein-based building block comprised in the molecule of the present technology comprises at least two conjugation sites, one of which is a (free or protected) thiol group from a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, and the other one is a -OH group from a tyrosine preferably located at a solvent-accessible position in the protein-based carrier building block, preferably from a /V- or C-terminally exposed tyrosine.
- WO 2021/050554 the content of which is herewith incorporated by reference, describes in detail how to incorporate one or more unnatural amino acid(s) in a protein.
- the protein-based building block comprised in the molecule of the present technology comprises six attachment points or conjugation sites, wherein three of them are -SH groups present in the side chain of three Cys, preferably located at solvent- accessible positions, and wherein three of them are -NH2 present in the side chain of three Lys, preferably located at solvent-accessible positions in the protein-based building block.
- a conjugation site is a -SH group (free or capped) present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block
- the cargo can be attached or conjugated to the building block (directly or by means of a linker) by alkylation, metal-assisted arylation, disulphide exchange or addition to a maleimide Michael acceptor. It can also be attached or conjugated using the so-called "PODS-based conjugation", (see, e.g., Davydova M.
- an APN-maleimide 'bifunctional' linker (see Formula I in the examples), also known as 3-(4-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l- yl)phenyl)propiolonitrile), can be used to attach or conjugate a cargo to a -SH attachment point present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block.
- maleimide- modified cargos can be attached to the -SH attachment point present in the side chain of a cysteine preferably located at a solvent- accessible position in the protein-based carrier building block, see also the examples.
- a conjugation site is a -OH group of a tyrosine preferably located at a solvent- accessible position in the protein-based carrier building block
- the cargo can be attached or conjugated to the building block (directly or by means of a linker) by several chemical methods such as cross-linking via catalytic tyrosine mono electronic oxidation, three-component Mannich-type tyrosine conjugation, conjugation via sulphur fluoride exchange chemistry (SuFEx), transition-metal complexes for tyrosine conjugation, diazonium coupling reaction, reactions with triazolinediones, etc. (for a review, see, e.g., D. Alvarez Dorta et al., Chem. Eur. J., 2020, 26, 14257).
- the cargo can be attached or conjugated to the building block (directly or by means of a linker) enzymatically as described, e.g., in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086.
- the protein-based building block may preferably be extended with flexible (GG) or (G4SI)I-3GG tags (sequences) in order to facilitate the enzymatic addition, as described in Alan M.
- tyrosinase from Agaricus bisporus (abTYR)
- abTYR a copper-dependent enzyme that functions to convert tyrosine into melanin via an o-quinone intermediate
- bmTYR Bacillus megaterium tyrosinase
- the cargo may be attached or conjugated to the carrier building block (directly or by means of a linker) by reaction of a group present in the cargo/linker (e.g., isothiocyanates, isocyanates, acyl azides, NHS esters, sulfonyl chlorides, aldehydes, glyoxals, epoxides, oxiranes, carbonates, aryl halides, imidoesters, carbodiimides, anhydrides, or fluorophenyl esters) and the primary amine.
- a group present in the cargo/linker e.g., isothiocyanates, isocyanates, acyl azides, NHS esters, sulfonyl chlorides, aldehydes, glyoxals, epoxides, oxiranes, carbonates, aryl halides, imidoesters, carbodiimides, anhydrides, or fluorophenyl est
- the building block comprises at least two conjugation sites which include a - OH from a tyrosine and a -SH from a cysteine preferably located at a solvent-accessible positions in the protein-based carrier building block
- thiol nucleophiles can be conveniently capped through disulfide formation with Ellman's reagent.
- the thiol groups can be de-capped through brief exposure to an appropriate reducing agent, as described in Alan M. Marmelstein et al., mentioned above.
- Sortases allow functionalization of the N-, C-terminus and the creation of non-natural fusions (i.e., N-N or C-C chimeras) via the installation of click handles, see, e.g., Guimaraes C. P. et al. ("Site-specific C-terminal and internal loop labelling of proteins using sortase-mediated reactions", Nature Protocols, 2013, 8(9): 1787-1799).
- sortase-mediated reactions are applicable to any protein of interest (e.g., to the protein-based carrier building block), provided it contains (i) an LPXTG motif (where X can be any amino acid and glycine cannot be a free carboxylate) as the sortase target or (ii) a suitably exposed glycine residue to serve as the incoming nucleophile.
- the natural nucleophile of sortase can be replaced by any peptide/protein with an oligoglycine (Glyi-s) at the /V-terminus (in many cases a single glycine suffices).
- the peptides can be decorated with any cargo molecule (e.g., fluorophores, biotin, cross-linkers, lipids, carbohydrates, nucleic acids), provided that a free /V-terminal glycine remains available on the peptide used as the incoming nucleophile.
- cargo molecule e.g., fluorophores, biotin, cross-linkers, lipids, carbohydrates, nucleic acids
- LPXTG-containing protein and nucleophile leads to the covalent attachment of that nucleophile to the protein of interest in a site-specific manner.
- Guimaraes C. P. et al. provides a protocol that allows the functionalization of any given protein at its C-terminus.
- the target protein is engineered with a sortase-recognition motif (LPXTG).
- sortase cleaves the protein between the threonine and glycine residues, facilitating the attachment of an exogenously added oligoglycine (Glyi-s) peptide modified with the functional group of choice (e.g., the cargo to be attached to the protein-based carrier building block).
- Glyi-s exogenously added oligoglycine
- Theile C. S. et al. (“Site-specific N- terminal labeling of proteins using sortase-mediated reactions", Nature Protocols, 2013, 8(9): 1800-1807) describes the use of sortase-mediated reactions to label the /V-terminus of any given protein of interest.
- the protein to be labeled is engineered with an exposed stretch of glycines or alanines at its /V-terminus when using sortase A from 5. aureus or 5. pyogenes, respectively.
- a peptide decorated with a functional group of choice fluorophores, biotin, lipids, nucleic acids, carbohydrates and so on
- a sortase recognition motif LPXTG/A sequence X being any amino acid, as stated above
- Sortase A cleaves between the threonine and glycine/alanine residues, forming a thioester intermediate with the peptide probe.
- Nucleophilic attack by the /V-terminally modified protein of interest resolves the intermediate, resulting in the formation of a covalent bond between the peptide probe (e.g., the cargo) and the N terminus of the protein (see Fig. 1 of Theile C. S. et al., mentioned above).
- depsi-peptides can be used for /V-terminal labeling, see Theile C. S. et al., mentioned above.
- peptides for creating C-to-C linked proteins are synthesized with an /V-terminal triglycine motif and an azide or cyclooctyne (DIBAC) at the C- terminus (see also Fig. 2 of this document).
- DIBAC azide or cyclooctyne
- N-to-N linked proteins synthesize peptides containing the LPXTGG sortase A recognition sequence at the C-terminus (X can be any residue, but the authors prefer a polar residue, such as a glutamic acid, to aid precipitation of the peptide after cleavage from the resin and to increase the solubility of the peptide in water) and an azido or a cyclooctyne group at the /V-terminus of the probe.
- the proteins to be linked should comprise 1-5 Gly at the /V-terminus.
- the final step of the procedure is fusing the click handle-containing proteins, see Fig. 1 of Witte M. D. et al.
- a cargo may be attached or conjugated to it using sortase, provided that the C-terminal end of the building block comprises a sortase-recognition motif (LPXTG) and the cargo comprises a oligoglycine ((G ly)i-s) modified peptide at the /V-terminal (see Fig. 2 of Guimaraes C. P. et al.).
- sortase sortase-recognition motif
- the at least one conjugation site or attachment point present in the protein-based carrier building block is preferably located at a solvent-accessible position in the building block.
- Preferably all conjugation sites or attachment points present in the proteinbased carrier building block are located at solvent-accessible positions in the building block.
- the skilled person is able to identify "solvent-accessible positions" in the carrier building block precursor. This can be performed in silica by means of computer modelling. For instance, the skilled person can make use of readily available software tools such as MAESTRO (Schrodinger, LLC, New York, NY, 2021), a multi-agent prediction system, based on statistical scoring functions (SSFs) and different machine learning approaches, see, e.g., Laimer et al.
- a protein is selected as starting point fordeveloping the protein-based carrier building block (the so-called “building block precursor”).
- building block precursor residues in the building block precursor with a Solvent-Accessible Surface Area (SASA) greater than or equal to, e.g., 27 A 2 (square angstrom) can be considered to be solvent-accessible.
- SASA Solvent-Accessible Surface Area
- the stability (AG in solvent) of the mutation of each of the identified residues e.g., to a cysteine residue
- Destabilizing mutations are generally not further considered as potential positions for conjugation sites or attachment points.
- the stability (AG in solvent) of the mutation of each of the identified residues is calculated.
- those residues with lower calculated AG in solvent would be preferably further selected as potential positions for conjugation sites or attachment points. For instance, AG values in the range of -20 to +5 kcal/mol can be considered as nondestabilizing mutations.
- HDX-MS hydrogen/de uteri urn exchange mass spectrometry
- HDX-MS reports on the local chemical environment and solvent accessibility of the protein backbone by monitoring the exchange of peptide bond amide protons with the deuterons of a D2O solvent.
- the rate of hydrogen-deuterium exchange is dependent on the solvent accessibility and folded state of the protein (see Englander SW. et al., "Hydrogen exchange: the modern legacy of Linderstrpm-Lang", Protein Sci., 1997, 6(5) : 1101-9).
- the in silica modelling e.g., with MAESTRO
- the reactive group of that amino acid e.g., the -SH present in the side chain of the cysteine
- other reactive groups present in the side chain of other amino acids present in the protein-based carrier building block e.g., with other -SH groups present in the protein, if any.
- the "solvent-accessible positions" can be identified and/or verified empirically.
- the "solvent-accessible positions" theoretically identified using available in silica software tools such as MAESTRO, as described above, may preferably be empirically confirmed by manufacturability.
- Formulation and process stability of potential building block candidates help narrow down lead candidates at an early stage, prior to large- scale manufacturing (see the examples and also, e.g., Ramachander, R., Rathore, N. (2013), "Molecule and manufacturability assessment leading to robust commercial formulation for therapeutic proteins” in: Kolhe, P., Shah, M., Rathore, N. (eds) Sterile Product Development, AAPS Advances in the Pharmaceutical Sciences Series, vol 6.
- protein expression of the selected variants may take place.
- the introduction of the specific amino acids at the theoretically-identified solvent accessible positions e.g., point mutations, addition of amino acids at the /V- and/or C- terminal of the protein, etc.
- the introduction of the specific amino acids at the theoretically-identified solvent accessible positions e.g., point mutations, addition of amino acids at the /V- and/or C- terminal of the protein, etc.
- the introduction of the specific amino acids at the theoretically-identified solvent accessible positions e.g., point mutations, addition of amino acids at the /V- and/or C- terminal of the protein, etc.
- the minimal required solubility and lack of specific binding to human proteins (and, optionally, to non-protein molecules and/or non-human proteins, preferably to the precursor's target), as described in detail above, can be assessed. Possible changes in 3D structure could be assessed, for example, by CD (circular dichroism) spectrum analysis, as described in detail above.
- the stability of the resulting variants can also be confirmed with a Thermal Shift Assay. This assay detects protein melting temperatures (Tm) and can thus be used to check protein stability. It can be used to characterize the stability/folding of a protein's 3D structure. SYPRO® Orange is a naturally quenched dye that interacts with the hydrophobic core of proteins which becomes visible following thermal denaturation. As a result, the temperature in the middle of the thermal denaturation process is labelled as melting temperature Tm. This is a way of assessing the stability of the resulting variants or mutants.
- model cargos can be attached or conjugated to the selected variants, in order to quantify the extent of conjugation (conjugation efficiency), i.e., to ascertain whether the resulting protein-based building block with the conjugation sites at the selected solvent- accessible positions will in practice be suitable for the attachment or conjugation of the desired cargos.
- a “model cargo” may be any molecule with a molecular weight higher than, e.g., 100 Da.
- a "model cargo” may be a maleimide-modified alanine (e.g., /V-Maleoyl-
- conjugation of the "model cargo(s)" results in a stable conjugate (protein-based building block with one or more model cargos conjugated to it), with an acceptable extent of conjugation (to be decided on a case-by-case basis, for example >90% conjugation efficiency, such as 90% conjugation efficiency, or 95% conjugation efficiency, or 97% conjugation efficiency, or 99% conjugation efficiency or more), allowing a standard PK in vivo, preserving its globular 3D structure and the conjugation status in vivo, etc., those solvent-accessible positions should be preferred for cargo conjugation, and conjugation of the desired cargo(s) may take place, see also the examples below.
- the at least one conjugation site present in the building block may be generated by introducing specific point mutations at solvent-accessible positions in the peptide sequence of the building block precursor.
- point mutations may be introduced at solvent-accessible positions in the building block precursor in order to generate the protein-based building block comprised in the molecule of the present technology, which comprises at least one conjugation site or attachment point at defined solvent-accessible positions, as described herein.
- the building block precursor may be modified by adding one or more amino acids at the N- and/or C-terminal of the protein sequence, to introduce at least one conjugation site or attachment point preferably at a solvent-accessible position, as described herein, to generate the protein-based building block.
- the at least one conjugation site may be already present preferably at solvent-accessible positions in the protein-based building block precursor, and there is no need of generating it.
- the point mutations are non-destabilizing point mutations.
- Stability of mutants can be calculated with different methods which predict the impact of mutations on protein stability, e.g., based on artificial intelligence (Al).
- stability of mutants can be calculated with MAESTRO, as defined above and explained in detail in the examples, and can also be confirmed empirically by manufacturability (including but not limited to expression level and stability assessment, as described above).
- the point mutations are mutations of amino acids preferably located at solvent-accessible positions in the building block precursor to cysteines.
- the point mutation consists of the replacement of a serine residue preferably in a solvent-accessible position of the building block precursors by a cysteine.
- the point mutations are mutations of preferably solvent-accessible amino acids in the building block precursor to lysines.
- the point mutations are mutations of preferably solvent-accessible amino acids in the building block precursor to tyrosines.
- the point mutations are mutations of preferably solvent- accessible amino acids in the building block precursor to a natural or non-natural amino acid, as described above.
- the conjugation sites may be generated by adding, in the building block precursor, one or more C- or /V-terminal natural and/or one or more C- or /V-terminal non- natural amino acid(s) with a reactive group in its side chain.
- the one or more terminal natural or non-natural amino acid is added at the C-terminus of the building block precursor.
- one or more of the conjugation sites is(are) generated by adding a N- or C- terminal cysteine, a N- or C- terminal tyrosine and/or a N- or C-terminal non-natural amino acid to the protein-based building block precursor.
- the at least one protein-based carrier building block comprised in the molecule of the present technology may comprise a N- and/or C-terminal conjugation site or attachment point suitable for conjugation with sortase, as described above.
- the proteinbased carrier building block should be engineered to comprise a C-terminal sortase recognition motif (LPXTG, where X can be any amino acid), a /V-terminal polygly ((Gly)i-s) tag or both.
- the protein-based carrier building block does not specifically bind any non-human protein and/or non-protein molecule, preferably the precursor's target, when a cargo is conjugated to the at least one attachment point or conjugation site on the proteinbased carrier building block, as described above.
- the molecule of the present technology which comprises at least one protein-based building block and at least one cargo attached to the at least one protein-based building block through the at least one conjugation site or attachment point, does not specifically bind any non- human protein or non-protein molecule, such as any human non-protein molecule, as described herein, in particular it does not specifically bind any protein or non-protein molecule to which the building block precursor binds, if any.
- the protein-based building block or molecule of the present technology shows any interaction with one or more human protein (or non-human protein, or non-protein molecule, as described above), such interaction is characterized by low specificity and/or low affinity, as described in detail above.
- a human protein is a protein present in the human body.
- the Human Protein Atlas (HPA, https://www.proteinatlas.org) is a Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues, and organs.
- Small globular non-human proteins in the context of the present technology, include proteins which are derived from human proteins, but which have been modified so that they are no longer human proteins.
- small globular non-human proteins are ISVDs, such as "human ISVDs” (e.g., VH, VL) and “non-human ISVDs” (e.g., VHH, non-human VH, VL or engineered ISVDs), DARPins (derived from ankyrin repeat proteins), affibodies or affitins.
- human ISVDs e.g., VH, VL
- non-human ISVDs e.g., VHH, non-human VH, VL or engineered ISVDs
- DARPins derived from ankyrin repeat proteins
- affibodies or affitins affibodies or affitins.
- the small globular non-human proteins may have a therapeutic or targeting activity.
- ISVD Immunoglobulin single variable domain
- the protein-based carrier building block comprised in the molecule of the present technology has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa.
- the at least one building block comprised in the molecule of the present technology does not specifically bind to any human protein, as defined in this specification, preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), such as any human non-protein molecule (such as human DNA, human RNA, human glycans, human lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor
- the protein-based carrier building block does not specifically bind any non-human protein and/or non-protein molecule, preferably the precursor's target, when a cargo is conjugated to the at least one attachment point or conjugation site on the proteinbased carrier building block, as described above.
- the molecule of the present technology which comprises at least one protein-based building block and at least one NLS attached to the at least one protein-based building block through the at least one conjugation site or attachment point, does not specifically bind any nonhuman protein or non-protein molecule, such as any human non-protein molecule, as described herein, in particular it does not specifically bind any protein or non-protein molecule to which the building block precursor binds, if any.
- the protein-based building block or molecule of the present technology shows any interaction with one or more human protein (or non-human protein, or non-protein molecule, as described above), such interaction is characterized by low specificity and/or low affinity, as described in detail above.
- the resulting ISVD-based building block does not specifically bind to any human protein.
- the ISVD-based building block does not specifically bind to any non-protein molecule, such as any human non-protein molecule.
- the ISVD-based building block does not specifically bind to any non-human protein or non-protein molecule to which the protein-based carrier building block precursor specifically binds, if any, as described above.
- an "ISVD-based building block” refers to a proteinbased building block which derives from an ISVD, i.e., which is structurally similar to an ISVD but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the ISVD specifically binds.
- the ISVD-based building block has a sequence identity of at least 60%, or 70%, or 80% with an ISVD, e.g., with its ISVD precursor.
- the ISVD-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with an ISVD, e.g., with its ISVD precursor.
- an ISVD-based building block may share the whole amino acid sequence with its ISVD precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids.
- the ISVD-based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind any human protein and preferably does not specifically bind any protein or non-protein molecule to which the precursor specifically binds.
- a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16
- immunoglobulin single variable domain defines immunoglobulin molecules wherein the antigen binding site is present on, and formed by, a single immunoglobulin domain. This sets ISVDs apart from “conventional” immunoglobulins (e.g., monoclonal antibodies) or their fragments (such as Fab, Fab', F(ab')2, scFv, di-scFv), wherein two immunoglobulin domains, in particular two variable domains, interact to form an antigen binding site.
- a heavy chain variable domain (VH) and a light chain variable domain (VL) interact to form an antigen binding site.
- the complementarity determining regions (CDRs) of both VH and VL will contribute to the antigen binding site, i.e., a total of 6 CDRs will be involved in antigen binding site formation.
- the antigen-binding domain of a conventional 4-chain antibody such as an IgG, IgM, IgA, IgD or IgE molecule; known in the art
- a conventional 4-chain antibody such as an IgG, IgM, IgA, IgD or IgE molecule; known in the art
- a Fab fragment, a F(a b')2 fragment, an Fv fragment such as a disulphide linked Fv or a scFv fragment, or a diabody (all known in the art) derived from such conventional 4-chain antibody would normally not be regarded as an ISVD as, in these cases, binding to the respective epitope of an antigen would normally not occur by one single immunoglobulin domain but by a pair of associating immunoglobulin domains such as light and heavy chain variable domains, i.e., by a VH-VL pair of immunoglobulin domains, which jointly bind to an epitope of the respective antigen.
- ISVDs are capable of specifically binding to an epitope of the antigen without pairing with an additional immunoglobulin variable domain.
- the binding site of an ISVD is formed by a single VH, a single VHH or single VL domain.
- the ISVD building block precursor may be a light chain variable domain sequence (e.g., a V sequence) or a suitable fragment thereof; or a heavy chain variable domain sequence (e.g., a Vn-sequence or VHH sequence) or a suitable fragment thereof; as long as the resulting building block has a globular 3D structure, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and is soluble, as defined in detail above.
- a light chain variable domain sequence e.g., a V sequence
- a heavy chain variable domain sequence e.g., a Vn-sequence or VHH sequence
- An ISVD which may preferably be the precursor of the protein-based building block comprised in the molecule of the present technology can for example be a heavy chain ISVD, such as a VH, VHH, including a camelized VH or humanized VHH.
- the protein-based building block precursor is a VHH, including a camelized VH or humanized VHH, as long as the resulting protein-based building block is soluble, has a globular 3D structure, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to human proteins.
- a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or
- the resulting building block does not specifically bind to any non-protein molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids.
- the resulting building block does also not specifically bind to any non-human protein to which the proteinbased carrier building block precursor specifically binds, if any, as described above.
- Heavy chain ISVDs can be derived from a conventional four-chain antibody or from a heavy chain antibody.
- the ISVD precursor may be a single domain antibody (or an amino acid sequence that is suitable for use as a single domain antibody), a "dAb” or dAb (or an amino acid sequence that is suitable for use as a dAb) or a Nanobody® ISVD (as defined herein, and including but not limited to a VHH); other single variable domains, or any suitable fragment of any one thereof, as long as the resulting protein-based building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein (human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids, and, preferably, does also not specifically bind to any non-human protein to which the protein-based carrier building block precursor specifically binds, if any, as described above.
- a non-protein (human) molecule such
- the ISVD precursor is a VH, a humanized VH, a human VH, a VHH, a humanized VHH or a camelized VH. More preferably, the ISVD precursor is a Nanobody® ISVD (such as a VHH, including a humanized VHH or camelized VH) or a suitable fragment thereof, as long as the protein-based building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein (human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids, and, preferably, does also not specifically bind to any non-human protein to which the protein-based carrier building block precursor specifically binds, if any, as described above.
- Nanobody® is a registered trademark from Ablynx N.V.
- VHH domains also known as VHHS, VHH antibody fragments, and VHH antibodies
- VHH domains have originally been described as the antigen binding immunoglobulin variable domain of "heavy chain antibodies”; i.e., of “antibodies devoid of light chains”, see Hamers-Casterman et al., Nature, 363: 446-448, 1993.
- the term “VHH domain” has been chosen in order to distinguish these variable domains from the heavy chain variable domains that are present in conventional 4-chain antibodies, which are referred to herein as "VH domains”, and from the light chain variable domains that are present in conventional 4-chain antibodies, which are referred to herein as "VL domains".
- VHH domains can be obtained from heavy chain-only antibodies (HCAbs) that are circulating in Camelidae, see e.g., Muyldermans S., "A guide to: generation and design of nanobodies", FEBS J., 2021, 288(7):2084-2102.
- HCAbs heavy chain-only antibodies
- the ISVD-based building block has a sequence identity of at least 80% with a VHH (such as a humanized VHH or camelized VH), e.g., its VHH precursor.
- the ISVD- based building block has a sequence identity of at least 60%, or at least 70%, or 80%, or at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with a VHH, e.g., its VHH precursor.
- the ISVD-based building block may share the whole amino acid sequence with its VHH precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids, which are different in the protein-based carrier building block.
- immunoglobulins typically involve the immunization of experimental animals, fusion of immunoglobulin producing cells to create hybridomas and screening for the desired specificities.
- immunoglobulins can be generated by screening of naive, immune, or synthetic libraries, e.g., by phage display.
- VHHS immunoglobulin sequences
- WO 94/04678 Hamers-Casterman et al. 1993 ("Naturally occurring antibodies devoid of light chains", Nature, 363: 446-448, 1993) and Muyldermans et al. 2001 (“Single domain camel antibodies: current status", J Biotechnol., 2001, 74: 277- 302) can be exemplified.
- camelids are immunized with the target antigen in order to induce an immune response against said target antigen.
- the repertoire of VHHS obtained from said immunization is further screened for VHHS that bind (or not) a target antigen.
- immunoglobulin sequences of different origin may be used, comprising mouse, rat, rabbit, donkey, human and camelid immunoglobulin sequences.
- fully human, humanized or chimeric sequences are also included.
- a “camelized VH” comprises an amino acid sequence that corresponds to the amino acid sequence of a naturally occurring VH domain, but that has been “camelized”, i.e., by replacing one or more amino acid residues in the amino acid sequence of a naturally occurring VH domain from a conventional 4-chain antibody by one or more of the amino acid residues that occur at the corresponding position(s) in a VHH domain of a heavy chain antibody.
- This can be performed in a manner known per se, which will be clear to the skilled person, for example on the basis of the further description herein and the prior art (e.g. WO 2008/020079).
- the structure of an ISVD sequence can be considered to be comprised of four framework regions ("FRs”), which are referred to in the art and herein as “Framework region 1" ("FR1”); as “Framework region 2” (“FR2”); as “Framework region 3” ("FR3”); and as “Framework region 4" ("FR4"), respectively; which framework regions are interrupted by three complementary determining regions (“CDRs"), which are referred to in the art and herein as “Complementarity Determining Region 1" (“CDR1”); as “Complementarity Determining Region 2" (“CDR2”); and as “Complementarity Determining Region 3" (“CDR3”), respectively.
- CDRs complementary determining regions
- CDR regions may also be done according to different methods.
- FR1 of an ISVD comprises the amino acid residues at positions 1-30
- CDR1 of an ISVD comprises the amino acid residues at positions 31-35
- FR2 of an ISVD comprises the amino acids at positions 36-49
- CDR2 of an ISVD comprises the amino acid residues at positions 50-65
- FR3 of an ISVD comprises the amino acid residues at positions 66-94
- CDR3 of an ISVD comprises the amino acid residues at positions 95-102
- FR4 of an ISVD comprises the amino acid residues at positions 103-113.
- the framework sequences are a suitable combination of immunoglobulin framework sequences or framework sequences that have been derived from immunoglobulin framework sequences, for example by humanization or camelization.
- the framework sequences may be framework sequences derived from a light chain variable domain (e.g., a V sequence) and/or from a heavy chain variable domain (e.g. a Vn-sequence or VHH sequence).
- the framework sequences are either framework sequences that have been derived from a VHH-sequence in which said framework sequences may optionally have been partially or fully humanized or are conventional VH sequences that have been camelized (as defined herein).
- the ISVD sequence is a naturally occurring sequence (from any suitable species) or a synthetic or semi-synthetic sequence, including but not limited to "humanized” (as defined herein) immunoglobulin sequences (such as partially or fully humanized mouse or rabbit immunoglobulin sequences, and in particular partially or fully humanized VHH sequences), "camelized” (as defined herein) immunoglobulin sequences, as well as immunoglobulin sequences that have been obtained by techniques such as affinity maturation (for example, starting from synthetic, random or naturally occurring immunoglobulin sequences), CDR grafting, veneering, combining fragments derived from different immunoglobulin sequences, PCR assembly using overlapping primers, and similar techniques for engineering immunoglobulin sequences well known to the skilled person; or any suitable combination of any of the foregoing.
- immunoglobulin sequences such as partially or fully humanized mouse or rabbit immunoglobulin sequences, and in particular partially or fully humanized VHH sequences
- camelized as defined herein immunoglobulin
- nucleotide sequences may be naturally occurring nucleotide sequences or synthetic or semi-synthetic sequences, and may for example be sequences that are isolated by PCR from a suitable naturally occurring template, e.g., DNA or RNA isolated from a cell, nucleotide sequences that have been isolated from a library (and in particular, an expression library), nucleotide sequences that have been prepared by introducing mutations into a naturally occurring nucleotide sequence (using any suitable technique known per se, such as mismatch PCR), nucleotide sequence that have been prepared by PCR using overlapping primers, or nucleotide sequences that have been prepared using techniques for DNA synthesis known per se.
- a suitable naturally occurring template e.g., DNA or RNA isolated from a cell
- nucleotide sequences that have been isolated from a library and in particular, an expression library
- nucleotide sequences that have been prepared by introducing mutations into a naturally occurring nucleotide sequence using any suitable
- the at least one protein-based carrier building block comprised in the molecule of the present technology is derived from a Nanobody® ISVD belonging to the so- called "VH3 class", i.e. a Nanobody® ISVDs with a high degree of sequence homology to human germline sequences of the VH3 class such as DP-47, DP-51 or DP-29, as long as the proteinbased building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein human molecule and preferably does also not specifically bind to any non-human protein to which the ISVD precursor specifically binds, as described above.
- VH3 class i.e. a Nanobody® ISVDs with a high degree of sequence homology to human germline sequences of the VH3 class such as DP-47, DP-51 or DP-29
- Nanobody® ISVDs in particular VHH sequences, including (partially) humanized VHH sequences and camelized VH sequences
- VHH sequences including (partially) humanized VHH sequences and camelized VH sequences
- a Nanobody® ISVD can be defined as an immunoglobulin sequence with the (general) structure
- FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which the framework sequences are as further defined herein.
- FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which: one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to the Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 below.
- Nanobody® ISVD can be defined as an amino acid sequence with the (general) structure
- FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to the Kabat numbering are chosen from the Hallmark residues mentioned in Table 3.
- the protein-based building block comprised in the molecule of the present technology may derive from anti-viral ISVDs, such as from anti-viral VHH or Nanobody® ISVDs.
- the building block comprised in the molecule of the present technology may derive from a functional ISVD (i.e., an ISVD which specifically binds to human proteins, and/or to non-human proteins, such as viral proteins and/or bacterial proteins, and/or to non-protein molecules, such as human non-protein molecules) which has been engineered/modified so that it no longer specifically binds to any human protein, preferably which has been engineered/modified so that it also no longer specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably which has been engineered/modified so that it also no longer specifically binds to any non-protein molecule to which it originally bound, if any.
- a functional ISVD i.e., an ISV
- the ISVD-based building block comprised in the molecule of the present technology derives from an ISVD, such as from a heavy-chain ISVD, preferably from a Nanobody® ISVD, which has been further engineered/modified to include mutations which prevent/ re move binding by pre-existing antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325.
- ISVD precursor residues preferably located at solvent-accessible positions should be identified to generate the at least one conjugation site, as described in detailed above in this description. Additionally or alternatively, one or more conjugation site(s) may already be in the ISVD precursor, either as reactive groups in the side chain of amino acids preferably located at solvent-accessible positions or as free /V-terminal primary amine and/or free C-terminal carboxylic acid.
- the at least one protein-based building block carrier comprised in the molecule of the present technology is derived from an ISVD, such as an ISVD belonging to the so-called "VH3 class", wherein the resulting building block comprises at least one cysteine, at least one lysine, at least one non-natural amino acid and/or at least one tyrosine, preferably located at one or more solvent-accessible positions.
- the at least one protein-based building block comprised in the molecule of the present technology is derived from an ISVD, such as an ISVD belonging to the so-called "VH3 class", wherein the resulting building block comprises at least one engineered cysteine, at least one engineered lysine, at least one non-natural amino acid and/or at least one engineered tyrosine, preferably located at one or more solvent-accessible positions.
- an ISVD such as an ISVD belonging to the so-called "VH3 class”
- the resulting building block comprises at least one engineered cysteine, at least one engineered lysine, at least one non-natural amino acid and/or at least one engineered tyrosine, preferably located at one or more solvent-accessible positions.
- the protein-based carrier building block carrier comprised in the molecule of the present technology when it is derived from an ISVD, preferably from a VHH (including humanized VHH or camelized VH) or Nanobody® ISVD, as described above, comprises a leucine at position 108 (according to Kabat numbering).
- the protein-based building block carrier comprised in the molecule of the present technology when it is derived from an ISVD, as described above, comprises a valine at position 11, a leucine at position 89 and/or a leucine at position 108 (according to Kabat numbering).
- the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186:
- Xi position 1 according to Kabat numbering
- Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
- X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
- X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X10 (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X15 (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xie (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X17 (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X19: (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X20 (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X21 (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X23 (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X24 (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X25 (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X26 (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X27 (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X28 (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X29 (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X30 (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- X31 (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
- X32 (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X33 (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X34 (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X35 (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- X36 (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X37 (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- X38 (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X39 (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- X40 (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X42 (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X43 (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X44 (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X45 (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X46 (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- X47 (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X48 (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X51: (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X52 (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X53 (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X54 (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xse (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X57 (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- Xss (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xeo (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai or any other amino acid with a reactive group in its side chain, such as cysteine;
- Xei (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X62 (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
- X64 (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
- Xee (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
- Xes (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- Xeg (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X70 (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
- X71 (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X72 (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
- X76 (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu, or any other amino acid with a reactive group in its side chain, such as cysteine;
- X77 (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X78 (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xso is absent or Gly
- Xsi is absent or Gly
- the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
- the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186 as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
- the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186 as defined above, wherein SEQ ID NO.: 186 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 186 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 186 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 186 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 186 is preferably Lys or Gin and/or SEQ ID NO 186 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 186 as defined above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 186 as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 186 as defined above, wherein SEQ ID NO.: 186 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 186 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 186 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 186 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 186 is preferably Lys or Gin and/or SEQ ID NO 186 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 206:
- Xia (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Zi position 11 according to Kabat numbering
- X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X5 (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X7 (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X?b (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X? c (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X9 (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X10 (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X12 (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X14 (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X15 (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xie (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X17 (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- Xis (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Z2 (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
- X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X21 (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- Z3 (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
- X23 (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X24 is absent or Gly
- X25 is absent or Gly
- X26 is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 206, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 206, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds
- the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
- the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 206 as defined above, wherein SEQ ID NO.: 206 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 206 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 206 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or SEQ ID NO 206 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 206, as defined above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 206, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 206 as defined above, wherein SEQ ID NO.: 206 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 206 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 206 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or SEQ ID NO.: 206 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185:
- Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine
- Zi position 11 according to Kabat numbering
- X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
- X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X5 (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xe (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- X7 (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X9 ( position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X10 (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
- Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X12 (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
- X13 (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X14 (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
- X15 (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Xie (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X17 (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
- Xis (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
- X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- Z2 (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
- X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
- X21 (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
- X22 (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
- Z3 (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
- X23 (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
- X24 is absent or Gly
- X25 is absent or Gly
- X26 is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds
- the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
- the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
- the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185 as defined above, wherein SEQ ID NO.: 185 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 185 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 185 preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or SEQ ID NO.: 185 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 185, as defined above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 185, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
- the polypeptide and/or molecule comprise SEQ ID NO.: 185 as defined above, wherein SEQ ID NO.: 185 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors.
- the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 185 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 185 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 185 is preferably Lys or Gin and/or SEQ ID NO 18 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
- the protein-based carrier building block comprises at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent- accessible positions, such as three amino acids with a reactive group in its side chain, such as three cysteines, or three lysines, or three tyrosines, or three non-natural amino acids, preferably three cysteines, in the following solvent-accessible positions in SEQ ID NO.: 179 according to Kabat numbering:
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn;
- X13 (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) is Thr;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20 (positionlOOa according to Kabat numbering) is Gly;
- X21 (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn;
- X13 (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) is Thr;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X21 (position lOOf according to Kabat numbering) is Asp;
- X22 (position 105 according to Kabat numbering) is Arg;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- Xe (position 23 according to Kabat numbering) is Ala
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) is Lys
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn; Xis: (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20 (positionlOOa according to Kabat numbering) is Gly;
- X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X22 (position 105 according to Kabat numbering) is Arg;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- Xi:(position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- X7 (position 25 according to Kabat numbering) is Ser; Xs: (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) is Lys
- X10 (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X12 (position 62 according to Kabat numbering) is Asn;
- X13 (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) is Thr;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20 (positionlOOa according to Kabat numbering) is Gly;
- X21 (position lOOf according to Kabat numbering) is Asp;
- X22 (position 105 according to Kabat numbering) is Arg;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X3 (position 17 according to Kabat numbering) is Ser;
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) is Lys
- X10 (position 44 according to Kabat numbering) is Glu
- X14 (position 68 according to Kabat numbering) is Thr;
- X17 (position 74 according to Kabat numbering) is Ala;
- X21 (position lOOf according to Kabat numbering) is Asp;
- X22 (position 105 according to Kabat numbering) is Arg;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- X2 (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X9 (position 43 according to Kabat numbering) is Lys
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X22 (position 105 according to Kabat numbering) is Arg;
- X25 is absent
- X26 is absent
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) is Lys
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn;
- X13 (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) is Thr;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20 (positionlOOa according to Kabat numbering) is Gly;
- X22 (position 105 according to Kabat numbering) is Arg;
- X26 is Cys.
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xs (position 31 according to Kabat numbering) is Asn;
- X9 (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X12 (position 62 according to Kabat numbering) is Asn;
- X14 (position 68 according to Kabat numbering) is Thr;
- Xie (position 72 according to Kabat numbering) is Asp;
- X17 (position 74 according to Kabat numbering) is Ala;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20: (positionlOOa according to Kabat numbering) is Gly;
- X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X22 (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X25 is absent
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) is Ser
- Xe (position 23 according to Kabat numbering) is Ala
- X9 (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X10 (position 44 according to Kabat numbering) is Glu
- X12 (position 62 according to Kabat numbering) is Asn;
- X13 (position 65 according to Kabat numbering) is Gly;
- X14 (position 68 according to Kabat numbering) is Thr;
- X15 (position 70 according to Kabat numbering) is Ser
- Xie (position 72 according to Kabat numbering) is Asp; Xi?: (position 74 according to Kabat numbering) is Ala;
- Xis (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- Z2 (position 89 according to Kabat numbering) is Vai or Leu;
- X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
- X21 (position lOOf according to Kabat numbering) is Asp;
- X22 (position 105 according to Kabat numbering) is Arg;
- Z3 (position 108 according to Kabat numbering) is Gin or Leu;
- X23 (position 112 according to Kabat numbering) is Ser;
- X24 is absent
- X25 is absent
- X26 is absent, or
- Xi (position 7 according to Kabat numbering) is Ser
- X2 (position 13 according to Kabat numbering) is Gin
- X5 (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Veterinary Medicine (AREA)
- Mycology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Physics & Mathematics (AREA)
- Oncology (AREA)
- Plant Pathology (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicinal Preparation (AREA)
Abstract
The present technology relates to the field of drug delivery and provides molecules comprising or consisting of at least one protein-based carrier building block, wherein the protein-based carrier building block comprises at least one, preferably at least two, attachment point(s) or conjugation site(s). In particular, the technology provides a molecule comprising at least one protein-based building block, wherein the at least one protein-based building block: a) comprises at least one conjugation site or attachment point; b) has a molecular mass of 2.5 to 70 kDa; c) has a globular three-dimensional (3D) structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at room temperature; and does not specifically bind to any human protein or binds one or more human proteins with a KD value greater than 5x10-4 mol/litre, wherein the molecule further comprises at least one nuclear localization sequence (NLS), covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, wherein the NLS preferably comprises or consists of SEQ ID NO.: 221.
Description
PROTEIN-BASED CONJUGATION CARRIERS FOR INTRANUCLEAR DELIVERY
FIELD OF THE TECHNOLOGY
The present technology provides molecules comprising or consisting of at least one proteinbased carrier building block, wherein the protein-based carrier building block comprises at least one, preferably at least two attachment point(s) or conjugation site(s), and at least one nuclear localization sequences (NLS), covalently linked to at least one conjugation site or attachment point comprised in the protein-based carrier building block, directly or by means of a linker.
The present technology further relates to nucleic acids encoding such molecules or part of such molecules; to host cells comprising such nucleic acids and/or expressing or capable of expressing such molecules or part of such molecules; to compositions, and in particular to pharmaceutical compositions that comprise such molecules, nucleic acids and/or host cells; and to uses of such molecules, nucleic acids, host cells and/or compositions, in particular for labelling, prophylactic, therapeutic and/or diagnostic purposes.
TECHNOLOGICAL BACKGROUND
Protein-based therapeutics (referred to as "Biologicals") are creating new therapeutic strategies which are hard to achieve via the classic, small molecule-based therapeutics. One fast-growing field comprises conjugation-based targeted therapeutics, such as antibody-drug conjugates (ADCs). Targeted drug delivery is critical for improving the therapeutic benefits of the drugs, such as anticancer drugs. The active delivery strategy increases the bioavailability of drugs not only to the diseased tissue and subsequent individual target cells but also to the active sites inside the target organelles where certain drugs carry out their desired pharmacological activities. Many drugs act within the nucleus; therefore, effective drug delivery to the nucleus of a target cell may be critical for its therapeutic action, see, e.g., Roy et al., "Overcoming the barriers of nuclear-targeted drug delivery using nanomedicine-based strategies for enhanced anticancer therapy", Journal of Drug Delivery Science and Technology, 2023, 83. The nucleus has a double membrane called nuclear envelope. To allow the exchange of proteins between the nucleus and cytoplasm, proteins must be transported efficiently through the nuclear pore complex (NPC), which penetrates the nuclear envelope. The NPC is
a large, multimeric structure that generally acts as a permeability barrier between the cytoplasm and nucleoplasm. The main structural components of the NPC include the central channel, the cytoplasmic ring moiety and cytoplasmic filaments, and the nuclear ring moiety and nuclear basket. The NPC has eightfold rotational symmetry. Each NPC is connected to the inner and outer nuclear membranes by symmetrical 8 molecular spoke proteins, and the 8 molecular spoke proteins surround each other into a central channel with an outer diameter of 122 nm and an inner diameter of 70 nm. Diverse proteins, such as transcription factors, histones, and cell cycle regulators, need to be transported into the nucleus through the NPC after their synthesis, which necessitates the presence of a nuclear localization signal or nuclear localization sequence (NLS) on these proteins. The NLS is recognized by the corresponding nuclear transporters, which can interact with nucleoporins to help NLS- containing proteins reach the nucleus through NPCs (Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60). NLS have been used to deliver molecules such as nucleic acids, proteins, and nanoparticles intranuclearly.
Currently, the manufacture of biologicals mainly relies on recombinant production and is hence restricted to polypeptide production which can combine naturally into one functional unit (i.e., IgGs).
There is a need of further targeted-therapeutic strategies, in particular targeted to the cell nucleus, which allow for versatile production and cargo conjugation.
SUMMARY OF THE TECHNOLOGY
The current technology aims at simplifying the generation of conjugation-based therapeutics, in particular of targeted conjugation-based therapeutics, and/or to create a plug-and play strategy that can create alternative formats which allow for versatile conjugation of different cargos.
Whereas classic conjugation strategies need to focus on preserving the functionality of the involved polypeptides, a described above, the present technology employs a non-targeting
protein-based carrier building block which solely serves as a site-specific conjugation vehicle (protein-based carrier building block). This protein-based carrier building block can be contained within a genetic construct, thus be the product of one manufacturing campaign or, alternatively, can be produced separately (e.g., recombinantly or by alternative means such as solid-phase peptide synthesis, SPPS) and later connected to the active and/or targeting moiety (i.e., the cargo), as it will be explained in detail below. The latter strategy renders more freedom for cargo conjugation conditions onto the protein-based carrier building block. Such freedom can translate into making use of site-specific conjugation onto common amino acids which are usually used in a stochastic way (e.g., lysines) or into conjugation conditions which might otherwise impair the functionality/quality of the targeting building block(s) or into alternative production platforms (e.g., chemical synthesis). The position and number of the conjugation sites or attachment points can be engineered/tuned to the specific application. Advantageously, the molecules of the present technology comprise at least one NLS, which allows the molecule to reach the intranuclear space, for targeted delivery of drugs into the cell nucleus.
Hence, the present technology provides molecules comprising or consisting of at least one protein-based carrier building block, wherein the protein-based carrier building block comprises at least one attachment point or conjugation site, preferably at least two attachment points or conjugation sites. The protein-based building block comprises at least one nuclear localization sequence (NLS), covalently linked to at least one conjugation site or attachment point comprised in it. The conjugation sites or attachment points are suitable for conjugation or attachment of NLS and optionally further cargos to the protein-based carrier building block, as described herein. A "cargo" is any molecule which is/may be attached or conjugated to the protein-based carrier building block through the attachment point(s) or conjugation site(s) present therein. For instance, cargos which may be attached or conjugated to the protein-based carrier building block comprised in the molecule of the present technology are proteins such as targeting proteins, peptides such as NLSs and cell-penetrating peptides (CPPs), polyethylene glycol (PEG), small molecules, glycans, lipids, chelators, fluorophores, radio isotopes, vitamins such as folic acid or biotin, nucleic acids such as Antisense Oligonucleotides (ASOs) etc. Hence, in a further embodiment, the present technology provides the protein-based carrier building block attached to at least one NLS, as
described above, and further attached to another cargo, as described herein, i.e., the molecule of the present technology comprises (or, alternatively, consists of) at least one protein-based building block comprising at least one NLS, as described herein, and at least one further cargo attached or conjugated to it through at least one conjugation site or attachment point. The further cargo may preferably be a cell-penetrating peptide or a targeting molecule, such as a cel I -targeting moiety.
The protein-based carrier building block comprised in the molecule of the present technology comprises (and, preferably, consists of) at least part of a protein, preferably a whole protein. Hence, preferably, the protein-based carrier building block is a polypeptide. The proteinbased carrier building block has a globular 3D structure and is soluble. In addition, the proteinbased carrier building block comprised in the molecule of the present technology has a size (molecular mass or molecular weight, MW) of about 2.5 to about 70 kDa, preferably of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa, such as about 6 kDa, or about 7 kDa, or about 16 kDa.
Finally, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein, although it may show non-specific binding to one or more human proteins, as explained in detail herein. In this case, the proteinbased carrier building block may bind to human proteins with low specificity and/or low selectivity, as defined herein. Preferably, the protein-based carrier building block does also not specifically bind to any non-protein (preferably human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans. The protein-based carrier building block may derive from a target-binding protein (such as an immunoglobulin single variable domain (ISVD), a DARPin, an affibody or an affitin), as described below. It may also derive from other proteins which show specific binding towards, e.g., human proteins, such as small globular human proteins. This is the so-called "protein-based carrier building block precursor". In these cases, preferably, the protein-based building block does also not specifically bind to any molecule (including non-human proteins) to which the protein-based carrier building block precursor specifically binds (if any). For example, if the precursor of the protein-based carrier building block is an anti-RSV (respiratory syncytial virus) ISVD, the protein-based carrier building block preferably does not specifically bind RSV. Hence, preferably, the protein-based
carrier building block does also not specifically bind to the precursor's target, should the precursor have a target and should this be a non-human molecule, such as a non-human protein, or a human non-protein molecule, such as human DNA, RNA, glycans, lipids, etc. In a further preferred embodiment, the protein-based carrier building block does not specifically bind any human protein, non-human protein and/or non-protein molecule when a cargo is conjugated to the at least one, preferably at least two, attachment points or conjugation sites on the protein-based carrier building block.
Hence, the at least one protein-based carrier building block comprised in the molecule of the present technology: a) Has at least one attachment point (also referred to as conjugation site in the present description), preferably at least two attachment points or conjugation sites, wherein an attachment point or conjugation site is a reactive group in the side chain of a non-natural or natural amino acid (e.g., Cys, Lys, Tyr, Orn, etc.) preferably located at a solvent-accessible position in the protein-based carrier building block, and/or the /V-terminal primary amine, and/or the C-terminal carboxylic group of the protein-based carrier building block, if these are available. The at least one, preferably at least two, attachment point(s) or conjugation site(s) is(are) thus preferably located at solvent-accessible positions in the protein-based carrier building block; b) Has a size (molecular mass) of from about 2.5 to about 70 kDa, preferably from about 2.5 to about 50 kDa, such as from about 2.5 to less than 50 kDa, more preferably from about 2.5 to about 30 kDa, even more preferably from about 2.5 to about 16 kDa; c) Has a solubility of about 10 mg/mL or more, measured in an aqueous solution at room temperature (RT), preferably measured in a buffer or water at RT, more preferably measured in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80
(0.001% to 1%, such as 0.01%)), preferably wherein the buffer is citrate buffer 5 mM or PBS, at pH 7.0 or 7.4; d) Has a globular 3D structure, as described below; e) Does not specifically bind to any human protein (or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre), preferably it does also not specifically bind to the precursor's target (or binds the precursor target, which may be a non-human protein or a non-protein molecule, with a KD (KD value) greater than 5xl0-4 mol/litre), and preferably it also does not specifically bind to any non-protein molecule, such as nucleic acids (e.g., DNA, RNA), lipids or glycans, preferably it does not specifically bind to any human non-protein molecule (or binds any non-protein molecule with a KD (KD value) greater than 5xl0-4 mol/litre), preferably it also does not specifically bind to any non-protein molecule (such as nucleic acids (e.g., DNA, RNA), glycans, lipids, etc.), to which the building block precursor binds specifically, if any, for instance as determined by cell-binding assay or by surface plasmon resonance (SPR), for instance as described herein and/or in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay; h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein;
i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein, when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214, wherein the protein-based carrier building block comprises at least one nuclear localization sequence (NLS), covalently linked (directly or by means of a linker) to the at least one conjugation site or attachment point.
In a first aspect, the present technology relates to a molecule comprising at least one proteinbased carrier building block, wherein the at least one protein-based carrier building block: a) comprises at least one conjugation site or attachment point, preferably at least two attachment points or conjugation sites; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as from about 2.5 kDa to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular 3D structure;
d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at RT, preferably measured in a buffer or water at RT, more preferably in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)); e) does not specifically bind to any human protein or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by surface plasmon resonance (SPR), for instance as described herein and/or in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay; h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists
and/or yeast), or binds to biomolecules, including human biomolecules and nonhuman biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214 (EVQLQASGGGLAQPGGSLRLSVTVSGSIDVINNMAWYRQAPGNARELVATITSGFSTNYA SSVKGRFTISRDNAKKAVYLQMNSLKPEDTADYYSKVHLI RLGAARAYDYWGQGTQVTVS), wherein the protein-based carrier building block comprises at least one nuclear localization sequence (NLS), covalently linked (directly or by means of a linker) to the at least one conjugation site or attachment point.
The at least one NLS attached to at least one attachment point or conjugation site comprised in the protein-based carrier building block is not particularly limited. The skilled person is aware of NLSs which may be comprised in the molecule of the present technology. Suitable examples comprise NLSs as described, e.g., in Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60, e.g., on Table 1, page 3 or page 4 of this document. In one embodiment, the at least one NLS comprises or consists of the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221), see, e.g., Dang CV and Lee WM., "Identification of the human c-myc protein nuclear translocation signal", Mol Cell Biol. 1988, 8(10):4048-5. In one embodiment, the at least one NLS comprises or consists of the SV40mono NLS (SEQ ID NO.: 256, PKKKRKV). In one embodiment, the at least
one NLS comprises or consists of the SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV). In one embodiment, the at least one NLS comprises or consists of the NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD) (Ray et al. 2015, Bioconj. Chem. 26(6): 1004-1007, Quantitative tracking of protein trafficking to the nucleus using cytosolic protein delivery by nanoparticle-stabilized nanocapsules).
Preferably, in the molecule of the present technology, the at least one protein-based carrier building block does not specifically bind to any non-protein molecule, e.g., to any human nonprotein molecule, such as human DNA, human RNA, human lipids or human glycans.
The molecule of the present technology may comprise more than one protein-based carrier building block, such as, e.g., two, three, four, five, six or more protein-based carrier building blocks. These protein-based carrier building blocks may be directly linked to each other, or linked to each other through a linker, as described herein.
Preferably, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises more than one conjugation site or attachment point, preferably at least two conjugation sites or attachment points, such as two conjugation sites or attachment points, or at least three conjugation sites or attachment points, such as three, four, five, six, seven, eight or nine conjugation sites or attachment points, which are preferably reactive groups present in the side chain of a natural or non-natural amino acids comprised in the protein-based carrier building block, or which may be (additionally or alternatively) the /V- terminal primary amine and/or the C-terminal carboxylic acid group of the protein-based building block.
The at least one NLS may be attached or conjugated to any of the conjugation sites present in the protein-based building block, directly or by means of a linker, as described herein. In one embodiment, the protein-based building block comprises one NLS covalently linked, by means of a peptide linker, to the C-terminal carboxylic acid group of the protein-based building block. In one embodiment, the protein-based building block comprises one NLS covalently linked, by means of a peptide linker, to another attachment point or conjugation sites, such as reactive groups present in the side chain of a natural or non-natural amino acids comprised in the
protein-based carrier building block. In another embodiment, the protein-based building block comprises more than one NLS covalently linked, by means of a peptide linker, to more than one attachment points or conjugation sites present in the protein-based building block. For instance, one of the NLS may be covalently linked, directly or by means of a peptide linker, to the C-terminal carboxylic acid group of the protein-based building block, and one or more NLS may be covalently linked, directly or by means of a peptide linker, to other attachment points or conjugation sites, such as reactive groups present in the side chain of a natural or non-natural amino acids comprised in the protein-based carrier building block and/or to the /V-terminal primary amine of the protein-based building block.
For instance, the at least one conjugation site or attachment point comprised in the at least one protein-based carrier building block is a free or capped thiol group, a free or capped hydroxyl group and/or a free or capped primary amine. In a further embodiment, the at least one conjugation site or attachment point comprised in the at least one protein-based carrier building block is a reactive group present in the side chain of a cysteine and/or in the side chain of a tyrosine, and/or in the side chain of a lysine, and/or in the side chain of an ornithine. In another further embodiment, the at least one protein-based building block comprises a /V- and/or a C-terminal Cys and/or a /V- and/or a C-terminal Tyr, preceded or followed by a (GG) or (G4Si)i-3GG sequence, such as CGG-, -GGC, YGG-, -GGY, -(G4SI)I-3GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-.
Preferably, the at least one protein-based carrier building block present in the molecule of the present technology is (i) a building block based on small globular non-human proteins, such as an ISVD-based building block, a DARPin-based building block, an affibody-based building block or an affitin-based building block or (ii) a building block based on small globular human proteins, such as cyclin-dependent kinase subunit 1 (CDK-1).
In one embodiment, the least one protein-based building block is derived from a heavy chain ISVD, preferably from a VH, VHH, including a camelized VH or humanized VHH. In another embodiment, the at least one protein-based building block is derived from an ISVD belonging to the "VH3 class", preferably wherein the resulting building block comprises at least one (preferably engineered) cysteine, at least one (preferably engineered) lysine, at least one non-
natural amino acid and/or at least one (preferably engineered) tyrosine at one or more solvent-accessible positions of the protein-based building block.
In another embodiment, the least one protein-based building block is derived from RSV001A04, SEQ ID NO.: 179.
SEQ ID NO.: 179:
EVQLVESGGGLVQAGGSLSISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVEGRFTI SRDNAKNTGYLQMNSLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTQVTVSS
In one embodiment, the at least one protein-based building block is an ISVD-based building block which comprises a Leu or a Gin, preferably a Leu at position 108, according to Kabat numbering, preferably wherein the ISVD-based building block comprises a Vai or a Leu, preferably a Vai at position 11 and/or a Vai, a Thr or a Leu, preferably a Leu at position 89, according to Kabat numbering.
In another embodiment, the at least one protein-based building block comprises or, alternatively, consists of SEQ ID NO.: 186:
X1VX2LX3EX4X5GX6X7X8X9X10X11GX12X13X14IX15CX16AX17X18X19X20LX21X22X23VLGWFRX24AX25X26X2 7X28X29X30FVAAI NX31X32X33X34X35X36X37X38PX39X40VX41X42X43FX44IX45X46X47X48X49X50X51TGX52LX5 3MX54X55LX56X57X58DX59AX6OYX61CGAGX62PX63X64X65X66AYX67X68X69X7OSYX71X72X73GX74X75TX76V X77VX78X79X80X81X82, wherein
Xi (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai, or any other amino acid with a reactive group in its side chain, such as cysteine;
Xs (position 12 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X10 (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xig: (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X20: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X23: (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X24: (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X25: (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X26: (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X27: (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X28: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X29: (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X30: (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X31: (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X32: (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X33: (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X34: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X35: (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X36: (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X37: (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X38: (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X39: (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X40: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X41: (position 64 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X42: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X43: (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X44: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X45: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X46: (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X47: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X48: (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X49: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X50: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xsi: (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X52: (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X53: (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X54: (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xse: (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X57: (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xss: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeo: (position 89 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai, or any other amino acid with a reactive group in its side chain, such as cysteine;
Xei: (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X62: (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X64: (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xee: (position 100a according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeg: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X70: (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X71: (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X72: (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X73: (position 103 according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X74: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X75: (position 106 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X76: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu or any other amino acid with a reactive group in its side chain, such as cysteine;
X77: (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X78: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine.
X79: (position 113 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xso: is absent or Gly;
Xsi: is absent or Gly;
X82: is absent or Cys,
or a sequence which has 80% or more identity with SEQ ID NO.: 186, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 186, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as about2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described herein.
In another embodiment, the at least one protein-based building block comprises or consists of SEQ ID NO.: 225: EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCS
In another embodiment, the at least one protein-based building block is a DARPin-based building block, preferably derived from the DARPin K27 as defined in SEQ ID NO.: 187.
SEQ ID NO.: 187:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGR TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGRTPLHLAAKRGHLEIVEVLLKNGADVNAQDKFGKTAFD ISIDNGNEDLAEILQKL
In one embodiment, the protein-based building block is a DARPin-based building block which comprises, or alternatively, consists of, SEQ ID NO.: 188:
X1X2GX3X4LLX5AAX6X7X8X9X10X11X12VX13X14LMX15X16X17AX18VX19AX20X21X22X23GX24TPLHLAAX25 X26X27X28X29X30IVX31VLLX32X33X34AX35VX36AX37DX38X39GATPLHLAAX40X41X42X43X44X45IVX46VLLX4 7X48X49AX5OVX51AX52DX53X54GATPLHX55AAX56X57X58X59X6OX61IVX62X63LX64X65X66X67AX68X69X7OAX 71DX72X73X74X75TAX76X77ISX78X79X80X81X82X83X84LAX85X86LX87X88X89X90, wherein:
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X2 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X4 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X7 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; Xs can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X10 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xu can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X13 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X14 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X15 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X20 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X21 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X24 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X26 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be His or any amino acid with a reactive group in its side chain, such as cysteine X29 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine X30 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X31 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine
X32 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
X33 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine
X34 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine
X35 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X36 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X37 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X38 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X40 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X41 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X44 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X50 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X51 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X54 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X58 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X59 can be His or any amino acid with a reactive group in its side chain, such as cysteine; Xeo can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; Xei can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X62 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X63 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X64 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X65 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xee can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X67 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xes can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X69 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X70 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X71 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X72 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X73 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X74 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X75 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X76 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X77 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X78 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X79 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xso can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xsi can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xs2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xs3 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs6 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
Xs7 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xs8 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine
Xs9 is absent or Leu;
X90 is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about
50 kDa, such as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described herein, in particular does not specifically bind to human KRAS protein (GTPase KRas, EC:3.6.5.2, primary accession number P01116, see also Lim S., et al., "Exquisitely specific anti- KRAS biodegraders inform on the cellular prevalence of nucleotide-loaded states", ACS Cent. Sci. 2021, 7, 2, 274-291).
In another embodiment, the at least one protein-based building block is a small globular human protein-based building bock, preferably derived from the polypeptide as defined in SEQ ID NO.: 190.
SEQ ID NO.: 190
SHKQIYYSDKYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHYMIHEPEPHILLFR RPLPKKPKK
In one embodiment, the protein-based building block is a small globular human protein-based building bock which comprises, or alternatively, consists of, SEQ ID NO.: 191:
XlX2X3X4lX5X6SX7X8X9XloXllX12X13X14X15Xl6X17X18VX19LPX2oX21X22AX23X24VX25X23bX24bX25bX26MX2 7X28X29X30WX31X32LX33VX34QX35X36X37WX38HX39X40X41X42X43X44X45X46X47I LLFX48X49X50X51X52X53X 54X55X55X57, wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X4can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xn can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xi2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi4can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be His or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Met or any amino acid with a reactive group in its side chain, such as cysteine;
X2ocan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X2i can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X23bcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24bcan be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X25bcan be His or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X29 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xsocan be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X31 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X32 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X34can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X35 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xse can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3? can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xss can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X4ocan be Met or any amino acid with a reactive group in its side chain, such as cysteine; X4i can be He or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X44can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Pro or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs4can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as from about 2.5 to about 50 kDa, such as from about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described herein.
In another embodiment, the at least one protein-based building block comprises or consists of 3C_hCKSl (SEQ ID NO.: 101):
SHKQIYYSDKCDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSCGWVHYMIHEPEPHILLFR RPLPKKPKC
For instance, the at least one protein-based building block may be selected from SEQ ID NO.: 80-105, 175, 199, 208, 222-225.
In one embodiment, the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein and at least one NLS, as described herein, covalently linked to a conjugation site or attachment point comprised in the at least one protein-based building block. In a preferred embodiment, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein and at least two NLS, as described herein, covalently linked to at least two conjugation sites or attachment points comprised in the at least one protein-based building block. More preferably, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein and at least three NLS, as described herein, covalently linked to at least three conjugation sites or attachment points comprised in the at least one protein-based building block. In another preferred embodiment, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein and at least four NLS, as described herein, covalently linked to at least four conjugation sites or attachment points comprised in the at least one protein-based building block. The NLS are preferably as described, e.g., in Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19( 1) :60, e.g., on Table 1, page 3 or page 4 of this document. In one embodiment, the at least one NLS comprises or consists of the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221), see, e.g., Dang CV and Lee WM., "Identification of the human c-myc protein nuclear translocation signal", Mol Cell Biol. 1988, 8(10):4048-5. In one embodiment, the at least one NLS comprises or consists of the SV40mono NLS (SEQ ID NO.: 256, PKKKRKV). In one embodiment, the at least one NLS comprises or consists of the SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV). In one embodiment, the at least one NLS comprises or consists of the NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
In another embodiment, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein, one or more NLS, as described herein, and at least one cell-targeting moiety, as defined herein, attached to at least one attachment point or conjugation site.
In another embodiment, the molecule of the present technology comprises at least one protein-based carrier building block as defined herein, one or more NLS, as described herein, and at least one cell-penetrating peptide (CPP), preferably more than one CPP, such as two, three, four, five or more CPPs, as defined herein, attached to at least one, preferably to more than one, attachment points or conjugation sites.
In one embodiment, the at least one protein-based carrier building block and the at least one cell-targeting moiety and/or the at least one CPP are directly linked to each other. In a further embodiment, they are linked to each other through a peptide linker, preferably wherein the peptide linker is selected from the linkers depicted in Table A-l, such as SEQ ID NO.: 158-169 or 193-196, or 298, or GGG. Other linkers may be used, such as APN-maleimide linkers, as defined below and exemplified in the examples.
In one embodiment, the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein, (i) one or more NLS, as described herein, (ii) one or more cel I -targeting moieties, as described herein and, optionally, (iii) one or more CPPs, as described herein.
In one embodiment, the molecule of the present technology comprises at least one proteinbased carrier building block as defined herein, one or more NLS, as described herein, preferably one or more cel I -targeting moieties, as described herein, preferably one or more CPPs, as described herein and at least one further moiety or cargo attached to the attachment point or conjugation site, wherein the at least one further moiety or cargo is selected from: a) a half-life extending (HLE) moiety, such as PEG, and/or an albumin binding ISVD;
b) a further targeting moiety, such as an EGFR-targeting moiety, e.g., GE11 peptide or an anti-EGFR ISVD (such as an anti-EGFR VHH); and/or other cell specific binding moieties; c) a therapeutic moiety or precursor therefrom, preferably a therapeutic moiety which target is in the cell nucleus, e.g., such as a CDK inhibitor; d) an imaging moiety, such as deferoxamine (DFO); e) nucleic acids such as DNA or ASOs; f) vitamins, such as folate; g) a tumor-associate carbohydrate; and/or h) Toll-like receptor targeting moiety.
In a preferred embodiment, the at least one further cargo is at least one therapeutic moiety which target is in the cell nucleus.
In one embodiment, the at least one further cargo is a half-life extending moiety. In one embodiment, the at least one half-life extending moiety is an albumin-binding ISVD, wherein the albumin-binding ISVD is preferably selected from SEQ ID NOs: 50-64 and 106, more preferably SEQ ID NO.: 63 or SEQ ID NO.: 106, or a sequence with at least 70%, preferably at least 80%, more preferably at least 90% and even more preferably at least 95% identity with SEQ ID NOs: 50-64 and/or 106. In a further embodiment, the at least one half-life extending moiety is a linear or branched polyethylene glycol moiety with a molecular weight of about 1- 60 kDa, preferably with a weight of about 1-15 kDa, such as about 14 or 15 kDa, or of about 1-10 kDa, such as 5 or 10 kDa.
In one embodiment, the at least one protein-based carrier building block and the at least one further moiety or cargo are directly linked to each other. In a further embodiment, they are linked to each other through a peptide linker, preferably wherein the peptide linker is selected from the linkers depicted in Table A-l, such as SEQ ID NO.: 158-169 or 193-196, or 298, more preferably SEQ ID NO.: 163. Other linkers may be used, such as APN-maleimide linkers, as defined below and exemplified in the examples.
For instance, the molecule of the present technology may comprise any one of SEQ ID NOs.: 107-127, SEQ ID NOs. : 170-174, 176, 200 or 306.
SEQ ID NO: 306:
DVQLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRF TISRDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSGGGGSGGGGSGGGGSEVQLVESG GGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTISRDNAKNT GYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCS
The molecule of the present technology may comprise SEQ ID NO.: 215. The molecule of the present technology may comprise:
SEQ ID NO.: 226 (T028501899:
A0315007E07(ElD,LllV,S76N,N82bS,E83R,V89L,H91Y,N101H)-15GS-
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGR FTISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYAD SVKGRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGG GGSGGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGP PNyECRFTISRDNAKNTGYLQMNCLAPDDTAyYYCGAGTPLNPGAYlYDWSYDYW PAAKRVKLP
The present technology also provides a nucleic acid encoding the molecule of the present technology (or part of the molecule of the present technology). In addition, the present technology provides a vector comprising the nucleic acid of the present technology, and a composition comprising the molecule of the present technology, such as a pharmaceutical composition.
Furthermore, the present technology relates to the molecule or composition of the present technology for use in medicine, in particular for use in the (prophylactic or therapeutic) treatment of diseases and or disorders, such as autoimmune/inflammatory diseases, cancer and/or infectious diseases.
BRIEF DESCRIPTION OF THE FIGURES
Figure 1. Amino acid sequence of ISVD RSV001A04 (SEQ ID NO.: 179).
Figure 2. Amino acid sequence of K27m (without the C-terminal L), SEQ ID NO.: 68.
Figure 3. Amino acid sequence of the CKSl-building block precursor (SEQ ID NO.: 190).
Figure 4. Conjugation using an APN-maleimide 'bifunctional' linker. The carrier (protein-based building block comprised in the molecule) comprises at least one attachment point or conjugation site (represented as "-SH" in the figure). The APN-maleimide 'bifunctional' linker can be first attached to the conjugation site present in the carrier. Then, the cargo (represented as "DR5-SH" in the figure) can be attached to the other side of the APN- maleimide 'bifunctional' linker. Hence, the cargo has been attached or conjugated to the carrier through an APN-maleimide 'bifunctional' linker.
Figure 5. Conjugability check of cysteine-engineered ISVDs-based carrier building blocks using Mass Spectrometry Deconvoluted Mass Spectrum of Mal-APN conjugation onto the molecule T028100075, comprising an ISVD-based carrier building block with one attachment point or conjugation site ("ISVD179-APN", SEQ ID NO.: 176, Figure 5A) and onto the molecule T028100069, comprising an ISVD-based carrier building block with three attachment points or conjugation sites ("ISVD107-APN", SEQ ID NO.: 107, Figure 5B). Mass Spec analysis was carried out using electrospray ionization (ESI) with online reverse phase column (RPC) for clean-up of the sample.
Figure 6: Non-reducing PAGE analysis of a partial CMA1 uploaded CKS-based carrier.
Figure 7. Mass Spectrometry indicated the right mass with intact NLS.
Figure 8. Internalization of 7D12 EGFR ISDV in NCI-H226 (A) and Bx-PC3 (B) cell lines, both of which express EGFR (not shown).
Figure 9. Evaluation of possible cytotoxic effects of CMA-1 on NCI-H226 (A) and BxPC-3 (B) cells using IncuCyte™ Cytotox Green Reagent.
Figure 10. Internalization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1. Blue = DAPI staining (nucleus) / Red = AF647 (ISVD compound).
Figure 11. Co-localization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 (AF647 in Cy5 channel) and endosomes using anti-EAAl polyclonal antibody (AF488 in FITC channel). Blue =
DAPI staining (nucleus) / Red = AF647 (ISVD compound) / Green = AF488 (endosomal marker) / Yellow = co-localization of AF647 + AF 488.
Figure 12. Co-localization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 (AF647 in Cy5 channel) and nuclei (DAPI channel) using Z-stacks. Blue = DAPI staining (nucleus) / Red = AF647 (ISVD compound) / Pink = co-localization of AF647 + DAPI.
Figure 13. T023800001, EGFROO7D12(Q1E,K3Q)-2OGS-ALB11-GGC
DEFINITIONS
Unless indicated or defined otherwise, all terms used have their usual meaning in the art, which will be clear to the skilled person. Reference is, for example, made to the standard handbooks, such as Sambrook et al., 1989 (Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory Press), Ausubel et al., 1987 (Current protocols in molecular biology, Green Publishing and Wiley Interscience, New York), Lewin 1985 (Genes II, John Wiley & Sons, New York, N.Y.), Old et al., 1981 (Principles of Gene Manipulation: An Introduction to Genetic Engineering, 2nd Ed., University of California Press, Berkeley, CA), Roitt et al., 2001 (Immunology, 6th Ed., Mosby/Elsevier, Edinburgh), Roitt et al., 2001 (Roitt's Essential Immunology, 10th Ed., Blackwell Publishing, UK), and Janeway et al., 2005 (Immunobiology, 6th Ed., Garland Science Pu blishing/Churchi 11 Livingstone, New York), as well as to the general background art cited herein.
Unless indicated otherwise, all methods, steps, techniques and manipulations that are not specifically described in detail herein can be performed and have been performed in a manner known per se, as will be clear to the skilled person. Reference is, for example, again made to the standard handbooks and the general background art mentioned herein and to the further references cited therein; as well as to for example the following reviews: Presta 2006 (Adv. Drug Deliv. Rev., 58: 640), Levin and Weiss 2006 (Mol. Biosyst., 2: 49), Irving et al., 2001 (J. Immunol. Methods, 248: 31), Schmitz et al., 2000 (Placenta 21 Suppl. A: S106), Gonzales et al., 2005 (Tumour Biol., 26: 31), which describe techniques for protein engineering, such as affinity maturation and other techniques for improving the specificity and other desired properties of proteins such as immunoglobulins.
It must be noted that as used herein, the singular forms "a", "an", and "the", include plural references unless the context clearly indicates otherwise. Thus, for example, reference to "a reagent" includes one or more of such different reagents and reference to "the method" includes reference to equivalent steps and methods known to those of ordinary skill in the art that could be modified or substituted for the methods described herein.
Unless otherwise indicated, the term "at least" preceding a series of elements is to be understood to refer to every element in the series. Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the technology described herein. Such equivalents are intended to be encompassed by the present technology.
The term "and/or" wherever used herein includes the meaning of "and", "or" and "all or any other combination of the elements connected by said term".
Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integer or step. When used herein the term "comprising" can be substituted with the term "containing" or "including" or sometimes when used herein with the term "having".
The term "sequence" as used herein (for example in terms like "immunoglobulin sequence", "antibody sequence", "variable domain sequence", "VHH sequence" or "protein sequence"), should generally be understood to include both the relevant amino acid sequence as well as nucleic acids or nucleotide sequences encoding the same, unless the context requires a more limited interpretation. Amino acid sequences are interpreted to mean a single amino acid or an unbranched sequence of two or more amino acids, depending on the context. Nucleotide sequences are interpreted to mean an unbranched sequence of 3 or more nucleotides.
It is understood that any reference to the amino acid sequences is meant to encompass post- translational modifications of these sequences occurring in mammalian cells such as CHO cells,
including, but not limited to, /V-glycosylation, O-glycosylation, deamidation, Asp isomerization/fragmentation, pyro-glutamate formation, removal of C-terminal lysine, and Met/Trp oxidation.
When a nucleotide sequence or amino acid sequence is said to "comprise" another nucleotide sequence or amino acid sequence, respectively, or to "essentially consist of" another nucleotide sequence or amino acid sequence, this may mean that the latter nucleotide sequence or amino acid sequence has been incorporated into the first mentioned nucleotide sequence or amino acid sequence, respectively, but more usually this generally means that the first mentioned nucleotide sequence or amino acid sequence comprises within its sequence a stretch of nucleotides or amino acid residues, respectively, that has the same nucleotide sequence or amino acid sequence, respectively, as the latter sequence, irrespective of how the first mentioned sequence has actually been generated or obtained (which may for example be by any suitable method described herein).
Amino acids are organic compounds that contain amino[a] (-NH+s) and carboxylate (-CO~2) functional groups, along with a side chain (R group) specific to each amino acid. For instance, amino acids include those L-amino acids commonly found in naturally occurring proteins. "Amino acids", in the context of the present technology, also include D-amino acids and nonnatural, unusual or unnatural amino acids, as described below. Amino acid residues will be indicated according to the standard three-letter or one-letter amino acid code. Reference is made to Table A-2 on page 48 of WO 08/020079. Examples of amino acids commonly found in proteins and represented in the genetic code are listed in Table 1 below. Other common amino acids (excluding those listed in Table 1 below) are described on the table on p. 624 of Pure & Appl. Chem., Vol. 56, No. 5, pp. 595—624, 1984, reproduced below as Table 2 for convenience.
Table 1: Common amino acids (IUPAC)
D-amino acids are also encompassed by the definition of "amino acid". As used herein, the term "D-amino acid" refers to amino acids where the stereogenic carbon alpha to the amino group has the D-configuration.
Unusual, unnatural or non-natural amino acids are also encompassed by the definition of "amino acid". As used herein, the term "unnatural amino acid" or "non-canonical amino acid" or "non-natural amino acid" or "novel amino acid" (or the like) refers to an amino acid that is not one of the twenty amino acids commonly found in peptides synthesized in nature,
and known by the one letter abbreviations A, R, N, C, D, Q, E, G, H, I, L, K, M, F, P, S, T, W, Y and V. Exemplary unnatural amino acids are described in Young et al., "Beyond the canonical 20 amino acids: expanding the genetic lexicon," J. of Biological Chemistry, 285(15): 11039- 11044 (2010), the disclosure of which is incorporated herein by reference.
Alexander R. Nbdling et al. ("Using genetically incorporated unnatural amino acids to control protein functions in mammalian cells", Essays Biochem, 3 July 2019; 63 (2): 237-266), the disclosure of which is incorporated herein by reference, provides an overview of unnatural amino acids that have been successfully incorporated into proteins in mammalian cells, see, e.g., Table 1 starting on p. 240.
Non-limiting examples of unnatural amino acids include: p-acetyl-phenylalanine, O-4-allyl-L- tyrosine, 4-propyl-L-tyrosine, L-Dopa, p-azido-phenylalanine, N6-(propargyloxy)-carbonyl-L- lysine (PrK), azido-lysine (N6-azidoethoxy-carbonyl-L-lysine, AzK). In some embodiments, the unnatural amino acid comprises a selective reactive group, or a reactive group for site- selective labeling or conjugation of a moiety or cargo. In some instances, the chemistry is a biorthogonal reaction (e.g., biocompatible and selective reactions). In some cases, the chemistry is a Cu(l)-catalyzed or "copper-free" alkyne-azide triazole-forming reaction, the Staudinger ligation, inverse-electron-demand Diels-Alder (IEDDA) reaction, "photo-click" chemistry, or a metal-mediated process such as olefin metathesis and Suzuki-Miyaura or Sonogashira cross-coupling. For further examples of unnatural amino acids, we refer to WO 2021/072167, the disclosure of which is incorporated herein by reference.
The terms "protein", "peptide", "protein/peptide", and "polypeptide" are used interchangeably throughout the present disclosure, and each has the same meaning for purposes of this disclosure. Each term refers to an organic compound made of a linear chain of two or more amino acids. The compound may have ten or more amino acids; twenty-five or more amino acids; fifty or more amino acids; one hundred or more amino acids, two hundred or more amino acids, and even three hundred or more amino acids. The skilled artisan will appreciate that polypeptides generally comprise fewer amino acids than proteins, although there is no art-recognized cut-off point of the number of amino acids that distinguish a polypeptide and a protein; that polypeptides may be made by chemical synthesis or
recombinant methods; and that proteins are generally made in vitro or in vivo by recombinant methods as known in the art.
By convention, the amide bond in the primary structure of polypeptides is in the order that the amino acids are written, in which the amine end (/V-terminus) of a polypeptide is always on the left, while the acid end (C-terminus) is on the right.
Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated using the symbols shown in Table 1 with the modified positions; e.g., hydroxylations or glycosylations, but these modifications shall not be shown explicitly in the amino acid sequence. Any peptide or protein that can be expressed as a sequence modified linkages, cross links and end caps, non-peptidyl bonds, etc., is embraced by this definition.
In the context of the present technology, the terms "specificity", "binding specifically" or "specific binding" refer to the number of different target molecules, such as antigens, to which a particular binding unit can bind with sufficiently high affinity (see below). "Specificity" , "binding specifically" or "specific binding" are used interchangeably herein with "selectivity”, "binding selectively" or "selective binding". Generally, binding units, such as binding ISVDs, specifically bind to their designated targets.
The specificity /selectivity of a binding unit can be determined based on affinity. The affinity denotes the strength or stability of a molecular interaction. The affinity is commonly given by the KD, or dissociation constant, which has units of mol/litre (or M). The affinity can also be expressed as an association constant, K , which equals 1/KD and has units of (mol/ litre) 1 (or M 1).
The affinity is a measure for the binding strength between a moiety and a binding site on a target molecule: the lower the value of the KD, the stronger the binding strength between a target molecule and a targeting moiety.
The Ko-value characterizes the strength of a molecular interaction also in a thermodynamic sense as it is related to the change of free energy (DG) of binding by the well-known relation DG=RT.In(KD) (equivalently DG=-RT.In(KA)), where R equals the gas constant, T equals the absolute temperature and In denotes the natural logarithm.
The KD may also be expressed as the ratio of the dissociation rate constant of a complex, denoted as kOff, to the rate of its association, denoted kon (so that KD =kOff/kOn and KA = kon/koff). The off-rate kOff has units s 1 (where s is the SI unit notation of second). The on-rate kon has units M -1s -1. The on-rate may vary between 102 M -1s 1 to about 107 M -1s -1, approaching the diffusion-limited association rate constant for bimolecular interactions. The off-rate is related to the half-life of a given molecular interaction by the relation ti/2=l n (2)/kOff . The off-rate may vary between IO-6 s 1 (near irreversible complex with a ti/2 of multiple days) to I s 1 (ti/2=0.69 s).
The measured KD may correspond to the apparent KD if the measuring process somehow influences the intrinsic binding affinity of the implied molecules for example by artefacts related to the coating on the biosensor of one molecule. Also, an apparent KD may be measured if one molecule contains more than one recognition sites for the other molecule or molecules. In such situation the measured affinity may be affected by the avidity of the interaction by the two molecules.
The dissociation constant (KD) may be the actual or apparent dissociation constant, as will be clear to the skilled person. Methods for determining the KD will be clear to the skilled person, and for example include the techniques mentioned below. In this respect, it will also be clear that it may not be possible to measure dissociation constants of more than IO-4 moles/litre or IO-3 moles/litre (e.g., of IO-2 moles/litre). Optionally, as will also be clear to the skilled person, the (actual or apparent) KD may be calculated on the basis of the (actual or apparent) association constant (KA), by means of the relationship (KD = 1/KA). KA = 1/KD --> KA= [AB] / [A].[B],
The term "about" used in the context of the parameters or parameter ranges of the provided herein shall have the following meanings. Unless indicated otherwise, where the term "about"
is applied to a particular value or to a range, the value or range is interpreted as being as accurate as the method used to measure it. If no error margins are specified in the application, the last decimal place of a numerical value indicates its degree of accuracy. Where no other error margins are given, the maximum margin is ascertained by applying the rounding-off convention to the last decimal place, e.g., for a pH value of about pH 2.7, the error margin is 2.65-2.74. However, for the following parameters, the specific margins shall apply: a temperature specified in °C with no decimal place shall have an error margin of ± 1°C (e.g., a temperature value of about 50°C means 50°C ± 1°C); a time indicated in hours shall have an error margin of 0.1 hours irrespective of the decimal places (e.g., a time value of about 1.0 hours means 1.0 hours ± 0.1 hours; a time value of about 0.5 hours means 0.5 hours ± 0.1 hours).
In the present application, any parameter indicated with the term "about" is also contemplated as being disclosed without the term "about". In other words, embodiments referring to a parameter value using the term "about" shall also describe an embodiment directed to the numerical value of said parameter as such. For example, an embodiment specifying a pH of "about pH 2.7" shall also disclose an embodiment specifying a pH of "pH 2.7" as such; an embodiment specifying a pH range of "between about pH 2.7 and about pH 2.1" shall also describe an embodiment specifying a pH range of "between pH 2.7 and pH 2.1", etc.
For the purposes of comparing two or more nucleotide sequences, the percentage of "sequence identity" between a first nucleotide sequence and a second nucleotide sequence may be calculated by dividing [the number of nucleotides in the first nucleotide sequence that are identical to the nucleotides at the corresponding positions in the second nucleotide sequence] by [the total number of nucleotides in the first nucleotide sequence] and multiplying by [100%], in which each deletion, insertion, substitution or addition of a nucleotide in the second nucleotide sequence - compared to the first nucleotide sequence - is considered as a difference at a single nucleotide (position). Alternatively, the degree of sequence identity between two or more nucleotide sequences may be calculated using a known computer algorithm for sequence alignment such as NCBI Blast v2.0, using standard settings. Some other techniques, computer algorithms and settings for determining the
degree of sequence identity are for example described in WO 04/037999, EP 0967284, EP 1085089, WO 00/55318, WO 00/78972, WO 98/49185 and GB 2357768. Usually, for the purpose of determining the percentage of "sequence identity" between two nucleotide sequences in accordance with the calculation method outlined hereinabove, the nucleotide sequence with the greatest number of nucleotides will be taken as the "first" nucleotide sequence, and the other nucleotide sequence will be taken as the "second" nucleotide sequence.
For the purposes of comparing two or more amino acid sequences, the percentage of "sequence identity" between a first amino acid sequence and a second amino acid sequence (also referred to herein as "amino acid identity") may be calculated by dividing [the number of amino acid residues in the first amino acid sequence that are identical to the amino acid residues at the corresponding positions in the second amino acid sequence] by [the total number of amino acid residues in the first amino acid sequence] and multiplying by [100%], in which each deletion, insertion, substitution or addition of an amino acid residue in the second amino acid sequence - compared to the first amino acid sequence - is considered as a difference at a single amino acid residue (position), i.e., as an "amino acid difference" as defined herein. Alternatively, the degree of sequence identity between two amino acid sequences may be calculated using a known computer algorithm, such as those mentioned above for determining the degree of sequence identity for nucleotide sequences, again using standard settings. Usually, for the purpose of determining the percentage of "sequence identity" between two amino acid sequences in accordance with the calculation method outlined hereinabove, the amino acid sequence with the greatest number of amino acid residues will be taken as the "first" amino acid sequence, and the other amino acid sequence will be taken as the "second" amino acid sequence.
Also, in determining the degree of sequence identity between two amino acid sequences, the skilled person may take into account so-called "conservative" amino acid substitutions, which can generally be described as amino acid substitutions in which an amino acid residue is replaced with another amino acid residue of similar chemical structure and which has little or essentially no influence on the 3D structure, function, activity, or other biological properties of the polypeptide. Such conservative amino acid substitutions are well known in the art, for
example from WO 04/037999, GB 335768, WO 98/49185, WO 00/46383, and WO 01/09300; and (preferred) types and/or combinations of such substitutions may be selected on the basis of the pertinent teachings from WO 04/037999 as well as WO 98/49185 and from the further references cited therein.
Such conservative substitutions preferably are substitutions in which one amino acid within the following groups (a) - (e) is substituted by another amino acid residue within the same group: (a) small aliphatic, nonpolar or slightly polar residues: Ala, Ser, Thr, Pro and Gly; (b) polar, negatively charged residues and their (uncharged) amides: Asp, Asn, Glu and Gin; (c) polar, positively charged residues: His, Arg and Lys; (d) large aliphatic, nonpolar residues: Met, Leu, He, Vai and Cys; and (e) aromatic residues: Phe, Tyr and Trp. Particularly preferred conservative substitutions are as follows: Ala into Gly or into Ser; Arg into Lys; Asn into Gin or into His; Asp into Glu; Cys into Ser; Gin into Asn; Glu into Asp; Gly into Ala or into Pro; His into Asn or into Gin; He into Leu or into Vai; Leu into lie or into Vai; Lys into Arg, into Gin or into Glu; Met into Leu, into Tyr or into He; Phe into Met, into Leu or into Tyr; Ser into Thr; Thr into Ser; Trp into Tyr; Tyr into Trp; and/or Phe into Vai, into He or into Leu.
Amino acid sequences and nucleic acid sequences are said to be "exactly the same" if they have 100% sequence identity (as defined herein) over their entire length. When comparing two amino acid sequences, the term "amino acid difference" refers to an insertion, deletion or substitution of a single amino acid residue on a position of the first sequence, compared to the second sequence; it being understood that two amino acid sequences may contain one, two or more such amino acid differences.
According to the present description, "protein solubility" is a thermodynamic parameter defined as the concentration of protein in a saturated solution that is in equilibrium with a solid phase, either crystalline or amorphous, under a given set of conditions (see, e.g., Kramer RM. et al., "Toward a molecular understanding of protein solubility: increased negative surface charge correlates with increased solubility", Biophys J., 2012, 102(8):1907-15).
PROTEIN-BASED BUILDING BLOCK
The molecule of the present technology comprises at least one protein-based carrier building block (also referred herein as "carrier building block", "protein-based building block", or simply "building block" or "carrier"), as defined herein. For instance, the molecule of the present technology may comprise a single protein-based carrier building block. In other embodiments, the molecule comprises more than one protein-based building blocks, such as two, three, four, five, six or more carrier building blocks. The protein-based carrier building block comprises (and, preferably, consists of) at least part of a protein or a whole structured protein, i.e., the protein-based carrier building block is preferably a polypeptide.
The protein-based carrier building block is designed as a "carrier" or "delivery" moiety, with at least one attachment point or conjugation site, preferably with at least two attachment points or conjugation sites, for conjugation or attachment of cargos, as defined in detail below. Suitable cargos include proteins, peptides, toxic payloads, nucleic acids, oligonucleotides, fluorophores, glycans, chelators for/and radio-isotopes, polyethylene glycol (PEG) molecules, vitamins (such as biotin or folate), etc. Specific non-limiting examples of suitable cargos are depicted below in the present description.
An attachment point or conjugation site, in the context of the present technology, refers to any group comprised in the protein-based building block which is suitable for attaching or conjugating a cargo to it. The attachment point or conjugation site is preferably present at a solvent-accessible position in the protein-based building block, as explained in detail below. An attachment point or conjugation site may be a reactive group present in the side chain of any amino acid in the protein-based carrier building block, preferably an amino acid present at a solvent-accessible position in the protein-based carrier building block, or may be the /V- terminal primary amine, and/or the C-terminal carboxylic group of the protein-based building block. The attachment point/conjugation site allows the formation of a covalent bond with a group present in the cargo to be conjugated and/or attached to the protein-based carrier building block. In a preferred embodiment, the attachment point or conjugation site is a reactive group present in the side chain of an amino acid in the protein-based carrier building block, preferably present at a solvent-accessible position in the protein-based carrier building block, which allows the formation of a covalent bond with a group present in the cargo to be
conjugated and/or attached to the protein-based carrier building block. In another embodiment, two of the conjugation sites or attachment points of the protein-based building block are reactive groups present in the side chain of two amino acids present in the proteinbased carrier building block, preferably two amino acid presents at solvent-accessible positions in the protein-based carrier building block. In another embodiment, all of the conjugation sites or attachment points of the protein-based building block are reactive groups present in the side chain of amino acids present in the protein-based carrier building block, preferably amino acid presents at solvent-accessible positions in the protein-based carrier building block.
Globular three dimensional (3D) structure
The protein-based carrier building block comprised in the molecule of the present technology has a globular three-dimensional (3D) structure, i.e., it is or comprises a structured protein with a globular 3D structure. Globular proteins have approximately spherical shape. Nearly all globular proteins contain substantial numbers of a-helices and/or p-sheets folded into a compact structure that is stabilized by both polar and nonpolar interactions. The globular 3D structure forms naturally and often involves interactions mediated by the side chains of the amino acids. Most often, the hydrophobic amino acid side chains are buried, closely packed, in the interior of a globular protein, out of contact with water. Hydrophilic amino acid side chains lie on the surface of the globular proteins exposed to the water. Consequently, globular proteins are usually very soluble in aqueous solutions (from "Gene Expression: Translation of the Genetic Code", Chang-Hui Shen, in Diagnostic Molecular Biology, 2019). In the context of the present technology, a protein or part of a protein with globular 3D structure can be defined as a protein or part of it which comprises at least one a-helix and/or at least one |3- sheet as part of its secondary structure. From a simple sequence of amino acids to its final 3D structure, a protein passes through four levels of structuring known as primary, secondary, tertiary, and quaternary. At the end of these stages the protein begins to fold up into a stable 3D structure that will allow it to fulfil its proper function. Hence, the amino acid sequence of a protein is known as the "primary structure" of that protein. The "secondary structure" can be defined as the arrangement of a polypeptide chain into more or less regular hydrogen- bonded structures, and it has two basic elements:
o Alpha helix - spiral configuration of a polypeptide chain with 3.6 residues (amino acids) per turn. The helix may be left-handed or right-handed, and the latter is more common. o Beta strand (or beta-sheet) - two adjacent polypeptide strands that are bonded together. Two or more strands may interact to form a beta sheet.
Finally, the "tertiary structure" can be defined as the level of protein structure at which an entire polypeptide chain has folded into a 3D structure. In multi-chain proteins, the term tertiary structure applies to the individual chains. See Smith, A.D., et al., eds. 1997, Oxford Dictionary of Biochemistry and Molecular Biology, New York: Oxford University Press.
The three-dimensional structure of a protein can be determined by techniques such as X-ray crystallography, nuclear magnetic resonance (NMR), cryo-electron microscopy (EM) or circular dichroism (CD). X-ray crystallography is a common technique used to determine 3D protein structure, but also NMR (suited for small proteins) and cryo-EM (suited for large proteins) can provide information about a protein's tertiary structure. Circular dichroism is an excellent method for rapidly evaluating the secondary structure, folding and binding properties of proteins, see, e.g., Jones, C. ("Circular dichroism of biopharmaceutical proteins in a quality-regulated environment", J Pharm Biomed Anal., 2022, 219:114945). Because the CD spectra of proteins are so dependent on their conformation, CD can be used to estimate the structure of unknown proteins and monitor conformational changes due to temperature, mutations, heat, denaturants or binding interactions. For instance, a-helical proteins have negative bands at 222 nm and 208 nm and a positive band at 193 nm. Proteins with well- defined antiparallel -pleated sheets (P-helices) have negative bands at 218 nm and positive bands at 195 nm, while disordered proteins have very low ellipticity above 210 nm and negative bands near 195 nm. See Greenfield NJ., "Using circular dichroism spectra to estimate protein secondary structure", Nat Protoc., 2006, l(6):2876-90 for further details.
Hence, the protein-based carrier building block comprised in the molecule of the present technology comprises at least one a-helix and/or at least one -sheet as part of its secondary structure, preferably more than one a-helix and/or more than one -sheet as part of its secondary structure, leading to a globular 3D tertiary structure. This allows the engineering
of site- and stereospecific-conjugation sites or attachment points, as described in detail in this specification. The presence of at least one a-helix and/or at least one p-sheet in a certain polypeptide or protein can be determined by known techniques, as explained above, such as, e.g., CD.
Solubility
The protein-based carrier building block comprised in the molecule of the present technology is soluble. In the context of the present technology, a soluble building block means that the building block has a solubility of 10 mg/mL or more, preferably of 20 mg/mL, preferably of 50 mg/mL or more, and even more preferably of 100 mg/mL or more, measured in water or a suitable buffer or solvent (e.g., an aqueous solution, or a physiological buffer, such as a buffer which is amenable for parenteral administration) at room temperature (RT). In a preferred embodiment, the solubility of the protein-based carrier building block is measured in water or in a suitable buffer at RT, more preferably in a buffer such as citrate buffer (e.g., citrate buffer 5 mM) or PBS, at pH 7.0 or 7.4, at RT. Other preferred buffers which are suitable for measuring the solubility of the protein-based carrier building block are Dulbecco's phosphate buffered saline (DPBS, which is a balanced salt solution containing potassium chloride, monobasic potassium phosphate, sodium chloride, and dibasic sodium phosphate, e.g., 2.7 mM KCI, 1.5 mM KH2PO4, 136.9 mM NaCI, 8.9 mM Na2HPO4’7H2O, pH7.0-7.3, commercially available from GIBCO (Nrl4190-094)), preferably pH 7.0 or 7.3 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)).
The skilled person is aware of methods to measure the solubility of a protein solution. For instance, the supplementary material of Kramer RM. et al., "Toward a molecular understanding of protein solubility: increased negative surface charge correlates with increased solubility", Biophys J., 2012, 102(8) :1907-15) describes solubility measurements of folded proteins.
Additionally or alternatively, solubility measurements can be performed as follows. The protein solution (e.g., in citrate buffer 5 mM, pH 7.0, or in PBS pH 7.4, or in water, or in any of the suitable buffers described above) is concentrated by ultrafiltration (e.g., via tangential flow filtration (TFF) ) until some cloudiness appears in the solution. Then, the solution is spined at high speed or 0.22 pm filtered to remove any non-soluble material, and the OD2so of the supernatant is measured. Using the molar extinction coefficient of the specific protein, the protein concentration of the supernatant (and, thus, the concentration of the protein in a saturated solution that is in equilibrium with a solid phase, i.e., the protein solubility) is obtained.
For instance, in the context of the present technology, physiological buffers suitable for parenteral administration can include the following components: Glutamate, Tartrate, Lactate, Citrate, Malate, Gluconate, Ascorbate, Maleate, Phosphate, Succinate, Acetate, Bicarbonate, Aspartate, Histidine, Benzoate, Tromethamine, Diethanolamine, Ammonium or Glycine. The most common buffers used in parenteral formulations are based on histidine, citrate, phosphate, and acetate (see, e.g., Broadhead J, Gibson M., "Parenteral dosage forms", in: Gibson M., editor, "Pharmaceutical preformulation and formulation", New York: Informa healthcare; 2009, p. 325-47).
Preferably, the protein-based carrier building block comprised in the molecule of the present technology is soluble in reduced state, i.e., it is soluble when the -SH groups (e.g., in the side chain of one or more Cys) present at solvent accessible positions in its amino acid sequence, if any, is(are) in a reduced form (as "-SH"), and not oxidized. For instance, a protein-based carrier building block may be reduced when subjected to reducing conditions for enough time. For instance, reducing conditions may mean using beta-mercaptoethanol (2-ME), dithiothreitol (DTT) orTCEP (Tris (2-carboxyethyl) phosphine).
Size (molecular mass)
The protein-based carrier building block comprised in the molecule of the present technology has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5, 5, 10, 15, 20, 25, 30, 35, 40, 50, 60, 65 or about 70 kDa. Preferably, the building block is a small building block with a size of about 2.5 to about 50 kDa, such as of about 2.5 to about less than 50 kDa, such
as about 2.5 to about 40 kDa, or about 2.5 to about 35 kDa, more preferably of about 2.5 to about 30 kDa, such as about 5 to about 30 kDa, or about 7 to about 30 kDa, or about 10 to about 30 kDa, or about 2.5 to about 25 kDa, or about 5 to about 25 kDa, or about 7 to about 25 kDa, or about 10 to about 25 kDa, or about 2.5 to about 20 kDa, or about 5 to about 20 kDa, or about 7 to about 20 kDa, or about 10 to about 20 kDa, or about 2.5 to about 18 kDa, or about 5 to about 18 kDa, or about 7 to about 18 kDa, or about 10 to about 18 kDa. More preferably, the building block comprised in the molecule of the present technology has a size of about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, or such as about 2.5, 3, 5, 6.5, 7, 10, 11, 12, 13, 14, 15 or 16 kDa. For instance, the protein-based building block may have a size (molecular mass) of about 6 kDa, or of about 7 kDa, or of about 15 kDa, or of about 16 kDa. In an even more preferred embodiment, the protein-based carrier building block has a size of about 15 kDa.
Non-functionality
The protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein. If the building block shows any interaction with one or more human proteins, such interaction is characterized by low specificity and/or low affinity, as defined herein.
For instance, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins or Fc-sensors. For instance, the protein-based carrier building block does not specifically bind C-type lectin receptors (CLRs). All antibodies possess two functional domains — one that confers antigen specificity, known as the antigen-binding fragment (Fab), and another that drives antibody function, known as the crystallizable fragment (Fc). The specific effector functions that are triggered by antibodies are determined by the receptors to which the antibody Fc domain binds and the specific innate immune cells on which these FcRs are expressed. These sensors include both classical FcRs and non-classical C-type lectin receptors (CLRs), see Lu, L. et al., "Beyond binding: antibody effector functions in infectious diseases", Nat Rev Immunol, 2018, 18, 46-61. Table 1 of Lu, L. et al provides non-limiting examples of Fc domain sensors (e.g., Fey or FcRn) to which the protein-based carrier building block comprised in the molecule of the present technology do not specifically bind. Consequently,
the protein-based carrier building block comprised in the molecule of the present technology does not show effector functions of conventional antibodies mediated by the Fc domain. In another embodiment, the protein-based carrier building block and/or the molecule does not specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins or Fc- sensors. For instance, the protein-based carrier building block and/or the molecule does not specifically bind C-type lectin receptors (CLRs). Hence, in one embodiment, none of the components comprised in the molecule of the present technology (e.g., at least one proteinbased carrier building block and at least one NLS (cargo) attached or conjugated to it) specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins, Fc-sensors and/or CLRs. In another embodiment, the protein-based building block and/or the molecule of the present technology does not show effector functions of conventional antibodies mediated by the Fc domain, i.e., none of the components comprised in the molecule of the present technology show effector functions of conventional antibodies mediated by the Fc domain. In one embodiment, the molecule of the present technology does not include conventional VH-VL pairing/interaction and/or does not include CL-CH1 pairing such as CL-CH1 binding disulphide bridges.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind the variable domain of the light chain (VL) and/or the variable domain of the heavy chain (VH) of an antibody, such as the VL and/or the VH of a monoclonal antibody (mAb). In another embodiment, the protein-based carrier building block does not specifically bind the first constant domain of the heavy chain (CHI) of an antibody, such as the CHI of a mAb. In another embodiment, the protein-based carrier building block does not specifically bind the constant domain of the light chain (CL) of an antibody, such as the CL of a mAb. In another embodiment, the protein-based carrier building block does not specifically bind the third constant domain of the heavy chain (CH3) of an antibody, such as the CH3 of a mAb. In another embodiment, the protein-based carrier building block does not specifically bind the second constant domain of the heavy chain (CH2) of an antibody, such as the CH2 of a mAb. In one embodiment, the building block and/or the molecule of the present technology is not a Fab fragment from an antibody, such as from a mAb. In one embodiment, the building block and/or the molecule of the present technology is not a CH, preferably is not a CHI fragment from an antibody, such as from a mAb. The
building block and/or the molecule of the present technology is not an antibody, such as a mAb, is not a Fc fragment, or a Fv fragment.
The protein-based carrier building block comprised in the molecule of the present technology may derive from a target-binding protein (such as an ISVD, a DARPin, an affibody or an affitin) (the "protein-based carrier building block precursor"). In the context of the present technology, a "protein-based carrier building block precursor" or "building block precursor" is a protein-based moiety which may be modified to generate the protein-based carrier building block comprised in the molecule of the present technology.
In the context of the present technology, the "protein-based carrier building block precursor" is a protein which is modified (e.g., by point mutations and/or by addition/deletion of amino acids to its sequence) to generate the protein-based carrier building block comprised in the molecule of the present technology. For instance, the "protein-based carrier building block precursor" is modified so that it no longer specifically binds any human protein, preferably so that it also does not specifically bind any (non-human) molecule (including non-human biomolecule) and/or any non-protein (human) molecule (including biomolecule), in particular any molecule (including biomolecule) to which the precursor specifically binds. In addition, if necessary, the "protein-based carrier building block precursor" is modified so that it incorporates one or more attachment points or conjugation sites as described herein. The "protein-based carrier building block precursor" has a sequence identity of at least 60%, such as at least 70%, or at least 75%, preferably of at least 80% with the protein-based carrier building block derived from it. For instance, the "protein-based carrier building block precursor" has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with the protein-based carrier building block derived from it. For instance, the "protein-based carrier building block precursor" may share the whole amino acid sequence with the protein-based carrier building block derived from it with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, twenty or more amino acids. Of course, the protein-based carrier building block derived from a protein-based carrier building block precursor has a globular 3D structure, is soluble, has a
size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, does not specifically bind any human protein and preferably does not specifically bind any protein or non-protein molecule to which the precursor specifically binds.
Preferably, the carrier building block comprised in the molecule of the present technology does also not specifically bind to any non-protein molecule (including non-protein biomolecules, such as nucleic acids, e.g., DNA and/or RNA, lipids (e.g., phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecule), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., such as phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids. In particular, preferably, the carrier building block does also not specifically bind to any non-protein molecule (including biomolecules) (such as nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecules), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non- protein molecule (including biomolecules) or a non-human protein).
In another embodiment, the carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein or part thereof present on the surface of human cells. Preferably, the carrier building block comprised in the molecule of the present technology does not specifically bind to any human molecule (such as protein, lipid, sugar, etc.) or part thereof present in the surface of human cells.
In another preferred embodiment, the protein-based building block comprised in the molecule of the present technology does also not specifically bind to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically
bind to the precursor's target, e.g., a non-human protein or a non-protein molecule (including biomolecules)), or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a nonprotein molecule) with a KD value greater than 5xl0-4 mol/litre, as described herein. For example, if the precursor of the protein-based carrier building block is an anti-RSV (respiratory syncytial virus) ISVD (i.e., it specifically binds one or more proteins of RSV, such as protein F of RSV), the protein-based carrier building block derived from it preferably does not specifically bind those RSV proteins (or binds those proteins, such as protein F of RSV preferably with a KD value greater than 5xl0-4 mol/litre, as described herein).
For example, if the precursor of the protein-based carrier building block specifically binds to virus (e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like) and/or to viral molecules (e.g., it specifically binds one or more viral biomolecules, e.g., viral proteins, viral nucleic acids, viral lipids or viral glycans), the proteinbased carrier building block derived from it preferably does not specifically bind those virus and/or viral molecules (or binds those virus and/or viral molecules preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to virus (e.g., it is an anti-viral ISVD, an anti-viral DARPin, an anti-viral affitin, an anti-viral affibody, or the like) and/or to viral molecules (e.g., it specifically binds one or more viral biomolecules, e.g., viral proteins, viral nucleic acids, viral lipids or viral glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Example of viruses which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: RSV, influenza virus, rabies virus, potyvirus, bacteriophage, rotavirus, HIV protein, Hepatitis B virus, Hepatitis C virus, norovirus, Shiga toxins from lambdoid prophages, Herpes simplex virus, Grapevine fanleaf virus (GFLV), Ebola, Middle East respiratory syndrome (MERS) virus, acute respiratory syndrome (SARS) virus, SARS-COV2, Vibrio or a White Spot Syndrome virus, cytomegalovirus, parvovirus, ZIKA virus, Chikungunya Virus (CHIKV). Hence, in one embodiment, the proteinbased building block precursor, e.g., an ISVD, may specifically bind to one or more of these viruses (or molecules, including biomolecules, comprised therein). The resulting protein-
based building block may not specifically bind to the virus (or molecules, including biomolecules, comprised therein) to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these viruses (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to protozoa (a microorganism, unicellular eukaryote) (e.g., it is an anti-protozoa ISVD, an antiprotozoa DARPin, an anti-protozoa affitin, an anti-protozoa affibody, or the like) and/or to protozoa molecules (e.g., it specifically binds one or more protozoa biomolecules, e.g., protozoa proteins, protozoa nucleic acids, protozoa lipids or protozoa glycans), the proteinbased carrier building block derived from it preferably does not specifically bind those protozoa and/or protozoa molecules (or binds those protozoa and/or protozoa molecules preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to protozoa (e.g., it is an anti-protozoa ISVD, an anti-protozoa DARPin, an anti-protozoa affitin, an anti-protozoa affibody, or the like) and/or to protozoa molecules (e.g., it specifically binds one or more protozoa biomolecules, e.g., protozoa proteins, protozoa nucleic acids, protozoa lipids or protozoa glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of protozoa and protozoa molecules to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: Trypanosoma evansi, Eimeria stiedae, Variant surface glycoprotein (VSG). Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to one or more of these protozoa (or molecules, including biomolecules, comprised therein). The resulting protein-based building block may not specifically bind to the protozoa (or molecules, including biomolecules, comprised therein) to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these protozoa (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to mammalian proteins (e.g., it is an anti-mammalian protein ISVD, an anti-mammalian protein DARPin, an anti-mammalian protein affitin, an anti-mammalian protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those mammalian proteins (or binds those mammalian proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to mammalian proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Example of a mammalian protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is bovine serum albumin. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to this mammalian protein. The resulting protein-based building block may not specifically bind to the mammalian protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards this mammalian protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to avian proteins (e.g., it is an anti-avian protein ISVD, an anti-avian protein DARPin, an anti-avian protein affitin, an anti-avian protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those avian proteins (or binds those avian proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to avian proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Example of an avian protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is Ovalbumin (chicken). Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to this avian protein. The resulting protein-based building block may not specifically bind to the avian protein to which the precursor binds. If the protein-based building block
comprised in the molecule of the present technology shows specific binding towards this avian protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to yeast and/or moulds proteins (e.g., it is an anti-yeast and/or anti-moulds protein ISVD, an antiyeast and/or moulds protein DARPin, an anti-yeast and/or moulds protein affitin, an anti-yeast and/or moulds protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those yeast and/or moulds proteins (or binds those yeast and/or moulds proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to yeast and/or moulds proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of yeast and moulds proteins to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are yeast extract, inactivated yeast, Candida. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to one or more of these yeast and/or moulds proteins. The resulting protein-based building block may not specifically bind to at least one of these yeast and/or moulds proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards these yeast and/or moulds proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to plant proteins (e.g., it is an anti-plant protein ISVD, an anti-plant protein DARPin, an anti-plant protein affitin, an anti-plant protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those plant proteins (or binds those plant proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to plant proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of plant proteins to which the protein-
based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are Starch Branching Enzyme II (maize), polyphenol, Linoic acid (Sunflower, maize), plant seed. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to one or more of these plant proteins. The resulting protein-based building block may not specifically bind to at least one of these plant proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards these plant proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to fungi proteins (e.g., it is an anti-fungi protein ISVD, an anti-fungi protein DARPin, an anti-fungi protein affitin, an anti-fungi protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those fungi proteins (or binds those fungi proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to fungi proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of fungi proteins to which the proteinbased building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are Cutinase, chitin, fungus sphingolipids. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to at least one of these fungi proteins. The resulting proteinbased building block may not specifically bind to the at least one of these fungi protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards at least one of these fungi proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to bacteria (e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like) and/or to bacterial molecules (e.g., it specifically binds one or more bacterial biomolecules, e.g., bacterial proteins, bacterial nucleic acids, bacterial lipids
or bacterial glycans), the protein-based carrier building block derived from it preferably does not specifically bind those bacteria and/or bacterial molecules (or binds those bacteria and/or bacterial molecules preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to bacteria (e.g., it is an anti-bacterial ISVD, an anti-bacterial DARPin, an anti-bacterial affitin, an anti-bacterial affibody, or the like) and/or to bacterial molecules (e.g., it specifically binds one or more bacterial biomolecules, e.g., bacterial proteins, bacterial nucleic acids, bacterial lipids or bacterial glycans), as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of bacteria and bacterial molecules to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are the following: Beta-lactamase, tetanus toxin, Lactate Oxidase, Salmonella typhimurium, Helicobacter pylori, Mycobacterium tuberculosis, Clostridium difficile (toxin A and B), Pseudomonas aeruginosa, Bacillus anthracis, Botulinum Neurotoxin, Treponema pallidum, Chlamydia trachomatis, Escherichia coli, Campylobacter jejuni ('flagella), Salmonella enterica, Bordetella pertussis (toxin), Shigella spp, Streptomyces venezuelae, chloramphenicol. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to one or more of these bacteria (or their molecules, including biomolecules). The resulting protein-based building block may not specifically bind to the bacteria (or their molecules, including biomolecules) to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards one or more of these bacteria (or molecules, including biomolecules, comprised therein), that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to non-human animal proteins, such as snake proteins (e.g., it is an anti-snake protein ISVD, an anti-snake protein DARPin, an anti-snake protein affitin, an anti-snake protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those snake proteins (or binds those snake proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically binds to snake proteins, as its precursor does, but the specific
binding is eliminated when at least a cargo is attached to the protein-based building block. Example of a snake protein to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind is Cobra toxin. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to this snake protein. The resulting proteinbased building block may not specifically bind to the snake protein to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards this snake protein, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to green fluorescent protein (GFP), which is a protein from Jellyfish (sea jellies) and corals, sea anemones, zoanithids, copepods and lancelets (e.g., it is an anti-GFP ISVD, an anti-GFP DARPin, an anti-GFP affitin, an anti-GFP affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind GFP (or binds GFP preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the proteinbased carrier building block specifically binds to GFP, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to GFP. The resulting protein-based building block may not specifically bind to GFP to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards GFP, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to insect proteins (e.g., it is an anti-insect protein ISVD, an anti-insect protein DARPin, an antiinsect protein affitin, an anti-insect protein affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind those insect proteins (or binds those insect proteins preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the protein-based carrier building block specifically
binds to insect proteins, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Examples of insect proteins to which the protein-based building block precursor (and/or protein-based building block comprised in the molecule of the present technology) may specifically bind are Androctonus autralis hecor toxins, chitin, chitin binding domain (CBD), V-ATPase subunit C, trehalase, cytochrome p450 monooxygenase, chitin deacetylase, chitin synthase and NPC1 sterol transporter. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to at least one of these insect proteins. The resulting protein-based building block may not specifically bind to the at least one of these insect proteins to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards at least one of these insect proteins, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
For example, if the precursor of the protein-based carrier building block specifically binds to chitin, which is a crustaceans protein (e.g., it is an anti-chitin ISVD, an anti-chitin DARPin, an anti-chitin affitin, an anti-chitin affibody, or the like), the protein-based carrier building block derived from it preferably does not specifically bind chitin (or binds chitin preferably with a KD value greater than 5xl0-4 mol/litre, as described herein). In other embodiments, the proteinbased carrier building block specifically binds to chitin, as its precursor does, but the specific binding is eliminated when at least a cargo is attached to the protein-based building block. Hence, in one embodiment, the protein-based building block precursor, e.g., an ISVD, may specifically bind to chitin. The resulting protein-based building block may not specifically bind to chitin to which the precursor binds. If the protein-based building block comprised in the molecule of the present technology shows specific binding towards chitin, that specific binding as described herein is lost when at least a cargo is attached to the at least one conjugation site comprised therein.
Hence, preferably, the protein-based carrier building block does not specifically bind to the precursor's target, should the protein-based carrier building block precursor have a target and should this be a non-human molecule (including biomolecules), such as a non-human protein. Hence, in one embodiment, the at least one protein-based building block comprised in the
molecule of the present technology does not specifically bind any RSV protein, such as protein F of RSV, or binds any RSV protein, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. WO 2009/147248, Tables A-l and A-2, provide examples of F-protein binding sequences. In one embodiment, the at least one protein-based building block comprised in the molecule of the present technology does not comprise/does not consist of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656. In another embodiment, the at least one protein-based building block comprised in the molecule of the present technology does not comprise/does not consist of the amino acid sequence as defined in SEQ ID NO.: 214.
In a further preferred embodiment, the protein-based building block comprised in the molecule of the present technology, when it has at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) attached to it (via at least one conjugation sites or attachment points comprised therein) does not specifically bind to any molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with at least a cargo attached to it, preferably does not specifically bind to the precursor's target, e.g., a non-human protein or a non-protein molecule (including biomolecules)), or binds to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with a cargo attached to it, preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a non-protein molecule) with a KD value greater than 5xl0-4 mol/litre, as described herein. Hence, in a preferred embodiment, if the protein based carrier building block comprised in the molecule of the present technology shows any specific binding towards a specific target, such as towards a molecule (including biomolecules, e.g., human, non-human animal, plant, microbial, viral, etc.), or towards a cell (e.g., animal, human, plant cell), microorganisms, virus, etc., that specific binding is eliminated when at least a cargo is attached to at least one attachment point or conjugation site comprised in the protein-based building block. In this specific embodiment, the cargo attached to the protein-based building block may of course show specific binding towards a target (including biomolecules, as described herein), but the protein-based building block does no longer specifically binds its target.
In a further preferred embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human or non-human (e.g., non-human animal, plant, yeast, etc.) cell and/or cell type (such as the ones exemplified in Example 6). If the building block shows any interaction with one or more human or non- human cells and/or cell types, such interaction is characterized by low specificity and/or low affinity, as defined herein. In particular, preferably, the carrier building block does also not specifically bind to any human or non-human cell and/or cell type to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-protein molecule or a protein present on the surface of a human cell). The lack of binding to any human or non-human cell and/or cell type can for example be assessed with the "cell binding assay" as described below (see also, e.g., Hunter SA and Cochran JR, "Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations", Methods Enzymol., 2016, 580:21-44).
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any microorganisms such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules). If the building block shows any interaction with one or more microorganisms and/or virus, or with any microbial or viral molecule (including biomolecules), such interaction is characterized by low specificity and/or low affinity, as defined herein. In particular, preferably, the carrier building block does also not specifically bind to any microorganism and/or virus (or to any microbial or viral molecule (including biomolecules)) to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a virus, a microorganism, a non-protein molecule (including biomolecules) or a protein present on the surface of a microorganism and/or virus). The lack of binding to any microorganism, or virus, or microbial molecule, or viral molecule can for example be assessed with the "cell binding assay" and/or SPR as described herein.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any microorganisms such as bacteria,
fungi, protists, yeast and/or virus (and/or to any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.) when it has at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) attached or conjugated to it (via at least one attachment point or conjugation sites comprised therein). If the building block comprising the cargo attached to it shows any interaction with one or more microorganisms and/or virus (or with any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.), such interaction is characterized by low specificity and/or low affinity, as defined herein. In particular, preferably, the carrier building block does also not specifically bind to any microorganism and/or virus (or to any microbial or viral molecule or biomolecule, such as microbial or viral proteins, nucleic acids, lipids, glycans, etc.) to which the protein-based carrier building block precursor specifically binds when the protein-based carrier building block has at least one cargo attached or conjugated to it (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target , e.g., a non-protein molecule or biomolecule or a protein present on the surface of a microorganism and/or virus, or present in the microorganism or virus, when it has at least one cargo attached or conjugated to it). For instance, the proteinbased building block comprised in the molecule of the present technology does not specifically bind to any viruses and/or viral proteins, such as RSV and/or one or more proteins of RSV, such as protein F of RSV, when the building block has at least a cargo attached or conjugated to it. Hence, for instance, the protein-based carrier building block (e.g., a DARPin-based carrier building block) may show specific binding towards a microorganism and/or a virus, such as RSV and/or RSV proteins, such as protein F of RSV, but the specific binding (as defined herein), if any, is lost when at least one cargo is attached or conjugated to the protein-based building block.
The lack of specific binding to any microorganism can for example be assessed with the "cell binding assay" as described herein. The lack of specific binding to viruses, microbial and/or viral molecules or biomolecules can, for example, be assessed by surface plasmon resonance, as described herein.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any molecule, including biomolecules,
including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any molecule, including bio molecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans) with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the protein-based carrier building block does not specifically bind to any human and/or non-human animal biomolecule (e.g., human and/or non-human animal proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any human and/or non-human animal biomolecule with a KD (KD value) greaterthan 5xl0-4 mol/litre, as described herein. For instance, the protein-based carrier building block does not specifically bind to any bacterial molecule (including bacterial biomolecules, e.g., bacterial proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any bacterial molecule, as defined above, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any viral molecule (including biomolecules, e.g., viral proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any viral molecule, as defined herein, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the proteinbased carrier building block does not specifically bind to any fungi molecule (including biomolecules, e.g., fungi proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any fungi molecule, as defined herein, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the proteinbased carrier building block does not specifically bind to any yeast molecule (including biomolecules, e.g., yeast proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any yeast molecule, as described herein, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the proteinbased carrier building block does not specifically bind to any plant molecule (including biomolecules, e.g., plant proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as
phosphatidylserine (PS)) or glycans), or binds to any plant molecule, as defined herein, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the proteinbased carrier building block does not specifically bind to any mammalian molecule (including mammalian biomolecules, e.g., mammalian proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), or binds to any mammalian molecule, as defined herein, with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein.
In the context of the present technology, the term "biomolecule" or "biological molecule" refers to molecules present in organisms, including animals, plants, microorganisms that play a role in one or more biological processes, such as cell division, morphogenesis, or development. Biomolecules are the building blocks of life and perform important functions in living organisms. Biomolecules include the primary metabolites which are large macromolecules such as proteins, carbohydrates (glycans), lipids (e.g., such as PS), and nucleic acids (such as DNA, RNA), as well as small molecules such as vitamins and hormones. The four major types of biomolecules are carbohydrates (glycans), lipids, nucleic acids, and proteins.
In a further preferred embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind any non-human protein and/or any non-protein molecule (including biomolecule) when at least one cargo (such as a "model cargo", e.g. a maleimide-modified alanine) is conjugated to the at least one attachment point or conjugation site on the protein-based carrier building block, preferably it does not specifically bind any non-human protein and/or any non-protein molecule (including biomolecule) to which the protein-based carrier building block precursor specifically binds, or binds to them with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein.
Hence, in one embodiment, the present technology provides a molecule comprising at least one protein-based carrier building block as described in the present technology, wherein the protein-based carrier building block has at least a cargo (such as a "model cargo", e.g. a maleimide-modified alanine or a NLS) attached or conjugated to it (via at least one attachment point or conjugation site comprised in the protein-based carrier building block), and wherein the protein-based carrier building block does not specifically bind to any molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.). Hence, in one
embodiment, the protein-based building block, when at least a cargo is conjugated to it, loses its target binding specificity. For instance, the protein-based building block, comprising a cargo attached to it, does not specifically bind to any molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.) which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block, with at least a cargo attached to it, preferably does not specifically bind to the precursor's target, or binds to any (non-human) molecule (including biomolecules) and/or organisms (such as cells, microorganisms, virus, etc.) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block, with a cargo attached to it, preferably does also not specifically bind to the precursor's target) with a KD value greater than 5xl0-4 mol/litre, as described herein.
The skilled person is aware of means for reducing and/or eliminating specific binding of a protein-based carrier building block precursor to proteins and/or non-protein molecules (including biomolecules). For instance, mutations may be performed in the amino acid sequence of the precursor building block so that it no longer specifically binds to human proteins, or to any non-human protein, or to non-protein molecules (including biomolecules), or binds to them with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein.
The affinity of a molecular interaction between two molecules (e.g., of two biomolecules) can be measured via different techniques known perse, such as the well-known surface plasmon resonance (SPR) biosensor technique (see for example Ober et al. 2001, Intern. Immunology 13: 1551-1559, in particular section "Surface plasmon resonance (SPR) experiments" starting on p. 1552, which describes conditions for measuring the affinity of a molecular interaction between two molecules, or the explanations provided herein in this description). The term "surface plasmon resonance", as used herein, refers to an optical phenomenon that allows for the analysis of real-time biospecific interactions by detection of alterations in protein concentrations within a biosensor matrix, where one molecule is immobilized on the biosensor chip and the other molecule is passed over the immobilized molecule under flow conditions yielding kon, kOff measurements and hence KD (or K ) values. This can for example be performed using the well-known BIAcore® system (BIAcore International AB, a Cytiva lifesciences company, Uppsala, Sweden and Piscataway, NJ). For further descriptions, see
Jonsson et al. (1993, Ann. Biol. Clin. 51: 19-26), Jonsson et al. (1991 Biotechniques 11: 620- 627), Johnsson et al. (1995, J. Mol. Recognit. 8: 125-131), and Johnnson et al. (1991, Anal. Biochem. 198: 268-277). For instance, the affinity (KD) of a molecular interaction between two molecules can be determined via SPR on a ProteOn XPR36 instrument (Bio-Rad Laboratories). The experiment can be performed at 25°C, and as assay buffer PBS pH7.4 containing 0.005% Tween 20 (Bio-Rad Laboratories) can be used. Targets such as human proteins or non-protein molecules (biomolecules), or non-human biomolecules, as described herein, such as nucleic acids (e.g., DNA, RNA), lipids (e.g., such as phosphatidylserine (PS)) or glycans can be immobilized onto different ligand lanes from a GLC sensorchip (Bio-Rad Laboratories), e.g., with the ProteOn Amine Coupling Kit (Bio-Rad Laboratories) according to the manufacturer's instructions. The protein-based building block can be captured on the target immobilized ligand lanes. One ligand lane can serve as a reference surface and no protein-based building block is captured on the surface. Different concentrations (e.g., ranging from 300 nM to 1.2 nM) diluted in running buffer can be flowed over the respective protein-based building blocks and reference surface in multi-cycle kinetics for 2 minutes, followed by a constant flow of the assay buffer for 15 minutes. Between the different injections, the surfaces can be regenerated with 3 M MgCl2 (Cytiva), or with 10 mM Glycine pH 1.5 (Cytiva). Several buffer blanks can be injected for double referencing. Data can be analyzed, e.g., with the ProteOn Manager 3.1.0 software (Bio-Rad Laboratories). The kinetic rate constants (ka and kd) can be calculated by fitting the sensorgrams via the Langmuir 1:1 interaction ligand binding model. The equilibrium dissociation constant KD can be calculated as the kd/ka ratio. See also, e.g., https://nicoyalife.com/wp- content/uploads/2023/02/characterization-of-lnfl uenza-using-Alto.pdf.
Another well-known biosensor technique to determine affinities of biomolecular interactions is bio-layer interferometry (BLI) (see for example Abdiche et al. 2008, Anal. Biochem. 377: 209- 217). The term "bio-layer Interferometry" or "BLI", as used herein, refers to a label-free optical technique that analyzes the interference pattern of light reflected from two surfaces: an internal reference layer (reference beam) and a layer of immobilized protein on the biosensor tip (signal beam). A change in the number of molecules bound to the tip of the biosensor causes a shift in the interference pattern, reported as a wavelength shift (nm), the magnitude of which is a direct measure of the number of molecules bound to the biosensor tip surface.
Since the interactions can be measured in real-time, association and dissociation rates and affinities can be determined. BLI can for example be performed using the well-known Octet® Systems (ForteBio, a division of Pall Life Sciences, Menlo Park, USA).
Alternatively, affinities can be measured in Kinetic Exclusion Assay (KinExA) (see for example Drake et al., "Characterizing high-affinity antigen/antibody complexes by kinetic- and equilibrium-based methods", Anal. Biochem., 2004, 328: 35-43), using the KinExA® platform (Sapidyne Instruments Inc, Boise, USA). The"term "KinExA", as used herein, refers to a solution-based method to measure true equilibrium binding affinity and kinetics of unmodified molecules. Equilibrated solutions of a binding unit/target complex, such as an antibody/antigen complex, are passed over a column with beads precoated with antigen (or antibody), allowing the free antibody (or antigen) to bind to the coated molecule. Detection of the antibody (or antigen) thus captured is accomplished with a fluorescently labeled protein binding the antibody (or antigen).
Further, the GYROLAB® immunoassay system provides a platform for automated bioanalysis and rapid sample turnaround (Fraley et al., "The Gyrolab™ immunoassay system: a platform for automated bioanalysis and rapid sample turnaround", Bioanalysis 2013, 5: 1765-74).
Further, the affinity of a molecular interaction between two molecules (e.g., between two biomolecules such as between two proteins), or between one biomolecule such as one protein and one cell, can be measured using flow cytometry to analyze ligand binding to antigens such as proteins, lipids (e.g., such as phosphatidylserine (PS)), sugars, etc. presented on the surface of a cell ("cell binding assay"). The skilled person is familiar with cell-binding assays to determine the affinity of a certain soluble molecule (such as the molecule of the present technology) and a binding partner present on the surface of a cell, such as a human cell. For instance, Hunter S. A. and Cochran J. R. ("Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations", Methods Enzymol. 2016, 580: 21-44), present a practical guide for measuring binding events between soluble ligands and binding partners expressed on the surface of, inter alia, mammalian cells. For instance, as shown in the examples, the cell binding assay can be carried out as follows:
a. Adding a fixed number of (human or non-human) cells to a 96-well V-bottom plate (e.g., 50 pL human cell suspension (e.g., 5E+04/96-well), or to tubes, such as Eppendorf tubes, in cold fluorescence-activated cell sorting (FACS) buffer (e.g., consisting of D-PBS, 2% heat inactivated fetal bovine serum (HI FBS) and 0.05% Sodium Azide); b. Optionally performing a washing step; c. Adding the soluble molecule, e.g., the molecule of the present technology, or the protein-based building block, preferably marked, such as with a fluorescent label or an epitope tag, and incubate for a certain amount of time, until the reaction has come to equilibrium, generally a number of hours, e.g., for about 3h, at low temperature, typically 4°C while shaking; d. Evaluating the binding of the soluble molecule to the human cells by flow cytometry, e.g., by FACS.
Hence, the cell-binding assay can be performed by adding a number of cells (human or non- human, such as non-human animal, plant, microorganisms, etc.) to a recipient (e.g., 96-well V-bottom plate or tubes, as described above), preferably in a physiological buffer, adding the molecule which binding is to be assessed (e.g., the molecule of the present technology, or the protein-based building block), which is preferably marked, and incubate it with the cells for an amount of time, e.g., when the reaction has come to equilibrium, generally a number of hours, e.g., for about 3h, preferably at low temperature, typically 4°C, preferably while shaking, and finally evaluating the binding of the soluble molecule to the human cells by flow cytometry, e.g., by FACS.
In order to calculate the binding affinities (e.g., KD and/or EC50) between a soluble molecule, e.g., the molecule of the present technology, and a human cell, the soluble molecule in step c. above may be added to each well/tube at varying concentrations, spanning two orders of magnitude above and below the anticipated KD and/or EC50. Binding values can be determined from the average signal value (e.g., average fluorescence value) of each sample, plotting the fraction bound vs. ligand concentration (log scale) and fitting a sigmoidal curve using nonlinear regression analysis. The ligand concentration at half the fraction bound, also referred to as EC50, will be a first approximation of the equilibrium dissociation constant (KD).
The skilled person is able to determine whether a molecule is able to specifically bind human proteins, as defined in the context of the present technology. For instance, the skilled person may make use of commercially available protein arrays to determine the binding affinity of a certain molecule (protein) towards human proteins. For instance, the skilled person may make use of the commercially available Proteome Profiler™ Antibody Arrays, which allows for the semiquantitative measurement of more than 100 proteins in a single sample. Alternatively or additionally, the skilled person may make use of, for example, HuProt™ assay, such as the version v4.0, which consists of >21,000 unique human proteins, isoform variants, and protein fragments - covering 16,794 unique genes. This includes 15,889 of the 19,613 canonical human proteins described in the Human Protein Atlas, with broad coverage across protein subclasses. The skilled person can also use commercially available cell arrays, such as human, non-human animal, plant, bacteria, yeast, etc. arrays to determine the binding affinity (e.g., KD and/or EC50) of a certain molecule (protein) towards human cells. See also, e.g., Example 6.
Similarly, the skilled person is able to determine whether a molecule is able to specifically bind a non-human protein, such as a bacterial or viral protein. For instance, the skilled person may make use of protein-binding assays to determine the binding affinity of a certain molecule (e.g., a protein) towards non-human (such as bacterial or viral) proteins. Similarly, the skilled person is able to determine whether a molecule (e.g., a protein) is able to specifically bind a non-protein molecule, e.g., a human non-protein molecule, such as human DNA, human RNA, human lipids (e.g., such as phosphatidylserine (PS)) or human glycans, see, e.g., Campanero- Rhodes MA et al., "Microarray strategies for exploring bacterial surface glycans and their interactions with glycan-binding proteins", Front Microbiol. 2020, 10:2909. For instance, as described above, the binding affinity of a molecular interaction between two molecules (such as two proteins, or a protein and a non-protein molecule) can be measured by SPR. SPR allows forthe determination of the KD of a potential interaction between two molecules, as described in detail above.
As it will be evident to the skilled person, the at least one carrier building block comprised in the molecule of the present technology may show non-specific binding with one or more human proteins (and/orwith one or more non-human proteins, and/orwith one or more non-
protein molecules, such as human non-protein molecules, and/or with one or more human cell types as described above). This is because there may be molecular forces between the at least one carrier building block and one or more human proteins (and/or one or more nonhuman proteins, and/or one or more non-protein molecules, such as human non-protein molecules, and/or one or more human cells, as described above), e.g., in the form of hydrophobic interactions, hydrogen bonding, Van der Waals interactions, and other nonspecific interactions. Hence, if this happens, the at least one carrier building block comprised in the molecule of the present technology may non-specifically bind to one or more human proteins (and/or to one or more non-human proteins, and/or to one or more non- protein molecules, such as human non-protein molecules, and/or to one or more human cells, if this is the case, as described above). In the context of the present technology, any KD value greater than 5xl0-4 mol/litre (or any KA value lower than 2xl03 litres/mol) is generally considered to represent "non-specific binding". Hence, the building block may bind to any human protein (or non-human protein, and/or to any human cell, if this is the case, as explained above) with a KD (KD value) greater than 5xl0-4 mol/litre (or with a K value lower than 2xl03 litres/mol), such as with a KD (KD value) greater than 5.5xl0-4 mol/litre (or with a KA value lower than 1.8xl03 litres/mol), or with a KD (KD value) greater than 6xl0-4 mol/litre (or with a KA value lower than 1.7xl03 litres/mol). In addition, in the context of the present technology, in a preferred embodiment, the carrier building block may bind to any non-protein molecule, such as to any human non-protein molecule (e.g., DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)), glycans) with a KD (KD value) greater than 5xl0-4 mol/litre (or with a KA value lower than 2xl03 litres/mol), such as with a KD (KD value) greater than 5.5x10” 4 mol/litre (or with a KA value lower than 1.8xl03 litres/mol), or with a KD (KD value) greater than 6xl0-4 mol/litre (or with a KA value lower than 1.7xl03 litres/mol). In addition, in the context of the present technology, the building block may bind to any human cell with a KD (KD value) greater than 5xl0”4 mol/litre (or with a KA value lower than 2xl03 litres/mol), such as with a KD (KD value) greater than 5.5xl0-4 mol/litre (or with a KA value lower than 1.8xl03 litres/mol), or with a KD (KD value) greater than 6xl0”4 mol/litre (or with a KA value lower than 1.7xl03 litres/mol). In the context of the present technology, this binding affinity is considered to be "non-specific binding".
In one embodiment, the protein-based carrier building block comprised in the molecule of the present technology is not derived from the crystallizable fragment of an antibody (Fc, which contains two CH2 and two CH3 domains) such as the Fc fragment of a monoclonal antibody (mAb). In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology is not derived from the CH2 and/or the CH3 domains of the Fc fragment. In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology is not derived from a CHI and/or the CL domains comprised in the antigen-binding fragment (Fab) of an antibody, such as the CHI and/or the CL domains comprised in the Fab of a mAb. In one embodiment, the molecule of the present technology is not (or is not derived from) a crystallizable fragment (Fc) of an antibody, such as a mAb. In another embodiment, the molecule of the present technology is not (or is not derived from) the Fab of an antibody, such as a mAb.
In one embodiment, the molecule of the present technology does not comprise VH-VL pairs or, e.g., it does not comprise at least one VH and at least one VL which interact (are bound to) with each other, such as in an antibody. In another embodiment, the molecule of the present technology does not comprise CL-CH1 conjugates, e.g., it does not comprise at least one CL and at least one CHI which are linked to each other, e.g., through a disulphide bridge.
In a preferred embodiment, the binding properties of the at least one protein-based carrier building block comprised in the molecule of the present technology are not affected or altered (or essentially not affected or altered) when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block. As described herein, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any human protein. If the building block shows any interaction with one or more human proteins, such interaction is characterized by low specificity and/or low affinity, as defined herein. Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any human protein, and/or, if the building block shows any interaction with one or more human proteins, such interaction is still characterized by low specificity and/or low affinity, as defined herein. As also described herein, it is preferred that
the carrier building block comprised in the molecule of the present technology does also not specifically bind to any non-protein molecule (including non-protein biomolecules, such as nucleic acids, e.g., DNA and/or RNA, lipids (e.g., phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecule), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., such as phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids. In particular, preferably, the carrier building block does also not specifically bind to any non-protein molecule (including biomolecules) (such as nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans), e.g., to any non-protein human molecule (including biomolecules), such as human nucleic acids, e.g., human DNA and/or human RNA, human lipids (e.g., phosphatidylserine (PS)) or human glycans, e.g., human glycoplipids to which the protein-based carrier building block precursor specifically binds (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-protein molecule (including biomolecules) or a nonhuman protein). Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any non-protein molecule, in particular it still does not specifically bind to any non-protein molecule (including biomolecules) to which the protein-based carrier building block precursor specifically binds. As described above, in another preferred embodiment, the protein-based building block comprised in the molecule of the present technology does also not specifically bind to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non-human protein or a non- protein molecule (including biomolecules)), or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to (i.e., the protein-based building block preferably does also not specifically bind to the precursor's target, e.g., a non- human protein or a non-protein molecule) with a KD value greater than 5xl0-4 mol/litre, as described herein. Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the proteinbased carrier building block, the protein-based carrier building block still does not specifically bind to any (non-human) molecule (including biomolecules) which the protein-based carrier building block precursor specifically binds to. As also described herein, in a preferred
embodiment, the protein-based carrier building block does not specifically bind to the precursor's target, should the protein-based carrier building block precursor have a target and should this be a non-human molecule (including biomolecules), such as a non-human protein. Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to the precursor's target. As also described above, in another preferred embodiment, the protein-based carrier building block does not specifically bind to any microorganism such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules). If the building block shows any interaction with one or more microorganisms and/or virus, or with any microbial or viral molecule (including biomolecules), such interaction is characterized by low specificity and/or low affinity, as defined herein. Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any microorganism such as bacteria, fungi, protists, yeast and/or virus, or to any microbial or viral molecule (including biomolecules), or binds to it with low specificity and/or low affinity, as described herein. As further described above, in another preferred embodiment, the protein-based carrier building block comprised in the molecule of the present technology does not specifically bind to any molecule, including biomolecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any molecule, including bio molecules, including human molecules and non-human molecules (including human and non-human biomolecules, e.g., human and/or non-human proteins, nucleic acids such as DNA and/or RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans) with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein. For instance, the protein-based carrier building block does not specifically bind to any human and/or non-human animal biomolecule (e.g., human and/or non-human animal proteins, human and/or non-human nucleic acids such as DNA and/or RNA, human and/or non-human lipids (e.g., such as phosphatidylserine (PS)) or human and/or non-human glycans), or binds to any human and/or non-human animal biomolecule with a KD (KD value) greaterthan 5xl0-4 mol/litre, as described
herein. Hence, in this preferred embodiment, when one or more cargos are attached to one or more attachment points or conjugation sites comprised in the protein-based carrier building block, the protein-based carrier building block still does not specifically bind to any molecule, as described herein, or binds to it with a KD (KD value) greater than 5xl0-4 mol/litre, as described herein.
It is thus preferred that the intrinsic binding properties of the protein-based building block do not change or do not essentially change when one or more cargos are attached or conjugated to it.
Attachment points or conjugation sites
As mentioned above, the carrier building block present in the molecule of the present technology has at least one attachment point (also referred to as conjugation site in the present disclosure), preferably at a solvent-accessible position, as defined further below. Preferably, the at least one protein-based carrier building block comprises more than one attachment points or conjugation sites, preferably at solvent-accessible positions. In a preferred embodiment, the protein-based carrier building block comprises at least two attachment points or conjugation sites, preferably at solvent-accessible positions. In another embodiment, the protein-based building block comprises three conjugation sites or more, such as six or nine conjugation sites, preferably at solvent-accessible positions. In a preferred embodiment, the protein-based building block comprises four conjugation sites, preferably located at solvent-accessible positions in the protein-based carrier building block. In a preferred embodiment, the protein-based building block comprises five conjugation sites, preferably located at solvent-accessible positions in the protein-based carrier building block. For instance, the protein-based carrier building block may have two, three, four, five, six, seven, eight, nine, ten conjugation sites or more, preferably at solvent-accessible positions. In one embodiment, if more than one, the conjugation sites present in the carrier building block are different from each other. For instance, if the carrier building block comprises two conjugation sites, these conjugation sites may be functionally/chemically different from each other, i.e., each conjugation site or attachment point is chemically different from each other (e.g., if there are two conjugation sites, one conjugation site may be a -SH group present in the side chain of a cysteine located in a solvent-accessible position, and the other conjugation
site may be a -NH2 group present in the side chain of a lysine located in a solvent-accessible position, or the /V-terminal NH2 group). If the building block has more than two conjugation sites (e.g., at least three conjugation sites, such as three, four, five, six, seven, eight, nine, ten, etc.), there may be at least two types of conjugation sites among the at least three conjugation sites present in the building block. In another embodiment, if the building block has three conjugation sites, each conjugation site is functionally different from each other. In another embodiment, if the building block has three conjugation sites, two conjugation sites are the same and one conjugation site is functionally different from the other two conjugation sites. In another embodiment, all conjugation sites present in the building block are functionally different from each other. In another embodiment, all conjugation sites present in the carrier building block are the same. For instance, the protein-based building block may comprise one, two, three, four, five, six, seven, eight, nine, ten or more conjugation sites which are all the same, e.g., which are all -SH groups present in the side chain of cysteines located at solvent- accessible positions in the protein-based building block.
In another embodiment, alternatively or additionally, if there is more than one conjugation site, the conjugation sites are spatially distant from each other (spatially separated from each other). The skilled person will appreciate that the minimal distance between conjugation sites will be dictated by the nature of the cargos (and linkers, if used) which are to be attached or conjugated to the attachment points or conjugation sites in the protein-based carrier building block. For larger cargos (e.g., ISVDs), the minimal distance can still be kept small when used in combination with long linkers, which add the needed flexibility and the envisaged target binding. A short distance between conjugation sites, combined with short linkers, if any, will likely limit the target binding of larger cargos, and result in restricted engagement (e.g., increased cell specificity). In addition, the solubility of the molecule may be decreased (i.e., the molecule may be more prone to aggregation). On the other hand, if the cargos to be attached are rather small (e.g., radioactive isotopes), the minimal distance can be kept small even in the absence of linkers. Hence, the skilled person will be able to select the location of the specific conjugation sites and the length and flexibility of the linkers, if any, depending on the nature of the cargos which are to be attached or conjugated to the protein-based carrier building block.
A "conjugation site" or "attachment point" may be a reactive group in the side chain of a natural or a non-natural (also referred to as "noncanonical", "unnatural" or "unusual", as described above) amino acid preferably located at a solvent-accessible position in the proteinbased carrier building block. It may also be the C-terminal and/or /V-terminal reactive group (-COOH and -NH2 groups, respectively) of the protein-based carrier building block. In the context of the present technology, a "reactive group in the side chain of an amino acid" (either natural or non-natural, as defined above) refers to any chemical group present in the side chain of an amino acid which is capable of forming a covalent bond. For instance, if the amino acid is lysine (or ornithine (Orn), or Diaminopropionic acid (Dap), or Diaminobutyric acid (Dab)), the reactive group present on its side chain is a primary amine. For example, if the amino acid is cysteine, the reactive group present on its side chain is a thiol group. For example, if the amino acid is aspartic or glutamic acid, the reactive group present in their side chain is a carboxylic group. For example, if the amino acid is tyrosine, the reactive group present on its side chain is a phenolic hydroxyl group. For example, if the amino acid is arginine, the reactive group present on its side chain is a guanidino group. For example, if the amino acid is methionine, the reactive group present on its side chain is a thioether group.
In the context of the present technology, the "C-terminal or /V-terminal reactive group of the protein-based carrier building block" refers to the -COOH and -NH2 reactive groups present in the C- and /V-terminal amino acid of the protein-based carrier building block. If the carrier building block does not have a free C- and/or /V-terminal end (e.g., because the carrier building block is C- and/or /V-terminal linked to another protein-based building block, or to another peptide or protein, or because the /V-terminal is amidated, or because the C-terminal is acetylated, etc.), then the N- and C-terminal ends of the carrier building block are not suitable as attachment points or conjugation sites as defined herein. In some embodiments the "conjugation site" or "attachment point" is not the C-terminal or /V-terminal reactive group of the protein-based carrier building block.
The at least one conjugation site or attachment point present in the building block comprised in the molecule of the present technology may be already present in the building block precursor (e.g., the -NH2 group in the side chain of a lysine present in the building block precursor, preferably at a solvent-accessible position) or may be engineered. Preferably, at
least one or more of the attachment points or conjugation sites of the protein-based building block are engineered. In the context of the present technology, an "engineered" attachment point or conjugation site means a conjugation site or attachment point which is present in the protein-based carrier building block, but which was not present in its precursor at the same or corresponding position. For instance, the protein-based building block precursor may be modified to introduce one or more attachment points or conjugation sites, as described in detail below. A non-limiting example of an engineered attachment point or conjugation site is a reactive group present in the side chain of an amino acid in the protein-based carrier building block which amino acid was not present at the same or equivalent position in the building block precursor. For instance, if the building block precursor has a serine at a certain position X (which is preferably a solvent-accessible position) in the building block precursor, and that serine is mutated to a cysteine in the carrier building block, the -SH group of that cysteine would be an engineered attachment point or conjugation site. For instance, if an amino acid (e.g., a Cys, or a Tyr) is added at the /V- or C-terminal end of the building block precursor, the reactive group present in the side chain of that newly added amino acid in the carrier building block would be an engineered attachment point or conjugation site.
Hence, in a preferred embodiment, the protein-based carrier building block comprised in the molecule of the present technology has at least two conjugation sites or attachment points, wherein at least one, preferably at least two of them are engineered attachment points or conjugation sites, i.e., they were not present in the building block precursor at the same or corresponding position. In another preferred embodiment, all of the conjugation sites or attachment points present in the protein-based building block are engineered attachment points or conjugation sites, i.e., they were not present in the building block precursor at the same or corresponding position. In one embodiment, the carrier building block has two or more engineered attachment points or conjugation sites, such as three, four, five, six, seven, eight, nine, ten or more engineered attachment points or conjugation sites.
As used herein a residue position in one polypeptide sequence "corresponds to" a residue position in another polypeptide sequence if it exists in an equivalent position in the polypeptide sequence, as indicated, e.g., by primary sequence homology or functional equivalence or Kabat numbering. A corresponding position may be identified by alignment of
the two polypeptide sequences. The alignment used to identify a corresponding position or corresponding region may be obtained using a conventional alignment algorithm such as Blast (Altschul et al., "Basic local alignment search tool", J Mol Biol., 1990, 215(3):403-10).
The at least one conjugation site present in the carrier building block comprised in the molecule of the present technology may be free (i.e., ready for reaction) or capped/protected. Hence, the a-amino group, the carboxylic acid terminus, or the reactive groups present in the side chain of one or more amino acids of the carrier building block (e.g., amines, carboxylic acids, alcohols, thiols) may be capped or protected with a protecting group, e.g., to prevent polymerization of the amino acids, to minimize undesirable side reactions during the synthesis of the building block or to selectively attach different cargos, for example. Of course, if the at least one conjugation site is capped or protected, it has to be de-capped or deprotected before attaching or conjugating a cargo to it, as described in detail below.
Thus, the at least one conjugation site present in the protein-based carrier building block comprised in the molecule of the present technology may be (non-limiting) a primary amine, a thiol group, a hydroxyl group, a guanidino group, a carboxyl group or a thioether group. For instance, the conjugation site may be a free or capped (protected) thiol group.
Hence, in some embodiments, the at least one conjugation site present in the protein-based carrier building block comprised in the molecule of the present technology may be a primary amine present in the side chain of a lysine (or ornithine (Orn), or Diaminopropionic acid (Dap), or Diaminobutyric acid (Dab)) in the protein-based building block, preferably located at a solvent-accessible position. In other embodiments, the conjugation site is a thiol group present in the side chain of a cysteine in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block. In other embodiments, the conjugation site is a carboxylic group present in the side chain of an aspartic or glutamic acid in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block. In other embodiments, the conjugation site is a guanidino group present the side chain of an arginine in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block. In other embodiments, the conjugation site is a thioether group present the side chain of a methionine
in the protein-based building block, preferably located at a solvent-accessible position in the protein-based building block. In other embodiments, the conjugation site is the phenolic OH- group of a tyrosine in the protein-based building block, preferably located at a solvent- accessible position in the protein-based building block. In one embodiment, the tyrosine is preferably located at the /V- or C-terminal end of the protein-based carrier building block of the molecule. In other embodiments, the conjugation site is the /V-terminal primary amine of the carrier building block, if this is free and preferably solvent-accessible. In other embodiments, the conjugation site is the C-terminal carboxyl group of the carrier building block, if this is free and preferably solvent-accessible.
As described above, the conjugation sites may be free or protected. For instance, as already described above, if the conjugation site is a thiol group (e.g., from a cysteine in the proteinbased building block, preferably located at a solvent-accessible position in the protein-based building block), the thiol group may be free (-SH) or protected/capped. A capped thiol group refers to a thiol group which is (reversibly) protected with a protecting group (e.g., with another cysteine, with glutathione (GSH), with cysteamine or with protecting groups such as Benzyl (Bzl, Bn), Trityl (Trt), Diphenylmethyl (Dpm, Bzh, Bh), Tetrahydropyranyl (Thp), tert- Butyl (tBu), etc.). Spears, R., et al., ("Cysteine protecting groups: applications in peptide and protein science", Chem. Soc. Rev., 2021, 50, 11098) provides a review on the different cysteine protecting groups. In addition, Isidro-Llobet, A., etal., ("Amino acid-protecting groups", Chem Rev., 2009, 109(6):2455-504) provides a review of different amino acid protecting groups.
In one embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises at least two attachment points or conjugation sites which are two reactive groups present in the side chain of two amino acids (which may be natural or a non-natural) in the protein-based building block, preferably located at solvent-accessible positions in the protein-based carrier building block. For instance, in one embodiment, the protein-based carrier building block comprises at least two attachment points or conjugation sites which are two reactive groups present in the side chain of two natural amino acids (e.g., two Cys) in the protein-based building block, preferably located at solvent-accessible positions in the protein-based carrier building block.
In another embodiment, the protein-based carrier building block comprises four conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block.
In another embodiment, the protein-based carrier building block comprises five conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block. The fifth conjugation site may be the /V-terminus or the C- terminus of the protein-based carrier building block.
In another embodiment, the protein-based carrier building block comprises five conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block. Preferably, the fifth conjugation site is the /V-terminal amine of the protein-based carrier building block.
In another embodiment, the protein-based carrier building block comprises six conjugation sites located at solvent accessible positions, wherein four conjugation sites are four -SH groups present in the side chain of four Cys located at solvent-accessible positions in the proteinbased carrier building block. Preferably, the fifth conjugation site is the /V-terminal amine of the protein-based carrier building block and the sixth conjugation site is the C-terminal carboxylic acid of the protein-based carrier building block, to which the at least one NLS is preferably conjugated, as described herein.
At least one of the attachment points or conjugation sites present in the protein-based building block is linked (directly or via a linker) to a cargo which is a nuclear localization sequence (NLS), as defined herein. More than one attachment points or conjugation sites present in the protein-based building block may be linked (directly or via a linker) to more than one NLSs.
In a further preferred embodiment, the molecule of the present technology comprises at least one further cargo, wherein the at least one further cargo is attached or conjugated to the at least one protein-based carrier building block through at least one attachment point or conjugation site. The cargo may be a cell-targeting moiety (also referred to herein as "targeting moiety"), as described in detail below. The molecule of the present technology may comprise more than one cell-targeting moieties, such as two, three, four, five or more cel I -targeting moieties. These can each be covalently linked to the attachment points or conjugation sites comprised in the protein-based building block (directly or by means of a linker, as described herein). They can also be covalently linked to one attachment point or conjugation site in tandem, i.e., two or more cel I -targeting moieties are covalently linked to each other (e.g., via their /V- and C-terminal parts) and then, all of them, covalently linked to the protein-based carrier building block through one attachment point or conjugation site comprised therein. The one or more cell-targeting moieties may be attached or conjugated to the at least one protein-based carrier building block through at least one attachment point or conjugation site by genetic fusion.
Hence, in one embodiment, the molecule of the present technology comprises at least one protein-based carrier building block, at least one NLS and at least one cell-targeting moiety.
In a further embodiment, the molecule of the present technology comprises at least one further cargo, wherein the at least one further cargo is attached or conjugated to the at least one protein-based carrier building block through the at least one attachment point or conjugation site.
The cargo may be a cell-penetrating peptide (CPP), as described in detail below. The molecule of the present technology may comprise more than one CPPs, such as two, three, four, five or more CPPs. These can each be covalently linked to the attachment points or conjugation sites comprised in the protein-based building block (directly or by means of a linker, as described herein). They can also be covalently linked to one attachment point or conjugation site in tandem, i.e., two or more CPPs are covalently linked to each other (e.g., via their /V- and C- terminal parts) and then, all of them, covalently linked to the protein-based carrier building block through one attachment point or conjugation site comprised therein.
The cargo may be a cell-penetrating peptide (CPP), as described in detail below, and a celltargeting moiety, as described in detail below. These can each be covalently linked to the attachment points or conjugation sites comprised in the protein-based building block (directly or by means of a linker, as described herein). They can also be covalently linked to one attachment point or conjugation site in tandem, i.e., two or more cell-targeting moieties or CPPs are covalently linked to each other (e.g., via their /V- and C-terminal parts) and then, all of them, covalently linked to the protein-based carrier building block through one attachment point or conjugation site comprised therein.
In the context of the present technology, a "cargo" may be any molecule which is/may be attached or conjugated to the protein-based carrier building block through the attachment point(s) or conjugation site(s) present therein. It is clear from the above that NLS, such as PAAKRVKLD, SEQ ID NO.: 221, or "cel I -targeting moieties", or CPPs are also cargos. For instance, cargos which may be attached or conjugated to the protein-based carrier building block are proteins, peptides, ISVDs (such as VHH, VL or VH), polyethylene glycol (PEG), small molecules (such as CDK inhibitors), glycans (such as, e.g., M6P), lipids, chelators, fluorophores, radio isotopes, vitamins (such as folic acid or biotin), nucleic acids such as DNA and/or Antisense Oligonucleotides (ASOs), etc. The cargo may have different functionalities. For instance, the at least one cargo may be a half-life extending (HLE) molecule, a targeting molecule, a therapeutic molecule or precursor thereof (including mRNA), an imaging molecule, a toxic molecule, an agonist, a T-cell engagement molecule, a sweeping/degrader molecule, a cell-penetrating molecule, a nuclear localization molecule, a blood brain barrier (BBB) shuttle, a radiotherapeutic molecule or an imaging probe.
Hence, in another embodiment, the molecule of the present technology comprises at least one further cargo, wherein the further cargo is also attached or conjugated to the at least one protein-based carrier building block through at least one attachment point or conjugation site, and wherein the further cargo is a HLE molecule, such as an albumin-binding ISVD (as described herein, e.g., as defined in Table 8, such as SEQ ID NO.: 63 or 106) or a PEG molecule, or ELNN polypeptides, as described herein. In another embodiment, the molecule of the present technology comprises at least one further cargo, wherein the further cargo is also attached or conjugated to the at least one protein-based carrier building block through at least
one attachment point or conjugation site, and wherein the further cargo is a targeting moiety and/or a therapeutic moiety as described herein. In a further embodiment, the molecule of the present technology comprises at least two further cargos, wherein the further cargos are attached or conjugated to the at least one protein-based carrier building block through at least two attachment points or conjugation sites, wherein the at least two further cargos are one HLE molecule, as described herein, and one therapeutic and/or targeting moiety, as described herein.
In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises at least two cysteines, preferably located at solvent accessible positions, such as three cysteines, or four cysteines, or six cysteines, or nine cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the at least two, such as three, or four, or six, or nine, conjugation sites as defined herein. In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises three cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the three conjugation sites as defined herein. In one embodiment, the protein-based carrier building block does not comprise any other cysteine at solvent accessible positions besides the three cysteines at solvent-accessible positions which bear the three conjugation sites (free or capped thiol groups) (but may comprise one or more cysteines at positions which are not solvent- accessible). In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises four cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the four conjugation sites as defined herein. In one embodiment, the protein-based carrier building block does not comprise any other cysteine at solvent accessible positions besides the four cysteines at solvent-accessible positions which bear the four conjugation sites (free or capped thiol groups) (but may comprise one or more cysteines at positions which are not solvent- accessible). In another embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises four, five, six, seven, eight, nine, ten or more cysteines, preferably located at solvent accessible positions, with free or capped thiol groups that are the four, five, six, seven, eight, nine, ten or more conjugation sites as defined herein. In one embodiment, the protein-based carrier building block does not
comprise any other cysteine at solvent accessible positions besides the four, five, six, seven, eight, nine, ten or more cysteines which bear the four, five, six, seven, eight, nine, ten or more conjugation sites (free or capped thiol groups) and which are located at solvent-accessible positions in the building block. In other embodiments, the at least one protein-based building block comprised in the molecule of the present technology comprises at least one amino acid, such as one, two, three, four, five, six, seven, eight, nine, ten or more, which may be natural or non-natural, preferably located at solvent accessible positions, which comprises a reactive group on its side chain which is the conjugation site as defined herein. In another embodiment, the at least one protein-based building block comprised in the molecule of the present technology comprises at least two conjugation sites, one of which is a (free or protected) thiol group from a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, and the other one is a -OH group from a tyrosine preferably located at a solvent-accessible position in the protein-based carrier building block, preferably from a /V- or C-terminally exposed tyrosine. In another embodiment, the at least one protein-based building block comprised in the molecule of the present technology comprises at least two conjugation sites, one of which is a (free or protected) thiol group from a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, and the other one is a reactive group from a non-natural amino acid preferably located at a solvent-accessible position in the protein-based carrier building block.
In one embodiment, a conjugation site orattachment point in the protein-based building block is a selenol (-HSe) group from a selenocysteine (Sec or U), which may be located, e.g., in the C-terminal of the protein-based carrier building block. In another embodiment, a conjugation site or attachment point in the protein-based building block is a keto group of a p- acetylphenylalanine (pAcPhe), that can be selectively coupled to an alkoxyamine derivatized cargo, see, e.g., Jun Y. Axup et al., "Synthesis of site-specific antibody-drug conjugates using unnatural amino acids", PNAS, 2012, 109 (40) 16101-16106.
The skilled person is aware of ways of incorporating one or more unnatural amino acids in the at least one protein-based building block comprised in the molecule of the present technology, if this is the case. For instance, WO 2021/050554, the content of which is
herewith incorporated by reference, describes in detail how to incorporate one or more unnatural amino acid(s) in a protein.
In one embodiment, the conjugation site is a free or capped thiol group in the side chain of a cysteine, preferably present at a solvent-accessible position in the building block. Cysteine is often the site of choice when it comes to the site-specific modification of proteins, also known as bioconjugation, owing to its favourable properties (nucleophilic profile of the thiol at neutral/near-neutral pH, low natural abundance, general ease of incorporation into proteins via site-directed mutagenesis) (from Spears R. J. et al., "Cysteine protecting groups: applications in peptide and protein science", Chem. Soc. Rev., 2021, 50, 11098-11155).
In a preferred embodiment, the at least one conjugation site or attachment point is selected from: a thiol group (-SH, free or capped) present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, -NH2 (primary amine, eitherfrom the /V-terminal end of the protein-based building block or present in the side chain of an amino acid, such as lysine or ornithine), -OH present in the side chain of a tyrosine (either a C-terminal tyrosine, /V-terminal tyrosine or a tyrosine preferably present at any other solvent-accessible position in the protein-based carrier building block), C-terminal -COOH and azido group present in the side chain of non-natural amino acids (such as azidolysine). More preferably, the at least one conjugation site or attachment point is a thiol group (free or capped) present in the side chain of a cysteine preferably located at a solvent- accessible position in the protein-based building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises six attachment points or conjugation sites, wherein three of them are -SH groups present in the side chain of three Cys, preferably located at solvent- accessible positions, and wherein three of them are -NH2 present in the side chain of three Lys, preferably located at solvent-accessible positions in the protein-based building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises four attachment points or conjugation sites, wherein three of them are three -SH groups present in the side chain of three Cys, preferably located at solvent-
accessible positions, and wherein one of them is the /V-terminal primary amine of the proteinbased carrier building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises four attachment points or conjugation sites that are four -SH groups present in the side chain of four Cys, preferably located at solvent-accessible positions.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises four attachment points or conjugation sites, wherein three of them are three -SH groups present in the side chain of three Cys, preferably located at solvent- accessible positions, and wherein one of them is the C-terminal carboxylic acid of the proteinbased carrier building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises five attachment points or conjugation sites, wherein four of them are four -SH groups present in the side chain of four Cys, preferably located at solvent- accessible positions, and wherein one of them is the /V-terminal primary amine of the proteinbased carrier building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises five attachment points or conjugation sites, wherein four of them are four -SH groups present in the side chain of four Cys, preferably located at solvent- accessible positions, and wherein one of them is the C-terminal carboxylic acid of the proteinbased carrier building block.
In one embodiment, the protein-based building block comprised in the molecule of the present technology comprises six attachment points or conjugation sites, wherein four of them are four -SH groups present in the side chain of four Cys, preferably located at solvent- accessible positions, wherein one of them is the /V-terminal primary amine of the proteinbased carrier building block and wherein one of them is the C-terminal carboxylic acid of the protein-based carrier building block.
Hence, the at least one conjugation site present in the at least one building block comprised in the molecule of the present technology allows for conjugation of different cargos (directly or by means of a linker, as it will be clear to the skilled person and described in detail below). The skilled person is aware of ways of attaching cargos to the conjugation site(s) present in the building block. For instance, Spicer C. D. et al. ("Achieving controlled biomoleculebiomaterial conjugation", Chem Rev. 2018, 118(16):7702-7743), the content of which is herewith incorporated by reference, provides a review on the chemistry of biomolecule conjugation and provide a comprehensive overview of the key strategies for achieving controlled functionalization.
For instance, if a conjugation site is a -SH group (free or capped) present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, the cargo can be attached or conjugated to the building block (directly or by means of a linker) by alkylation, metal-assisted arylation, disulphide exchange or addition to a maleimide Michael acceptor. It can also be attached or conjugated using the so-called "PODS-based conjugation", (see, e.g., Davydova M. et al., "Synthesis and bioconjugation of thiol-reactive reagents for the creation of site-selectively modified immunoconjugates", J Vis Exp., 2019, 145:10.3791/59063). These different methods provide a high level of chemoselectivity for cysteine (see, e.g., D. Alvarez Dorta, et al., Chem. Eur. J. 2020, 26, 14257). If at least one conjugation site is a -SH group (free or capped) present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block, the cargo can be attached or conjugated to it through addition to a maleimide Michael acceptor. Maleimide present in the cargo will specifically react with the at least one free thiol to form a thioether bond, generally at pH 6.5 to 7.5. Of course, if the -SH group is capped or protected, it should first be decapped or deprotected (e.g., reduced with reducing reagent, such as dithiothreitol (DTT) or tris(2-carboxyethyl)phosphine (TCEP)), and then the cargo can be attached to it. For instance, an APN-maleimide 'bifunctional' linker (see Formula I in the examples), also known as 3-(4-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l- yl)phenyl)propiolonitrile), can be used to attach or conjugate a cargo to a -SH attachment point present in the side chain of a cysteine preferably located at a solvent-accessible position in the protein-based carrier building block. For instance, a bis-maleimido-PEG3-linker (1,11- bismaleimido-triethyleneglycol) can be used to attach or conjugate a cargo to a -SH
attachment point present in the side chain of a cysteine preferably located at a solvent- accessible position in the protein-based carrier building block. In addition, maleimide- modified cargos (see, e.g., PEG-maleimide, /V-ethylmaleimide, maleimido-PEG-acid, Resiquimod (R-848)-maleimide, cryptophycin-PEG-maleimide) can be attached to the -SH attachment point present in the side chain of a cysteine preferably located at a solvent- accessible position in the protein-based carrier building block, see also the examples.
For instance, if a conjugation site is a -OH group of a tyrosine preferably located at a solvent- accessible position in the protein-based carrier building block, the cargo can be attached or conjugated to the building block (directly or by means of a linker) by several chemical methods such as cross-linking via catalytic tyrosine mono electronic oxidation, three-component Mannich-type tyrosine conjugation, conjugation via sulphur fluoride exchange chemistry (SuFEx), transition-metal complexes for tyrosine conjugation, diazonium coupling reaction, reactions with triazolinediones, etc. (for a review, see, e.g., D. Alvarez Dorta et al., Chem. Eur. J., 2020, 26, 14257).
Alternatively or additionally, if the conjugation site is the -OH group of an N- and/or C-terminal tyrosine, the cargo can be attached or conjugated to the building block (directly or by means of a linker) enzymatically as described, e.g., in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. As described therein, if conjugation of at least one cargo to a N- and/or C-terminal tyrosine is to be performed, the protein-based building block may preferably be extended with flexible (GG) or (G4SI)I-3GG tags (sequences) in order to facilitate the enzymatic addition, as described in Alan M. Marmelstein et al., cited above. In this case, tyrosinase from Agaricus bisporus (abTYR), a copper-dependent enzyme that functions to convert tyrosine into melanin via an o-quinone intermediate, may be used. Alternatively, the much smaller Bacillus megaterium tyrosinase (bmTYR) may be used to catalyze the reaction.
For instance, if the conjugation site is the /V-terminal primary amine of the protein-based carrier building block and/or the primary amine present in the side chain of an amino acid preferably located at a solvent-accessible position in the protein-based carrier building block (e.g., Lys, Orn, or any non-natural amino acid with a primary amine on its side chain), the cargo
may be attached or conjugated to the carrier building block (directly or by means of a linker) by reaction of a group present in the cargo/linker (e.g., isothiocyanates, isocyanates, acyl azides, NHS esters, sulfonyl chlorides, aldehydes, glyoxals, epoxides, oxiranes, carbonates, aryl halides, imidoesters, carbodiimides, anhydrides, or fluorophenyl esters) and the primary amine. See, e.g., Bioconjugate Techniques (Third edition), 2013, Chapter 3 - "The reactions of bioconjugation", Greg T. Hermanson.
For instance, if the building block comprises at least two conjugation sites which include a - OH from a tyrosine and a -SH from a cysteine preferably located at a solvent-accessible positions in the protein-based carrier building block, thiol nucleophiles can be conveniently capped through disulfide formation with Ellman's reagent. Following the coupling reaction in the -OH from the tyrosine, the thiol groups can be de-capped through brief exposure to an appropriate reducing agent, as described in Alan M. Marmelstein et al., mentioned above.
Another option to attach or conjugate a cargo to an attachment point or conjugation site in the protein-based building block (directly or by means of a linker) is the use of sortase- mediated transpeptidation reactions. Sortases allow functionalization of the N-, C-terminus and the creation of non-natural fusions (i.e., N-N or C-C chimeras) via the installation of click handles, see, e.g., Guimaraes C. P. et al. ("Site-specific C-terminal and internal loop labelling of proteins using sortase-mediated reactions", Nature Protocols, 2013, 8(9): 1787-1799). As described in this protocol, sortase-mediated reactions are applicable to any protein of interest (e.g., to the protein-based carrier building block), provided it contains (i) an LPXTG motif (where X can be any amino acid and glycine cannot be a free carboxylate) as the sortase target or (ii) a suitably exposed glycine residue to serve as the incoming nucleophile. The natural nucleophile of sortase can be replaced by any peptide/protein with an oligoglycine (Glyi-s) at the /V-terminus (in many cases a single glycine suffices). In turn, the peptides can be decorated with any cargo molecule (e.g., fluorophores, biotin, cross-linkers, lipids, carbohydrates, nucleic acids), provided that a free /V-terminal glycine remains available on the peptide used as the incoming nucleophile. Thus, incubation of sortase, LPXTG-containing protein and nucleophile leads to the covalent attachment of that nucleophile to the protein of interest in a site-specific manner. Guimaraes C. P. et al., mentioned above, provides a protocol that allows the functionalization of any given protein at its C-terminus. The target protein is engineered with
a sortase-recognition motif (LPXTG). Upon recognition, sortase cleaves the protein between the threonine and glycine residues, facilitating the attachment of an exogenously added oligoglycine (Glyi-s) peptide modified with the functional group of choice (e.g., the cargo to be attached to the protein-based carrier building block). Theile C. S. et al. ("Site-specific N- terminal labeling of proteins using sortase-mediated reactions", Nature Protocols, 2013, 8(9): 1800-1807) describes the use of sortase-mediated reactions to label the /V-terminus of any given protein of interest. As described in this protocol, the protein to be labeled is engineered with an exposed stretch of glycines or alanines at its /V-terminus when using sortase A from 5. aureus or 5. pyogenes, respectively. A peptide decorated with a functional group of choice (fluorophores, biotin, lipids, nucleic acids, carbohydrates and so on) and comprising a sortase recognition motif LPXTG/A sequence (X being any amino acid, as stated above) at its C terminus (e.g., the cargo) is then added to the reaction together with sortase. Sortase A cleaves between the threonine and glycine/alanine residues, forming a thioester intermediate with the peptide probe. Nucleophilic attack by the /V-terminally modified protein of interest resolves the intermediate, resulting in the formation of a covalent bond between the peptide probe (e.g., the cargo) and the N terminus of the protein (see Fig. 1 of Theile C. S. et al., mentioned above). Alternatively, depsi-peptides can be used for /V-terminal labeling, see Theile C. S. et al., mentioned above. Finally, Witte M. D. et al. ("Production of unnaturally linked chimeric proteins using a combination of sortase-catalyzed transpeptidation and click chemistry", Nature Protocols, 2013, 8(9): 1808-1819) describes a procedure for the production of N-to-N and C-to-C fusion proteins. By equipping the /V-terminus or C-terminus of the proteins of interest with a set of click handles using sortase A, followed by a strain- promoted click reaction, unnatural N-to-N and C-to-C linked (hetero) fusion proteins are established. As described in Witte M. D. et al., peptides for creating C-to-C linked proteins are synthesized with an /V-terminal triglycine motif and an azide or cyclooctyne (DIBAC) at the C- terminus (see also Fig. 2 of this document). The proteins of interest are engineered with C- terminal LPXTG sequences. To prepare N-to-N linked proteins, the authors of this protocol synthesize peptides containing the LPXTGG sortase A recognition sequence at the C-terminus (X can be any residue, but the authors prefer a polar residue, such as a glutamic acid, to aid precipitation of the peptide after cleavage from the resin and to increase the solubility of the peptide in water) and an azido or a cyclooctyne group at the /V-terminus of the probe. The
proteins to be linked should comprise 1-5 Gly at the /V-terminus. The final step of the procedure is fusing the click handle-containing proteins, see Fig. 1 of Witte M. D. et al.
In view of the above, it is possible to attach or conjugate cargos to the protein-based carrier building block (directly or by means of a linker) using sortases, as described in detail in Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al., the content of which is incorporated herewith by reference. The cargos may be attached or conjugated (directly or by means of linkers) at conjugation sites or attachment points in the protein-based carrier building block, which are either the N- or C-terminus of the protein-based building block using the above-described sortase methodology. Hence, if the conjugation site or attachment point of the protein-based carrier building block is the C-terminal end of the building block, a cargo may be attached or conjugated to it using sortase, provided that the C-terminal end of the building block comprises a sortase-recognition motif (LPXTG) and the cargo comprises a oligoglycine ((G ly)i-s) modified peptide at the /V-terminal (see Fig. 2 of Guimaraes C. P. et al.). If the conjugation site or attachment point of the protein-based carrier building block is the /V-terminal end of the building block, a cargo may be attached or conjugated to it using sortase, provided that the /V-terminal end of the building block comprises a (Gly)i-s tag sequence and the cargo comprises a sortase-recognition motif (LPXTG/A) at the C-terminal (see Fig. 1 of Theile C. S. et al.). In addition, protein or peptide cargos can be attached to the /V/C-terminal end of the protein-based carrier building block in a N-to-N and/or C-to-C manner, as described in detail in Witte M. D. et al.
Hence, by selecting appropriate (possibly initially capped) conjugation sites, the skilled person is able to attach or conjugate different cargos to the building block.
Solvent-accessible positions
As described above, the at least one conjugation site or attachment point present in the protein-based carrier building block is preferably located at a solvent-accessible position in the building block. Preferably all conjugation sites or attachment points present in the proteinbased carrier building block are located at solvent-accessible positions in the building block.
The skilled person is able to identify "solvent-accessible positions" in the carrier building block precursor. This can be performed in silica by means of computer modelling. For instance, the skilled person can make use of readily available software tools such as MAESTRO (Schrodinger, LLC, New York, NY, 2021), a multi-agent prediction system, based on statistical scoring functions (SSFs) and different machine learning approaches, see, e.g., Laimer et al. BMC Bioinformatics (2015) 16:116. In addition, the skilled person can also make use of readily available software tools such as YASARA (www.yasara.org), for identifying at least potential solvent-accessible positions for the at least one conjugation site of the building block. With the help of in silica tools such as MAESTRO or YASARA, the skilled person is able to identify solvent-accessible positions that are potentially suitable for engineering conjugations sites as defined above. Hence, with the help of tools such as MAESTRO or YASARA, potentially suitable conjugation sites are identified. An example of how to identify solvent-accessible positions that are potentially suitable for engineering conjugations sites as defined above is provided in the examples of this application (e.g., Examples 1-3). As described therein, a protein is selected as starting point fordeveloping the protein-based carrier building block (the so-called "building block precursor"). Using, e.g., MAESTRO, residues in the building block precursor with a Solvent-Accessible Surface Area (SASA) greater than or equal to, e.g., 27 A2 (square angstrom) can be considered to be solvent-accessible. The stability (AG in solvent) of the mutation of each of the identified residues (e.g., to a cysteine residue) can then be calculated, see, e.g., Laimer J. et al, "MAESTRO-multi agent stability prediction upon point mutations", BMC Bioinformatics, 2015, 16:116, for further details. Destabilizing mutations (e.g., mutations for which the calculated AG in solvent is higher) are generally not further considered as potential positions for conjugation sites or attachment points. Hence, once potentially suitable conjugation sites are identified with the help of tools such as MAESTRO or YASARA, the stability (AG in solvent) of the mutation of each of the identified residues (e.g., to a cysteine residue) is calculated. Those residues with lower calculated AG in solvent would be preferably further selected as potential positions for conjugation sites or attachment points. For instance, AG values in the range of -20 to +5 kcal/mol can be considered as nondestabilizing mutations. The skilled person will understand that the AG value for each of the mutations of the identified residues may vary depending on the specific protein and/or the specific mutations considered. The skilled person will also understand that the preferred mutations are those whose AG values are the lowest. Depending on these AG values, the
number of conjugation sites and the type of cargo that will be conjugated, the skilled person will further select certain positions over others among the ones initially identified as potentially solvent-accessible with the help of tools such as MAESTRO or YASARA.
Alternatively or additionally, the skilled person can use hydrogen/de uteri urn exchange mass spectrometry (HDX-MS) to determine at least potential solvent-accessible positions in a protein. HDX-MS reports on the local chemical environment and solvent accessibility of the protein backbone by monitoring the exchange of peptide bond amide protons with the deuterons of a D2O solvent. The rate of hydrogen-deuterium exchange is dependent on the solvent accessibility and folded state of the protein (see Englander SW. et al., "Hydrogen exchange: the modern legacy of Linderstrpm-Lang", Protein Sci., 1997, 6(5) : 1101-9).
If the identified solvent-accessible position is to be occupied by a certain amino acid with a reactive group in its side chain, (e.g., by a cysteine), the in silica modelling (e.g., with MAESTRO) will also take into account the potential interactions of the reactive group of that amino acid (e.g., the -SH present in the side chain of the cysteine) with other reactive groups present in the side chain of other amino acids present in the protein-based carrier building block (e.g., with other -SH groups present in the protein, if any).
Additionally or alternatively, the "solvent-accessible positions" can be identified and/or verified empirically. For instance, the "solvent-accessible positions" theoretically identified using available in silica software tools such as MAESTRO, as described above, may preferably be empirically confirmed by manufacturability. Formulation and process stability of potential building block candidates help narrow down lead candidates at an early stage, prior to large- scale manufacturing (see the examples and also, e.g., Ramachander, R., Rathore, N. (2013), "Molecule and manufacturability assessment leading to robust commercial formulation for therapeutic proteins" in: Kolhe, P., Shah, M., Rathore, N. (eds) Sterile Product Development, AAPS Advances in the Pharmaceutical Sciences Series, vol 6. Springer, New York, NY). Hence, once potential suitable solvent-accessible positions have been theoretically identified in the protein-based building block precursor, expression levels, conjugation efficiency, formulation, quality control, solubility, process stability, etc., of the resulting protein-based carrier building block should preferably be evaluated. Solvent-accessible positions which lead to building
blocks excelling in expression yield, manufacturability, solubility and/or stability are preferred, see the examples for further details.
For instance, once suitable solvent-accessible positions have been theoretically identified in the building block precursor, protein expression of the selected variants (i.e., the resulting protein-based building blocks with amino acid(s) bearing the conjugation site(s) in the theoretically-selected solvent-accessible position(s)) may take place. In this step it can be asserted whether the introduction of the specific amino acids at the theoretically-identified solvent accessible positions (e.g., point mutations, addition of amino acids at the /V- and/or C- terminal of the protein, etc.) has a negative impact on, e.g., the synthesis, expression levels, conjugation efficiency or 3D globular structure of each specific variant. In addition, the minimal required solubility and lack of specific binding to human proteins (and, optionally, to non-protein molecules and/or non-human proteins, preferably to the precursor's target), as described in detail above, can be assessed. Possible changes in 3D structure could be assessed, for example, by CD (circular dichroism) spectrum analysis, as described in detail above. In addition, the stability of the resulting variants can also be confirmed with a Thermal Shift Assay. This assay detects protein melting temperatures (Tm) and can thus be used to check protein stability. It can be used to characterize the stability/folding of a protein's 3D structure. SYPRO® Orange is a naturally quenched dye that interacts with the hydrophobic core of proteins which becomes visible following thermal denaturation. As a result, the temperature in the middle of the thermal denaturation process is labelled as melting temperature Tm. This is a way of assessing the stability of the resulting variants or mutants.
In addition, "model cargos" can be attached or conjugated to the selected variants, in order to quantify the extent of conjugation (conjugation efficiency), i.e., to ascertain whether the resulting protein-based building block with the conjugation sites at the selected solvent- accessible positions will in practice be suitable for the attachment or conjugation of the desired cargos. A "model cargo" may be any molecule with a molecular weight higher than, e.g., 100 Da. For instance, if a potential conjugation site is a thiol group, a "model cargo" may be a maleimide-modified alanine (e.g., /V-Maleoyl-|3-alanine), or a biotin-maleimide, as described in Junutula, J. et al. ("Site-specific conjugation of a cytotoxic drug to an antibody improves the therapeutic index", Nat Biotechnol, 26, 925-932 (2008)). For instance, if the
conjugation of the "model cargo(s)" results in a stable conjugate (protein-based building block with one or more model cargos conjugated to it), with an acceptable extent of conjugation (to be decided on a case-by-case basis, for example >90% conjugation efficiency, such as 90% conjugation efficiency, or 95% conjugation efficiency, or 97% conjugation efficiency, or 99% conjugation efficiency or more), allowing a standard PK in vivo, preserving its globular 3D structure and the conjugation status in vivo, etc., those solvent-accessible positions should be preferred for cargo conjugation, and conjugation of the desired cargo(s) may take place, see also the examples below.
Point mutations
In one embodiment, the at least one conjugation site present in the building block may be generated by introducing specific point mutations at solvent-accessible positions in the peptide sequence of the building block precursor. For instance, point mutations may be introduced at solvent-accessible positions in the building block precursor in order to generate the protein-based building block comprised in the molecule of the present technology, which comprises at least one conjugation site or attachment point at defined solvent-accessible positions, as described herein.
For instance, the conjugation sites may be generated by mutating specific amino acids preferably at solvent-accessible positions of a building block precursor to cysteine ("Cys- mutations"). Alternatively or additionally, the conjugation sites may be generated by mutating specific amino acids preferably at solvent-accessible positions of a building block precursor to natural or non-natural amino acids with a reactive group in its side chain. Amino acid distribution data of occurrence at certain positions (e.g., Cys, Ser) in the building block precursor can also be used to guide the design and introduction of conjugation sites.
Additionally or alternatively, the building block precursor may be modified by adding one or more amino acids at the N- and/or C-terminal of the protein sequence, to introduce at least one conjugation site or attachment point preferably at a solvent-accessible position, as described herein, to generate the protein-based building block.
In another embodiment, the at least one conjugation site may be already present preferably at solvent-accessible positions in the protein-based building block precursor, and there is no need of generating it. This is the case for the primary amine at the /V-terminal of the building block, the -COOH at the C-terminal or in the side chain of the building block, the primary amine in the side chain of, e.g., a lysine preferably already present at a solvent-accessible position in the building block precursor or the thiol group in the side chain of a cysteine preferably already present at a solvent-accessible position in the building block precursor.
If the building block comprises more than one conjugation sites, these can be generated by introducing, e.g., specific point mutations preferably at solvent-accessible positions in the peptide sequence of the building block precursor. Additionally or alternatively, other suitable conjugation sites or attachment points may be already present preferably at solvent- accessible positions in the building bock precursor, i.e., there is no need of generating these conjugation sites by introducing, e.g., specific point mutations and/or adding one or more amino acids at the /V- and/or C-terminal of the building bock precursor. The skilled person will decide on the number and position of the attachment point(s) or conjugation site(s) based on the protein-based building block and the cargo(s) to be attached to it, directly or by means of a linker, as described herein.
As described in detail above, preferably, the point mutations are non-destabilizing point mutations. Stability of mutants can be calculated with different methods which predict the impact of mutations on protein stability, e.g., based on artificial intelligence (Al). For instance, stability of mutants can be calculated with MAESTRO, as defined above and explained in detail in the examples, and can also be confirmed empirically by manufacturability (including but not limited to expression level and stability assessment, as described above).
In a preferred embodiment, the point mutations are mutations of amino acids preferably located at solvent-accessible positions in the building block precursor to cysteines. In another embodiment, the point mutation consists of the replacement of a serine residue preferably in a solvent-accessible position of the building block precursors by a cysteine. In another embodiment, the point mutations are mutations of preferably solvent-accessible amino acids in the building block precursor to lysines. In another embodiment, the point mutations are
mutations of preferably solvent-accessible amino acids in the building block precursor to tyrosines. In another embodiment, the point mutations are mutations of preferably solvent- accessible amino acids in the building block precursor to a natural or non-natural amino acid, as described above.
Addition of a C- or N- natural and/or non-natural amino acid with a reactive group in its side chain
For instance, the conjugation sites may be generated by adding, in the building block precursor, one or more C- or /V-terminal natural and/or one or more C- or /V-terminal non- natural amino acid(s) with a reactive group in its side chain. Preferably, if present, the one or more terminal natural or non-natural amino acid is added at the C-terminus of the building block precursor. For instance, one or more of the conjugation sites is(are) generated by adding a N- or C- terminal cysteine, a N- or C- terminal tyrosine and/or a N- or C-terminal non-natural amino acid to the protein-based building block precursor. Preferably, at least one of the conjugation sites is generated by adding a N- or C- terminal tyrosine to the protein-based building block precursor, preferably a C-terminal tyrosine. In a preferred embodiment, the N- and/or C-terminal Tyr is preceded/followed by flexible (GG) or ((G4SI)I-3GG) sequences (e.g. - GGY, -(G4SI)I-3GGY, YGG-, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078- 5086.
Hence, the at least one protein-based carrier building block comprised in the molecule of the present technology may comprise a N- and/or C-terminal Cys, Tyr, and/or non-natural amino acid, for instance a C-terminal Tyr, as in a -GGY or -(G4SI)I-3GGY tag (sequence).
In addition, the at least one protein-based carrier building block comprised in the molecule of the present technology may comprise a N- and/or C-terminal conjugation site or attachment point suitable for conjugation with sortase, as described above. In these cases, the proteinbased carrier building block should be engineered to comprise a C-terminal sortase recognition motif (LPXTG, where X can be any amino acid), a /V-terminal polygly ((Gly)i-s) tag or both. In addition, if N-to-N and/or C-to-C attachments or conjugations are desired, the protein-based carrier building block should be engineered to comprise a C-terminal sortase
recognition motif (for C-to-C attachments) or a /V-terminal polygly ((Gly)i-s) tag (for N-to-N attachments), as described in detail above. See in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al., listed above.
Finally, as described above, the conjugation sites may be generated by combinations of the above mechanisms, e.g., the at least one conjugation site can be obtained by performing point mutations (e.g., Ser to Cys at a solvent-accessible position of the building block, as described above), and/or by adding a C- and/or N- terminal amino acid, such as cysteine, or tyrosine, or a non-natural amino acid, or a sortase recognition motif, or a polygly ((Gly)i-s) tag to the protein-based building block precursor, as described above.
Examples of building blocks
Small globular non-human protein-based building blocks
The protein-based carrier building block(s) comprised in the molecule of the present technology may be based on a small globular non-human protein. In the context of the present technology, a "small globular non-human protein" refers to a non-human protein which has a size (molecular mass) of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa, as described herein and which has a globular three-dimensional (3D) structure, as described herein. In addition, the at least one non-human protein-based carrier building block does not specifically bind to any human protein, as defined in this specification, preferably it also does not specifically bind to any nonprotein molecule (such as nucleic acids (e.g., DNA, RNA), glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), such as any human non-protein molecule (biomolecule) (such as human DNA, human RNA, human glycans, human lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as nucleic acids (DNA, RNA), glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor binds specifically, if any. Further, preferably, the at least one non- human protein-based carrier building block (i) does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than
5xl0-4 mol/litre, preferably as determined by cell-binding assay, (ii) does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein, and/or (iii) does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein.
In a preferred embodiment, the protein-based carrier building block does not specifically bind any non-human protein and/or non-protein molecule, preferably the precursor's target, when a cargo is conjugated to the at least one attachment point or conjugation site on the proteinbased carrier building block, as described above. Hence, in a preferred embodiment, the molecule of the present technology, which comprises at least one protein-based building block and at least one cargo attached to the at least one protein-based building block through the at least one conjugation site or attachment point, does not specifically bind any non- human protein or non-protein molecule, such as any human non-protein molecule, as described herein, in particular it does not specifically bind any protein or non-protein molecule to which the building block precursor binds, if any.
As described above, in the context of the present technology, if the protein-based building block or molecule of the present technology shows any interaction with one or more human protein (or non-human protein, or non-protein molecule, as described above), such interaction is characterized by low specificity and/or low affinity, as described in detail above.
A human protein is a protein present in the human body. The Human Protein Atlas (HPA, https://www.proteinatlas.org) is a Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues, and organs.
Small globular non-human proteins, in the context of the present technology, include proteins which are derived from human proteins, but which have been modified so that they are no longer human proteins. Examples of small globular non-human proteins are ISVDs, such as "human ISVDs" (e.g., VH, VL) and "non-human ISVDs" (e.g., VHH, non-human VH, VL or engineered ISVDs), DARPins (derived from ankyrin repeat proteins), affibodies or affitins.
The small globular non-human proteins may have a therapeutic or targeting activity.
Immunoglobulin single variable domain (ISVD)-based building blocks
In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology is based on a polypeptide which comprises or, alternatively, consists of, at least one immunoglobulin single variable domain (ISVD), such as an ISVD derived from VH or VHH (a heavy-chain ISVD).
As described above, the protein-based carrier building block comprised in the molecule of the present technology has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, such as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa. In addition, the at least one building block comprised in the molecule of the present technology does not specifically bind to any human protein, as defined in this specification, preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), such as any human non-protein molecule (such as human DNA, human RNA, human glycans, human lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor binds specifically, if any.
In a preferred embodiment, the protein-based carrier building block does not specifically bind any non-human protein and/or non-protein molecule, preferably the precursor's target, when
a cargo is conjugated to the at least one attachment point or conjugation site on the proteinbased carrier building block, as described above. Hence, in a preferred embodiment, the molecule of the present technology, which comprises at least one protein-based building block and at least one NLS attached to the at least one protein-based building block through the at least one conjugation site or attachment point, does not specifically bind any nonhuman protein or non-protein molecule, such as any human non-protein molecule, as described herein, in particular it does not specifically bind any protein or non-protein molecule to which the building block precursor binds, if any.
As described above, in the context of the present technology, if the protein-based building block or molecule of the present technology shows any interaction with one or more human protein (or non-human protein, or non-protein molecule, as described above), such interaction is characterized by low specificity and/or low affinity, as described in detail above.
Hence, in the specific embodiment where the at least one protein-based building block is based on an ISVD, preferably a heavy-chain ISVD, the resulting ISVD-based building block does not specifically bind to any human protein. In addition, as explained above, it is preferred that the ISVD-based building block does not specifically bind to any non-protein molecule, such as any human non-protein molecule. Furthermore, it is also preferred that the ISVD-based building block does not specifically bind to any non-human protein or non-protein molecule to which the protein-based carrier building block precursor specifically binds, if any, as described above.
In the context of the present technology, an "ISVD-based building block" refers to a proteinbased building block which derives from an ISVD, i.e., which is structurally similar to an ISVD but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the ISVD specifically binds. For instance, the ISVD-based building block has a sequence identity of at least 60%, or 70%, or 80% with an ISVD, e.g., with its ISVD precursor. For instance, the ISVD-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with an ISVD, e.g., with its ISVD precursor. For instance,
an ISVD-based building block may share the whole amino acid sequence with its ISVD precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids. In addition, the ISVD-based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind any human protein and preferably does not specifically bind any protein or non-protein molecule to which the precursor specifically binds.
The skilled person is aware of means for eliminating the specific binding properties of a certain ISVD precursor, e.g., by performing mutations in the amino acids responsible for the binding of the ISVD to the target (e.g., in one or more of the amino acids conforming the CDRs of the ISVD), by adding amino acids and/or by deleting amino acids from the precursor's sequence.
The term "immunoglobulin single variable domain" (ISVD), interchangeably used with "single variable domain", defines immunoglobulin molecules wherein the antigen binding site is present on, and formed by, a single immunoglobulin domain. This sets ISVDs apart from "conventional" immunoglobulins (e.g., monoclonal antibodies) or their fragments (such as Fab, Fab', F(ab')2, scFv, di-scFv), wherein two immunoglobulin domains, in particular two variable domains, interact to form an antigen binding site. Typically, in conventional immunoglobulins, a heavy chain variable domain (VH) and a light chain variable domain (VL) interact to form an antigen binding site. In this case, the complementarity determining regions (CDRs) of both VH and VL will contribute to the antigen binding site, i.e., a total of 6 CDRs will be involved in antigen binding site formation.
In view of the above definition, the antigen-binding domain of a conventional 4-chain antibody (such as an IgG, IgM, IgA, IgD or IgE molecule; known in the art) or of a Fab fragment, a F(a b')2 fragment, an Fv fragment such as a disulphide linked Fv or a scFv fragment, or a diabody (all known in the art) derived from such conventional 4-chain antibody, would normally not be regarded as an ISVD as, in these cases, binding to the respective epitope of an antigen would normally not occur by one single immunoglobulin domain but by a pair of associating
immunoglobulin domains such as light and heavy chain variable domains, i.e., by a VH-VL pair of immunoglobulin domains, which jointly bind to an epitope of the respective antigen.
In contrast, generally, ISVDs are capable of specifically binding to an epitope of the antigen without pairing with an additional immunoglobulin variable domain. The binding site of an ISVD is formed by a single VH, a single VHH or single VL domain.
In the context of the present technology, in the specific embodiment where the at least one protein-based building block is based on an ISVD, the ISVD building block precursor may be a light chain variable domain sequence (e.g., a V sequence) or a suitable fragment thereof; or a heavy chain variable domain sequence (e.g., a Vn-sequence or VHH sequence) or a suitable fragment thereof; as long as the resulting building block has a globular 3D structure, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and is soluble, as defined in detail above. An ISVD which may preferably be the precursor of the protein-based building block comprised in the molecule of the present technology can for example be a heavy chain ISVD, such as a VH, VHH, including a camelized VH or humanized VHH. In one embodiment, the protein-based building block precursor is a VHH, including a camelized VH or humanized VHH, as long as the resulting protein-based building block is soluble, has a globular 3D structure, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to human proteins. In addition, preferably, the resulting building block does not specifically bind to any non-protein molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids. Furthermore, preferably, the resulting building block does also not specifically bind to any non-human protein to which the proteinbased carrier building block precursor specifically binds, if any, as described above. Heavy chain ISVDs can be derived from a conventional four-chain antibody or from a heavy chain antibody.
For example, the ISVD precursor may be a single domain antibody (or an amino acid sequence that is suitable for use as a single domain antibody), a "dAb" or dAb (or an amino acid sequence that is suitable for use as a dAb) or a Nanobody® ISVD (as defined herein, and including but not limited to a VHH); other single variable domains, or any suitable fragment of any one thereof, as long as the resulting protein-based building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein (human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids, and, preferably, does also not specifically bind to any non-human protein to which the protein-based carrier building block precursor specifically binds, if any, as described above.
Preferably, the ISVD precursor is a VH, a humanized VH, a human VH, a VHH, a humanized VHH or a camelized VH. More preferably, the ISVD precursor is a Nanobody® ISVD (such as a VHH, including a humanized VHH or camelized VH) or a suitable fragment thereof, as long as the protein-based building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein (human) molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, e.g., glycoplipids, and, preferably, does also not specifically bind to any non-human protein to which the protein-based carrier building block precursor specifically binds, if any, as described above. Nanobody® is a registered trademark from Ablynx N.V.
"VHH domains", also known as VHHS, VHH antibody fragments, and VHH antibodies, have originally been described as the antigen binding immunoglobulin variable domain of "heavy chain antibodies"; i.e., of "antibodies devoid of light chains", see Hamers-Casterman et al., Nature, 363: 446-448, 1993. The term "VHH domain" has been chosen in order to distinguish these variable domains from the heavy chain variable domains that are present in conventional 4-chain antibodies, which are referred to herein as "VH domains", and from the light chain variable domains that are present in conventional 4-chain antibodies, which are referred to herein as "VL domains". For a further description of VHH'S, reference is made to the review article by Muyldermans ("Single domain camel antibodies: current status", J Biotechnol., 2001, 74: 277-302). VHH domains can be obtained from heavy chain-only antibodies (HCAbs) that are circulating in Camelidae, see e.g., Muyldermans S., "A guide to:
generation and design of nanobodies", FEBS J., 2021, 288(7):2084-2102. Hence, in a preferred embodiment, the ISVD-based building block has a sequence identity of at least 80% with a VHH (such as a humanized VHH or camelized VH), e.g., its VHH precursor. For instance, the ISVD- based building block has a sequence identity of at least 60%, or at least 70%, or 80%, or at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with a VHH, e.g., its VHH precursor. For instance, the ISVD-based building block may share the whole amino acid sequence with its VHH precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids, which are different in the protein-based carrier building block.
Typically, the generation of immunoglobulins involves the immunization of experimental animals, fusion of immunoglobulin producing cells to create hybridomas and screening for the desired specificities. Alternatively, immunoglobulins can be generated by screening of naive, immune, or synthetic libraries, e.g., by phage display.
The generation of immunoglobulin sequences, such as VHHS, has been described extensively in various publications, among which WO 94/04678, Hamers-Casterman et al. 1993 ("Naturally occurring antibodies devoid of light chains", Nature, 363: 446-448, 1993) and Muyldermans et al. 2001 ("Single domain camel antibodies: current status", J Biotechnol., 2001, 74: 277- 302) can be exemplified. In these methods, camelids are immunized with the target antigen in order to induce an immune response against said target antigen. The repertoire of VHHS obtained from said immunization is further screened for VHHS that bind (or not) a target antigen.
In the context of the present technology, immunoglobulin sequences of different origin may be used, comprising mouse, rat, rabbit, donkey, human and camelid immunoglobulin sequences. In the context of the present technology, fully human, humanized or chimeric sequences are also included. In the context of the present technology, camelid immunoglobulin sequences and humanized camelid immunoglobulin sequences, or camelized domain antibodies, e.g. camelized dAb as described by Ward et al. (Nature, 341: 544, 1989)
(see for example WO 94/04678 and Davies and Riechmann, "'Camelising' human antibody fragments: NMR studies on VH domains", Febs Lett., 339:285-290, 1994 and "Single antibody domains as small recognition units: design and in vitro antigen selection of camelized, human VH domains with improved protein stability", Prot. Eng., 1996, 9(6):531-537) are also included.
A "humanized VHH" comprises an amino acid sequence that corresponds to the amino acid sequence of a naturally occurring VHH domain, but that has been "humanized" , i.e., by replacing one or more amino acid residues in the amino acid sequence of said naturally occurring VHH sequence (and in particular in the framework sequences) by one or more of the amino acid residues that occur at the corresponding position(s) in a VH domain from a conventional 4-chain antibody from a human being (e.g., indicated above). This can be performed in a manner known perse, which will be clear to the skilled person, for example on the basis of the further description herein and the prior art (e.g., WO 2008/020079). Again, it should be noted that such humanized VHHS can be obtained in any suitable manner known per se and thus are not strictly limited to polypeptides that have been obtained using a polypeptide that comprises a naturally occurring VHH domain as a starting material. Preferably, if the building block is a VHH, the VHH is a humanized VHH.
A "camelized VH" comprises an amino acid sequence that corresponds to the amino acid sequence of a naturally occurring VH domain, but that has been "camelized", i.e., by replacing one or more amino acid residues in the amino acid sequence of a naturally occurring VH domain from a conventional 4-chain antibody by one or more of the amino acid residues that occur at the corresponding position(s) in a VHH domain of a heavy chain antibody. This can be performed in a manner known per se, which will be clear to the skilled person, for example on the basis of the further description herein and the prior art (e.g. WO 2008/020079). Such "camelizing" substitutions are usually inserted at amino acid positions that form and/or are present at the VH-VL interface, and/or at the so-called Camelidae hallmark residues, as defined herein (see for example WO 94/04678 and Davies and Riechmann, 1994 and 1996, supra). In one embodiment, the VH sequence that is used as a starting material or starting point for generating or designing the camelized VH is a VH sequence from a mammal, or the VH sequence of a human being, such as a VH3 sequence. However, it should be noted that such camelized VH can be obtained in any suitable manner known per se and thus are not strictly limited to
polypeptides that have been obtained using a polypeptide that comprises a naturally occurring VH domain as a starting material.
The structure of an ISVD sequence can be considered to be comprised of four framework regions ("FRs"), which are referred to in the art and herein as "Framework region 1" ("FR1"); as "Framework region 2" ("FR2"); as "Framework region 3" ("FR3"); and as "Framework region 4" ("FR4"), respectively; which framework regions are interrupted by three complementary determining regions ("CDRs"), which are referred to in the art and herein as "Complementarity Determining Region 1" ("CDR1"); as "Complementarity Determining Region 2" ("CDR2"); and as "Complementarity Determining Region 3" ("CDR3"), respectively.
Also, as further described in paragraph q) on pages 58 and 59 of WO 2008/020079, the amino acid residues of an ISVD are numbered according to the general numbering for VH domains given by Kabat et al. ("Sequence of proteins of immunological interest", US Public Health Services, NIH Bethesda, MD, Publication No. 91), as applied to VHH domains from Camelids in the article of Riechmann L. and Muyldermans S., "Single domain antibodies: comparison of camel VH and camelised human VH domains", J Immunol Methods, 1999, 231(l-2):25-38, see for example Figure 2 of this publication). It should be noted that - as is well known in the art for VH domains and for VHH domains - the total number of amino acid residues in each of the CDRs may vary and may not correspond to the total number of amino acid residues indicated by the Kabat numbering. That is, one or more positions according to the Kabat numbering may not be occupied in the actual sequence, or the actual sequence may contain more amino acid residues than the number allowed for by the Kabat numbering. This means that, generally, the numbering according to Kabat may or may not correspond to the actual numbering of the amino acid residues in the actual sequence. The total number of amino acid residues in a VH domain and a VHH domain will usually be in the range of from 110 to 120, often between 112 and 115. It should however be noted that smaller and longer sequences may also be suitable for the purposes described herein.
In the present application CDR sequences may also be described according to Kabat numbering with AbM CDR annotation, as described in Kontermann and Dubel (Eds. 2010, Antibody Engineering, vol 2, Springer Verlag Heidelberg Berlin, Martin, Chapter 3, pp. 33-51).
According to this method, FR1 comprises the amino acid residues at positions 1-25, CDR1 comprises the amino acid residues at positions 26-35, FR2 comprises the amino acids at positions 36-49, CDR2 comprises the amino acid residues at positions 50-58, FR3 comprises the amino acid residues at positions 59-94, CDR3 comprises the amino acid residues at positions 95-102, and FR4 comprises the amino acid residues at positions 103-113.
Determination of CDR regions may also be done according to different methods. In the CDR determination according to Kabat, FR1 of an ISVD comprises the amino acid residues at positions 1-30, CDR1 of an ISVD comprises the amino acid residues at positions 31-35, FR2 of an ISVD comprises the amino acids at positions 36-49, CDR2 of an ISVD comprises the amino acid residues at positions 50-65, FR3 of an ISVD comprises the amino acid residues at positions 66-94, CDR3 of an ISVD comprises the amino acid residues at positions 95-102, and FR4 of an ISVD comprises the amino acid residues at positions 103-113.
In such an immunoglobulin sequence, the framework sequences may be any suitable framework sequences, and examples of suitable framework sequences will be clear to the skilled person, for example on the basis the standard handbooks and the further disclosure and prior art mentioned herein.
The framework sequences are a suitable combination of immunoglobulin framework sequences or framework sequences that have been derived from immunoglobulin framework sequences, for example by humanization or camelization. For example, the framework sequences may be framework sequences derived from a light chain variable domain (e.g., a V sequence) and/or from a heavy chain variable domain (e.g. a Vn-sequence or VHH sequence). In one aspect, the framework sequences are either framework sequences that have been derived from a VHH-sequence in which said framework sequences may optionally have been partially or fully humanized or are conventional VH sequences that have been camelized (as defined herein).
In particular, the framework sequences present in the ISVD sequences referred to in the present technology may contain one or more of Hallmark residues (as defined herein), such that the ISVD sequence is a Nanobody® ISVD, such as, e.g., a VHH, including a humanized VHH
or camelized VH. Some non-limiting examples of suitable combinations of such framework sequences will become clear from the further disclosure herein.
However, it should be noted that, in the context of the present technology, the origin of the ISVD sequence or the origin of the nucleotide sequence used to express it is not limited, nor as to the way that the ISVD sequence or nucleotide sequence is or has been generated or obtained. Thus, the ISVD sequences may be naturally occurring sequences (from any suitable species) or synthetic or semi-synthetic sequences. In a specific but non-limiting aspect, the ISVD sequence is a naturally occurring sequence (from any suitable species) or a synthetic or semi-synthetic sequence, including but not limited to "humanized" (as defined herein) immunoglobulin sequences (such as partially or fully humanized mouse or rabbit immunoglobulin sequences, and in particular partially or fully humanized VHH sequences), "camelized" (as defined herein) immunoglobulin sequences, as well as immunoglobulin sequences that have been obtained by techniques such as affinity maturation (for example, starting from synthetic, random or naturally occurring immunoglobulin sequences), CDR grafting, veneering, combining fragments derived from different immunoglobulin sequences, PCR assembly using overlapping primers, and similar techniques for engineering immunoglobulin sequences well known to the skilled person; or any suitable combination of any of the foregoing.
Similarly, nucleotide sequences may be naturally occurring nucleotide sequences or synthetic or semi-synthetic sequences, and may for example be sequences that are isolated by PCR from a suitable naturally occurring template, e.g., DNA or RNA isolated from a cell, nucleotide sequences that have been isolated from a library (and in particular, an expression library), nucleotide sequences that have been prepared by introducing mutations into a naturally occurring nucleotide sequence (using any suitable technique known per se, such as mismatch PCR), nucleotide sequence that have been prepared by PCR using overlapping primers, or nucleotide sequences that have been prepared using techniques for DNA synthesis known per se.
As described above, the ISVD precursor is preferably a VHH, including a humanized VHH or camelized VH, or a suitable fragment thereof, more preferably a humanized VHH or a suitable
fragment thereof. The resulting protein-based building block should be soluble, have a globular 3D structure and not specifically bind to human proteins, preferably should also not specifically bind to any non-protein molecule and preferably should also not specifically bind to any non-human protein to which the VHH precursor specifically binds, if any, as described above. Preferably, as described above, the molecule comprising at least one VHH (including humanized VHH or camelized Vn)-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non- human protein to which the VHH (including humanized VHH or camelized VH) precursor specifically binds.
Further, preferably, the at least one ISVD-based carrier building block (i) does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay, (ii) does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein, and/or (iii) does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein.
As described above, the ISVD precursor is preferably a VHH, humanized VHH or camelized VH, such as a Nanobody® ISVD, or a suitable fragment thereof, more preferably a humanized Nanobody® ISVD or a suitable fragment thereof. The resulting protein-based building block should be soluble, have a globular 3D structure and not specifically bind to human proteins, preferably should also not specifically bind to any non-protein molecule and preferably should also not specifically bind to any non-human protein to which the VHH, humanized VHH or camelized VH, such as Nanobody® ISVD, precursor specifically binds, if any, as described above. Preferably, as described above, the molecule comprising at least one VHH, humanized
VHH or camelized VH, such as Nanobody® ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the VHH, humanized VHH or camelized VH, such as Nanobody® ISVD precursor specifically binds. For a general description of Nanobody® ISVDs, reference is made to the present description, as well as to the prior art cited herein. In this respect, it should however be noted that this description and the prior art mainly described Nanobody® ISVDs of the so-called "VH3 class", i.e. Nanobody® ISVDs with a high degree of sequence homology to human germline sequences of the VH3 class such as DP-47, DP-51 or DP-29. It should however be noted that the present technology in its broadest sense can generally use any type of Nanobody® ISVD, and for example also uses the Nanobody® ISVDs belonging to the so-called "VH4 class", i.e., Nanobody® ISVDs with a high degree of sequence homology to human germline sequences of the VH4 class such as DP-78, as for example described in WO 2007/118670.
In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology is derived from a Nanobody® ISVD belonging to the so- called "VH3 class", i.e. a Nanobody® ISVDs with a high degree of sequence homology to human germline sequences of the VH3 class such as DP-47, DP-51 or DP-29, as long as the proteinbased building block is soluble, has a globular 3D structure and does not specifically bind to human proteins, preferably does not specifically bind to any non-protein human molecule and preferably does also not specifically bind to any non-human protein to which the ISVD precursor specifically binds, as described above.
Generally, Nanobody® ISVDs (in particular VHH sequences, including (partially) humanized VHH sequences and camelized VH sequences) can be characterized by the presence of one or more "Hallmark residues" (as described herein) in one or more of the framework sequences (again as further described herein). Generally, a Nanobody® ISVD can be defined as an immunoglobulin sequence with the (general) structure
FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4
in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which one or more of the Hallmark residues are as further defined herein.
In particular, a Nanobody® ISVD can be an immunoglobulin sequence with the (general) structure
FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which the framework sequences are as further defined herein.
More in particular, a Nanobody® ISVD can be an immunoglobulin sequence with the (general) structure
FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which: one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to the Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 below.
Thus, a Nanobody® ISVD can be defined as an amino acid sequence with the (general) structure
FR1 - CDR1 - FR2 - CDR2 - FR3 - CDR3 - FR4 in which FR1 to FR4 refer to framework regions 1 to 4, respectively, and in which CDR1 to CDR3 refer to the complementarity determining regions 1 to 3, respectively, and in which one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to the Kabat numbering are chosen from the Hallmark residues mentioned in Table 3.
For instance, when the protein-based building block comprised in the molecule of the present technology is based on an ISVD, it may derive from anti-viral ISVDs, such as from anti-viral VHH or Nanobody® ISVDs. For instance, the building block comprised in the molecule of the present technology may derive from a functional ISVD (i.e., an ISVD which specifically binds to human proteins, and/or to non-human proteins, such as viral proteins and/or bacterial proteins, and/or to non-protein molecules, such as human non-protein molecules) which has been engineered/modified so that it no longer specifically binds to any human protein, preferably which has been engineered/modified so that it also no longer specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably which has been engineered/modified so that it also no longer specifically binds to any non-protein molecule to which it originally bound, if any. In a further preferred embodiment, the ISVD-based building block comprised in the molecule of the present technology derives from an ISVD, such as from a heavy-chain ISVD, preferably from a Nanobody® ISVD, which has been further engineered/modified to include mutations which prevent/ re move binding by pre-existing antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/ re move binding by pre-existing antibodies/factors, the amino acid at position 11
(according to Kabat) may be Vai or Leu, preferably Vai; and/or the amino acid at position 89 (according to Kabat) may be preferably Vai, Thr or Leu, preferably Leu; and/or the amino acid at position 110 (according to Kabat) may be preferably Thr, Lys or Gin, preferably Thr; and/or the amino acid at position 112 (according to Kabat) may be Ser, Lys or Gin, preferably Ser; and/or the ISVD-based building block may contain a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
The resulting ISVD-based building block may be derived from a variant from an anti-hRSV ISVD, such as, e.g., variants from the anti-hRSV ISVDs depicted on Table A-2 starting on p. 69 of WO 2018/099968. In a preferred embodiment, the resulting ISVD-based building block is derived from a variant from ISVD RSV001A04, SEQ ID NO.: 179 in the present description, and also referred to as RSV001A04, and described in detail in SEQ ID NO.: 5 on Table A-l, page 388 of WO 2010/139808 (referred therein to as NC41)). In this specific embodiment, the proteinbased carrier building block derived from RSV001A04 does not specifically bind any human protein. In this embodiment, the "building block precursor" (or "ISVD precursor") is RSV001A04, SEQ ID NO.: 179:
EVQLVESGGGLVQAGGSLSISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVEGRFTI SRDNAKNTGYLQMNSLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTQVTVSS.
Once an ISVD is selected as starting point (as "ISVD precursor", see above), residues preferably located at solvent-accessible positions should be identified to generate the at least one conjugation site, as described in detailed above in this description. Additionally or alternatively, one or more conjugation site(s) may already be in the ISVD precursor, either as reactive groups in the side chain of amino acids preferably located at solvent-accessible positions or as free /V-terminal primary amine and/or free C-terminal carboxylic acid.
For instance, one or more of the identified residues, preferably located at solvent-accessible positions in the amino acid sequence of the ISVD precursor are replaced by a cysteine, a lysine, a tyrosine and/or a non-natural amino acid.
In one embodiment, the at least one protein-based building block carrier comprised in the molecule of the present technology is derived from an ISVD, such as an ISVD belonging to the so-called "VH3 class", wherein the resulting building block comprises at least one cysteine, at least one lysine, at least one non-natural amino acid and/or at least one tyrosine, preferably located at one or more solvent-accessible positions. In another embodiment, the at least one protein-based building block comprised in the molecule of the present technology is derived from an ISVD, such as an ISVD belonging to the so-called "VH3 class", wherein the resulting building block comprises at least one engineered cysteine, at least one engineered lysine, at least one non-natural amino acid and/or at least one engineered tyrosine, preferably located at one or more solvent-accessible positions.
Preferably, the protein-based carrier building block carrier comprised in the molecule of the present technology, when it is derived from an ISVD, preferably from a VHH (including humanized VHH or camelized VH) or Nanobody® ISVD, as described above, comprises a leucine at position 108 (according to Kabat numbering). In other embodiments, the protein-based building block carrier comprised in the molecule of the present technology, when it is derived from an ISVD, as described above, comprises a valine at position 11, a leucine at position 89 and/or a leucine at position 108 (according to Kabat numbering).
In one embodiment, the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186:
X1VX2LX3EX4X5GX6X7X8X9X10X11GX12X13X14IX15CX16AX17X18X19X20LX21X22X23VLGWFRX24AX25X26X2 7X28X29X30FVAAI NX31X32X33X34X35X36X37X38PX39X40VX41X42X43FX44IX45X46X47X48X49X50X51TGX52LX5 3MX54X55LX56X57X58DX59AX6OYX61CGAGX62PX63X64X65X66AYX67X68X69X7OSYX71X72X73GX74X75TX76V X77VX78X79X80X81X82, wherein
Xi (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xs (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai or any amino acid with a reactive group in its side chain, such as cysteine;
Xs (position 12 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X10 (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X20: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X23: (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X24: (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X25: (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X26: (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X27: (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X28: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X29: (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X30: (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X31: (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X32: (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X33: (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X34: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X35: (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X36: (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X37: (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X38: (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X39: (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X40: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X41: (position 64 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X42: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X43: (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X44: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X45: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X46: (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X47: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X48: (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X49: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xso: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X51: (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X52: (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X53: (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X54: (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xse: (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X57: (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xss: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeo: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai or any other amino acid with a reactive group in its side chain, such as cysteine;
Xei: (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X62: (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X64: (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xee: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeg: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X70: (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X71: (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X72: (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X73: (position 103 according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X74: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X75: (position 106 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X76: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu, or any other amino acid with a reactive group in its side chain, such as cysteine;
X77: (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X78: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X79: (position 113 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xso: is absent or Gly;
Xsi: is absent or Gly;
X82: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 186, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 186, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186 as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
In a further preferred embodiment, additionally or alternatively, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 186 as defined above, wherein SEQ ID NO.: 186 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/ re move binding by preexisting antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.:
186 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 186 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 186 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 186 is preferably Lys or Gin and/or SEQ ID NO 186 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 186 as defined above. Preferably, the polypeptide and/or molecule comprise SEQ ID NO.: 186 as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above. In a further embodiment, additionally or alternatively, the polypeptide and/or molecule comprise SEQ ID NO.: 186 as defined above, wherein SEQ ID NO.: 186 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/remove binding by pre-existing antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 186 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 186 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 186 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 186 is preferably Lys or Gin and/or SEQ ID NO 186 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
In one embodiment, the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 206:
XiaVQLVEXiGGGZiVX2AGGX3LX4lX5CX6AX7X7bGX7cLSX8YVLGWFRQAPGX9XioREFVAAINWRGXiil TIGPPXi2VEXi3RFXi4lXi5RXi6NXi7Xi8NTGYLQIVINXi9LAPXi9bDTAZ2YYCGAGTPLNPX2oAYIYX2iWS YDYWGX22GTZ3VTVX23SX24X25X26 wherein
Xia (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Zi (position 11 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xe: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X7: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X?b: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X?c: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9: (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X10: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X14: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xi9t>: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Z2: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
Z3: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
X23: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X24: is absent or Gly;
X25: is absent or Gly;
X26: is absent or Cys,
or a sequence which has 80% or more identity with SEQ ID NO.: 206, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 206, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 206, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
In a further preferred embodiment, additionally or alternatively, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 206 as defined above, wherein SEQ ID NO.: 206 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/ re move binding by preexisting antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 206 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 206 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ
ID NO.: 206 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or SEQ ID NO 206 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 206, as defined above. Preferably, the polypeptide and/or molecule comprise SEQ ID NO.: 206, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above. In a further embodiment, additionally or alternatively, the polypeptide and/or molecule comprise SEQ ID NO.: 206 as defined above, wherein SEQ ID NO.: 206 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/remove binding by pre-existing antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 206 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 206 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 206 is preferably Lys or Gin and/or SEQ ID NO.: 206 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
In one embodiment, the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185:
EVQLVEX1GGGZ1VX2AGGX3LX4IX5CX6AX7GGSLSX8YVLGWFRQAPGX9X10REFVAAINWRGX11ITIGP PX12VEX13RFX14IX15RX16NX17X18NTGYLQMNX19LAPDDTAZ2YYCGAGTPLNPX20AYIYX21WSYDYWG X22GTZ3VTVX23SX24X25X26 wherein
Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Zi (position 11 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xe: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X7: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9: ( position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X10: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X13: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X14: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Z2: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
Z3: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
X23: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X24: is absent or Gly;
X25: is absent or Gly;
X26: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein,
preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the Hallmark residues mentioned in Table 3 above.
In a further preferred embodiment, additionally or alternatively, the protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 185 as defined above, wherein SEQ ID NO.: 185 has been further engineered/modified to include mutations which prevent/ re move binding by preexisting antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/ re move binding by preexisting antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 185 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 185 preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or SEQ ID NO.: 185 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 185, as defined above. Preferably, the polypeptide and/or molecule comprise SEQ ID NO.: 185, as defined above, wherein one or more of the amino acid residues at positions 11, 37, 44, 45, 47, 83, 84, 103, 104 and 108 according to Kabat numbering are chosen from the
Hallmark residues mentioned in Table 3 above. In a further embodiment, additionally or alternatively, the polypeptide and/or molecule comprise SEQ ID NO.: 185 as defined above, wherein SEQ ID NO.: 185 has been further engineered/modified to include mutations which prevent/remove binding by pre-existing antibodies/factors. Examples of such mutations are described, e.g., in WO 2012/175741 and WO 2015/173325. For instance, to prevent/remove binding by pre-existing antibodies/factors, the amino acid at position 11 (according to Kabat) in SEQ ID NO.: 185 is preferably Vai, and/or the amino acid at position 89 (according to Kabat) in SEQ ID NO.: 185 is preferably Thr or Leu and/or the amino acid at position 110 (according to Kabat) in SEQ ID NO.: 185 is preferably Lys or Gin and/or the amino acid at position 112 (according to Kabat) in SEQ ID NO. 185 is preferably Lys or Gin and/or SEQ ID NO 18 contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid.
In one embodiment, the protein-based carrier building block comprises at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent- accessible positions, such as three amino acids with a reactive group in its side chain, such as three cysteines, or three lysines, or three tyrosines, or three non-natural amino acids, preferably three cysteines, in the following solvent-accessible positions in SEQ ID NO.: 179 according to Kabat numbering:
* 43, lOOf, 105; or
* 43, 75, 100a; or
* 21, 68, lOOf; or
* 7, 44, 55; or
* 13, 72, 100a;
* 13, 31, lOOf; or
* C-terminal Cys (-GGC), even more preferably in at least one of the following solvent-accessible positions in SEQ ID NO.: 179 (according to Kabat numbering):
* 43, lOOf, 105; or
* 43, 75, 100a.
Hence, in one embodiment, the protein-based building block comprised in the molecule of the present technology comprises, or alternatively, consists of, one of the following sequences:
*SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
Xis: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi:(position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent; or
*SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is G ly;
X26: is Cys.
Hence, in one embodiment, the present technology provides a polypeptide which comprises or alternatively consists of, one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
Xi?: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
Xi2: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser
X24: is absent;
X25: is absent;
X26: is absent, or
*SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys.
For instance, the protein-based carrier building block comprised in the molecule of the present technology may comprise at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent-accessible positions, preferably six amino acids with a reactive group in its side chain, such as six cysteines, or six lysines, or six tyrosines, or six nonnatural amino acids, preferably six cysteines, in the following solvent-accessible positions in SEQ ID NO.: 179, or three cysteines and three lysines in the following solvent-accessible positions in SEQ ID NO.: 179, according to Kabat numbering:
* 19, 44, 65, 70, 82b, 112; or
* 21, 43, 55, 68, 74, 112; or
* 19, 23, 31, 70, 82b, lOOf;
* 13, 25, 43, 65, 72, 100a;
* 25, 43, 75, 82b, 100a, 112;
* 25, 43, 75, 100a, 105, 112;
* 25, 43, 75, 100a, 105, C-terminal Cys (-GGC);
* 43, 68, 75, 100a, 105, C-terminal Cys (-GGC);
* 25, 43, 75, lOOf, 105, C-terminal Cys (-GGC); or
* 43, 68, 75, lOOf, 105, C-terminal Cys (-GGC).
Hence, in one embodiment, the protein-based building block comprised in the molecule of the present technology comprises, or alternatively, consists of, one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
Xi4: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys.
Hence, in one embodiment, the present technology provides a polypeptide which comprises or alternatively consists of one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
Xis: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xio: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xio: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xig: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
Xi4: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys.
For instance, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 179 (or variants thereof with sequence identity of 80% or more, as described above) may comprise at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent-accessible positions, preferably nine amino acids with a reactive group in its side chain, such as nine cysteines, or nine lysines, or nine tyrosines, or nine non-natural amino acids, preferably nine cysteines in the following solvent-accessible positions according to Kabat numbering:
* 21, 31, 43, 68, 72, 82b, 100a, lOOf, 105; or
* 7, 13, 19, 23, 44, 55, 62, 70, 74; or
* 7, 17, 31, 44, 55, 62, 68, 75, 112.
Hence, in one embodiment, the protein-based building block comprised in the molecule of the present technology comprises, or alternatively, consists of, one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine; in
Xi2: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non- natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent.
Hence, in one embodiment, the present technology provides a polypeptide which comprises or alternatively consists of, one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
Xi?: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a nonnatural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent.
For instance, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 179 (or variants thereof with sequence identity of 80% or more, as described above) may comprise at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent-accessible positions, preferably four amino acids with a reactive group in its side chain, such as four cysteines, or four lysines, or four tyrosines, or four non-natural amino acids, preferably four cysteines in the following solvent-accessible positions according to Kabat numbering:
* 19, 65, 82b, 112.
Hence, in one embodiment, the protein-based building block comprised in the molecule of the present technology comprises, or alternatively, consists of, one of the following sequences:
* SEQ ID NO.: 185, wherein
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent.
Additionally, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 185, 186 and/or 206 (or any of the above-described variants, or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 185, 186 and/or 206 (or any of the above-described variants, or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 185, 186 and/or 206 (or any of the above-described variants, or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C- terminal of the polypeptide defined by SEQ ID NO.: 185, 186 and/or 206 (or any of the abovedescribed variants, or any variant thereof with sequence identity of 80% or more, as described above), the exposed tyrosine may be preceded/followed by flexible tags, such as (GG) or (G4Si)i-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086.
In addition, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 185, 186 and/or 206 (or any of the above-described variants, or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the N- terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, one of the sequences of Table 4, or a sequence which has 80% or more identity with a sequence of Table 4, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 4, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non- human protein to which the ISVD precursor specifically binds.
Hence, in one embodiment, the present technology provides a polypeptide which comprises one of the sequences of Table 4, or a sequence which has 80% or more identity with a sequence of Table 4, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 4
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 80, or a polypeptide which has 80% or more identity with SEQ ID NO.: 80, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to a bout 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 43, lOOf and 105 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 80 (or any
variant thereof with sequence identity of 80% or more, as described above), positions 43, lOOf and 105 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block. The building block comprising or consisting of SEQ ID NO: 80 preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 80 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 80 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 80 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/fol lowed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C- terminal of the polypeptide defined by SEQ ID NO.: 80 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 80 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the N-
terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 81, or a polypeptide which has 80% or more identity with SEQ ID NO.: 81, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 81, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 43, 75 and 100a (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 81 (or any variant thereof with sequence identity of 80% or more, as described above), positions 43, 75 and 100a (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 81 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 81 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present
in the N- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 81 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 81 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 81 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 82, or a polypeptide which has 80% or more identity with SEQ ID NO.: 82, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 82, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of
the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 21, 68 and lOOf (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 82 (or any variant thereof with sequence identity of 80% or more, as described above), positions 21, 68 and lOOf (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 82 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 82 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 82 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 82 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (e.g., YGG-, -GGY, Y(G4SI)I-3GG-,-(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 82 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 83, or a polypeptide which has 80% or more identity with SEQ ID NO.: 83, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 83, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 13, 72 and 100a (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 83 (or any variant thereof with sequence identity of 80% or more, as described above), positions 13, 72 and 100a (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 83 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 83 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 83 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a
tyrosine is present in the N- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 83 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 83 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 84, or a polypeptide which has 80% or more identity with SEQ ID NO.: 84, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 84, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 13, 31 and lOOf
(according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 84 (or any variant thereof with sequence identity of 80% or more, as described above), positions 13, 31 and lOOf (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 84 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 84 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 84 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 84 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (e.g., YGG-, -GGY, Y(G4SI)I-3GG-,-(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 84 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 85, or a polypeptide which has 80% or more identity with SEQ ID NO.: 85, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 85, provided that the building block
has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 7, 44 and 55 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 85 (or any variant thereof with sequence identity of 80% or more, as described above), positions 7, 44 and 55 (according to Kabat numbering) are solvent- accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 85 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 85 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 85 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/fol lowed by a flexible tag (sequence), such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 85 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed and/or preceded by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG- , -GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, YGG(G4SI)I-3-, or -(G4SI)I-3GGY), preferably -GGY, although
longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 85 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the N- terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 175, or a polypeptide which has 80% or more identity with SEQ ID NO.: 175, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 175, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, a C-terminal cysteine has been engineered in the building-block precursor (-GGC). Hence, in the building block comprising or consisting of SEQ ID NO.: 175 (or any variant thereof with sequence identity of 80% or more, as described above), the C-terminal cysteine comprise a thiol group, which is the conjugation site present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block
comprising or consisting of SEQ ID NO.: 175 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 175 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 175 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 175 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 175 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 225, or a polypeptide which has 80% or more identity with SEQ ID NO.: 225, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 19, 65, 82b and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 225 (or any variant thereof with sequence identity of 80% or more, as described above), positions
19, 65, 82b and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block. The building block comprising or consisting of SEQ ID NO.: 225 preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 225 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 225 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 225 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/fol lowed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or -GGC). If a tyrosine is present in the /V- and/or C- terminal of the polypeptide defined by SEQ ID NO.: 225 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 225 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the N- terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, one of the sequences of Table 5, or a sequence which has 80% or more identity with a sequence of Table 5, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 5, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds.
Hence, in another embodiment, the present technology provides a polypeptide which comprises, or alternatively, consists of, one of the sequences of Table 5, or a sequence which has 80% or more identity with a sequence of Table 5, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 5.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 86, or a polypeptide which has 80% or more identity with SEQ ID NO.: 86, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 86, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 19, 44, 65, 70, 82b and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 86 (or any variant thereof with sequence identity of 80% or more, as described above), positions 19, 44, 65, 70, 82b and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block
comprising or consisting of SEQ ID NO.: 86 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 86 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 86 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 86 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 86 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 87, or a polypeptide which has 80% or more identity with SEQ ID NO.: 87, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 87, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or
preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 21, 43, 55, 68, 74 and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 87 (or any variant thereof with sequence identity of 80% or more, as described above), positions 21, 43, 55, 68, 74 and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 87 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 87 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 87 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence) such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 87 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded and/or followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 87 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as
explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 88, or a polypeptide which has 80% or more identity with SEQ ID NO.: 88, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 88, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 19, 23, 31, 70, 82b and lOOf (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 88 (or any variant thereof with sequence identity of 80% or more, as described above), positions 19, 23, 31, 70, 82b and lOOf (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 88 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 88 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present
in the N- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 88 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence) such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 88 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 88 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 89, or a polypeptide which has 80% or more identity with SEQ ID NO.: 89, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 89, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of
the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 13, 25, 43, 65, 72 and 100a (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 89 (or any variant thereof with sequence identity of 80% or more, as described above), positions 13, 25, 43, 65, 72 and 100a (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 89 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 89 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 89 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence) such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 89 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 89 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 90, or a polypeptide which has 80% or more identity with SEQ ID NO.: 90, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 90, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 25, 43, 75, 82b, 100a and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 90 (or any variant thereof with sequence identity of 80% or more, as described above), positions 25, 43, 75, 82b, 100a and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 90 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 90 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 90 (or any variant
thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence) such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 90 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 90 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 91, or a polypeptide which has 80% or more identity with SEQ ID NO.: 91, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 91, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-
protein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 25, 43, 75, 100a, 105 and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 91 (or any variant thereof with sequence identity of 80% or more, as described above), positions 25, 43, 75, 100a, 105 and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 91 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 91 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 91 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence) such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 91 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 91 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 92, or a polypeptide which has 80% or more identity with SEQ ID NO.: 92, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 92, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 25, 43, 75, 100a and 105 (according to Kabat numbering) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 92 (or any variant thereof with sequence identity of 80% or more, as described above), positions 25, 43, 75, 100a and 105 (according to Kabat numbering) and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 92 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V- -terminal of the polypeptide defined by SEQ ID NO.: 92 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- terminal of the polypeptide defined by SEQ ID NO.:
92 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence) such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 92 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG- , Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed-, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078- 5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 92 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the N- terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 93, or a polypeptide which has 80% or more identity with SEQ ID NO.: 93, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 93, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 43, 68, 75, 100a and 105
(according to Kabat numbering) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 93 (or any variant thereof with sequence identity of 80% or more, as described above), positions 43, 68, 75, 100a and 105 (according to Kabat numbering) and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 93 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 93 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 93 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence) such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 93 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG- , Y(G4SI)I- 3GG-, YGG(G4SI)I-3-, or YGG(SIG4)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 93 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 94, or a polypeptide which has 80% or more identity with SEQ ID NO.: 94, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 94, provided that the building block
has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 25, 43, 75, lOOf and 105 (according to Kabat numbering) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 94 (or any variant thereof with sequence identity of 80% or more, as described above), positions 25, 43, 75, lOOf and 105 (according to Kabat numbering) and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 94 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 94 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 94 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence) such as (GG) (e.g., CGG-). If a tyrosine is present in the N-terminal of the polypeptide defined by SEQ ID NO.: 94 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by
flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, Y(G4SI)I- 3GG-, YGG(G4SI)I-3-, or YGG(SIG4)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 94 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 95, or a polypeptide which has 80% or more identity with SEQ ID NO.: 95, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 95, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 43, 68, 75, lOOf and 105 (according to Kabat numbering) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 95 (or any variant thereof with sequence identity of 80% or more, as described above), positions 43, 68, 75, lOOf and 105 (according to Kabat numbering) and the C-terminal are solvent-accessible positions, and
are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine, the amino acid at position 11 (according to Kabat numbering) is preferably valine and the amino acid at position 89 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 95 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 95 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 95 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG), e.g., CGG-. If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 95 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, Y(G4SI)I- 3GG-, YGG(G4SI)I-3-, or YGG(SIG4)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 95 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 222, or a polypeptide which has 80% or more identity with SEQ ID NO.: 222, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 222, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about
7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 21, 31, 43, 68, 72, 82b, 100a, lOOf and 105 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 222 (or any variant thereof with sequence identity of 80% or more, as described above), positions 21, 31, 43, 68, 72, 82b, 100a, lOOf and 105 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 222 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 222 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 222 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 222 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed and/or preceded by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, YGG(G4SI)I-3-, or -(G4SI)I-3GGY), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078- 5086. In addition, the protein-based carrier building block which comprises, or alternatively,
consists of, the polypeptide defined by SEQ ID NO.: 222 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (G ly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 223, or a polypeptide which has 80% or more identity with SEQ ID NO.: 223, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 223, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 7, 13, 19, 23, 44, 55, 62, 70 and 74 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 223 (or any variant thereof with sequence identity of 80% or more, as described above), positions 7, 13, 19, 23, 44, 55, 62, 70 and 74 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 223 (or any variant
thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 223 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 223 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 223 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed and/or preceded by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, YGG(G4SI)I-3-, or -(G4SI)I-3GGY), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 223 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 224, or a polypeptide which has 80% or more identity with SEQ ID NO.: 224, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 224, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, about 7 kDa or about 15 kDa, preferably about 15 or 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound,
if any, all as described in detail above. Preferably, as described above, at least one ISVD- derived protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or nonprotein molecules, including biomolecules, to which the ISVD precursor specifically binds. In this embodiment, the amino acids at the solvent-accessible positions 7, 17, 31, 44, 55, 62, 68, 75 and 112 (according to Kabat numbering) are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 224 (or any variant thereof with sequence identity of 80% or more, as described above), positions 7, 17, 31, 44, 55, 62, 68, 75 and 112 (according to Kabat numbering) are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. Further, in this embodiment, the amino acid at position 108 (according to Kabat numbering) is preferably leucine. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 224 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 224 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 224 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as a (GG) tag (e.g., CGG- or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 224 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, -(G4SI)I-3GGY), or YGG(G4SI)I- 3-), preferably YGG- or -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the proteinbased carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 224 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
Without being limiting, advantageous immunoglobulin single variable domains which may be used as starting point (ISDV precursors) for developing the preferred ISVD building block comprised in the molecule of the present technology are described in WO 2018/099968 and WO 2010/139808. Preferably, the anti-hRSV immunoglobulin single variable domain which may be used as starting point for developing the preferred ISVD building block is selected from any of the sequences depicted on Table A-2 on p. 69-70 of WO 2018/099968, incorporated herewith by reference.
An ISVD which may be used as starting point for developing ISVD building blocks is SEQ ID NO.: 5 depicted in on Table A-l, page 388 of WO 2010/139808 (RSV001A04, SEQ ID NO.: 179 in the present description).
DARPin-based building block
In addition, suitable building blocks in the context of the present technology may be derived from small, globular proteins, as defined above, such as other biologica Is, e.g., may be derived from DARPins.
In the context of the present technology, a "DARPin-based building block" refers to a proteinbased building block which derives from a DARPin, i.e., which is structurally similar to a DARPin but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the DARPin precursor specifically binds. For instance, the DARPin-based building block has a sequence identity of at least 60%, or 70%, or 80% with a DARPin, e.g., its DARPin precursor. For instance, the DARPin-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with a DARPin, e.g., its DARPin precursor. For instance, a DARPin-based building block may share the whole amino acid sequence with its DARPin precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids. I n addition, the DARPin-based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about
2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, such as about 6 kDa, or 7 kDa, or 14-16 kDa, does not specifically bind any human protein and preferably does not specifically bind any (non-human) protein or nonprotein molecule to which the precursor specifically binds. Preferably, as described above, at least one DARPin-based protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the DARPin precursor specifically binds.
DARPins (designed ankyrin repeat proteins) are small, single domain proteins (14-16 kDa) which can be selected to bind any given target protein with high affinity and specificity (from Stumpp MT et al., "DARPins: A new generation of protein therapeutics", Drug Discovery Today, 2008, 13(15— 16) :695-701). As explained above, the protein-based building block derived from DARPins no longer specifically binds any human protein, i.e., the precursor DARPin as defined in SEQ ID NO.: 187 has been engineered/modified so that it no longer specifically binds any human protein. For instance, DARPin K27 may be modified so that it no longer binds any human protein, as described above, in particular so that it no longer specifically binds human KRAS protein, as described above (or binds it with low specificity/low affinity, as described above). Preferably the protein-based building block derived from DARPins also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it does not specifically bind any human non-protein molecule (such as human DNA, human RNA, human glycans, human lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor binds specifically, if any. Preferably, as described above, at least one DARPin-based protein-based building block, preferably when conjugated to at least one cargo, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any
target, such as protein and/or non-protein molecules, including biomolecules, to which the DARPin precursor specifically binds.
Further, preferably, the at least one DARPin-based carrier building block (i) does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay, (ii) does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cellbinding assay and/or SPR, as described herein, and/or (iii) does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein.
In one embodiment, the protein-based carrier building block comprised in the molecule of the present technology is based on the polypeptide as defined in SEQ ID NO.: 187 (DARPin K27 building block precursor):
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGR TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGRTPLHLAAKRGHLEIVEVLLKNGADVNAQDKFGKTAFD ISIDNGNEDLAEILQKL
In one embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 187, or a sequence which has 80% or more identity with SEQ ID NO.: 187, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 187, wherein the polypeptide comprises at least one amino acid with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 187:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 187:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 187:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 187, or a sequence which has 80% or more identity with SEQ ID NO.: 187, preferably a sequence which has 85% more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 187, wherein the polypeptide comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, ortyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 187:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in more than one of the following positions in SEQ ID NO.: 187:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in more than one of the following positions in SEQ ID NO.: 187:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
Hence, the present technology further provides a polypeptide and/or molecule which comprises SEQ ID NO.: 187, or a sequence which has 80% or more identity with SEQ ID NO.: 187, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 187, wherein SEQ ID NO.: 187comprises at least one amino acid with a reactive group in its side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 187:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 187:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 187:
85, 95, 143, 148.
Preferably, the polypeptide and/or molecule of the present technology comprise SEQ ID NO.: 187, or a sequence which has 80% or more identity with SEQ ID NO.: 187, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 187, wherein SEQ ID NO.: 187comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 187:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in more than one of the following positions in SEQ ID NO.: 187:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in more than one of the following positions in SEQ ID NO.: 187:
85, 95, 143, 148.
For instance, the following point mutations can be performed in the polypeptide as defined in SEQ ID NO.: 187, so that it does not longer bind any human protein, in particular so that it does not longer bind the precursor target, e.g., protein KRAS:
R69A, R102A and R111A.
See SEQ ID NO.: 180, which corresponds to SEQ ID NO.: 187 but with the above Arg to Ala mutations at positions 69, 102 and 111:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTAF DISI DNGNEDLAEILQKL.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 180, or a sequence which has 80% or more identity with SEQ ID NO.: 180, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ I D NO.: 180, wherein the polypeptide comprises at least one amino acid with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 180:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 180:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 180:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa,
such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
In another embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 180, or a sequence which has 80% or more identity with SEQ ID NO.: 180, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 180, wherein the polypeptide comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 180:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in more than one of the following positions in SEQ ID NO.: 180:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in more than one of the following positions in SEQ ID NO.: 180:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above. in
Hence, the present technology further provides a polypeptide and/or molecule which comprise SEQ ID NO.: 180, or a sequence which has 80% or more identity with SEQ ID NO.: 180, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 180, wherein SEQ ID NO.: 180 comprises at least one amino acid with a reactive group in its side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 180:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 180:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 180:
85, 95, 143, 148.
In one embodiment, the polypeptide and/or molecule of the present technology comprise SEQ ID NO.: 180, or a sequence which has 80% or more identity with SEQ ID NO.: 180, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 180, wherein SEQ ID NO.: 180 comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 180:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155,
preferably in more than one of the following positions in SEQ ID NO.: 180:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in more than one of the following positions in SEQ ID NO.: 180:
85, 95, 143, 148.
In another embodiment, the polypeptide as described in SEQ ID NO.: 180 does not have the C-terminal leucine (K27m (without the C-terminal L)), see SEQ ID NO.: 68 and Figure 2:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTAF DISI DNGNEDLAEILQK
In one embodiment, the protein-based building block comprised in the molecule of the present technology does not comprise or consists of a protein with SEQ ID NO.: 180.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO. : 68, or a sequence which has 80% or more identity with SEQ ID NO.: 68, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 68, wherein the polypeptide comprises at least one amino acid with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 68:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 68:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 68:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 68, or a sequence which has 80% or more identity with SEQ ID NO.: 68, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 68, wherein the polypeptide comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 68:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in more than one of the following positions in SEQ ID NO.: 68:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155,
more preferably in more than one of the following positions in SEQ ID NO.: 68:
85, 95, 143, 148, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
Hence, the present technology further provides a polypeptide and/or molecule which comprise SEQ ID NO.: 68, or a sequence which has 80% or more identity with SEQ ID NO.: 68, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 68, wherein SEQ ID NO.: 68 comprises at least one amino acid with a reactive group in its side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 68:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 68:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 68:
85, 95, 143, 148.
Preferably, the polypeptide and/or molecule of the present technology comprise SEQ ID NO.: 68, or a sequence which has 80% or more identity with SEQ ID NO.: 68, preferably a sequence
which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 68, wherein SEQ ID NO.: 68comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 68:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in more than one of the following positions in SEQ ID NO.: 68:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in more than one of the following positions in SEQ ID NO.: 68:
85, 95, 143, 148.
Hence, in one embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 188:
X1X2GX3X4LLX5AAX6X7X8X9X10X11X12VX13X14LMX15X16X17AX18VX19AX20X21X22X23GX24TPLH LAAX25 X26X27X28X29X30IVX31VLLX32X33X34AX35VX36AX37DX38X39GATPLH LAAX40X41X42X43X44X45IVX46VLLX4 7X48X49AX5OVX51AX52DX53X54GATPLHX55AAX56X57X58X59X6OX61IVX62X63LX64X65X66X67AX68X69X7OAX 71DX72X73X74X75TAX76X77ISX78X79X80X81X82X83X84LAX85X86LX87X88X89X90, wherein:
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X4 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X7 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; Xs can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X10 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xu can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X12 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X13 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X14 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X15 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X20 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X21 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X24 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be His or any amino acid with a reactive group in its side chain, such as cysteine X29 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine X30 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X31 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X32 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine X33 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X34 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine
X35 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X36 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X37 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X38 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X40 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X41 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X44 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X50 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X51 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X54 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X58 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X59 can be His or any amino acid with a reactive group in its side chain, such as cysteine; Xeo can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; Xei can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X62 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X63 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine; X64 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X65 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xee can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xe? can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xes can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X69 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X70 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X71 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X72 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X73 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X74 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X75 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X76 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X77 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X78 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X79 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xso can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xs2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xs3 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs6 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
Xs7 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xs8 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs9 can be absent or Leu;
X90 can be absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about
16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 188 as defined above, or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188.
In another embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 189:
DLGKX1LLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLX2IVEVLLKNGAX3VNAX4DSY GATPLHLAAMRGHLX5IVX6VLLKYGAX7VX8AX9DEX10GATPLHLAAKAGHLX11IVEVLLKNGAX12VNAQ DKFGKTAFDISIX13NGNEX14LAEILQX15X16X17, wherein
Xi can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xecan be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X10 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X12 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X13 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X14 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X15 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xi6 can be absent or Leu;
X17 can be absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 189, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 189, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ ID NO.: 189 as defined above, or a sequence which has 80% or more identity with SEQ ID NO. : 189, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 189.
In another embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 181:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVX1VLLKYGADVX2AADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTA F D I S I X3 N G N EX4 LA E I LQKX5X6, wherein
Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be absent or Leu; and
Xe can be absent or Cys.
or a sequence which has 80% or more identity with SEQ ID NO.: 181, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 181, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above.
For instance, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 181 (or variants thereof with sequence identity of 80% or more, as described above) may comprise at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent-accessible positions, such as two amino acids with a reactive group in its side chain, such as two cysteines, or two lysines, or two tyrosines, or two non-natural amino acids, preferably two cysteines in the following solvent-accessible positions (see SEQ ID NO.: 181), and Xs and Xe are absent:
Xi and X2; or
X3 and X4.
For instance, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 181 (or variants thereof with sequence identity of 80% or more, as described above) may comprise at least one amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent-accessible positions, such as four amino acids with a reactive group in its side chain, such as four cysteines, orfour lysines, or four tyrosines, orfour non-natural amino acids, preferably four cysteines in the following solvent-accessible positions, and X5 and Xe are absent:
Xi, X2, X3 and X4, see SEQ ID NO.: 181.
Additionally, the protein-based carrier building block which comprises, or alternatively, consists of, any one of SEQ ID NOs.: 188, 189 or 181 (or variants thereof with sequence identity
of 80% or more, as described above) may comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 181 (or any variant thereof with sequence identity of 80% or more, as described above). In a preferred embodiment, the polypeptide defined by any one of SEQ ID NOs.: 188, 189 or 181 comprises one C-terminal cysteine (i.e., X90, X17 and Xe, respectively, are Cys). If a cysteine is present in the /V- and/or C- terminal of the polypeptide defined by any one of SEQ ID NOs.: 188, 189 or 181 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a GG tag (e.g., -GGC or CGG-). If a tyrosine is present in the N- and/or C-terminal of the polypeptide defined by any one of SEQ ID NOs.: 188, 189 or 181 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, -(G4SI)I-3GGY, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, any one of SEQ ID NOs. : 188, 189 or 181 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the polypeptide defined by any one of SEQ ID NOs.: 188, 189 or 181 comprises a C-terminal cysteine, preferably wherein the C-terminal cysteine is not preceded by any tag (sequence), see, e.g., SEQ ID NO.: 182, which corresponds to SEQ ID NO.: 181 but comprises a C-terminal Cys (i.e., X5 in SEQ ID NO.: 181 is absent and Xe in SEQ IS NO.: 181 is Cys):
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVX1VLLKYGADVX2AADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTA FDISIX3NGNEX4LAEILQKC, wherein
Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; and
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine.
In one embodiment, the at least one protein-based carrier building block present in the molecule of the present technology comprises, or alternatively, consists of, one of the sequences of Table 6, or a sequence which has 80% or more identity with a sequence of Table 6, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 6, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above.
Hence, the present technology further provides a polypeptide and/or molecule which comprise one of the sequences of Table 6, or a sequence which has 80% or more identity with a sequence of Table 6, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 6.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 96, or a polypeptide which has 80% or more identity with SEQ ID NO.: 96, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above. In this embodiment, the amino acids at the solvent accessible positions 143 and 148 (X3 and X4 in SEQ ID NOs.: 181 and 182) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 96 (or any variant thereof with sequence identity of 80% or more, as described above), positions 143 and 148 (X3 and X4 in SEQ ID NOs.: 181 and 182) and the C-terminal are solvent- accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 96 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the
polypeptide defined by SEQ ID NO.: 96 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 96 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as a GG tag (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 96 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, Y(G4SI)I-3GG)-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 96 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 97, or a polypeptide which has 80% or more identity with SEQ ID NO.: 97, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 97, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above. In this embodiment, the amino acids at the solvent accessible positions 85 and 95 (Xi and X2 in SEQ ID NOs.: 181 and 182) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 97 (or any variant thereof with sequence identity of 80% or more, as described above), positions 85 and 95 (Xi and X2 in SEQ ID NOs.: 181 and 182) and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are
the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 97 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 97 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- terminal of the polypeptide defined by SEQ ID NO.: 97 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 97 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, Y(G4SI)I- 3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 97 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 98, or a polypeptide which has 80% or more identity with SEQ ID NO.: 98, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 98, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above. In this embodiment, the amino acids at the solvent accessible positions 85, 95, 143 and 148 (Xi to X4 in SEQ ID NOs.: 181 and 182) and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID
NO.: 98 (or any variant thereof with sequence identity of 80% or more, as described above), positions 85, 95, 143 and 148 (Xi to X4 in SEQ ID NOs.: 181 and 182) and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 98 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 98 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- terminal of the polypeptide defined by SEQ ID NO.: 98 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag, such as (GG) tag (sequence) (e.g., CGG-). If a tyrosine is present in the /V- terminal of the polypeptide defined by SEQ ID NO.: 98 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tag, such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-,Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 98 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 199, or a polypeptide which has 80% or more identity with SEQ ID NO.: 199, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 199, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular does not specifically bind to human KRAS protein, as
described in detail above. In this embodiment, the C-terminal amino acid is a cysteine. Hence, in the building block comprising or consisting of SEQ ID NO.: 199 (or any variant thereof with sequence identity of 80% or more, as described above), the C-terminal cysteine is a solvent- accessible position, which comprises a thiol group, which is a conjugation site present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 199 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 199 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 199 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as a GG tag (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 199 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-,Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 199 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 208, or a polypeptide which has 80% or more identity with SEQ ID NO.: 208, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically
bind to any human protein, in particular does not specifically bind to human KRAS protein, as described in detail above. In this embodiment, the C-terminal amino acid is a cysteine. Hence, in the building block comprising or consisting of SEQ ID NO.: 208 (or any variant thereof with sequence identity of 80% or more, as described above), the C-terminal cysteine is a solvent- accessible position, which comprises a thiol group, which is a conjugation site present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 208 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal of the polypeptide defined by SEQ ID NO.: 208 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 208 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as a GG tag (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 208 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-,Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 208 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the DARPin-based building block is not and/or does not comprise the amino acid sequence with SEQ ID NO.: 180. Preferably, the at least one attachment point present in the DARPin-based building block is not the C-terminal reactive group, i.e. the -COOH reactive groups present in the C-terminal amino acid of the DARPin-based building block, and/or a primary amine, preferably is not a primary amine present in the side chain of a lysine and/or in the /V-terminus. In a preferred embodiment, the at least one attachment point of the DARPin-based building block is a thiol group (-SH), preferably present in the side chain of
at least one cysteine located at a solvent accessible position in the DARPin-based building block.
Affibody-based building block
Affibody molecules are a class of engineered affinity proteins with proven potential for therapeutic, diagnostic and biotechnological applications. Affibody molecules are small (6.5 kDa) single domain proteins that can be isolated for high affinity and specificity to any given protein target (from FEBS Letters, Volume 584, Issue 12, 18 June 2010, Pages 2670- 2680). In the context of the present technology, an "affibody-based building block" refers to a protein-based building block which derives from an affibody, i.e., which is structurally similar to an affibody but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the affibody precursor specifically binds. For instance, the affibody-based building block has a sequence identity of at least 60%, or 70%, or 80% with an affibody, e.g., its affibody precursor. For instance, the affibody-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with an affibody, e.g., its affibody precursor. For instance, an affibody-based building block may share the whole amino acid sequence with its affibody precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids. In addition, the affibody-based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind any human protein and preferably does not specifically bind any (non-human) protein or non-protein molecule to which the precursor specifically binds.
Affitin-based building block
Affitins are artificial proteins with the ability to selectively bind antigens. They are structurally derived from the DNA-binding protein Sac7d, found in Sulfolobus acidocaldarius. Due to their small size and high solubility, they can be easily produced in large amounts using bacterial
expression systems (see, e.g., Kalichuk V. et al., “ novel, smaller scaffold for Affitins: Showcase with binders specific for EpCAM", Biotechnol Bioeng. 2018; 115(2):290-299). In the context of the present technology, an "affitin-based building block" refers to a protein-based building block which derives from an affitin, i.e., which is structurally similar to an affitin but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the affitin precursor specifically binds. For instance, the affitin-based building block has a sequence identity of at least 60%, or 70%, or 80% with an affitin, e.g., its affitin precursor. For instance, the affitin-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with an affitin, e.g., its affitin precursor. For instance, an affitin-based building block may share the whole amino acid sequence with its affitin precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids. In addition, the affitin- based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind any human protein and preferably does not specifically bind any (nonhuman) protein or non-protein molecule to which the precursor specifically binds.
Small globular human protein-based building block
The protein-based carrier building block(s) comprised in the molecule of the present technology may be based on a small globular human protein. In the context of the present technology, a "small globular human protein" refers to a human protein which has a size (molecular mass) of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa, as described herein and which has a globular three- dimensional (3D) structure, as described herein.
Hence, the protein-based carrier building block(s) comprised in the molecule of the present technology may be based on a small globular human protein, such as cyclin-dependent kinase
subunit (CKS), e.g., it may be based on cyclin-dependent kinase subunit 1 (CKS1, Gene ID: 983). The binding functionality of the building block precursor (a small globular human protein in this case, such as CKS1) should be eliminated by, e.g., introducing at least one conjugation site in their target binding sites, or, e.g., by mutating residues in or near the binding site: The resulting protein-based building block should not specifically bind any human protein. Furthermore, preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor binds specifically, if any.
Preferably, the small globular human protein on which the protein-based building block may be based, does not comprise any cysteine in its original amino acid sequence, in particular if cysteine-engineering is carried out to create conjugation sites (e.g., if the protein-based carrier building block precursor will be modified by adding cysteines preferably at solvent-accessible positions to generate new conjugation sites or attachment points). In the context of the present technology, a "small globular human protein-based building block" refers to a protein-based building block which derives from a small globular human protein as defined herein, e.g., which is structurally similar to a small globular human protein but does not specifically bind to any human protein, preferably does not specifically bind to any target to which the small globular human protein precursor specifically binds. For instance, the small globular human protein-based building block has a sequence identity of at least 60%, or 70%, or 80% with a small globular human protein, e.g., its small globular human protein precursor. For instance, the small globular human protein-based building block has a sequence identity of at least 85%, such as at least 90%, such as at least 91%, such as at least 92%, such as at least 93%, such as at least 94%, such as at least 95%, such as at least 96%, such as at least 97%, such as at least 98%, such as at least 99%, or more with a small globular human protein, e.g., its small globular human protein precursor. For instance, a small globular human protein-based building block may share the whole amino acid sequence with its small globular human protein precursor with the exception of at least one, such as one, two, three, four, five, six, seven, eight, nine, ten, fifteen, eighteen, twenty, twenty-five, thirty or more amino acids. In addition,
the small globular human protein-based building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, does not specifically bind any human protein and preferably does not specifically bind any (non-human) protein or non-protein molecule to which the precursor specifically binds.
In one embodiment, the at least one protein-based carrier building block precursor is a small globular human protein such as CKS (e.g., CKS1). As explained in detail above, the resulting protein-based building block should no longer specifically bind to any human protein. Furthermore, preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), preferably it also does not specifically bind to any non-protein molecule (such as DNA, RNA, glycans, lipids (e.g., such as phosphatidylserine (PS)), etc.), to which the building block precursor binds specifically, if any, and preferably it also does not specifically bind to any non-human protein (e.g., a bacterial and/or viral protein) to which the building block precursor binds specifically, if any.
In one embodiment, the protein-based carrier building block comprised in the molecule of the present technology is based on the polypeptide as defined in SEQ ID NO.: 190:
SHKQIYYSDKYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHYMIHEPEPHILLFR RPLPKKPKK.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 190, or a sequence which has 80% or more identity with SEQ ID NO.: 190, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 190, wherein the polypeptide comprises at least one amino acid with a reactive group in its side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 190:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64, 69-78, preferably in at least one of the following positions in SEQ ID NO.: 190:
9-13, 22, 33, 51, 57 and 78, or in at least one of the following positions in SEQ ID NO.: 190:
1, 4, 10, 12, 25, 29, 33, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
Preferably, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 190, or a sequence which has 80% or more identity with SEQ ID NO.: 190, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 190, wherein the polypeptide comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive group in its side chain, such as cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 190:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64, 69-78, preferably in at least one of the following positions in SEQ ID NO.: 190:
9-13, 22, 33, 51, 57 and 78, or in more than one of the following positions in SEQ ID NO.: 190:
1, 4, 10, 12, 25, 29, 33, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
Hence, the present technology further provides a polypeptide and/or molecule which comprise SEQ ID NO.: 190, or a sequence which has 80% or more identity with SEQ ID NO.: 190, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 190, wherein SEQ ID NO.: 190comprises at least one amino acid with a reactive group in its side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 190:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64, 69-78, preferably in at least one of the following positions in SEQ ID NO.: 190:
9-13, 22, 33, 51, 57 and 78, or in at least one of the following positions in SEQ ID NO.: 190:
1, 4, 10, 12, 25, 29, 33.
Preferably, the the polypeptide and/or molecule of the present technology comprise SEQ ID NO.: 190, or a sequence which has 80% or more identity with SEQ ID NO.: 190, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 190, wherein SEQ ID NO.: 190comprises more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, amino acids with a reactive
group in its side chain, such as cysteine, in more than one, such as two, three, four, five, six, seven, eight, nine, ten, or more, of the following positions in SEQ ID NO.: 190:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64, 69-78, preferably in at least one of the following positions in SEQ ID NO.: 190:
9-13, 22, 33, 51, 57 and 78, or in more than one of the following positions in SEQ ID NO.: 190:
1, 4, 10, 12, 25, 29, 33.
Hence, in one embodiment, the protein-based carrier building block comprised in the molecule of the present technology comprises or, alternatively, consists of, a polypeptide as defined in SEQ ID NO.: 191:
XlX2X3X4lX5X6SX7X8X9X10XllX12X13X14X15Xl6X17X18VX19LRX20X21X22AX23X24VX25X23bX24bX25bX26MX2 7X28X29X30WX31X32LX33VX34QX35X36X37WX38HX39X40X41X42X43X44X45X46X47I LLFX48X49X50X51X52X53X 54X55X56X57, wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X4can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xe can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xwcan be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xn can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xi2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi4can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be His or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X2ocan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X2i can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X23bcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24bcan be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X25bcan be His or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X29 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X31 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X32 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X34can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X35 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xse can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X37 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X4ocan be Met or any amino acid with a reactive group in its side chain, such as cysteine; X4i can be He or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X44can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X47 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X48 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Pro or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xs4can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ
ID NO.: 191 as defined above, or a sequence which has 80% or more identity with SEQ ID NO.:
191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191.
In another embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 205:
SHKQIYYSX1X2X3X4X5EEFEYRHVX6LPKDIAKLVPX7THLMSESEWRNLGVQQSX8GWVHYX9IHEPEPHI LLFRRPLPKKPKX10, wherein
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X3 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X4can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Met or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xs can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
Hence, the present technology provides a polypeptide and/or molecule which comprises SEQ ID NO.: 205 as defined above, or a sequence which has 80% or more identity with SEQ ID NO. : 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 205.
In another embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, SEQ ID NO.: 192:
X1HKX2IYYSDX3YX4DEEFEYRHVMLPX5DIAX6LVPX7THLMSESEWRNLGVQQSQGWVHYMIHEPEPHI LLFRRPLPKKPKK, wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X2 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 192, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 192, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
Hence, the present technology provides a polypeptide and/or molecule which comprise SEQ
ID NO.: 192 as defined above, or a sequence which has 80% or more identity with SEQ ID NO. :
192, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 192.
Additionally, the protein-based carrier building block which comprises, or alternatively, consists of, any one of SEQ ID NOs.: 191, 192 or 205 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by any one of SEQ ID NOs.: 191, 192 or 205 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by any one of SEQ ID NOs.: 191, 192 or 205 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/followed by a flexible tag (sequence), such as a (GG) sequence (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C-terminal of the polypeptide defined by any one of SEQ ID NOs.: 191, 192 or 205 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, Y(G4SI)I-3GG-, YGG(G4SI)I-3-, YGG(SIG4)I-3- or -(G4SI)I-3GGY), preferably -GGY, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, any one of SEQ ID NOs.: 191, 192 or 205 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase- recognition motif (LPXTG) at the C- terminal end, and/or a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In one embodiment, the at least one protein-based carrier building block comprised in the molecule of the present technology comprises, or alternatively, consists of, one of the sequences of Table 7, or a sequence which has 80% or more identity with a sequence of Table 7, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 7, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably
of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above.
In another embodiment, the present technology also provides a polypeptide and/or molecule which comprise one of the sequences of Table 7, or a sequence which has 80% or more identity with a sequence of Table 7, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with a sequence of Table 7.
In one embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 99, or a polypeptide which has 80% or more identity with SEQ ID NO.: 99, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about
70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 10 and 13 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 99 (or any variant thereof with sequence identity of 80% or more, as described above), positions 10 and 13 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 99 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal end of the polypeptide defined by SEQ ID NO.: 99 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present at the /V- terminal of the polypeptide defined by SEQ ID NO.: 99 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag, such as (GG) tag (sequence) (e.g., CGG-). If a tyrosine is present at the /V- terminal of the polypeptide defined by SEQ ID NO.: 99 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (YGG-, YGG(G4SI)I-3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the proteinbased carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 99 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 100, or a polypeptide which has 80% or more identity with SEQ ID NO.: 100, preferably which has 85% or more, 90% or more, 95% or more, 97% or
more or 99% or more sequence identity with SEQ ID NO.: 100, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 10, 12, 22 and 51 are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 100 (or any variant thereof with sequence identity of 80% or more, as described above), positions 10, 12, 22 and 51 are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 100 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 100 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V- and/or C-terminal of the polypeptide defined by SEQ ID NO.: 100 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be preceded/fol lowed by a flexible tag (sequence, such as (GG) (e.g., -GGC or CGG-). If a tyrosine is present in the /V- and/or C- terminal of the polypeptide defined by SEQ ID NO.: 100 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be preceded/followed by flexible tags (sequences) such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, -GGY, - (G4SI)I-3GGY, YGG(G4SI)I-3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably -GGY or YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, SEQ ID NO.: 100 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a sortase-recognition motif (LPXTG) at the C- terminal end, and/or a (G ly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 101, or a polypeptide which has 80% or more identity with SEQ ID NO.: 101, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 101, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 11 and 51 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 101 (or any variant thereof with sequence identity of 80% or more, as described above), positions 11 and 51 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 101 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal end of the polypeptide defined by SEQ ID NO.: 101 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present at the /V-terminal of the polypeptide defined by SEQ ID NO.: 101 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG) (e.g., CGG-). If a tyrosine is present at the /V-terminal of the polypeptide defined by SEQ ID NO.: 101 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags, such as (GG) or (G4Si)i-3GG tags (sequences) (e.g., YGG-, YGG(G4SI)I-3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 101 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (G ly)i-s tag at the /V-terminal end, to allow for conjugation of the
cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile
C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 102, or a polypeptide which has 80% or more identity with SEQ ID NO.: 102, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 102, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 13 and 33 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 102 (or any variant thereof with sequence identity of 80% or more, as described above), positions 13 and 33 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 102 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal ends of the polypeptide defined by SEQ ID NO.: 102 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present at the /V-terminal of the polypeptide defined by SEQ ID NO.: 102 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 102 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, YGG(G4SI)I-3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the proteinbased carrier building block which comprises, or alternatively, consists of, the polypeptide
defined by SEQ ID NO.: 102 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 103, or a polypeptide which has 80% or more identity with SEQ ID NO.: 103, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 103, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 11, 33, and 51 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 103 (or any variant thereof with sequence identity of 80% or more, as described above), positions 11, 33, and 51 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 103 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal end of the polypeptide defined by SEQ ID NO.: 103 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present at the N-terminal of the polypeptide defined by SEQ ID NO.: 103 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 103 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, YGG(G4SI)I-3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al.,
Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the proteinbased carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 103 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 104, or a polypeptide which has 80% or more identity with SEQ ID NO.: 104, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 104, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 9, 13, 22, 33 and 51 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 104 (or any variant thereof with sequence identity of 80% or more, as described above), positions 9, 13, 22, 33 and 51 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 104 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal end of the polypeptide defined by SEQ ID NO.: 104 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 104 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG) (e.g., CGG-). If a tyrosine is present in the /V-terminal of the polypeptide defined by SEQ ID NO.: 104 (or any variant thereof with sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, YGG(G4SI)I-
3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 104 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
In another embodiment, the at least one protein-based carrier building block comprises, or alternatively, consists of, SEQ ID NO.: 105, or a polypeptide which has 80% or more identity with SEQ ID NO.: 105, preferably which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 105, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such about 2.5 to about 50 kDa, or of as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, as described in detail above. In this embodiment, the amino acids at the solvent-accessible positions 9, 13, 33, 51 and 57 and the C-terminal amino acid are preferably cysteines. Hence, in the building block comprising or consisting of SEQ ID NO.: 105 (or any variant thereof with sequence identity of 80% or more, as described above), positions 9, 13, 33, 51 and 57 and the C-terminal are solvent-accessible positions, and are preferably occupied by cysteines, which comprise thiol groups, which are the conjugation sites present in the protein building block, as described above. In addition, in this embodiment, the building block comprising or consisting of SEQ ID NO.: 105 (or any variant thereof with sequence identity of 80% or more, as described above) may additionally comprise an extra cysteine and/or an extra tyrosine at the /V-terminal end of the polypeptide defined by SEQ ID NO.: 105 (or any variant thereof with sequence identity of 80% or more, as described above). If a cysteine is present at the /V-terminal of the polypeptide defined by SEQ ID NO.: 105 (or any variant thereof with sequence identity of 80% or more, as described above), the cysteine may be followed by a flexible tag (sequence), such as (GG), (e.g., CGG-). If a tyrosine is present at the /V-terminal of the polypeptide defined by SEQ ID NO.: 105 (or any variant thereof with
sequence identity of 80% or more, as described above), the tyrosine may be followed by flexible tags (sequences), such as (GG) or (G4SI)I-3GG tags (sequences) (e.g., YGG-, YGG(G4SI)I- 3-, YGG(SIG4)I-3- or Y(G4SI)I-3GG-), preferably YGG-, although longer linkers might be preferred for applications where, e.g., more flexibility is needed, as described in detail in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. In addition, the protein-based carrier building block which comprises, or alternatively, consists of, the polypeptide defined by SEQ ID NO.: 105 (or variants thereof with sequence identity of 80% or more, as described above) may additionally comprise a (Gly)i-s tag at the /V-terminal end, to allow for conjugation of the cargo using sortase, as explained in detail above, see in particular Guimaraes C. P. et al., Theile C. S. et al. and Witte M. D. et al.
Cargos
As described in detail above, the molecule of the present technology comprises at least one protein-based carrier building block, at least one cargo which is a NLS, preferably also at least one further cargo which is a (cell)-targeting moiety and/or a CPP and, in addition, it may further comprise at least one, preferably at least two, attachment point(s) or conjugation site(s) suitable for attachment of different molecules, including proteins, peptides, toxic payloads, nucleic acids, glycans, radio-isotopes, PEG, etc. and combinations thereof, see below for further examples of suitable cargos. In the context of the present technology, a "cargo" is any molecule which is/may be attached or conjugated to the protein-based carrier building block through the attachment point(s) or conjugation site(s) present therein. Hence, a "cargo", in the context of the present technology, may be any molecule, including proteins, peptides, small molecules, toxic payloads, nucleic acids (such as DNA, RNA, ASOs, etc.), vitamins, lipids, glycans, radio-isotopes, PEG, etc. and combinations thereof, see below for specific examples. "Cargo", as defined herein, comprises NLSs, as described herein, CPPs, as described herein, therapeutic moieties, as described herein, and (cell)-targeting moieties, as defined herein.
Besides the NLS(s), as described herein and the optional (cell)-targeting moiety(ies), therapeutic moieties, and/or CPPs, as described herein, the molecule of the present technology may comprise at least one further cargo as defined herein conjugated to one of
the attachment points or conjugation sites present in the at least one protein-based carrier building block.
In a preferred embodiment, the at least one further cargo which may be attached to the conjugation site(s) of the at least one carrier building block is an ISVD as described herein (also referred to in the present description as "cargo ISVD"). In this context, the cargo ISVD may preferably specifically bind to one or more proteins in the human body, such as human proteins and/or may also specifically bind other proteins present in the human body (e.g., viral or bacterial proteins which are in the human body).
In a preferred embodiment, the molecule of the present technology comprises at least one further cargo attached to one of the conjugation sites or attachment points present in the protein-based building block, preferably to a conjugation site or attachment point which is the side chain of an amino acid preferably located at a solvent-accessible position of the proteinbased building block. In a preferred embodiment, the at least one protein-based carrier building block is an ISVD-derived building block, as described herein, and the further cargo attached to the building block is an ISVD as described herein. Hence, in this embodiment, the molecule of the present technology comprises at least one protein-based carrier building block which is derived from an ISVD, preferably from a heavy-chain ISVD, and one further cargo attached to it, wherein the cargo is an ISVD, preferably a heavy-chain ISVD.
In another embodiment, the at least one further cargo which may be attached to a conjugation site(s) of the at least one carrier building block is a group, residue, moiety or binding unit which provides the protein-based carrier building block (and/or molecule) with increased (in vivo) half-life compared to the corresponding carrier building block/molecule without said one or more other groups, residues, moieties or binding units.
The cargos (including the NLS(s), (cell)-targeting moiety(ies), CPPs, therapeutic moieties and/or further cargos, if present) are attached (or "anchored", "conjugated", "linked") to the at least one protein-based building block via at least one conjugation site, as described above. The cargo(s) and the at least one protein-based building block may be directly linked to each other (as for example described in WO 1999/23221) and/or may be linked to each other via
one or more suitable linkers, or any combination thereof. Suitable linkers for use in the molecule of the present technology will be clear to the skilled person, and may generally be any linker used in the art to link amino acid sequences or any other molecule comprised in the cargo. Preferably, said linker is suitable for use in constructing proteins or polypeptides that are intended for pharmaceutical use. Some particularly preferred linkers include the linkers that are used in the art to link antibody fragments or antibody domains. These include the linkers mentioned in the publication cited above, as well as for example linkers that are used in the art to construct diabodies or ScFv fragments (in this respect, however, it should be noted that, whereas in diabodies and in ScFv fragments, the linker sequence used should have a length, a degree of flexibility and other properties that allow the pertinent VH and VL domains to come together to form the complete antigen-binding site, there is no particular limitation on the length or the flexibility of the linker used in the molecule of this technology; this can be tuned depending on the specific applications, e.g., on the number and nature of the cargos to be attached to the protein-based building block, on the specific position and number of attachment points or conjugation sites and on the nature of the linker. As also shown in the examples, the skilled person will be able to select an appropriate linker for a certain application).
For example, a linker may be a suitable amino acid or amino acid sequence, and in particular amino acid sequences of between 1 and 50, preferably between 1 and 30, such as between 1 and 10 amino acid residues. Some preferred examples of such amino acid sequences include Gly-Ser linkers, for example of the type (GlyxSery)z, such as (for example (Gly4Ser)3 or (Gly3Ser2)3, as described in WO 1999/42077 and the GS30, GS15, GS9 and GS7 linkers described in the applications by Ablynx mentioned herein (see for example WO 2006/040153 and WO 2006/122825), as well as hinge-like regions, such as the hinge regions of naturally occurring heavy chain antibodies or similar sequences (such as described in WO 1994/04678). Some linkers are depicted in Table A-l below. Hence, any of the linkers shown in Table A-l may be used in the molecule of the present technology. For instance, a preferred linker which may be used in the molecule of the present technology is depicted in SEQ ID NO.: 163 (15GS). Preferred linkers are, e.g., comprising or consisting of SEQ ID NO.: 158, SEQ ID NO.: 163, SEQ ID NO.: 168 or SEQ ID NO.: 161.
Table A-l Preferred linker sequences
Some other particularly preferred linkers are poly-alanine (such as AAA), as well as the linkers GS30 (SEQ ID NO: 85 in WO 2006/122825) and GS9 (SEQ ID NO: 84 in WO 06/122825). In a preferred aspect the linker is chosen from the group consisting of SEQ ID NOs: 158-169 and 193-196, or 298, and the linker " A3 (3A)" as depicted in Table A-l below. For instance, if the cargo is a NLS, the NLS may be covalently attached to the at least one (and preferably more) conjugation sites comprised in the protein-based building block via a 3G linker (GGG). Preferred linkers are, e.g., comprising or consisting of SEQ ID NO.: 158, SEQ ID NO.: 163, SEQ ID NO.: 168 or SEQ ID NO.: 161.
Polyethylene glycol (PEG), in any of the variants described below, may also be used as a linker in the molecule of the present technology. Other suitable linkers for use in the molecule of the present technology are described, e.g., in Kjeldsen T. et al. ("Dually reactive long recombinant linkers for bioconjugations as an alternative to PEG", ACS Omega, 2020,
5:19827-19833). As described therein polar protein sequences with PEG-like properties, sometimes called "recombinant PEG", have in recent years been described by Alvarez ("Improving protein pharmacokinetics by genetic fusion to simple amino acid sequences", J. Biol. Chem., 2004, 279:3375-3381), Amunix (mixed sequences of GEDSTAP residues, termed "ELNN polypeptides", see, e.g., US 2014/0301974 Al), XL-protein (PAS repeats), Novo Nordisk (GQAP-like repeats), SOBI and others.
As used herein, the terms "ELNN polypeptides" and "ELNNs" are synonymous and refer to extended length polypeptides comprising non-naturally occurring, substantially non- repetitive sequences (e.g., polypeptide motifs) that are composed mainly of small hydrophilic amino acids, with the sequence having a low degree or no secondary or tertiary structure under physiologic conditions. ELNN polypeptides include unstructured hydrophilic polypeptides comprising repeating motifs of 6 natural amino acids (G, A, P, E, S, and/or T). In some embodiments, an ELNN polypeptide comprises multiple motifs of 6 natural amino acids (G, A, P, E, S, T), wherein the motifs are the same or comprise a combination of different motifs. In some embodiments, ELNN polypeptides can confer certain desirable pharmacokinetic, physicochemical, and pharmaceutical properties when linked to proteins (e.g., when linked to the protein-based building block). Such desirable properties may include but are not limited to enhanced pharmacokinetic parameters and solubility characteristics, as well as improved therapeutic index. ELNN polypeptides are known in the art, and non-limiting descriptions relating to and examples of ELNN polypeptides known as XTEN® polypeptides are available in Schellenberger et al., (2009), Nat Biotechnol 27(12):1186-90; Brandl et al., (2020), Journal of Controlled Release 327:186-197; and Radon et al., (2021), Advanced Functional Materials 31, 2101633 (pages 1-33), the entire contents of each of which are incorporated herein by reference.
Ravtansine/soravtansine (N2'-Deacetyl-N2'-(4-mercapto-4-methyl-l-oxopentyl)-maytansine or DM4, CAS Registry Number: 796073-69-3) is a maytansinoid connected via a cleavable chemical linker to the targeting mAb which may be used as a cytotoxic component of, e.g., antibody-drug conjugates. This cleavable linker is also suitable for being used in the molecule of the present technology.
The length, the degree of flexibility and/or other properties of the linker(s) used may have some influence on the properties of the molecule of the present technology. Based on the disclosure herein and the disclosure of other publications, such as, for example, WO 2017/089618, the skilled person will be able to determine the optimal linker(s) for use in the specific molecule of the present technology, optionally after some limited routine experiments.
Further suitable linkers for use in the molecule of the present technology are, e.g., cleavable linkers, i.e., linkers which have a trigger in its structure that can be efficiently cleaved. For instance, Su, Z. et al. ("Antibody-drug conjugates: Recent advances in linker chemistry", Acta Pharmaceutica Sinica B, 2021, 11(12): 3889-3907) reviews linkers that may be comprised in antibody-drug conjugates and which may also be used in the molecule of the present technology. For example, suitable linkers for use in the molecule of the present technology are APN-maleimide linker (3-(4-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l-yl)phenyl)propiolonitrile, MAPN) or bis-maleimido-PEG3 (BM(PEG)3) linker (BM(PEG)3 (1,11-bismaleimido- triethyleneglycol)).
In addition, bifunctional linkers may be used. For instance, the APN-Maleimide linker (806536, Sigma-Aldrich) can be used. This linker allows for conjugation twice, via cysteine-based chemistry. Both APN and maleimide couple to free thiols, albeit at different speed, see Figure 4 and Example 5 below. Other bifunctional linkers may also be used. For instance, a linker comprising (i) a maleimide group on one end and (ii) either a LPXTG motif (where X can be any amino acid and glycine cannot be a free carboxylate) as the sortase target or an oligoglycine (G lyi-s) on the other end, as described in detail above.
When two or more linkers are used in the molecule of the present technology, these linkers may be the same or different. Again, based on the disclosure herein, the skilled person will be able to determine the optimal linkers for use in the specific molecule of the present technology, optionally after some limited routine experiments.
Nuclear localization sequences (NLS)
In the context of the present invention, a "nuclear localization sequence" or "NLS" refers to generally short peptides (usually 4-8 amino acids long) that are recognized by specific receptor proteins involved in nuclear import. These receptors are part of the importin family, which includes importin a and importin |3. During the transport process a protein with an NLS binds to importin a, then the importin a-NLS complex interacts with importin |3 which enables the entire complex to move through the nuclear pore complex, which acts as a gateway between the cytoplasm and the nucleus. Once inside the nucleus, the protein is released from the importins, allowing it to perform its nuclear functions, see, e.g., Chang CC and Hsia KC., "More than a zip code: global modulation of cellular function by nuclear localization signals", FEBS J. 2021 Oct;288(19):5569-5585. NLSs can be classified as either monopartite or bipartite. Monopartite NLSs have a single basic amino acid cluster (e.g., PKKKRKV, SEQ ID NO.: 256, in SV40 Large T-antigen). Bipartite NLSs have two clusters of basic amino acids separated by a short spacer (e.g., nucleoplasmin's KR[PAATKKAGQA]KKKK, SEQ ID NO.: 257). Suitable examples of NLSs that can be used in the present technology are described, e.g., in Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60, e.g., on Table 1, page 3 or page 4 of this document. See also Table A-0 below. In one embodiment, the at least one NLS comprises or consists of the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221), see, e.g., Dang CV and Lee WM., "Identification of the human c-myc protein nuclear translocation signal", Mol Cell Biol. 1988, 8(10):4048-5.
Table A-0 (adapted from Table 1 on p. 2 of Lu J. et al., "Types of nuclear localization signals and mechanisms of protein import into the nucleus", Cell Commun Signal., 2021, 19(l):60)
Further NLSs suitable for use in the present technology are as follows:
KRPAATKKAGQAKKKK, SEQ ID NO.: 275, which is the BP NLS at the C-terminus of nucleoplasmin; and
FGNYNNQSSNFGPMKGGNFGGRSSGPY, SEQ ID NO.: 276, hPY-NLS (Human heterogeneous nuclear ribonucleoprotein Al (hnRNP Al)).
Preferably, the NLS comprises or consists of PAAKRVKLD, SEQ ID NO.: 221.
In another embodiment, the NLS comprises or consists of SV40mono NLS (SEQ ID NO.: 256, PKKKRKV).
In another embodiment, the NLS comprises or consists of SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV).
In another embodiment, the NLS comprises or consists of NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
The NLS may be preceded by a peptide linker, such as a linker as depicted in Table A-l, e.g., a GGG linker.
The molecule of the present technology comprises one or more NLSs, as defined above. The NLSs comprised in the molecule of the present invention (covalently linked, directly or by means of a linker, to one or more attachment points or conjugation sites comprised in the at least one protein-based building block) may be the same or different. For instance, the molecule of the present technology comprises more than one NLS, wherein all of the NLSs are the same (e.g., they are all PAAKRVKLD, SEQ ID NO.: 221; they are all SV40mono NLS (SEQ ID NO.: 256, PKKKRKV), they are all SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV), they are all NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD)).
Hence, in a preferred embodiment, the molecule of the present technology comprises at least one NLS covalently linked to at least one conjugation site or attachment point comprised in at least one protein-based building block.
In a preferred embodiment, the at least one NLS is covalently attached to at least one conjugation sites comprised in the protein-based building block via a linker, such as a peptide linker. Some useful linkers are depicted in Table A-l. For instance, the NLS may be covalently attached to the at least one conjugation site comprised in the protein-based building block via a 3G linker (GGG). In a preferred embodiment, the NLS is PAAKRVKLD, SEQ ID NO.: 221, and it is covalently linked to one attachment point or conjugation site via a GGG linker. In another embodiment, the NLS is SV40mono NLS (SEQ ID NO.: 256, PKKKRKV) and it is covalently linked to one attachment point or conjugation site via a GGG linker. In another embodiment, the NLS is SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV) and it is covalently linked to one attachment point or conjugation site via a GGG linker. In another embodiment, the NLS is NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD) and it is covalently linked to one
attachment point or conjugation site via a GGG linker. In a further preferred embodiment, the attachment point is the C-terminal carboxylic acid of the protein-based building block, which is preferably derived from an ISVD. Of course, the one or more NLS can also be attached or conjugated, directly or by means of a linker, to one or more attachment points comprised in the protein-based building block by genetic fusion. For instance, a NLS can be attached or conjugated, directly or by means of a linker, to the /V-terminal primary amine or to the C- terminal carboxylic acid of the protein-based building block by genetic fusion. Alternatively, the NLS peptides are site-specifica lly conjugated onto the attachment point(s) or conjugation site(s) that are reactive groups present in the side chain of an amino acid in the protein-based carrier building block, preferably an amino acid present at a solvent-accessible position in the protein-based carrier building block, more preferably reactive groups present in the side chain of a cysteine and/or in the side chain of a tyrosine, and/or in the side chain of a lysine, and/or in the side chain of a non-natural amino acid, preferably located at solvent-accessible positions in the protein-based carrier building block (e.g., -SH group present on the side chain of solvent-accessible cysteines).
As described herein, in the molecule of the present technology, at least one of the attachment points or conjugation sites present in the protein-based building block is linked (directly or via a linker) to a cargo which is a NLS, as defined herein. More than one attachment points or conjugation sites present in the protein-based building block may be linked (directly or via a linker) to more than one NLSs, which may be the same or different. Hence, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block. It is preferred that the molecule of the present technology comprises, besides the NLS, other cargos covalently linked (directly or via a linker) to conjugation sites or attachment points comprised in at least one protein-based building block, such as CPPs, therapeutic moieties or (cell)-targeting moieties. In one embodiment, the molecule of the present technology comprises at least one NLSs covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block. In another embodiment, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block and at least one,
preferably more than one, CPPs covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block. In another embodiment, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block, at least one, preferably more than one, CPPs covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block and at least one, preferably more than one, further cargos, such as (cell)-targeting moieties, e.g., tumor-targeting moieties, covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block. In another embodiment, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block and at least one, preferably more than one, therapeutic moiety covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block. In another embodiment, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block, at least one, preferably more than one, therapeutic moiety covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block and at least one, preferably more than one, further cargos, such as (cell)-targeting moieties, e.g., tumor-targeting moieties, covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block. In another embodiment, the molecule of the present technology comprises at least one NLS covalently linked (directly or via a linker) to at least one conjugation site or attachment point comprised in at least one protein-based building block, at least one, preferably more than one, CPPs covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block, at least one, preferably more than one, therapeutic moiety covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point
Tin
comprised in at least one protein-based building block and at least one, preferably more than one, further cargos, such as (cell)-targeting moieties, e.g., tumor-targeting moieties, covalently linked (directly or via a linker) to at least one, preferably more than one, conjugation site or attachment point comprised in at least one protein-based building block.
Cell-penetrating peptides (CPPs)
As described above, the protein-based carrier building block may have attached or conjugated, via its one or more conjugation sites or attachment points one or more other groups, residues, moieties or binding units, optionally linked via one or more peptidic linkers, wherein said one or more other groups, residues, moieties or binding units may be used for delivery purposes ("cell-penetrating peptides", CPPs). For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more CPPs attached or conjugated to the at least one protein-based carrier building block.
Examples of CPPs are provided, e.g., in Lehto T., "Cell-penetrating peptides for the delivery of nucleic acids", Expert Opin. Drug Deliv., 2012, 9(7):823-36 or in Zhang H. et al., "Recent advances of cell-penetrating peptides and their application as vectors for delivery of peptide and protein-based cargo molecules", Pharmaceutics, 2023, 15(8):2093. Examples of suitable CPPs are provided below:
- Transportan, GWTLNSAGYLLGKINLKALAALAKKIL-NH2, SEQ ID NO.: 278;
- TP10, AGYLLGKINLKALAALAKKIL-NH2, SEQ ID NO.: 279;
- MPG, GALFLGWLGAAGSTMGAPKKKRKV-cya (cysteamide), SEQ ID NO.: 280;
- Pep-1, KETWWETWWTEWSQPKKKRKV-cya, SEQ ID NO.: 281;
- Penetratin, RQIKIWFQNRRMKWKK, SEQ ID NO.: 282;
- MAP, KLALKLALKALKAALKLA-N H2, SEQ I D NO. : 283;
- CADY, GLWRALWRLLRSLWRLLWRA-cya, SEQ ID NO.: 284;
- Tat (48-60), GRKKRRQRRRPPQ, SEQ ID NO.: 285;
Oligoarginine, (R)n;
CMA-1, a cationic CPP which is adopting its active conformation in acidic conditions (see, e.g., Yang Y. et al., "Application of peptides in construction of nonviral vectors forgene delivery", Nanomaterials (Basel), 2022, 12(22) :4076) . This peptide has the following sequence: GGGIGAVLEVLTTGLPALISWIEEEEQQ (SEQ ID NO.: 218);
- MT23, LPKQKRRQRRRM , SEQ ID NO.: 286;
- tLyP-1, CGNKRTR, SEQ ID NO.: 287;
- gH625, HGLASTLTRWAHYNALIRA, SEQ ID NO.: 288;
- Crotamine, AASSSGGPPPGGGGGCCCCCMILTPPTTLLLLLLLLLHHAATAV, SEQ ID NO.: 289;
- Melittin, GIGAVLKVLTTGLPALISWIKRKRQQ-NH2, SEQ ID NO.: 290;
- Apamin, CNCKAPETALCARRCQQH, SEQ ID NO.: 291;
- LAH4, KKALLALALHHLAHLALHLALALKKA-NH2, SEQ ID NO.: 292; and
- TAT, YGRKKRRQRRREV, SEQ ID NO.: 277.
Further suitable CPPs are described in Xie et al., "Cell-penetrating peptides in diagnosis and treatment of human diseases: from preclinical research to clinical application", Front. Pharmacol., 2020, 11:697.
The CPPs may comprise a poly-Glu sequence to neutralize the cationic nature in neutral conditions as described, e.g., in Farkhani SM. et al., "Effect of poly-glutamate on uptake efficiency and cytotoxicity of cell penetrating peptides", IET Nanobiotechnol., 2016, 10(2) :87- 95.
Hence, the present technology provides molecules as defined herein which comprise at least one protein-based building block with at least one CPP, such as CMA-1, or LAH4, or TAT, attached to at least one attachment point or conjugation site comprised therein. For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more CPPs attached or conjugated, directly or by means of a linker, to the at least one protein-based carrier building block.
(In vivo) half-life extending moieties
As described above, the molecule of the present technology may comprise one or more other groups, residues, moieties or binding units, optionally linked via one or more linkers, such as peptide linkers, as defined above, in which said one or more other groups, residues, moieties or binding units provide the molecule of the present technology with increased (in vivo) halflife, compared to the corresponding molecule without said one or more other groups,
residues, moieties or binding units ("(in vivo) half-life extending moiety", or "half-life extending (HLE) moiety"). The HLE moiety is a cargo as described above when it is attached or conjugated to the at least one attachment point or conjugation site comprised in the protein-based carrier building block.
As such, in one embodiment, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) a half-life extending moiety. The (ii) at least one NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
The term "half-life" as used here can generally be defined as described in paragraph o) on page 57 of WO 2008/020079 and as mentioned therein refers to the time taken for the serum concentration of the compound or polypeptide to be reduced by 50%, in vivo, for example due to degradation of the sequence or compound and/or clearance or sequestration of the sequence or compound by natural mechanisms. The in vivo half-life of the protein-based carrier building block and/or molecule comprising the protein-based carrier building block can be determined in any manner known per se, such as by pharmacokinetic analysis. Suitable techniques will be clear to the person skilled in the art and may for example generally be as described in paragraph o) on page 57 of WO 2008/020079. As also mentioned in paragraph o) on page 57 of WO 2008/020079, the half-life can be expressed using parameters such as the ti/2-alpha, ti/2-beta and the area under the curve (AUG). In this respect it should be noted that the term "half-life" as used herein in particular refers to the ti/2-beta or terminal half-life (in which the ti/2-alpha and/orthe AUG or both may be kept out of considerations). Reference is for example made to the standard handbooks, such as Kenneth, A et al: Chemical Stability of Pharmaceuticals: A Handbook for Pharmacists and Peters et al, Pharmacokinetic analysis: A Practical Approach (1996). Reference is also made to "Pharmacokinetics", M Gibaldi & D Perron, published by Marcel Dekker, 2nd Rev. edition (1982). Similarly, the terms "increase in half-life" or "increased half-life" are also as defined in paragraph o) on page 57 of WO 2008/020079 and in particular refer to an increase in the ti/2-beta, either with or without an increase in the ti/2-alpha and/or the AUC or both.
(In vivo) half-life can be extended by an increase in the hydrodynamic radius (size) or by a decrease in the molecule's clearance. For instance, (in vivo) half-life extending moieties such as polyethylene glycol or ELNN polypeptides increase the size of the molecules to which they are attached, therefore bypassing renal clearance, and thus increasing the half-life of those molecules. Other (in vivo) half-life extending moieties such as binding units that can bind to, e.g., serum albumin, increase the half-life of the molecules to which they are attached by binding, e.g., to serum albumin. Albumin is the most abundant plasma protein, is highly soluble, very stable and has an extraordinarily long circulatory half-life as a direct result of its size and interaction with the FcRn mediated recycling pathway, see, e.g., Sleep D. et al., "Albumin as a versatile platform for drug half-life extension", Biochim Biophys Acta, 2013, 1830(12):5526-34.
The type of groups, residues, moieties or binding units is not generally restricted and may for example be chosen from the group consisting of a polyethylene glycol (PEG) molecule, ELNN polypeptides or fragments thereof, as described above, serum proteins or fragments thereof, binding units that can bind to serum proteins, an Fc portion, and small proteins or peptides that can bind to serum proteins.
More specifically, said one or more other groups, residues, moieties or binding units that provide the molecule of the present technology with increased half-life can be chosen from the group consisting of a polyethylene glycol (PEG) molecule, ELNN polypeptides or fragments thereof, binding units that can bind to serum albumin, such as human serum albumin, or a serum immunoglobulin, such as IgG, or Fc fusions which might provide extra functionalities in vivo such as HLE via FcRn, immune effector functions via Fey receptors. In one embodiment, said one or more other groups, residues, moieties or binding units that provide the molecule of the present technology with increased half-life is a binding unit that can bind to human serum albumin. In one embodiment, the binding unit is an ISVD.
For example, WO 2004/041865 describes ISVDs binding to serum albumin (and in particular human serum albumin) that can be linked or attached to other proteins (such as one or more protein-based building blocks comprised in the molecules of the present technology) in order to increase the half-life of the molecule of the present technology.
The international application WO 2006/122787, the content of which is herein incorporated by reference, describes a number of ISVDs against (human) serum albumin. These ISVDs include the ISVDs called Alb-1 (SEQ ID NO: 52 in WO 2006/122787) and humanized variants thereof, such as Alb-8 (SEQ ID NO: 62 in WO 2006/122787). Again, these can be used to extend the half-life of therapeutic proteins and polypeptides, and other entities or moieties, such as the molecule of the present technology.
Moreover, WO 2012/175400, the content of which is herein incorporated by reference, describes a further improved version of Alb-1, called Alb-23.
In one embodiment, the molecule of the present technology further comprises a serum albumin binding moiety selected from Alb-1, Alb-3, Alb-4, Alb-5, Alb-6, Alb-7, Alb-8, Alb-9, Alb- 10 (described in WO 2006/122787) and Alb-23. In one embodiment, the serum albumin binding moiety is Alb-8 or Alb-23 or its variants, as shown on pages 7-9 of WO 2012/175400. In one embodiment, the serum albumin binding moiety is selected from the albumin binders described in WO 2012/175741, WO 2015/173325, WO 2017/080850, WO 2017/085172, WO 2018/104444, WO 2018/134235, and WO 2018/134234, the content of which is herein incorporated by reference. Some serum albumin binders are also shown in Table 8 below.
In one embodiment, the molecule of the present technology comprises a serum albumin binding moiety as defined in Table 8 below, or a sequence which has at least 80% amino acid sequence identity, preferably at least 85% amino acid sequence identity, more preferably at least 90% amino acid sequence identity, such as 95% amino acid sequence identity or 99% amino acid sequence identity or more, or even essentially 100% amino acid sequence identity with one or more of the serum albumin binding moiety of Table 8.
In one embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23 (SEQ ID NO.: 51) as defined in Table 8 below, or a sequence which has at least 80% amino acid sequence identity, preferably at least 85% amino acid sequence identity, more preferably at least 90% amino acid sequence identity, such as 95% amino acid sequence identity or 99% amino acid sequence identity or more, or even essentially 100%
amino acid sequence identity with Alb23 (SEQ ID NO.: 51). In one embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002 (SEQ. ID NO. : 63) as defined in Table 8 below, or a sequence which has at least 80% amino acid sequence identity, preferably at least 85% amino acid sequence identity, more preferably at least 90% amino acid sequence identity, such as 95% amino acid sequence identity or 99% amino acid sequence identity or more, or even essentially 100% amino acid sequence identity with Alb23002 (SEQ ID NO.: 63). In one embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002(ElD) (SEQ I D NO.: 106) as defined i n Table 8 below, or a sequence which has at least 80% amino acid sequence identity, preferably at least 85% amino acid sequence identity, more preferably at least 90% amino acid sequence identity, such as 95% amino acid sequence identity or 99% amino acid sequence identity or more, or even essentially 100% amino acid sequence identity with Alb23002(ElD) (SEQ ID NO.: 106). In one embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23 (SEQ ID NO.: 51) as defined in Table 8 below. In one preferred embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002 (SEQ ID NO.: 63) as defined in Table 8 below. In another preferred embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002(ElD) (SEQ ID NO.: 106) as defined in Table 8 below.
In one embodiment, the molecule of the present technology comprises a HLE moiety as described in the following item A:
A. An ISVD that binds to human serum albumin and comprises
i. a CDR1 that is the amino acid sequence of SEQ ID NO: 65 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 65; ii. a CDR2 that is the amino acid sequence of SEQ ID NO: 66 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 66; and iii. a CDR3 that is the amino acid sequence of SEQ ID NO: 67 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 67.
In one embodiment, the ISVD comprises a CDR1 that is the amino acid sequence of SEQ ID NO: 65, a CDR2 that is the amino acid sequence of SEQ ID NO: 66 and a CDR3 that is the amino acid sequence of SEQ ID NO: 67.
Examples of such an ISVD that binds to human serum albumin have one or more, or all, framework regions as indicated for construct ALB23002 (SEQ ID NO.: 63) in Tables 9 and 10 (in addition to the CDRs as defined in the preceding item A). In one embodiment, it is an ISVD comprising or consisting of the full amino acid sequence of construct ALB23002 (SEQ ID NO: 63).
Table 9: Sequences for CDRs according to AbM CDR and framework annotation ("ID" refers to the given SEQ ID NO)
Table 10: Sequences for CDRs according to Kabat CDR and frameworks annotation ("ID" refers to the given SEQ ID NO)
Item A' can be also described using the Kabat CDR definition as:
A'. An ISVD that binds to human serum albumin and comprises i. a CDR1 that is the amino acid sequence of SEQ ID NO: 74 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 74; ii. a CDR2 that is the amino acid sequence of SEQ ID NO: 76 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 76; and iii. a CDR3 that is the amino acid sequence of SEQ ID NO: 78 or an amino acid sequence with 2 or 1 amino acid difference with SEQ ID NO: 78.
In one embodiment, the ISVD comprises a CDR1 that is the amino acid sequence of SEQ ID NO: 74, a CDR2 that is the amino acid sequence of SEQ ID NO: 76 and a CDR3 that is the amino acid sequence of SEQ ID NO: 78.
Examples of such an ISVD that binds to human serum albumin have one or more, or all, framework regions as indicated for construct ALB23002 in Table 10 (in addition to the CDRs as defined in the preceding item A'). In one embodiment, it is an ISVD comprising or consisting of the full amino acid sequence of construct ALB23002 (SEQ ID NO: 63, see also Table 10).
Also in another embodiment, the amino acid sequence of an ISVD binding to human serum albumin may have a sequence identity of more than 90%, such as more than 95% or more than 99%, with SEQ ID NO: 63, wherein the CDRs are as defined in the preceding item A or A'. In one embodiment, the ISVD binding to human serum albumin comprises or consists of the amino acid sequence of SEQ ID NO: 63.
When such an ISVD binding to human serum albumin has 2 or 1 amino acid difference in at least one CDR relative to a corresponding reference CDR sequence (item A or A' above), the ISVD has at least half the binding affinity, or at least the same binding affinity, to human serum
albumin compared to construct ALB23002 (SEQ ID NO: 63), wherein the binding affinity is measured using the same method, such as SPR.
In one embodiment, when such an ISVD binding to human serum albumin has a C-terminal position, it exhibits a C-terminal extension, such as a C-terminal alanine, cysteine, or glycine extension. In one embodiment such an ISVD is selected from SEQ ID Nos: 52, 53, 55, 57, 58, 59, 60, 61, 62, and 64 (see Table 8 above). In another embodiment, the ISVD binding to human serum albumin has another position than the C-terminal position (i.e., is not the C-terminal ISVD of the molecule of the present technology). In one embodiment such an ISVD is selected from SEQ ID Nos: 63, 50, 51, 54, 56 and 106 (see Table 8 above).
In one embodiment, said one or more other groups, residues, moieties or binding units that provide the molecule with increased half-life is a peptide that can bind to human serum albumin.
In particular the "serum-albumin binding polypeptide or binding domain" may be any suitable serum-albumin binding peptide capable of increasing the half-life (preferably T1/2R, as defined above) of the molecule (compared to the same molecule without the serum-albumin binding peptide or binding domain).
Specifically, the polypeptide sequence suitable for extending serum half-life is a polypeptide sequence capable of binding to a serum protein with a long serum half-life, such as serum albumin, transferrin, IgG, etc, in particular serum albumin.
Polypeptide sequences capable of binding to serum albumin have previously been described and may in particular be serum albumin binding peptides as described in WO 2008/068280 (and in particular WO 2009/127691 and WO 2011/095545), the content of which is herewith incorporated by reference.
In one embodiment, said one or more other groups, residues, moieties or binding units that provide the molecule with increased half-life is straight or branched chain poly(ethylene glycol) (PEG) or derivatives thereof (such as methoxypoly(ethylene glycol) or mPEG), which
increase the hydrodynamic radius of the molecule, thus exceeding the renal clearance and hence rendering the molecule with tuneable half-life extension). Generally, any suitable form of PEGylation can be used, such as the PEGylation used in the art for antibodies and antibody fragments (including but not limited to domain antibodies and scFv's); reference is made, for example, to: Chapman, Nat. Biotechnol., 54, 531 -545 (2002); Veronese and Harris, Adv. Drug Deliv. Rev. 54, 453-456 (2003); Harris and Chess, Nat. Rev. Drug. Discov. 2 (2003); WO 04/060965; and US6,875,841. Various reagents for PEGylation of polypeptides are also commercially available, for example from Nektar Therapeutics, USA, or NOF Corporation, Japan, such as the Sunbright® EA Series, SH Series, MA Series, CA Series, and ME Series, such as Sunbright® ME-100MA, Sunbright® ME-200MA, and Sunbright® ME-400MA.
After covalent attachment of PEG, molecules can have prolonged blood circulation half-lives, improved drug solubility and stability, and reduced immunogenicity (Swierczewskaa M., et al., "What is the future of PEGylated therapies?", Expert Opin Emerg Drugs. 2015; 20(4): 531- 536).
The PEG may be linearor branched and have a molecularweight from about I to about 40 kDa, such as from about 1 to about 30 kDa, or from about 1 to about 20 kDa, or from about 1 to about 10 kDa, preferably from about 2 to about 7 kDa, more preferably from about 4 to about 6k Da and even more preferably of about 5 kDa. The smaller PEG size (e.g., 5 kDa) should enable renal clearance of the PEG moieties, thus bypassing the disadvantages of standard used large 40-60 kDa PEG. In one embodiment, the protein-based carrier building block comprises more than one PEG molecules as described above. For instance, the protein-based carrier building block may comprise 2, 3, 4, 5, 6, 7, 8 or more PEG molecules, such as from 4 to 8 PEG molecules, such as from 5 to 7 PEG molecules, such as 6 PEG molecules. In one embodiment, the protein-based carrier building block comprises 6 molecules of linear 5 kDa PEG. Suitable PEG-groups and methods for attaching them to the protein-based carrier building block will be clear to the skilled person.
High MW PEG accumulate in the circulation and cannot be efficiently renal cleared. This may have a toxic effect in the human body. Conversely, PEG with lower MW have renal clearance, reducing the toxicity of the PEG-comprising molecules. See, e.g., Fang JL. et al., "Toxicity of
high-molecular-weight polyethylene glycols in Sprague Dawley rats", Toxicol Lett., 2022, 359:22-30.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, onto which at least one PEG molecule is conjugated, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa or less, attached to at least one attachment point or conjugation site. It is preferred that the PEG molecule attached or conjugated to the protein-based building block has a low MW, so that it can be efficiently cleared by the kidneys. The skilled person would understand that, if a PEG molecule has a high MW, it may not be efficiently renal cleared. Thus, it is preferred that the PEG molecule(s) comprised in the protein-based building block has(have) low MW, as explained herein (e.g., less than 20 kDa, preferably less than 10 kDa, or less than 7.5 kDa, or less than 5 kDa, or even 1-5 kDa). Of course, as the skilled person would understand, the chosen size of the PEG molecules also depends on the number of PEG molecules attached to the protein-based carrier building block.
For instance, the ISVD-based building block may comprise or, alternatively, consists of a building block selected from SEQ ID NOs.: 80-95, 175, 185, 186, 206 or 222-225, or a sequence which has 80% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206 or 222- 225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206 or 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such about 2.5 to about 50 kDa, or of as about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one cargo attached to it through the at least one conjugation site
or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
For instance, the present technology provides molecules as described herein which comprises at least one such ISVD-based building block, as defined herein, onto which at least one PEG molecule is conjugated, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, attached to at least one attachment point or conjugation site.
As mentioned, other means of increasing the half-life of the molecule of the present technology (such as the presence of linear or branched 40-60 kDa PEGylation, or fusion to human albumin or a suitable fragment thereof), although less preferred, are also included in the scope of the technology.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, onto which at least one PEG molecule is conjugated, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, attached to at least one attachment point or conjugation site. For instance, the ISVD- based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which
comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, onto which at least one PEG molecule is conjugated, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, onto which at least one PEG molecule is conjugated, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK-derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one ISVD- based building block with SEQ ID NO.: 90, onto which at least one, preferably more than one, such as two, or, preferably, three PEG molecules, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, are conjugated to the attachment points or conjugation sites of the ISVD-based building block. In some embodiments, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In some embodiments, the at least one PEG molecule, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, is conjugated to the ISVD-based building block comprised in molecule T028100118 (SEQ ID NO.: 117). As described above, there are preferably more than one PEG molecules, preferably
with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, conjugated to the ISVD-derived building blocks, preferably 3 PEG molecules, preferably with a MW of less than 20 kDa, such as less than 15 kDa, or less than 10 kDa, or less than 7.5 kDa, such as about 5 kDa, or less, more preferably 5 kDa, per ISVD-based building block.
Molecules comprising two or more PEG molecules with lower molecular weight (e.g., 5 kDa) are able to increase the half-life of the molecules, see Example 15 of WO 2024/133935, the content of which is incorporated herein by reference.
Generally, when the protein-based carrier building block and/or molecule of the present technology has increased half-life (e.g., through the presence of a half-life increasing ISVD, PEG moieties or any other suitable way of increasing half-life, as described above), the resulting protein-based carrier building block and/or molecule preferably has a half-life (as defined herein) that is at least 2 times, preferably at least 5 times, for example at least 10 times or more, such at least 20 times, or at least 50 times, or at least 100 times, or at least 150 times, or at least 200 times, or at least 300 times, or at least 400 times, or at least 500 times, greater than the half-life of the protein-based carrier building block and/or molecule without the half-life increasing group, residue, moiety or binding unit (as measured either in man and/or a suitable animal model, such as mouse or cynomolgus monkey). In particular, the protein-based carrier building block and/or molecule may have a half-life (as defined herein) in human subjects of at least 1 day, such as at least 3 days, or at least 7 days, such as at least 10 days, or at least 15 days, or at least 20 days. The skilled person is able to select the HLE moiety based on the desired half-life of the protein-based carrier building block and/or molecule of the present technology. For certain applications, however, it may be desirable that the protein-based carrier building block and/or molecule of the present technology has shorter half-life (e.g., radio imaging/therapy in a theranostic setting).
In one embodiment, the molecule of the present technology may comprise a half-life extending moiety, as described above, attached or conjugated to the protein-based carrier building block. For instance, the half-life extending moiety may be an albumin-binding ISVD, such as an albumin-binding ISVD as defined in Table 8 above, or a polypeptide which has at
least 80% identity with a polypeptide of Table 8, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide of Table 8. For instance, the molecule according to this embodiment may comprise a half-life extending moiety which is the polypeptide with SEQ ID NO.: 106, or the polypeptide with SEQ ID NO.: 63, and one protein-based carrier building block.
The protein-based carrier building block in this embodiment may be any protein-based building block as described above in this specification. In particular, the protein-based carrier building block in this embodiment may be an ISVD-based building block, as described above, a DARPin-based building block or a small globular human protein-based building block (e.g., a CKS-l-based building block), as described above.
For instance, the protein-based carrier building block in this embodiment may be an ISVD- derived protein-based carrier building block as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175 or 222-225, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175 or 222-225.
For instance, the protein-based carrier building block in this embodiment may be derived from a small globular protein, such as a protein based on cyclin-dependent kinase subunit (CKS). For instance, the protein-based carrier building block in this embodiment may be a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 99- 105, 205 or 191-192, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192.
For instance, the protein-based carrier building block in this embodiment may be derived from a DARPin protein. For instance, the protein-based carrier building block in this embodiment may be a protein as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of
SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189.
Hence, the present technology further provides a molecule comprising or, alternatively consisting of, at least one protein-based building block as described herein and at least one (in vivo) half-life extending moiety as described herein.
For instance, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 and (ii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
More specifically, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225, (ii) at least one nuclear localization sequence (NLS), and (iii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
More specifically, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225 preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one
of SEQ ID NO.: 80-95, 175, 185-186, 206, 222-225, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
For instance, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189 and (ii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
More specifically, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189 and (ii) at least one nuclear localization sequence (NLS), and (iii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
More specifically, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 96-98, 181-182, 199, 208 or 188-189 and (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a half-
life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
For instance, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 99-105, 205 or 191-192, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192 and (ii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
For instance, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 99-105, 205 or 191-192, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192 and (ii) at least one nuclear localization sequence (NLS), and (iii) a halflife extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
For instance, the molecule of the present technology may comprise (i) a protein-based building block comprising or, alternatively, consisting of SEQ ID NO.: 99-105, 205 or 191-192, or a polypeptide which has at least 80% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192, preferably at least 85%, more preferably at least 90%, or at least 95%, or at least 99% identity with a polypeptide as defined in any one of SEQ ID NO.: 99-105, 205 or 191-192 and (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a half-life extending moiety as described herein, such as a serum albumin binding ISVD, e.g., as defined in Table 8, and/or a PEG molecule, and/or a ELNN polypeptide or a fragment thereof.
In one preferred embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002 (SEQ ID NO.: 63) as defined in Table 8. In another preferred embodiment, the molecule of the present technology comprises the serum albumin binding moiety Alb23002(ElD) (SEQ ID NO.: 106) as defined in Table 8.
The half-life extending moiety may be covalently attached to the conjugation site on the protein-based carrier building block either directly or by means of a linker, such as a linker selected from the linkers depicted in Table A-l (e.g., GGG or SEQ ID NO.: 158 - 169 and 193- 196, or 298). For instance, the half-life extending moiety may be covalently attached to the protein-based carrier building block by means of a linker, such as a 15GS linker (SEQ ID NO.: 163).
Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is based on an ISVD are depicted in SEQ ID NOs.: 107-122, 176 and SEQ ID NO: 306, see Table 11 below.
Table 11: Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is based on an ISVD ("ID" refers to the SEQ ID NO as used herein)
Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is derived from the small globular protein CKS are depicted in SEQ ID NO.: 123-127 and 170-171, see Table 12 below.
Table 12: Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is derived from the small globular protein CKS ("ID" refers to the SEQ ID NO as used herein)
Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is derived from DARPins are depicted in SEQ ID NO.: 172-174 and 200, see Table 13 below.
Table 13: Examples of polypeptides comprising a half-life extending moiety which is an albumin-binding ISVD and a protein-based carrier building block which is a DARPin-based building block ("ID" refers to the SEQ ID NO as used herein)
Targeting moieties or cell-targeting moieties
As described above, the protein-based carrier building block may have attached or conjugated, via one or more conjugation sites or attachment points, one or more groups, residues, moieties or binding units, optionally attached via one or more linkers, in which said one or more other groups, residues, moieties or binding units target the molecule of the present technology to target molecules on cells, organs or tissues ("targeting moiety" or "celltargeting moiety" in the context of the present technology). For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more targeting moieties attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) a targeting moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a targeting moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) a targeting moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) a targeting moiety as described herein.
A targeting moiety, as defined herein, is any group, residue, moiety, or binding unit which is capable of being directed through its binding to a target. An amino acid sequence (such as an ISVD, an antibody, antigen-binding domains or fragments such as VHH domains or VH/VL domains, or generally an antigen binding protein or polypeptide or a fragment thereof) that "(specifically) binds", that "can (specifically) bind to", that "has affinity for" and/or that "has
specificity for" a specific antigenic determinant, epitope, antigen or protein, or for a specific non-protein molecule, such as nucleic acids (such as DNA or RNA) or glycans (or for at least one part, fragment or epitope thereof) is said to be "against" or "directed against" said antigenic determinant, epitope, antigen, protein or non-protein molecule. Specific binding of an antigen-binding protein to an antigen or antigenic determinant can be determined in any suitable manner known perse, including, for example, Scatchard analysis and/or competitive binding assays, such as radio-immunoassays (RIA), enzyme immunoassays (EIA) and sandwich competition assays, and the different variants thereof known per se in the art; as well as the other techniques mentioned herein.
As described above, the protein-based carrier building block may comprise, attached to at least one attachment point or conjugation site (directly or by means of a linker), one or more cel I -targeting moieties, as described in detail below. Alternatively, oradditionally, the proteinbased building block may comprise one or more (further) targeting moieties.
For instance, the molecule of the present technology may comprise a targeting moiety such as blood brain barrier (BBB) shuttling moieties, HM-3 integrin antagonists, cell-penetrating peptides (CPPs), short linear motifs (SLiMs) such as retinoblastoma-binding LxCxE motif or nuclear localization signals, folic acid, distearoyl, cholesterol, targeting nucleic acids and the like.
Non-limiting examples of targeting moieties which may be present in the molecule of the present technology, e.g., attached to the at least one protein-based carrier building block through the at least one attachment point or conjugation sites are the following:
Human serum albumin (HSA)-binding molecules, as described, e.g., in WO 2006/122787, WO 2011/095545, WO 2012/175400, WO 2017/085172 or WO 2018/104444, the content of which is herewith incorporated by reference. Any of the HSA-binding molecules described in these documents may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the protein-based carrier building block through the one or more attachment points or conjugation sites.
Aggrecan-binding molecules, as described, e.g., in WO 2018/220225, the content of which is incorporated herewith by reference. Any of the aggrecan-binding molecules described in WO 2018/220225 may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the proteinbased carrier building block through the one or more attachment points or conjugation sites.
Glypican-3 (GPC3)-binding molecules, as described, e.g., in WO 2022/129560 the content of which is incorporated herewith by reference. Any of the GPC3-binding molecules described in WO 2022/129560 may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the proteinbased carrier building block through the one or more attachment points or conjugation sites.
T-cell receptor (TCR)-binding molecules, as described, e.g., in WO 2016/180969, WO 2018/091606 or WO 2022/129637. Any of the TCR-binding molecules described in these documents may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the protein-based carrier building block through the one or more attachment points or conjugation sites.
Neonatal Fc receptor (FcRn)-binding molecules, as described, e.g., in WO 2008/074867 or WO 2009/080764. Any of the FcRn-binding molecules described in these documents may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the protein-based carrier building block through the one or more attachment points or conjugation sites.
Polymeric immunoglobulin receptor (plgR)-binding molecules, as described, e.g., also in WO 2008/074867 or WO 2009/080764. Any of the plgR-binding molecules described in these documents may be incorporated in the molecule of the present technology, e.g., by attaching it (directly or via a linker, as described herein) to the protein-based carrier building block through the one or more attachment points or conjugation sites.
Interleukin-6 receptor (IL-6R)-binding molecules, as described, e.g., in WO 2010/115995 or WO 2010/115998. Any of the anti-IL-6R sequences described in WO 2010/115995 or WO 2010/115998 may be attached to or conjugated to the attachment point(s) or conjugation site(s) of the protein-based building block.
Vascular endothelial growth factor receptor 1 (VEGF-R1, also called Flt-l)-binding molecules, as described, e.g., in WO 2008/142165. Any of the VEGF-R1 sequences described
in WO 2008/142165 may be attached to or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
Platelet derived growth factor receptor beta (PDGF-RP)-binding molecules, as described, e.g., in WO 2008/142165. Any of the PDGF-R|3-binding sequences described in WO 2008/142165 may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
Fibroblast growth factor receptor 4 (FGF-R4)-binding molecules, as described, e.g., in WO 2008/142165. Any of the FGF-R4-binding sequences described in WO 2008/142165 may be attached to or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
Epidermal growth factor receptor (EGFR)-binding molecules, as described, e.g., in WO 2005/044858, WO 2007/042289 or WO 2016/097313. Any of the EGFR-binding sequences described in WO 2005/044858, WO 2007/042289 or WO 2016/097313 may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
In addition, EGFR-binding oligopeptide GE11 (YHWYGYTPQNVI, SEQ ID NO.: 212, Mw (Molecular weight) 1540 g/mol, IP 7.67) may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the proteinbased building block. GE11 is a dodecapeptide with excellent EGFR affinity. It actively binds, similar to human EGF or anti-EGFR monoclonal antibody cetuximab, the surface of EGFR- positive tumor cells. GE11 is a potentially safe and efficient targeting moiety for selective drug delivery systems mediated through EGFR (cf. Li Z. et al., "Identification and characterization of a novel peptide ligand of epidermal growth factor receptor for targeted delivery of therapeutics", FASEB J., 2005, 19(14):1978-85). Hence, GE11 can be used to bind to and internalize in EGFR+ tumor cells and can be synthesized by solid-phase peptide synthesis. For more details, see, e.g., Pola R. et al., "Targeted polymer-based probes for fluorescence guided visualization and potential surgery of EGFR-positive head-and-neck tumors", Pharmaceutics, 2020, 12(1):31 or Hailing T. et al., "Challenges for the application of EGFR-targeting peptide GE11 in tumor diagnosis and treatment", J Control Release, 2022, 349:592-605.
Hence, the present technology provides molecules as defined herein which comprises at least one protein-based building block with at least one GE11 peptide attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one GE11 peptide attached to at least one attachment point or conjugation site. For instance, the ISVD- based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, the molecule comprising at least one such ISVD-derived protein-based building block and at least one GE11 peptide attached to it through the at least one conjugation site or attachment point, does not specifically bind to any non-protein molecule and/or does not specifically bind to any non-human protein to which the ISVD precursor specifically binds.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least one GE11 peptide attached to at least one attachment point or conjugation site. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity
with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, with at least one GE11 peptide attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one GE11 peptide attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK-derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one DARPin- based building block selected from SEQ ID NO.: 199, 97 and/or 98, preferably SEQ ID NO.: 97 or 98, and at least one, preferably more than one, such as two, or, preferably, three GE11 peptides conjugated to the attachment points or conjugation sited of the DARPin-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albuminbinding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one GE11 peptide is
conjugated to the DARPin-based building block comprised in molecule ALB-lC_K27m (SEQ ID NO.: 200), ALB-3C_K27m_wl (SEQ ID NO.: 173) and/or ALB-5C_K27m (SEQ ID NO.: 174), preferably ALB-3C_K27m_wl and/or ALB-5C_K27m.
In one embodiment, the molecule of the present technology comprises at least one ISVD- based building block selected from SEQ ID NO.: 80, 81 or 175, preferably SEQ ID NO.: 80 or 81, and at least one, preferably more than one, such as two, or, preferably, three GE11 peptides conjugated to the attachment points or conjugation sited of the ISVD-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one GE11 peptide is conjugated to the ISVD-based building block comprised in molecule T028100069 (SEQ ID NO.: 107), T028100070 (SEQ ID NO.: 108) and/or T028100075 (SEQ ID NO.: 176), preferably T028100069 and/or T028100070.
As described above, there are preferably more than one GE11 peptides conjugated to the ISVD- and/or DARPin-derived building blocks, preferably there are 3 GE11 peptides per ISVD- based building block.
In one preferred embodiment, the EGFR-binding molecule is a EGFR-binding ISVD. Preferably, the EGFR-binding ISVD comprises or consists of SEQ I D NO.: 216:
DVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSS
Human epidermal growth factor receptor 2 (HER-2 or receptor tyrosine-protein kinase erbB-2)-binding molecules, as described, e.g., in WO 2009/068625. Any of the HER-2- binding sequences described in WO 2009/068625 may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
Blood-brain barrier (BBB) delivery molecules, such as peptides (see, e.g., Sanchez-
Navarro and Giralt, "Peptide shuttles for blood-brain barrier drug delivery", Pharmaceutics
2022, 14, 1874) or other carrier systems (see, e.g., Mahringer et al., "Crossing the blood-brain
barrier: A review on drug delivery strategies using colloidal carrier systems", Neurochem I nt., 2021, 147:105017).
HM-3 integrin antagonist, see, e.g., Li T., et al., "Albumin Fusion at the N-Terminus or C -Terminus of HM-3 Leads to Improved Pharmacokinetics and Bioactivities", Biomedicines, 2021, 9(9):1084.
Short linear motifs (SLiMs), such as retinoblastoma-binding LxCxE motif or nuclear localization signals.
Folic acid, distearoyl, cholesterol, and the like.
In one embodiment, the molecule of the present technology comprises more than one targeting moieties. The two or more targeting moieties which may be comprised in the molecule of the present technology may be the same or different. They may target the same or different epitopes in a cell. In one embodiment, they are different. For instance, the molecule of the present technology may comprise two targeting moieties which are two ISVDs different from each other. They may target the same or different epitopes. In a preferred embodiment, the molecule of the present technology comprises two targeting moieties targeting the same cell or the same epitope, but they are different from each other.
In another embodiment, the molecule comprises more than two targeting moieties, such as three targeting moieties. They may be the same or different. In one embodiment, all targeting moieties comprised in the molecule of the present technology are the same. The targeting moietie(s) may be covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. For instance, if more than one, the targeting moieties may be each covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. For instance, the at least one proteinbased building block may comprise two targeting moieties, which may be the same or different, each attached (directly or by means of a linker, as described herein) to one attachment point comprised in the protein-based building block (i.e., the protein-based building block comprises at least two attachment points for conjugation of the two targeting moieties). For instance, the at least one protein-based building block may comprise three targeting moieties, which may be the same or different, each attached (directly or by means
of a linker, as described herein) to one attachment point comprised in the protein-based building block (i.e., the protein-based building block comprises at least three attachment points for conjugation of the three targeting moieties). In another embodiment, if more than one (such as two, or three, or more), the targeting moieties may be covalently linked to each other (directly, or by means of a linker, e.g., /V- to C-terminal) and then, all of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site present in the protein-based building block. For instance, if two, the targeting moieties may be covalently linked to each other (A/- to C-terminal), directly or by means of a linker, and then, both of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. For instance, if three, the targeting moieties may be covalently linked to each other (A/- to C-terminal), directly or by means of a linker, and then, the three of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. The targeting moiety(ies) may be attached or conjugated, directly or by means of a linker, to at least one conjugation site present in the protein-based building block via genetic fusion.
Tumor-targeting moieties
The targeting moiety which may be comprised in the molecule of the present technology may preferably be a tumor-targeting moiety. In the context of the present technology, a "tumorbinding moiety" or "tumor-targeting moiety" is any molecule which can specifically bind one or more tumoral cells. The tumor targeting moiety may comprise at least one attachment point or conjugation site, as defined herein, so that it can be covalently linked or attached to one attachment point or conjugation site present in the protein-based building block comprised in the molecule of the present technology.
The tumor-targeting moiety comprised in the molecule of the present technology is capable of specifically binding to a cell surface molecule on a target cell, such as a tumor antigen. The term "tumor antigen" as used herein may be understood as those antigens that are presented on tumor cells. These antigens can be presented on the cell surface with an extracellular part,
which is often combined with a transmembrane and cytoplasmic part of the molecule. These antigens can sometimes be presented only by tumor cells and never by a normal or healthy cell. Tumor antigens can be exclusively expressed on tumor cells or might represent a tumor specific mutation compared to normal (non-tumoral) cells. In this case, they are called "tumor-specific antigens (TSA)". However, this will not be the case generally. More common are antigens that are presented by tumor cells and normal cells, and they are called "tumor- associated antigens (TAA)". Tumor-associated antigens can be overexpressed on tumor cells compared to normal (non-tumoral) cells or are better accessible for antibody binding in tumor cells due to the less compact structure of the tumor tissue compared to normal (non-tumoral) tissue. TAA are preferably antigens that are expressed on cells of particular tumors, but that are preferably not expressed in normal (non-tumoral) cells. Often, TAA are antigens that are normally expressed in cells only at particular points in an organism's development (such as during fetal development) and that are being inappropriately expressed in the organism at the present point of development, or are antigens not expressed in normal (non-tumoral) tissues or cells of an organ now expressing the antigen.
In an embodiment, the tumor-targeting moiety binds to a tumor antigen, preferably a tumor specific antigen (TSA).
In an embodiment, the tumor-targeting moiety binds to a tumor antigen, preferably a tumor associated antigen (TAA).
In an embodiment, said antigen is present more abundantly on a cancer cell than on a normal (non-tumoral) cell. The antigen on a target cell to which the tumor-targeting moiety comprised in the molecule of the present technology binds is preferably a tumor-associated antigen (TAA). Preferred TAAs include MART-1, carcinoembryonic antigen ("CEA"), gplOO, MAGE-1, HER-2, CD20, LewisY antigens, Melanoma-associated Chondroitin Sulfate Proteoglycan (MCSP), Epidermal Growth Factor Receptor (EGFR), Fibroblast Activation Protein (FAP), CD19 and CD33. Further preferred TAAs include EGFR, HER2, CD133, Mesothelin, PSMA, Claudin, GD2, IL13Ra2, B7-H3, EGFRvlll, Tan-MUCl (e.g., as described in Table 1 of Kembuan GJ., et al., "Targeting solid tumor antigens with chimeric receptors: cancer biology meets synthetic immunology", Trends Cancer, 2024, (4):312-331), CEA and other examples
listed in Figure 2 of Kembuan GJ., et al., "Targeting solid tumor antigens with chimeric receptors: cancer biology meets synthetic immunology", Trends Cancer, 2024, (4):312-331.
Other TAA suitable as an antigen on a target cell for binding by the tumor-targeting moiety comprised in the molecule of the present technology include CD123, CD44, CLL-1, CD96, CD47, CD32, CXCR4, Tim-3 and CD25, or TAG-72, Ep-CAM, PSMA, PSA, glycolipids such as GD2 and GD3.
The TAA includes also hematopoietic differentiation antigens, i.e., glycoproteins usually associated with cluster differentiation (CD) grouping, such as CD4, CD5, CD19, CD20, CD22, CD33, CD36, CD45, CD52, CD69 and CD147; growth factor receptors, including HER2, ErbB3 and ErbB4; Cytokine receptors, including lnterleukin-2 receptor gamma chain (CD132 antigen), Interleukin-10 receptor alpha chain (I L-10R-A), Interleukin-10 receptor beta chain (I L-10R-B), Interleukin-12 receptor beta-1 chain (IL-12R-betal), Interleukin-12 receptor beta- 2 chain (IL-12 receptor beta-2), Interleukin-13 receptor alpha-1 chain (I L-13R-alpha-l) (CD213al antigen), Interleukin-13 receptor alpha-2 chain (Interleukin-13 binding protein), Interleukin-17 receptor (IL-17 receptor), lnterleukin-17B receptor (IL-17B receptor), Interleukin 21 receptor precursor (IL-21R), lnterleukin-1 receptor type I (IL-1R-1) (CD121a), lnterleukin-1 receptor type II (IL-lR-beta) (CDwl21b), lnterleukin-1 receptor antagonist protein (IL-lra), lnterleukin-2 receptor alpha chain (CD25 antigen), lnterleukin-2 receptor beta chain (CD122 antigen), lnterleukin-3 receptor alpha chain (I L-3R-alpha) (CD123 antigen); as well as others, such as CD30, IL23R, IGF-1R, IL5R, IgE, CD248 (endosialin), CD44v6, gpA33, Ron, Trop2, PSCA, claudin 6, claudin 18.2, CLEC12A, CD38, ephA2, c-Met, CD56, MUC16, EGFRvlll, AGS-16, CD27L, Nectin-4, SLITRK6, mesothelin, folate receptor, tissue factor, axl, glypican-3, CA9, Cripto, CD138, CD37, MUC1, CD70, gastrin releasing peptide receptor, PAP, CEACAM5, CEACAM6, CXCR7, N-cadherin, FXYD2 gamma a, CD21, CD133, Na/K-ATPase, mlgM (membrane-bound IgM), mlgA (membrane-bound IgA), Mer, Tyro2, CD120, CD95, CA 195, DR5, DR6, DcR3 and CAIX.
Accordingly, the molecule of the present technology may comprise a tumor-targeting moiety as described herein, which specifically binds to a TAA present on a tumoral cell. The TAA may also be chosen from the group consisting of Melanoma-associated Chondroitin Sulfate
Proteoglycan (MCSP), Epidermal Growth Factor Receptor (EGFR), Fibroblast Activation Protein (FAP), MART-1, carcinoembryonic antigen ("CEA"), gplOO, MAGE-1, HER-2, LewisY antigens, CD123, CD44, CLL-1, CD96, CD47, CD32, CXCR4, Tim-3, CD25, TAG-72, Ep-CAM, PSMA, PSA, GD2, GD3, CD4, CD5, CD19, CD20, CD22, CD33, CD36, CD45, CD52, CD147; growth factor receptors, including ErbB3 and ErbB4; Cytokine receptors, including lnterleukin-2 receptor gamma chain (CD132 antigen), Interleukin-10 receptor alpha chain (IL-10R-A), Interleukin-10 receptor beta chain (IL-10R-B), Interleukin-12 receptor beta-1 chain (IL-12R-betal), Interleukin-12 receptor beta-2 chain (IL-12 receptor beta-2), Interleukin-13 receptor alpha-1 chain (I L-13R-alpha-l) (CD213al antigen), Interleukin-13 receptor alpha-2 chain (Interleukin- 13 binding protein), Interleukin-17 receptor (IL-17 receptor), lnterleukin-17B receptor (IL-17B receptor), Interleukin 21 receptor precursor (IL-21R), lnterleukin-1 receptor type I (IL-1R-1) (CD121a), lnterleukin-1 receptor type II (IL-lR-beta) (CDwl21b), lnterleukin-1 receptor antagonist protein (IL-lra), lnterleukin-2 receptor alpha chain (CD25 antigen), lnterleukin-2 receptor beta chain (CD122 antigen), lnterleukin-3 receptor alpha chain (I L-3R-alpha) (CD123 antigen), CD30, IL23R, IGF-1R, IL5R, IgE, CD248 (endosialin), CD44v6, gpA33, Ron, Trop2, PSCA, claudin 6, claudin 18.2, CLEC12A, CD38, ephA2, c-Met, CD56, MUC16, EGFRvlll, AGS-16, CD27L, Nectin-4, SLITRK6, mesothelin, folate receptor, tissue factor, axl, glypican-3, CA9, Cripto, CD138, CD37, MUC1, CD70, gastrin releasing peptide receptor, PAP, CEACAM5, CEACAM6, CXCR7, N-cadherin, FXYD2 gamma a, CD21, CD133, Na/K-ATPase, mlgM (membrane-bound IgM), mlgA (membrane-bound IgA), Mer, Tyro2, CD120, CD95, CA 195, DR5, DR6, DcR3 and CAIX, and related polymorphic variants and isoforms, preferably said TAA is CD20 (UniProt 11836), HER2 (Uniprot P04626), EGFR, or CEACAM, polymorphic variants and/or isoforms thereof.
In a further embodiment, the tumor targeting moiety specifically binds a tumor associated antigen ortumor antigen, as described herein. Further examples of tumor associated antigens and tumor antigens are HER2, EGFR, CEACAM5 and PSMA.
In another embodiment, the molecule of the present technology comprises more than one tumor-binding moieties. The two or more tumor-binding moieties comprised in the molecule of the present technology may be the same of different. In one embodiment, they are different. For instance, the molecule of the present technology may comprise two tumor-
targeting moieties which are two ISVDs different from each other. They may target the same or different tumor cell. In a preferred embodiment, the molecule of the present technology comprises two tumor-binding moieties targeting the same cell, but different from each other. In one embodiment, the molecule of the present technology comprises the ISVD as defined in SEQ ID NO.: 227 and the ISVD as defined in SEQ ID NO.: 228. The tumor-binding moieties may be covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. For instance, if more than one, the tumor binding moieties may be each covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. In another embodiment, if more than one, the tumor binding moieties may be covalently linked to each other (directly, or by means of a linker) and then, all of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. For instance, if two, the tumor binding moieties may be covalently linked to each other (A/- to C-terminal), directly or by means of a linker, and then, both of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. See, e.g., SEQ ID NO.: 226:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGSGGG GSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVK GRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNV ECRFTISRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPAA KRVKLD
A "tumor targeting moiety", in the context of the present technology, may be any molecule, including proteins, peptides, small molecules, vitamins, lipids, glycans, etc., and derivatives thereof, or combinations thereof, provided that it specifically binds an antigen present in/on a tumoral cell.
The tumor targeting moiety may hence be any moiety that, when present in the molecule of the present technology, can bring the molecule of the present technology in close proximity with a tumor cell.
In an embodiment, the tumor-targeting moiety is a small molecule specifically binding to a tumor cell, such as folate.
In a preferred embodiment, said (tumor)-targeting moiety is an ISVD, as described herein. Said ISVD may be a VHH, a humanized VHH, a (single) domain antibody, a dAb, and a camelized VH.
In a preferred embodiment, the tumor-targeting moiety may be selected from:
SEQ ID NO.: 227
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT
ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSS,
SEQ ID NO.: 228
EVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVKGRF
TISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSS and
SEQ ID NO.: 216:
DVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSS
Hence, in one embodiment, the tumor-targeting moiety comprised in the molecule of the present technology may specifically bind to EGFR present in the tumoral cells, or to CEACAM5 present in the tumoral cells. Hence, the tumor-targeting moiety may be an anti-EGFR ISDV or an anti-CEACAM ISVD, e.g., comprising or consisting of SEQ ID NO.: 216, 227 and/or 228.
In a further preferred embodiment, the molecule comprises two tumor-targeting moieties which comprise or consists of SEQ ID NO.: 227 and 228.
In one embodiment, the tumor-targeting moiety comprises or consists of an ISVD specifically binding to GPC3.
In one embodiment, the ISVD binds to human GPC3 of SEQ ID NO: 297 (Human GPC3, (P51654)):
MAGTVRTACLVVAMLLSLDFPGQAQPPPPPPDATCHQVRSFFQRLQPGLKWVPETPVPGSDLQVCLPK GPTCCSRKMEEKYQLTARLNMEQLLQSASIVIELKFLIIQNAAVFQEAFEIVVRHAKNYTNAIVIFKNNYPSLT PQAFEFVGEFFTDVSLYILGSDINVDDMVNELFDSLFPVIYTQLIVINPGLPDSALDINECLRGARRDLKVFG
NFPKLIMTQVSKSLQVTRIFLQALNLGIEVINTTDHLKFSKDCGRMLTRMWYCSYCQGLIVIIVIVKPCGGYC NVVMQGCMAGVVEIDKYWREYILSLEELVNGMYRIYDMENVLLGLFSTIHDSIQYVQKNAGKLTTTIGKL CAHSQQRQYRSAYYPEDLFIDKKVLKVAHVEHEETLSSRRRELIQKLKSFISFYSALPGYICSHSPVAENDTLC
WNGQELVERYSQKAARNGMKNQFNLHELKMKGPEPVVSQIIDKLKHINQLLRTIVISIVIPKGRVLDKNLD EEGFESGDCGDDEDECIGGSGDGMIKVKNQLRFLAELAYDLDVDDAPGNSQQATPKDNEISTFHNLGNV HSPLKLLTSMAISVVCFFFLVH
In a preferred embodiment, the targeting moiety is an EGFR-binding ISVD. Preferably, the EGFR-binding ISVD comprises at least three CDR and four FR, wherein the CDRs comprise or consist of the following sequences:
CDR1: GRTSRSYGMG (SEQ ID NO.: 301)
CDR2: SGISWRGDSTG (SEQ ID NO.: 302)
CDR3: AAGSAWYGTLYEYDY (SEQ ID NO.: 303)
Preferably, the EGFR-binding ISVD (VHH) comprises or consists of SEQ ID NO.: 216:
DVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF
TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSS
In a preferred embodiment, the tumor-targeting moiety may be selected from a tumortargeting ISVD specifically binding human CEACAM5, that essentially consists of 4 framework
regions (FR1 to FR4, respectively) and 3 complementarity determining regions (CDR1 to CDR3, respectively), in which:
CDR1 (AbM numbering) consists of an amino acid sequence selected from: a) the amino acid sequence of GLTFSTYTMG (SEQ ID NO:229); b) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of GLTFSTYTMG (SEQ ID NO: 229); c) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequences GLTFSTYTMG (SEQ ID NO: 229); and
CDR2 (AbM numbering) consists of an amino acid sequence selected from: d) the amino acid sequence of AIIWSGSNTY (SEQ ID NO: 230); e) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of AIIWSGSNTY (SEQ ID NO: 230); f) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequence of AIIWSGSNTY (SEQ ID NO: 230); and
CDR3 (AbM numbering) consists of an amino acid sequence selected from: g) the amino acid sequence of QHFGPIGLTTRGYXY (SEQ ID NO: 231), wherein the amino acid residue X is selected from N, A, F, G, I, L, Y or H; h) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of QHFGPIGLTTRGYXY (SEQ ID NO: 231), wherein the amino acid residue X is selected from N, A, F, G, I, L, Y or H; i) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequence of QHFGPIGLTTRGYXY (SEQ ID NO: 231), wherein the amino acid residue X is selected from N, A, F, G, I, L Y or H.
The tumor targeting moiety may be a tumor-targeting ISVD specifically binding human CEACAM5 in which
CDR1 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 231 (QHFGPIGLTTRGYNY); or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 232 (QHFGPIGLTTRGYHY); or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYAY (SEQ ID NO: 233); or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYFY (SEQ ID NO: 234); or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYGY (SEQ ID NO: 235), or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYIY (SEQ ID NO: 236);
or in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYLY (SEQ ID NO: 237); or in which
- CDR1 (AbM numbering) consists of the amino acid sequence of SEQ ID NO:229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of QHFGPIGLTTRGYYY (SEQ ID NO: 238).
In a preferred embodiment, the tumor targeting moiety may be a tumor-targeting ISVD specifically binding human CEACAM5 in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 229;
- CDR2 (AbM numbering) consists of the amino acid sequences of SEQ ID NO: 230; and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 232 (QHFGPIGLTTRGYHY).
In another preferred embodiment, the tumor-targeting moiety may be selected from a tumortargeting ISVD specifically binding human CEACAM5, that essentially consists of 4 framework regions (FR1 to FR4, respectively) and 3 complementarity determining regions (CDR1 to CDR3, respectively), in which:
CDR1 (AbM numbering) consists of an amino acid sequence selected from: a) the amino acid sequence of GX1TFSX2YAX3G (SEQ ID NO: 239, wherein the amino acid residue Xi is selected from R or H, the amino acid residue X2 is selected from E or D and the amino acid residue X3 is selected from L or M; b) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of GX1TFSX2YAX3G (SEQ ID NO: 239), wherein the amino acid
residue Xi is selected from R or H, the amino acid residue X2 is selected from E or D and the amino acid residue X3 is selected from L or M; c) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequences GX1TFSX2YAX3G (SEQ ID NO: 239), wherein the amino acid residue Xi is selected from R or H, the amino acid residue X2 is selected from E or D and the amino acid residue X3 is selected from L or M; and
CDR2 (AbM numbering) consists of an amino acid sequence selected from: d) the amino acid sequence of AINWGGXWTY (SEQ ID NO: 240), wherein the amino acid residue X is selected from T, G or S; e) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of AINWGGXWTY (SEQ ID NO: 240), wherein the amino acid residue X is selected from T, G or S; f) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequence of AINWGGXWTY (SEQ ID NO: 240), wherein the amino acid residue X is selected from T, G or S); and
CDR3 (AbM numbering) consists of an amino acid sequence selected from: g) the amino acid sequence of SX1DYAGGX2PTGYX3Y (SEQ ID NO: 241), wherein the amino acid residue Xi is selected from S, P or L, the amino acid residue X2 is selected from N or S, the amino acid residue X3 is selected from P or A; h) amino acid sequences that have at least 80% amino acid identity with the amino acid sequence of SX1DYAGGX2PTGYX3Y (SEQ ID NO: 241), wherein the amino acid residue Xi is selected from S, P or L, the amino acid residue X2 is selected from N or S, the amino acid residue X3 is selected from P or A; i) amino acid sequences that have 3, 2, or 1 amino acid difference with the amino acid sequence of SX1DYAGGX2PTGYX3Y (SEQ ID NO: 241), wherein the amino acid residue Xi is selected from S, P or L, the amino acid residue X2 is selected from N or S, the amino acid residue X3 is selected from P or A.
The tumor targeting moiety may be a tumor-targeting ISVD specifically binding human CEACAM5 in which
- CDR1 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 242 (GHTFSEYALG);
- CDR2 (AbM numbering) consists of one of the amino acid sequences of SEQ ID NO: 243 (AINWGGGWTY); and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 244 (SSDYAGGNPTGYPY), or in which
- CDR1 (AbM numbering) consists of the amino acid sequence of SEQ ID NO:251 (GRTFSDYAMG);
- CDR2 (AbM numbering) consists of one of the amino acid sequences of SEQ ID NO: 252 (AINWGGTWTY); and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 253 (SLDYAGGSPTGYAY). or in which
- CDR1 (AbM numbering) consists of the amino acid sequence of SEQ ID NO:251;
- CDR2 (AbM numbering) consists of one of the amino acid sequences of SEQ ID NO: 254 (AINWGGSWTY); and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 255 (SPDYAGGNPTGYAY).
In a further preferred embodiment, the tumor targeting moiety may be a tumor-targeting ISVD specifically binding human CEACAM5 in which
- CDRl (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 242;
- CDR2 (AbM numbering) consists of one of the amino acid sequences of SEQ ID NO: 243; and
- CDR3 (AbM numbering) consists of the amino acid sequence of SEQ ID NO: 244.
In a preferred embodiment, the tumor-targeting moiety may be selected from:
SEQ ID NO.: 227 (A031500384 (EID), T028501789(ElD):
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT
ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSS
and
SEQ ID NO.: 228 (A031500099, T028501817):
EVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVKGRF
TISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSS
able 17: Sequences for CDRs according to AbM numbering and frameworks ("ID" refers to the given SEQ ID NO)
325
In a further preferred embodiment, the molecule comprises two tumor-targeting moieties which are two ISVDs specifically binding human CEACAM5, as defined above. In another preferred embodiment the molecule comprises two tumor-targeting moieties which comprise or consists of SEQ ID NO.: 227 and 228.
In one embodiment, the molecule of the present technology comprises more than one tumortargeting moieties. The two or more tumor-targeting moieties which may be comprised in the molecule of the present technology may be the same of different. They may target the same or different epitopes in a cell. In one embodiment, they are different. For instance, the molecule of the present technology may comprise two tumor-targeting moieties which are two ISVDs different from each other. They may target the same or different epitopes. In a preferred embodiment, the molecule of the present technology comprises two tumortargeting moieties targeting the same cell or the same epitope, but they are different from each other.
In another embodiment, the molecule comprises more than two tumor-targeting moieties, such as three targeting moieties. They may be the same or different. In one embodiment, all targeting moieties comprised in the molecule of the present technology are the same. The tumor-targeting moietie(s) may be covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. For instance, if more than one, the tumor-targeting moieties may be each covalently linked to the at least one protein-based building block comprised in the molecule of the present technology directly or by means of a linker, as described herein. For instance, the at least one protein-based building block may comprise two tumor-targeting moieties, which may be the same or different, each attached (directly or by means of a linker, as described herein) to one attachment point comprised in the protein-based building block (i.e., the protein-based building block comprises at least two attachment points for conjugation of the two tumor-targeting moieties). For instance, the at least one protein-based building block may comprise three tumor-targeting moieties, which may be the same or different, each attached (directly or by means of a linker, as described herein) to one attachment point comprised in the protein-based building block (i.e., the protein-based building block comprises
at least three attachment points for conjugation of the three tumor-targeting moieties). In another embodiment, if more than one (such as two, or three, or more), the tumor-targeting moieties may be covalently linked to each other (directly, or by means of a linker, e.g., /V- to C-terminal) and then, all of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site present in the protein-based building block. For instance, if two, the tumor-targeting moieties may be covalently linked to each other (A/- to C-terminal), directly or by means of a linker, and then, both of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. For instance, if three, the tumor-targeting moieties may be covalently linked to each other (A/- to C-terminal), directly or by means of a linker, and then, both of them, covalently linked (directly or by means of a linker) to the at least one protein-based building block comprised in the molecule of the present technology, via one single attachment point or conjugation site. The tumor-targeting moiety(ies) may be attached or conjugated, directly or by means of a linker, to at least one conjugation site present in the protein-based building block via genetic fusion.
Therapeutic moieties or precursors therefrom
As described above, the protein-based carrier building block comprised in the molecule of the present technology may have attached or conjugated, via one or more conjugation sites or attachment points, one or more other groups, residues, moieties or binding units, optionally linked via one or more peptidic linkers, in which said one or more other groups, residues, moieties or binding units are capable of exerting a therapeutic activity in the animal or human body ("therapeutic moiety or precursor therefrom"). For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more therapeutic moieties or precursors therefrom attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, and (iv) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, and (iv) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, (iv) a targeting moiety, and (v) a therapeutic moiety as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256,
304 and 305, (iii) a half-life extending moiety, (iv) a targeting moiety, and (v) a therapeutic moiety as described herein.
A therapeutic moiety, as defined herein, is any group, residue, moiety, or binding unit which is capable of exerting a therapeutic activity in the animal and/or human body. The therapeutic moiety may also be in the form of a precursor, which then gets activated to exert its therapeutic activity. For instance, a therapeutic moiety according to the present technology may be any therapeutic agent such as a drug, protein, peptide, gene, compound or any other pharmaceutically active ingredient which may be used for the treatment and/or prevention of a certain disease condition. For instance, a therapeutic moiety may be a therapeutic antibody, or a therapeutic ISVD.
Non-limiting examples of therapeutic moieties which may be present in the molecule of the present technology, e.g., attached to the at least one protein-based carrier building block through the at least one attachment point or conjugation sites are the those which specifically target the cell nucleus, which often focus on modulating gene expression such as gene editing techniques: (CRISPR-Cas9), Antisense Oligonucleotides (ASOs) which bind to complementary mRNA sequences in the nucleus, affecting gene expression, chemotherapeutic agents such as topoisomerase inhibitors (e.g., etoposide), histone deacetylase (HDAC) inhibitors which affect histone acetylation levels, also influencing gene expression. Further non-limiting examples of therapeutic moieties which may be present in the molecule of the present technology, e.g., attached to the at least one protein-based carrier building block through the at least one attachment point or conjugation sites, are cell cycle blockers such as mimosine, ciclopirox and deferoxamine (Gl/S blockers, as described, e.g., in Farinelli SE and Greene LA, "Cell cycle blockers mimosine, ciclopirox, and deferoxamine prevent the death of PC12 cells and postmitotic sympathetic neurons after removal of trophic support", J Neurosci., 1996, 16(3) :1150-62) or mitotic inhibitors such as paclitaxel and vinblastine (and other mitotic inhibitors as listed in the "List of Mitotic inhibitors - Drugs.com", https://www.drugs.com/drug-class/mitotic-inhibitors.html).
Hence, the present technology provides molecules as defined herein which comprises at least one protein-based building block with at least one therapeutic molecule attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. For instance, the ISVD-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more identity with SEQ ID NO.: SO- 95, 175, 185, 186, 206, 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived proteinbased building block, preferably when conjugated to at least one toxic molecule, through the at least one conjugation site or attachment point, comprised in the molecule of the present
technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least one toxic molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96- 98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, with at least one toxic molecule (e.g., Resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one toxic
molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one toxic molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one building block based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one building block based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one toxic molecule (e.g., resiquimod) attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK- derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one DARPin- based building block selected from SEQ ID NO.: 199, 97 and/or 98, preferably SEQ ID NO.: 97 or 98, and at least one, preferably more than one, such as two, or, preferably, three toxic
molecule (e.g., Resiquimod) molecules conjugated to the attachment points or conjugation sited of the DARPin-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one toxic molecule is conjugated to the DARPin-based building block comprised in molecule ALB-lC_K27m (SEQ ID NO.: 200), ALB-3C_K27m_wl (SEQ ID NO.: 173) and/or ALB- 5C_K27m (SEQ ID NO.: 174), preferably ALB-3C_K27m_wl and/or ALB-5C_K27m.
In one embodiment, the molecule of the present technology comprises at least one ISVD- based building block selected from SEQ ID NO.: 80, 81 or 175, preferably SEQ ID NO.: 80 or 81, and at least one, preferably more than one, such as two, or, preferably, three toxic molecules conjugated to the attachment points or conjugation sited of the ISVD-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one toxic molecule is conjugated to the ISVD-based building block comprised in molecule T028100069 (SEQ ID NO.: 107), T028100070 (SEQ ID NO.: 108) and/or T028100075 (SEQ ID NO.: 176), preferably T028100069 and/or T028100070.
As described above, there are preferably more than one therapeutic molecules conjugated to the ISVD- and/or DARPin-derived building blocks, preferably 3 therapeutic molecules per ISVD- and/or DARPin based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) 3 therapeutic molecules per ISVD- and/or DARPin-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) 3 therapeutic molecules per ISVD- and/or DARPin-based building block.
Further therapeutic moieties or precursors therefrom that may be comprised in the molecule of the present technology are CDK (cyclin-dependent kinase) inhibitors. These molecules inhibit the function of CDKs, and are widely used, e.g., to treat cancers by preventing over
proliferation of cancer cells. For instance, a CDK inhibitor that could be comprised in the molecule of the present technology is palbociclib (Ibrance), a CDK4/6 inhibitor approved by the US FDA in February 2015 for use in postmenopausal women with breast cancer that is estrogen receptor positive and HER2 negative. While there are multiple cyclin/CDK complexes regulating the cell cycle, CDK inhibitors targeting CDK4/6 are preferred. Bai J. et a/., "Cel I cycle regulation and anticancer drug discovery", Cancer Biol Med., 2017 Nov;14(4):348-362, provides an outline of some promising CDK inhibitors currently in preclinical and clinical trials that target cell cycle abnormalities in various cancers. Any of the CDK inhibitors disclosed in this review (e.g., Table 3) may be comprised in the molecule of the present technology (e.g., Palbociclib, letrozole, fulvestrant, LEE011, tamoxifen, anastrozole, BYL719, exemestane, everolimus and/or Abemaciclib).
Imaging moieties
As described above, the protein-based carrier building block may have attached or conjugated, via its one or more conjugation sites or attachment points one or more other groups, residues, moieties or binding units, optionally linked via one or more peptidic linkers, wherein said one or more other groups, residues, moieties or binding units are used for imaging purposes ("imaging moiety"). For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more imaging moieties attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, and (iv) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, and (iv) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, and (iv) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a therapeutic moiety, and (iv) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, (iv) a therapeutic moiety, and (v) an imaging moiety, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, (iv) a therapeutic moiety, and (v) an imaging moiety, as described herein.
Examples of imaging moieties are provided in Agdeppa ED, Spilker ME. A review of imaging agent development. AAPS J. 2009 Jun;ll(2):286-99. For instance, the imaging moiety present in the molecule of the present technology may be suitable for radiotherapy and for radio/fluorescence-guided cancer surgery. For instance, the imaging moiety may comprise radioactive isotopes that can be used for diagnostic and therapeutic proposes. For instance, the imaging moiety may be a contrast agent. For instance, the imaging moiety may be a non-
radioactive medical isotope. For instance, the imaging moiety may include desferrioxamine (DFO), such as used for 89Zirconium-DFO-labeling. For instance, the imaging moiety may be a fluorophore such as Alexa 647 or pHAb.
Toxic moieties or drugs
As described above, the protein-based carrier building block may have attached or conjugated, via its one or more conjugation sites or attachment points one or more other groups, residues, moieties or binding units, optionally linked via one or more (e.g., cleavable or non-cleavable, peptidic or non-peptidic) linkers, wherein said one or more other groups, residues, moieties or binding units are able to impart certain toxicity to cells and/or tissues ("toxic moiety" or "drug"). For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more toxic moieties attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, and (iv) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, and (iv) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, (iv) a halflife extending moiety, and (v) a toxic moiety" or "drug, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, (iv) a half-life extending moiety, and (v) a toxic moiety" or "drug, as described herein.
A toxic moiety which may be attached or conjugated to the protein-based carrier building block may belong to the "tubulin inhibitor" family (e.g., maytansinoids, auristatins, taxol derivates) or to the "DNA-modifying agents" family (e.g., calicheamicins, duocarymycins). They can also be antibiotics or enzymes. For a review, see Criscitiello C. et al., "Antibody-drug conjugates in solid tumors: a look into novel targets", Hematol Oncol, 2021, 14:20.
A special class of cytotoxic compounds (or toxic moieties) are tubulin inhibitors such as the maytansinoid DM4 and the macrolide Cryptophycin, which stop cell division. These cytotoxic compounds may be conjugated to other molecules such as antibodies. Ideally, the payload should only be released in the tumor cell and have little bystander effects. The toxins are usually attached to the antibody of interest via stochastic conjugation via lysines (e.g., Trastuzumab- DM1, see, e.g., Sang H. et al., "Conjugation site analysis of lysine-conjugated ADCs", Methods Mol Biol., 2020, 2078:235-250) or via site specific conjugation, the latter
usually via the free thiol of endogenous cysteines, see, e.g., Nadkarni DV., "Conjugations to endogenous cysteine residues", Methods Mol Biol., 2020, 2078:37-49.
The present technology provides non-targeting protein-based building blocks comprising attachment points or conjugation sites (e.g., with engineered surface-exposed cysteines) for conjugation of payloads. If needed, the cysteine conjugation can be combined with stochastic conjugation on solvent-accessible lysines for another type of cargo (e.g., one peptide on cysteines and pHAb and Alexa fluor on lysines, see Example 8, in particular molecule EGFR7D12-3C_hCKSl_c3-cMyc NLS, SEQ ID NO.: 215).
Hence, the present technology provides molecules as defined herein which comprises at least one protein-based building block with at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule attached to at least one attachment point or conjugation site. As such, the present technology provides molecules as defined herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one cytotoxic compound or payload. As such, the present technology provides molecules as defined herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one cytotoxic compound or payload.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221,
256, 304 and 305, and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. For instance, the ISVD-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID
NO.: 221, 256, 304 and 305, and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, with at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one cytotoxic compound or payload, such as DM4 and/or
Cryptophycin molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least building block based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. . As such, the present technology provides one molecule as described herein which comprises (i) at least building block based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one cytotoxic compound or payload, such as DM4 and/or Cryptophycin molecule, attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK-derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99- 105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one DARPin- based building block selected from SEQ ID NO.: 199, 97 and/or 98, preferably SEQ ID NO.: 97 or 98, and at least one, preferably more than one, such as two, or, preferably, three DM4 and/or three Cryptophycin molecules conjugated to the attachment points or conjugation sited of the DARPin-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one DM4 and/or at least one Cryptophycin molecule is conjugated to the DARPin-based building block comprised in molecule ALB-lC_K27m (SEQ ID NO.: 200), ALB-3C_K27m_wl (SEQ ID NO.: 173) and/or ALB-5C_K27m (SEQ ID NO.: 174), preferably ALB-3C_K27m_wl and/or ALB-5C_K27m.
In one embodiment, the molecule of the present technology comprises at least one ISVD- based building block selected from SEQ ID NO.: 80, 81 or 175, preferably SEQ ID NO.: 80 or 81, and at least one, preferably more than one, such as two, or, preferably, three DM4 and/or three Cryptophycin molecules conjugated to the attachment points or conjugation sited of the ISVD-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one DM4 and/or at least one Cryptophycin molecule is conjugated to the ISVD-based building block comprised in molecule T028100069 (SEQ ID NO.: 107), T028100070 (SEQ ID NO.: 108) and/or T028100075 (SEQ ID NO.: 176), preferably T028100069 and/or T028100070.
As described above, there are preferably more than one DM4 and/or at least one Cryptophycin molecule conjugated to the ISVD- and/or DARPin-derived building blocks, preferably 3 DM4 and/or 3 Cryptophycin molecules per ISVD-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) 3 DM4 and/or 3 Cryptophycin molecules per ISVD- and/or DARPin-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) 3 DM4 and/or 3 Cryptophycin molecules per ISVD- and/or DARPin-based building block.
Nucleic acids such as Antisense Oligonucleotides (ASOs)
The protein-based carrier building block may also have attached or conjugated, via its one or more conjugation sites or attachment points one or more nucleic acids, such as one or more ASO molecules. For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more nucleic acids, such as one, two, three, four, five, six, seven, eight, nine, ten or more ASO molecules attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, and (iv) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, and (iv) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a targeting moiety, (iv) a halflife extending moiety, and (v) one or more nucleic acids, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a targeting moiety, (iv) a half-life extending moiety, and (v) one or more nucleic acids, such as one or more ASO molecules, as described herein.
Vitamins
Vitamins are also suitable cargos to be attached to the conjugation sites or attachment points present in the at least one protein-based building block comprised in the molecule of the present technology. Non-limiting examples of vitamins are folate (folic acid), biotin, vitamin C, etc.
Folate
The folate receptor (FOLR) constitutes a useful target for tumor specific drug delivery, primarily because it is upregulated in many different types of cancers including those of ovary, endometrium, lung, kidney, mesothelium, head and neck. In normal human tissues, FOLR has very limited distribution mainly restricted to the kidneys, lungs, choroid plexus, and placenta. The receptors in these tissues except the placenta are localized on surface facing away from blood. These attributes make folate receptors an attractive target for efficient and selective binding. See Parashar S. et al., “ clickable folic acid-rhamnose conjugate for selective binding to cancer cells", Results in Chemistry, 2022, 4:100409. Hence, the protein-based carrier building block may also have attached or conjugated, via its one or more conjugation sites or attachment points one or more folic acid (folate) molecules. For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more folate molecules attached or conjugated to the at least one protein-based carrier building block.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) one or more folic acid (folate) molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) one or more folic acid (folate) molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, and (iv) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a therapeutic moiety, and (iv) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, (iv) a halflife extending moiety, and (v) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a therapeutic moiety, (iv) a half-life extending moiety, and (v) one or more folic acid (folate) molecules, such as one or more ASO molecules, as described herein.
Hence, folic acid (folate) may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
The present technology therefore provides molecules as defined herein which comprises at least one protein-based building block with at least one folate molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one folate attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. For instance, the ISVD-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more
identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically bind to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically bind to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one folate, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least folate attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the
building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, with at least one folate attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one folate attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one folate molecule, attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK-
derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one ISVD- based building block selected from SEQ ID NO.: 81 or 175, preferably SEQ ID NO.: 81, and at least one, preferably more than one, such as two, or, preferably, three folate molecules conjugated to the attachment points or conjugation sited of the ISVD-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one folate is conjugated to the ISVD- based building block comprised in molecule T028100070 (SEQ ID NO.: 108) and/or T028100075 (SEQ ID NO.: 176), preferably T028100070. As described above, there are preferably more than one folate conjugated to the ISVD-derived building blocks, preferably 3 folate or more molecules per ISVD-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) 3 folate or more molecules per ISVD- and/or DARPin-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) 3 folate or more molecules per ISVD- and/or DARPin-based building block.
Toll-like receptor (TLR) agonists
Toll-like receptor (TLR) agonists may be a promising approach to the treatment of autoimmune diseases, some cancers, bacterial, and viral infections (Farooq M. et al., "Toll-like
receptors as a therapeutic target in the era of immunotherapies", Front. Cell Dev. Biol., 2021). Table 1 on Farooq M. et al. provides a list of TLR-based ligands in clinical trials. The proteinbased carrier building block may also have attached or conjugated, via one or more conjugation sites or attachment points one or more TLR agonists. For instance, the molecule of the present technology may comprise one, two, three, four, five, six, seven, eight, nine, ten or more TLR agonists attached or conjugated to the at least one protein-based carrier building block.
Glycans
Targeted protein degradation strategies are gaining importance for new therapeutic strategies. Lysosome-targeting chimeras (LYTACs) make use of receptors, such as the cationindependent mannose 6-phosphate receptor (CI-M6PR) to direct extracellular proteins to lysosomes. See, e.g., Ahn G. et al., "Elucidating the cellular determinants of targeted membrane protein degradation by lysosome-targeting chimeras", Science, 2023, 382(6668):eadf6249 or Stevens CM. et al., "Development of oligomeric mannose-6- phosphonate conjugates for targeted protein degradation", ACS Med Chem Lett., 2023, 14(6):719-726.
Hence, the present technology provides molecules as defined herein which comprises at least one protein-based building block with at least one glycan attached to at least one attachment point or conjugation site. Aberrant glycosylation is a common feature of many cancers, playing crucial roles in tumor development and biology. Tumor-associated carbohydrate antigens (TACAs) that have been studied for selective cancer targeting and which may be comprised in the molecule of the present technology include truncated O-glycans (Tn, TF, and sialyl-Tn antigens), Gangliosides (GD2, GD3, GM2, GM3, and fucosyl-GMl), Globo-series glycans (Globo-H, SSEA-3, and SSEA-4), Lewis antigens, Polysialic acid, see, e.g., Berois N., et al. "Targeting tumor glycans for cancer therapy: successes, limitations, and perspectives. Cancers", 2022, 14(3):645.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, and (iv) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a therapeutic moiety, and (iv) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, (iv) a halflife extending moiety, and (v) at least one glycan, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256,
304 and 305, (iii) a therapeutic moiety, (iv) a half-life extending moiety, and (v) at least one glycan, as described herein.
Hence, the glycan may be attached or conjugated (directly or via a linker, as described herein) to the attachment point(s) or conjugation site(s) of the protein-based building block.
The present technology therefore provides molecules as defined herein which comprises at least one protein-based building block with at least glycan molecule attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan, attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one glycan attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305. For instance, the ISVD- based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally
bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least one glycan, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds.
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least glycan attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building block, as defined herein, with at least one glycan, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described
herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one glycan, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one glycan molecule, attached to at least one attachment point or conjugation site. For instance, the building block may be a CSK- derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one ISVD-
based building block selected from SEQ ID NO.: 80, 81 or 175, preferably SEQ ID NO.: 80 or 81, and at least one, preferably more than one, such as two, or, preferably, three glycans or more, conjugated to the attachment points or conjugation sited of the ISVD-based building blocks. Preferably, the molecule further comprises a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the at least one glycan, preferably M6P or bisM6P molecule is conjugated to the ISVD-based building block comprised in molecule T028100069 (SEQ ID NO.: 107), T028100070 (SEQ ID NO.: 108) and/or T028100075 (SEQ ID NO.: 176), preferably T028100069 and/or T028100070. As described above, there are preferably more than one glycans, preferably M6P or bisM6P, conjugated to the ISVD-derived building blocks, preferably 3 or more glycans, preferably M6P or bisM6P, per ISVD-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) 3 or more glycans, preferably M6P or bisM6P, per ISVD- and/or DARPin-based building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD- and/or DARPin-derived building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) 3 or more glycans, preferably M6P or bisM6P, per ISVD- and/or DARPin-based building block.
Lipids
The conjugation of lipids to drug-comprising molecules may have several advantages, such as increase in lipophilicity, change of other properties of drugs, improved targeting to the lymphatic system, enhanced tumour targeting, enhanced cell internalization and reduced toxicity, see, e.g., Irby, D. et al., "Lipid— drug conjugate for enhancing drug delivery", Mol. Pharmaceutics, 2017, (14)5:1325-1338.
The present technology further provides molecules as defined herein which comprises at least one protein-based building block with at least one lipid attached to at least one attachment point or conjugation site.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, and (iv) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a therapeutic moiety, and (iv) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a half-life extending moiety, and (iv) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, (iii) a half-life extending moiety, and (iv) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS), (iii) a therapeutic moiety, (iv) a halflife extending moiety, and (v) at least one lipid, as described herein.
As such, the molecule of the present technology may comprise (i) a protein-based building block, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256,
304 and 305, (iii) a therapeutic moiety, (iv) a half-life extending moiety, and (v) at least one lipid, as described herein.
The lipid is preferably a fatty acid. The at least one lipid may be a short-chain fatty acid, a medium-chain lipid or a lipid with larger chain. The lipid may be saturated, unsaturated or partially saturated. The lipid may be branched or unbranched. For instance, the lipid may be selected from:
Hexanoic acid, which is the carboxylic acid derived from hexane with the chemical formula CHsfC^hCOOH, comprising 6 C atoms and a carboxylic head group;
Undecanoic acid, which is a carboxylic acid with chemical formula CH3(CH2)9COOH.
For instance, the present technology provides one molecule as described herein which comprises at least one ISVD-based building block, as defined herein, with at least one lipid (preferably fatty acid) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one lipid (preferably fatty acid), attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one ISVD-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS) selected from SEQ ID NO.: 221, 256, 304 and 305, and (iii) at least one lipid (preferably fatty acid), attached to at least one attachment point or conjugation site. For instance, the ISVD-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, or a sequence which has 80% or more identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222- 225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 80-95, 175, 185, 186, 206, 222-225, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and
does not specifically bind to any human protein, preferably does not specifically binds to any non-human protein to which it originally bound, such as bacterial and/or viral proteins, as described in detail above and/or preferably does not specifically binds to any non-protein molecule to which it originally bound, if any, all as described in detail above. Preferably, as described above, at least one ISVD-derived protein-based building block, preferably when conjugated to at least lipid, through the at least one conjugation site or attachment point, comprised in the molecule of the present technology, does not specifically bind to any target, such as protein and/or non-protein molecules, including biomolecules, to which the ISVD precursor specifically binds
For instance, the present technology provides one molecule as described herein which comprises at least one DARPin-based building block, as defined herein, with at least one lipid (preferably fatty acid) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one DARPin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one lipid (preferably fatty acid), attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305. For instance, the DARPin-based building block comprises or, alternatively, consists of a building block selected from SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 96-98, 181, 182, 188, 189, 199 or 208, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein, in particular it does not specifically bind human KRAS protein, as described in detail above.
For instance, the present technology provides one molecule as described herein which comprises at least one affitin-based building block and/or at least one affi body-based building
block, as defined herein, with at least one lipid (preferably fatty acid) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one affitin-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one lipid (preferably fatty acid), attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
For instance, the present technology provides one molecule as described herein which comprises at least one building block based on a small globular protein, such as CKS1, as defined herein, with at least one lipid (preferably fatty acid) attached to at least one attachment point or conjugation site. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, based on a small globular protein, such as CKS1, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) at least one lipid (preferably fatty acid), attached to at least one attachment point or conjugation site. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305. For instance, the building block may be a CSK-derived building block selected from SEQ ID NO.: 99-105, 191, 192 and 205, or a sequence which has 80% or more identity with SEQ ID NO.: 99-105, 191, 192 and 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 99-105, 191, 192 and 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
In one embodiment, the molecule of the present technology comprises at least one building block selected from SEQ ID NO.: 86 (ISVD-derived, with 6 Cys) and SEQ ID NO: 101 (CKS- derived, comprising 3 Cys), and at least one, preferably more than one, such as two, or, preferably, three lipids (preferably fatty acid) conjugated to the attachment points or conjugation sites of the building blocks. For example, the molecule may further comprise a HLE moiety, such as an albumin-binding ISVD, e.g., SEQ ID NO.: 106. In one embodiment, the
at least one lipid (preferably fatty acid, even more preferably short-chain fatty acid) is conjugated to the ISVD-based building block comprised in molecule T028100078 (SEQ ID NO.: 113). As described above, there are preferably more than one lipid (preferably fatty acid, even more preferably short-chain fatty acid) conjugated to the building blocks, preferably 3 lipids (preferably fatty acid) molecules per building block. As such, the present technology provides one molecule as described herein which comprises (i) at least one protein-based building block, as defined herein, (ii) at least one nuclear localization sequence (NLS), and (iii) 3 or more lipids (preferably fatty acid) molecules, per protein-based building block. The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
NUCLEIC ACID MOLECULES
The present technology also provides a nucleic acid molecule encoding the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology.
A "nucleic acid molecule" (used interchangeably with "nucleic acid") is a chain of nucleotide monomers linked to each other via a phosphate backbone to form a nucleotide sequence. A nucleic acid may be used to transform/transfect a host cell or host organism, e.g. for expression and/or production of a polypeptide. Suitable (non-human) hosts or host cells for production purposes will be clear to the skilled person, and may for example be any suitable fungal, prokaryotic or eukaryotic cell or cell line or any suitable fungal, prokaryotic or eukaryotic organism. A host or host cell comprising a nucleic acid encoding the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology is also encompassed by the present technology.
A nucleic acid may be for example DNA, RNA, or a hybrid thereof, and may also comprise (e.g., chemically) modified nucleotides, like PNA. It can be single- or double-stranded. In one embodiment, it is in the form of double-stranded DNA. For example, the nucleotide sequences of the present technology may be genomic DNA, cDNA.
The nucleic acids of the present technology can be prepared or obtained in a manner known perse, and/or can be isolated from a suitable natural source. Nucleotide sequences encoding
naturally occurring (poly)peptides can for example be subjected to site-directed mutagenesis, so as to provide a nucleic acid molecule encoding polypeptide with sequence variation. Also, as will be clear to the skilled person, to prepare a nucleic acid, also several nucleotide sequences, such as at least one nucleotide sequence encoding a targeting moiety and for example nucleic acids encoding one or more linkers can be linked together in a suitable manner.
Techniques for generating nucleic acids will be clear to the skilled person and may for instance include, but are not limited to, automated DNA synthesis; site-directed mutagenesis; combining two or more naturally occurring and/or synthetic sequences (or two or more parts thereof), introduction of mutations that lead to the expression of a truncated expression product; introduction of one or more restriction sites (e.g. to create cassettes and/or regions that may easily be digested and/or ligated using suitable restriction enzymes), and/or the introduction of mutations by means of a PCR reaction using one or more "mismatched" primers.
VECTORS
Also provided is a vector comprising the nucleic acid molecule encoding the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology. A vector as used herein is a vehicle suitable for carrying genetic material into a cell. A vector includes naked nucleic acids, such as plasmids or mRNAs, or nucleic acids embedded into a bigger structure, such as liposomes or viral vectors.
In some embodiments, vectors comprise at least one nucleic acid that is optionally linked to one or more regulatory elements, such as for example one or more suitable promoter(s), enhancer(s), terminator(s), etc.). In one embodiment, the vector is an expression vector, i.e. a vector suitable for expressing an encoded polypeptide or construct under suitable conditions, e.g. when the vector is introduced into a (e.g. human) cell. DNA-based vectors include the presence of elements for transcription (e.g., a promoter and a polyA signal) and translation (e.g., Kozak sequence).
In one embodiment, in the vector, said at least one nucleic acid and said regulatory elements are "operably linked" to each other, by which is generally meant that they are in a functional relationship with each other. For instance, a promoter is considered "operably linked" to a coding sequence if said promoter is able to initiate or otherwise control/regulate the transcription and/or the expression of a coding sequence (in which said coding sequence should be understood as being "under the control of" said promotor). Generally, when two nucleotide sequences are operably linked, they will be in the same orientation and usually also in the same reading frame. They will usually also be essentially contiguous, although this may also not be required.
In one embodiment, any regulatory elements of the vector are such that they are capable of providing their intended biological function in the intended host cell or host organism.
For instance, a promoter, enhancer or terminator should be "operable" in the intended host cell or host organism, by which is meant that for example said promoter should be capable of initiating or otherwise controlling/regulating the transcription and/or the expression of a nucleotide sequence - e.g. a coding sequence - to which it is operably linked.
COMPOSITIONS
The present technology also provides a composition comprising the protein-based carrier building block and/or the molecule of the present technology. The composition may be a pharmaceutical composition. The composition may further comprise at least one pharmaceutically acceptable carrier, diluent or excipient and/or adjuvant, and optionally comprise one or more further pharmaceutically active polypeptides and/or compounds.
HOST ORGANISMS
The present technology also pertains to host cells or host organisms expressing the proteinbased carrier building block and/or the molecule (or part of the molecule) of the present technology, comprising the nucleic acid encoding the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology, and/or the vector
comprising the nucleic acid molecule encoding the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology.
In one embodiment the host is a non-human host. Suitable host cells or host organisms are clear to the skilled person, and are for example any suitable fungal, prokaryotic or eukaryotic cell or cell line or any suitable fungal, prokaryotic or eukaryotic organism. Specific examples include HEK293 cells, CHO cells, Escherichia coli or Komagataella phaffii (Pichia pastoris, see Bernauer L., et al. ("Komagataella phaffii as emerging model organism in fundamental research", Front. Microbiol., 2021, 11:1-16)). In one embodiment, the host is Komagataella phaffii (Pichia pastoris). In another embodiment, the host is Escherichia coli. Of course, cell free systems may also be employed to produce the protein-based carrier building block and/or the molecule of the present technology, as reviewed, for instance, in Gregorio NE, Levine MZ, Oza JP, "A user's guide to cell-free protein synthesis", Methods Protoc. 2019,2(l):24.
METHODS AND USES OF THE MOLECULE
The present technology also provides a method for producing the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology. The method may comprise transforming/transfecting a host cell or host organism with a nucleic acid encoding the at least one protein-based carrier building block and/or the molecule (or part of the molecule), expressing the at least one protein-based carrier building block and/or the molecule (or part of the molecule) in the host, optionally followed by one or more isolation and/or purification steps. Specifically, the method may comprise: a) expressing, in a suitable host cell or host organism or in another suitable expression system, a nucleic acid sequence encoding the at least one protein-based carrier building block and/or the molecule (or part of the molecule); optionally followed by: b) isolating and/or purifying the at least one protein-based carrier building block and/or the molecule (or part of the molecule).
During expression in any suitable expression system, capping agents may be used in order to cap the at least one attachment point or conjugation site present in the protein-based carrier building block. For instance, if the at least one protein-based carrier building block comprises
one or more -SH groups as attachment points or conjugation sites, cysteamine may be added during expression, in order to cap any free thiol group. Of course, the cap must be removed before attachment or conjugation of the cargo (e.g., with TCEP if the attachment point was capped with cysteamine).
For instance, the protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology may encompass a Protein A binding building block, so that the the protein-based carrier building block and/or the molecule (or part of the molecule) can be easily purified with Protein A chromatography after expression. Hence, Protein A chromatography can be employed to purify the protein-based carrier building block and/or the molecule (or part of the molecule). Further purification steps such as size exclusion chromatography (SEC) or ultrafiltration and/or ion-exchange chromatography may be applied in order to purify the protein-based carrier building block and/or the molecule (or part of the molecule).
To produce/obtain the at least one protein-based carrier building block and/or the molecule (or part of the molecule) of the present technology, both in genetic fusion or as a single polypeptide, the host cell or host organism or cell free system may generally be kept, maintained and/or cultured under conditions such that the (desired) protein-based carrier building block and/or molecule (or part of the molecule) of the technology is optimally expressed/produced. Suitable conditions will be clear to the skilled person and will usually depend upon the host cell/host organism or cell free system used, as well as on the regulatory elements that control the expression of the protein-based carrier building block or molecule (or part of the molecule) of the present technology.
Suitable host cells or host organisms for production purposes will be clear to the skilled person, and may for example be any suitable fungal, prokaryotic or eukaryotic cell or cell line or any suitable fungal, prokaryotic or eukaryotic organism. Specific examples include HEK293 cells, CHO cells, Escherichia coli or Komagataella phaffii (Pichia pastoris). In one embodiment, the host is Komagataella phaffii (Pichia pastoris). In another embodiment, the host is Escherichia coli.
Hence, the at least one protein-based building block and/or the molecule (or part of the molecule) of the present technology can be encoded in a nucleic acid molecule, optionally as part of an expression vector, and expressed and produced recombinantly, as described above.
In one embodiment, the molecule of the present technology comprises more than one protein-based carrier building block as defined above. For instance, the molecule of the present technology may comprise 2, 3 or more carrier building blocks as defined above. The more than one protein-based carrier building block can be encoded in a single nucleic acid molecule, optionally as part of an expression vector, and expressed and produced recombinantly, as described above.
In addition to the at least one protein-based carrier building block, the molecule of the present technology may further comprise one or more other groups, residues, moieties or binding units in which said one or more other groups, residues, moieties or binding units provide the molecule with several functionalities, such as binding specificity (e.g., by the presence of a targeting moiety in the molecule of the present technology), increased (in vivo) half-life extension (e.g., by the presence of half-life extending moiety in the molecule of the present technology), therapeutic properties (e.g., by the presence of a pharmaceutically active moiety in the molecule of the present technology), etc.
The one or more protein-based building blocks and/or the at least one cargo comprised in the molecule of the present technology may be recombinantly expressed as part of one or more genetic construct(s) and/or may be independently chemically synthesized (e.g., by SPPS). For instance, one or more protein-based carrier building block(s) may be expressed recombinantly, as part of a single genetic construct, and the one or more cargo(s) may also be expressed as part of another genetic construct. In a further step, the one or more cargo(s) may be attached or conjugated to the at least one protein-based carrier building block(s) through the conjugation site(s) or attachment point(s).
The molecule of the present technology may comprise at least one protein-based carrier building block, (i) at least one nuclear localization sequence (NLS) covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, (ii) at least one targeting moiety covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, and (iii) one or more other groups, residues, moieties or binding units. In one embodiment, the at least one protein-based carrier building block and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to one or more conjugation site(s) or attachment point(s) present in the protein-based carrier building block(s). The (i) at least one nuclear localization sequence (NLS) and the (ii) at least one targeting moiety may then be attached or conjugated to the conjugation sites or attachment points present in the protein-based carrier building block(s). The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (ii) at least one targeting moiety may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s). The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
In another embodiment, the at least one protein-based carrier building block, the (ii) at least one targeting moiety and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (i) at least one nuclear localization sequence (NLS) may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (ii) at least one targeting moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (iii) one or more other groups, residues, moieties or binding units may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS), the (ii) at least one targeting moiety and (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
The molecule of the present technology may comprise at least one protein-based carrier building block, (i) at least one nuclear localization sequence (NLS) covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, (ii) at least one therapeutic moiety covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, and (iii) one or more other groups, residues, moieties or binding units. In one embodiment, the at least one protein-based carrier building block and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to one or more conjugation site(s) or attachment point(s) present in the protein-based carrier building block(s). The (i) at least one nuclear localization sequence (NLS) and the (ii) at least one therapeutic moiety may then be attached or conjugated to the conjugation sites or attachment points present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (ii) at least one therapeutic moiety may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (ii) at least one therapeutic moiety and the (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (i) at least one nuclear localization sequence (NLS) may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (ii) at least one therapeutic moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (iii) one or more other groups, residues, moieties or binding units may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS), the (ii) at least one therapeutic moiety and (iii) one or more other groups, residues, moieties or binding units are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
The (ii) NLS may be selected from SEQ ID NO.: 221, 256, 304 and 305.
The molecule of the present technology may comprise at least one protein-based carrier building block, (i) at least one nuclear localization sequence (NLS) covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, (ii) at least one therapeutic moiety covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, and (iii) at least one targeting moiety covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block. In one embodiment, the at least one protein-based carrier building block and the (iii) at least one targeting moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to one or more conjugation site(s) or attachment point(s) present in the protein-based carrier building block(s). The (i) at least one nuclear localization sequence (NLS) and the (ii) at least one therapeutic moiety may then be attached or conjugated to the conjugation sites or attachment points present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (iii) at least one targeting moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (ii) at least one therapeutic moiety may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (ii) at least one therapeutic moiety and the (iii) at least one targeting moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (i) at least one nuclear localization sequence (NLS) may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS) and the (ii) at least one therapeutic moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. The (iii) at least one targeting moiety may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
In another embodiment, the at least one protein-based carrier building block, the (i) at least one nuclear localization sequence (NLS), the (ii) at least one therapeutic moiety and (iii) at least one targeting moiety are part of a single genetic construct and expressed recombinantly, i.e., they are expressed recombinantly as a single polypeptide. Further groups, residues, moieties or binding units may then be attached or conjugated to at least one conjugation site or attachment point present in the protein-based carrier building block(s).
The (ii) NLS may be selected from SEQ. ID NO.: 221, 256, 304 and 305.
For instance, the molecule of the present technology may comprise two or more proteinbased carrier building blocks which may be part of a genetic construct and recombinantly expressed.
For instance, the molecule of the present technology may comprise one or more proteinbased carrier building block and one or more other groups, residues, moieties or binding units in which said one or more other groups, residues, moieties or binding units provide the molecule with several functionalities, such as binding specificity, increased (in vivo) half-life extension, therapeutic properties etc. In this case, the one or more protein-based carrier building block and the one or more other groups, residues, moieties or binding units may be part of a single genetic construct and recombinantly expressed as a single polypeptide.
For instance, the molecule of the present technology may comprise one protein-based carrier building block and one half-life extension moiety, wherein the half-life extension moiety and protein-based carrier building block may be part of a genetic construct and expressed recombinantly as a single polypeptide.
Hence, the molecule of the present technology may comprise or consist of more than one protein-based building block(s), which may be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, the at least one targeting moiety, at least one therapeutic moiety and/or one or more cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building blocks.
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s) and one or more half-life extending moieties, as described above, they may be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, the at least one targeting moiety, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s) and one or more targeting moieties, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, the at least one therapeutic moiety, one or more half-life extending moieties and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s) and one or more therapeutic moieties, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, the at least one targeting moiety, one or more half-life extending moieties, and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s) and one or more targeting and/or therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS), one or more half-life extending moieties and, optionally, and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), one or more half-life extending moiety and one or more targeting moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), one or more half-life extending moiety and one or more therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, the at least one targeting moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), one or more half-life extending moiety and/or one or more targeting and/or therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. The at least one nuclear localization sequence (NLS) and, optionally, one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s) and at least one nuclear localization sequence (NLS), as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, one or more half-life extending moiety, at least one targeting moiety, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), and one or more half-life extending moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, at least one targeting moiety, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), and one or more targeting moieties, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, one or more half-life extending moiety, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), and at least one therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, one or more half-life extending moiety, at least one targeting moiety and/or one or more further cargos, as defined
above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), one or more half-life extending moiety and at least one targeting moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, at least one therapeutic moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), one or more half-life extending moiety and at least one therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, at least one targeting moiety and/or one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
In another embodiment, the molecule of the present technology may comprise one or more protein-based building block(s), at least one nuclear localization sequence (NLS), one or more half-life extending moiety, at least one targeting moiety and at least one therapeutic moiety, as described above, and they may all be part of a genetic construct and expressed recombinantly as a single polypeptide. Optionally, one or more further cargos, as defined above, may then be attached or conjugated to one or more attachment points or conjugation sites present in the protein-based building block(s).
Alternatively, the at least one protein-based carrier building block and/or molecule (or part of the molecule) of the present technology can be produced synthetically, e.g., using solid-phase peptide synthesis (SPPS), see, e.g., Jaradat, D.M.M., Thirteen decades of peptide synthesis:
key developments in solid phase peptide synthesis and amide bond formation utilized in peptide ligation, Amino Acids 50, 39-68 (2018).
As it will be evident to the skilled reader, if the molecule of the present technology comprises one or more protein-based building blocks, the at least one nuclear localization sequence (NLS) and, optionally, one or more half-life extending moiety, at least one targeting moiety, at least one therapeutic moiety and/or one or more other groups, residues, moieties or binding units, as defined above, part or the whole molecule may be encoded in a nucleic acid molecule, optionally as part of an expression vector, as defined above, and part or the whole molecule may be produced synthetically.
Once the one or more protein-based building blocks and, optionally, the one or more other groups, residues, moieties or binding units, as defined above, are produced, the cargos may be attached to the at least one protein-based building block via the attachment points or conjugation sites (preferably engineered conjugation sites or attachment points), as described above. For instance, the at least one protein-based carrier building block may be expressed recombinantly, as described above, and the cargo(s) conjugated to it via the at least one attachment point or conjugation site, thus rendering the molecule of the present technology. For instance, the at least one protein-based carrier building block may be produced synthetically, e.g., using SPPS, as described above, and the cargo(s) conjugated to it via the at least one attachment point or conjugation site, thus rendering the molecule of the present technology. For instance, the at least one protein-based carrier building block and one or more further moieties, such as HLE moieties and/or NLS, may be encoded in an expression vector and be expressed recombinantly, as described above, and the cargo(s) conjugated to the protein-based carrier building block via the at least one attachment point or conjugation site, thus rendering the molecule of the present technology. The at least one protein-based carrier building block, the at least one nuclear localization sequence (NLS), and the optionial one or more further moieties, such as HLE moieties, targeting moieties, therapeutic moieties, may be linked through a linker, as described in detail above.
For instance, the at least one protein-based carrier building block may be expressed recombinantly (alone or together with, e.g., at least some of the half-life extension moieties), as described above, and the cargo(s) (i.e., the at least one nuclear localization sequence (NLS), and optionally at least one targeting moiety, at least one therapeutic moiety and/or one or more further moieties or cargos) conjugated to it via the at least one attachment point or conjugation site. If a conjugation site is a -SH group (free or capped) present in the side chain of a cysteine present in the recombinantly-expressed protein-based carrier building block (which may be expressed alone or together with, e.g., at least some of the half-life extension moieties, as explained herein), a cargo can be attached or conjugated to the building block (directly or by means of a linker) by alkylation, metal-assisted arylation, disulphide exchange or addition to a maleimide Michael acceptor, see above in this description for further details. If a conjugation site is a -OH group of a tyrosine present in the recombinantly-expressed protein-based carrier building block (which may be expressed alone or together with, e.g., at least some of the half-life extension moieties, as explained herein), a cargo can be attached or conjugated to the building block (directly or by means of a linker) by several chemical methods such as cross-linking via catalytic tyrosine mono electronic oxidation, three- component Mannich-type tyrosine conjugation, conjugation via sulphur fluoride exchange chemistry (SuFEx), transition-metal complexes for tyrosine conjugation, diazonium coupling reaction, reactions with triazolinediones, etc. (for a review, see, e.g., D. Alvarez Dorta et al., Chem. Eur. J., 2020, 26, 14257). If a conjugation site is the -OH group of an N- and/or C- terminal tyrosine present in the recombinantly-expressed protein-based carrier building block (which may be expressed alone or together with, e.g., at least some of the half-life extension moieties, as explained herein), a cargo can be attached or conjugated to the building block (directly or by means of a linker) enzymatically as described, e.g., in Alan M. Marmelstein et al., Journal of the American Chemical Society, 2020, 142 (11), 5078-5086. If a conjugation site is the /V-terminal primary amine of the recombinantly-expressed protein-based carrier building block and/or the primary amine present in the side chain of an amino acid present in the recombinantly-expressed protein-based carrier building block (which may be expressed alone or together with, e.g., at least some of the half-life extension moieties, as explained herein) (e.g., Lys, Orn, or any non-natural amino acid with a primary amine on its side chain), a cargo may be attached or conjugated to the carrier building block (directly or by means of a
linker) by reaction of a group present in the cargo/linker (e.g., isothiocyanates, isocyanates, acyl azides, NHS esters, sulfonyl chlorides, aldehydes, glyoxals, epoxides, oxiranes, carbonates, aryl halides, imidoesters, carbodiimides, anhydrides, or fluorophenyl esters) and the primary amine.
The skilled person is familiar with groups, residues, or moieties able to provide therapeutic properties to the molecule of the present technology, such as pharmaceutically active moieties. The skilled person is also familiar with groups, residues or moieties able to provide specific targeting of the molecule of the technology to desired organs/tissues/cells in the human or animal body, such as targeting moieties.
For instance, at least some of the half-life extension moieties, targeting moieties, therapeutic moieties or precursors therefrom described above in the "Cargos" section may be incorporated in the molecule of the present technology as part of a genetic construct, expressed recombinantly, possibly together with the at least one protein-based carrier building block, as described in detail above. Hence, at least some of the half-life extension moieties, targeting moieties, therapeutic moieties or precursors therefrom or imaging molecules described above in the "Cargos" section may be incorporated in the molecule of the present technology (i) by attaching or conjugating them to the at least one attachment point or conjugation site present in the protein-based carrier building block, or (ii) by expressing them recombinantly together with the protein-based carrier building block. Hence, the at least one nuclear localization sequence (NLS) and, optionally, one or more halflife extending moiety, at least one targeting moiety and/or at least one therapeutic moiety, described above may be incorporated in the molecule of the present technology, independently, (i) by attaching or conjugating one or all of them to the at least one attachment point or conjugation site present in the protein-based carrier building block, or (ii) by expressing one or all of them recombinantly together with the protein-based carrier building block. Of course, combinations of the above mechanisms are possible; for instance, the at least one nuclear localization sequence (NLS), one or more of the half-life extension moieties, targeting moieties, therapeutic moieties or precursors therefrom described above in the "Cargos" section may be incorporated in the molecule of the present technology as part of a
genetic construct, expressed recombinantly possibly together with the at least one proteinbased carrier building block, and/or the at least one nuclear localization sequence (NLS), one or more of the half-life extension moieties, targeting moieties, therapeutic moieties or precursors therefrom described above in the "Cargos" section may be incorporated in the molecule of the present technology by attaching or conjugating them to the at least one attachment point or conjugation site present in the protein-based carrier building block. The skilled person will understand and decide how to generate the molecule of the present technology in light of the number of protein-based carrier building blocks and specific moieties and/or cargos that the molecule will incorporate.
If two or more proteins (e.g., one ISVD-derived protein-based carrier building block, and one further protein, such as one ISVD, which may be a targeting moiety, which may increase the in vivo half-life of the protein-based building block and/or molecule of the present technology and/or which may have therapeutic properties; or one CSK-based carrier building block and one further protein which may be a targeting moiety, which may increase the in vivo half-life of the protein-based building block and/or molecule of the present technology and/or which may have therapeutic properties; or one DARPin-based carrier building block and one further protein, which may be a targeting moiety, which may increase the in vivo half-life of the protein-based building block and/or molecule of the present technology, and/or which may have therapeutic properties) are comprised in the molecule of the present technology, they may be directly linked to each other, and/or may be linked to each other via one or more suitable linkers, or any combination thereof. Suitable linkers have been described above in this description.
As already described, the conjugation of cargos to the attachment points or conjugation sites may be performed directly or via a linker. Suitable linkers are, for instance, APN-Maleimide linker (806536, Sigma-Aldrich), which is a bifunctional linker (see also Formula I below). This linker allows for conjugation twice, via cysteine-based chemistry. Both APN and maleimide couple to free thiols, albeit at different speed. An example is shown in Figure 4. Hence, the APN-Maleimide linker may be used to attach cargos to attachment points or conjugation sites which are -SH groups present, e.g., in the side chain of a cysteine. Another linker is, for
instance, /V-ethylmaleimide (see, e.g., Formula II) or Maleimido-PEG12-acid (PubChem CID 68757103, UPAC name 3-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[3-(2,5-dioxopyrrol-l- yl)propanoylamino]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]etho xy]ethoxy]ethoxy]propanoic acid, also known as "Mal-amido-PEG12-acid") see Example 5.3 of WO 2024/133935, the content of which is incorporated herein by reference.
Formula II, /V-ethylmaleimide
The present technology further provides a method for producing the protein-based building block comprised in the molecule of the present technology, wherein the method comprises: a. Providing a protein-based building block precursor, a defined herein; b. Generating at least one, preferably more, as described herein, attachment points or conjugation sites in the protein-based building block precursor provided in a., as described in detail herein, and/or eliminating the specific binding properties of the proteinbased building block precursor provided in a. to produce the protein-based building block, wherein the protein-based building block: a) comprises at least one conjugation site or attachment point, preferably at least two attachment points or conjugation sites; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as from about 2.5 kDa to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular 3D structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at RT, preferably measured in a buffer or water at RT, more preferably in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM),
sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)); e) does not specifically bind to any human protein or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by surface plasmon resonance (SPR), for instance as described herein and/or in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay; h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein;
j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214.
Further, the present technology comprises a method to produce the molecule of the present technology which comprises:
(i) Providing a protein-based building block, wherein the protein-based building block: a) comprises at least two attachment points or conjugation sites; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as from about 2.5 kDa to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular 3D structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at RT, preferably measured in a buffer or water at RT, more preferably in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium
chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)); e) does not specifically bind to any human protein or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by surface plasmon resonance (SPR), for instance as described herein and/or in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the proteinbased carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay; h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein;
j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214;
(ii) covalently linking, directly or by means of a linker, at least one nuclear localization sequence (NLS) to at least one of the attachment points or conjugation sites present in the protein-based carrier building block provided in (i);
(iii) optionally, covalently linking, directly or by means of a linker, one or more half-life extending moieties, at least one targeting moiety and/or at least one therapeutic moiety to at least one of the attachment points or conjugation sites present in the protein-based carrier building block provided in (i); and
(iv) optionally, covalently linking, directly or by means of a linker, at least one further cargo as described herein to at least one of the attachment points or conjugation sites present in the protein-based carrier building block provided in (i), wherein steps (ii), (iii) and (iv) can be carried out in any order and/or simultaneously.
Hence, the present technology provides a molecule which comprises
(i) a protein-based building block, wherein the protein-based building block: a) comprises at least two attachment points or conjugation sites; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, such as from about 2.5 kDa to less than 50 kDa, more preferably
of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular 3D structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at RT, preferably measured in a buffer or water at RT, more preferably in a buffer such as citrate buffer or phosphate-buffered saline (PBS) at pH 7.0 or 7.4, at RT, or histidine buffer at pH 6.5, at RT (comprising histidine (10 mM to 100 mM, such as 10 mM), sucrose (1% to 10%, such as 10%) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)), or phosphate buffer pH 7.0, at RT (comprising NaH2PO4/Na2HPO4 (10 and 50 mM, such as 10 mM), sodium chloride (NaCI) (100-150 mM, such as 130 mM NaCI) and, optionally, Tween 80 (0.001% to 1%, such as 0.01%)); e) does not specifically bind to any human protein or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by surface plasmon resonance (SPR), for instance as described herein and/or in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the proteinbased carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay; h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than
5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non-human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214;
(ii) at least one nuclear localization sequence (NLS), covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the at least one protein-based building block;
(iii) optionally, at least one half-life extending (HLE) moiety, at least one targeting moiety, and/or at least one therapeutic moiety;
(iii) optionally, at least one further cargo, wherein the at least one further cargo is selected from: a) one or more further half-life extending (HLE) moiety, such as PEG or an albumin binder;
b) a cell-penetrating peptide (CPP); c) a further targeting moiety, such as an EGFR-targeting moiety, e.g., GE11 peptide, an anti-EGFR VHH, or an anti-CEACAM 5 VHH; d) a further therapeutic moiety or precursor therefrom, such as a Death receptor 5 (DR5) antagonist; e) an imaging moiety, such as deferoxamine (DFO); f) a toxic moiety, such as maytansinoid (DM4) or cryptophycin; g) nucleic acids, such as Antisense Oligonucleotides (ASOs) h) vitamins, such as folate; i) Toll-like receptor agonists, such as resiquimod; j) glycans; and/or k) lipids. wherein the at least two antibody-binding components can be covalently linked in the form of a cluster to a single attachment point or conjugation site present in the protein-based carrier building block or can be each covalently linked to one attachment point or conjugation site present in the protein-based carrier building block.
The molecule of the present technology or the composition comprising the molecule of the present technology are useful as a medicament.
Accordingly, the present technology provides the molecule of the present technology or a composition comprising the molecule of the present technology for use as a medicament.
Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use in entering the nucleus. Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for intranuclear delivery. Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use in intranuclear treatment.
Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use in the (prophylactic and/or therapeutic) treatment.
Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use in the (prophylactic and/or therapeutic) treatment of an autoimmune/inflammatory disease and/or cancer, such as hematological (blood) and solid tumor cancer disease.
Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use in the (prophylactic and/or therapeutic) treatment of an infectious disease.
Also provided is the molecule of the present technology or a composition comprising the molecule of the present technology for use as a vaccine. Hence, the present technology provides a vaccine comprising the molecule of the present technology or a composition comprising the molecule of the present technology, optionally further comprising further components such as pharmaceutically acceptable carriers and/or adjuvants.
A "subject" as referred to in the context of the present technology can be any animal. In one embodiment, the subject is a mammal. Among mammals, a distinction can be made between humans and non-human mammals. Non-human animals may be for example companion animals (e.g. dogs, cats), livestock (e.g. bovine, equine, ovine, caprine, or porcine animals), or animals used generally for research purposes and/or for producing antibodies (e.g. mice, rats, rabbits, cats, dogs, goats, sheep, horses, pigs, non-human primates, such as cynomolgus monkeys, or camelids, such as llama or alpaca).
In the context of prophylactic and/or therapeutic purposes, the subject can be any animal, and more specifically any mammal. In one embodiment, the subject is a human subject.
Substances, including molecules or compositions may be administered to a subject by any suitable route of administration, for example by enteral (such as oral or rectal) or parenteral (such as epicutaneous, sublingual, buccal, nasal, intratracheal, intra-articular, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, transdermal, or transmucosal) administration. In one embodiment, substances are administered by parenteral administration, such as intramuscular, subcutaneous or intradermal, administration.
An effective amount of a molecule as described, or a composition comprising the molecule of the present technology can be administered to a subject in order to provide the intended treatment results.
One or more doses can be administered. If more than one dose is administered, the doses can be administered in suitable intervals in order to maximize the effect of the molecule or composition comprising the same.
EXAMPLES
Example 1. In silico design of ISVD-based carrier building blocks
ISVD RSV001A04 (SEQ ID NO.: 179, also referred to as RSV 001A04) was selected as starting point for developing the ISVD-based carrier building block. Figure 1 shows the amino acid sequence of ISVD RSV001A04 (SEQ ID NO.: 179). Using MAESTRO, residues in the building block precursor with a Solvent-Accessible Surface Area (SASA) greater than or equal to 27 A2 (square angstrom) were considered to be solvent-accessible and further considered to calculate stability of a mutation to a cysteine. As a result, 78 potential solvent-accessible residues were selected in SEQ ID NO.: 179 as potential positions for point mutations with cysteines, to generate conjugation sites or attachment points in the protein-based carrier building block. These positions are (in SEQ ID NO.: 179, Kabat numbering):
1, 3, 5, 7-8, 10-15, 17-19, 21, 23, 25-28, 30-32, 39, 41-46, 52a-59, 61-62, 64-66, 68, 70-76, 79, 81, 82a-82b, 83-85, 87, 89, 91, 96, 98-100a, lOOd-lOOg, 101-103, 105-106, 108, 110 and 112-
113.
Stability (AG in solvent) of each mutant was calculated using MAESTRO, see, e.g., Laimer J. et al, "MAESTRO-multi agent stability prediction upon point mutations", BMC Bioinformatics, 2015, 16:116, for further details. Destabilizing cysteine mutations (i.e., those with higher calculated AG in solvent) were not further considered as potential positions for conjugation sites or attachment points. Based on the stability data, 27 potential positions were further selected; see the following amino acid positions in SEQ ID NO.: 179 according to Kabat numbering:
El, S7, Q13, S17, S19, S21, A23, S25, G26, S28, N31, K43, E44, D55, N62, G65, T68, S70, D72, A74, K75, S82b, D85, GlOOa, DIOOf, R105, S112.
In addition, the -SH group of an engineered C-terminal Cys (i.e., not present in the building block precursor) preceded by a GG tag was also selected as potential attachment point (-GGC).
The following potential solvent-accessible positions were finally selected (in SEQ ID NO.: 179, according to Kabat numbering) as preferred combinations of potential solvent-accessible positions based on the in-silico predictions:
9x CYS
* S21, N31, K43, T68, D72, S82b, GlOOa, DIOOf, R105
* S7, Q13, S19, A23, E44, D55, N62, S70, A74
* S7, S17, N31, E44, D55, N62, T68, K75, S112.
6x CYS
* S19, E44, G65, S70, S82b, S112
* S21, K43, D55, T68, A74, S112
* S19, A23, N31, S70, S82b, DIOOf
* Q13, S25, K43, G65, D72, GlOOa
* S25, K43, K75, S82, GlOOa, S112C
* S25, K43, K75, GlOOa, R105, S112
* S25, K43, K75, GlOOa, R105 and C-terminal C (-GGC)
* K43, T68, K75, GlOOa, R105 and C-terminal C (-GGC)
* S25, K43, K75, DIOOf, R105 and C-terminal C (-GGC)
* K43, T68, K75, DIOOf, R105 and C-terminal C (-GGC).
4x CYS
*S19, G65, S82b, S112
3x CYS
* K43, DIOOf, R105
* K43, K75, GlOOa
* S21, T68, DIOOf
* S7, E44, D55
* Q13, D72, GlOOa
* Q13, N31, DIOOf.
As a result, the following protein-based carrier building blocks comprising three attachment points or conjugation sites which are the -SH group in the side chain of three cysteines located at solvent-accessible positions were designed: a) 13001, corresponding to SEQ ID NO.: 80, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): K43C, DIOOfC, R105C and Q108L; b) 13002, corresponding to SEQ ID NO.: 81, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): K43C, K75C, GlOOaC and Q108L; c) 13003, corresponding to SEQ ID NO.: 82, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): S21C, T68C, DIOOfC and Q108L;
d) 13004, corresponding to SEQ ID NO.: 83, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): Q13C, D72C, GlOOaC and Q108L; e) 13005, corresponding to SEQ ID NO.: 84, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): Q13C, N31C, DIOOfC and Q108L; and f) 13006, corresponding to SEQ ID NO.: 85, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): S7C, E44C, D55C and Q108L.
Mutation Q108L was performed in all cases as shown in Table 3 above. In addition, the polypeptide of SEQ ID NO.: 175 was designed. In this polypeptide only mutation Q108L was introduced, together with a -GGC tag at the C-terminal.
In addition, the following protein-based carrier building blocks comprising four attachment points or conjugation sites which are the -SH group in the side chain of four cysteines located at solvent-accessible positions were designed: a) RSV001A04(S19C,G65C,S82bC,Q108L,S112C), corresponding to SEQ ID NO.: 225, which comprises the following point mutations in SEQ ID NO.: 225, at the following positions (according to Kabat numbering): S19C, G65C, S82bC, Q108L and S112C;
In addition, the following protein-based carrier building blocks comprising six attachment points or conjugation sites which are the -SH group in the side chain of six cysteines located at solvent-accessible positions were designed: a) 16001, corresponding to SEQ ID NO.: 86, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): S19C, E44C, G65C, S70C, S82bC, Q108L and S112C;
b) 16002, corresponding to SEQ ID NO.: 87, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): S21C, K43C, D55C, T68C, A74C, Q108L and S112C; c) 16003, corresponding to SEQ ID NO.: 88, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): S19C, A23C, N31C, S70C, S82bC, DIOOfC and Q108L; d) 16004, corresponding to SEQ ID NO.: 89, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): Q13C, S25C, K43C, G65C, D72C, GlOOaC and Q108L; e) 16005, corresponding to SEQ ID NO.: 90, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, S25C, K43C, K75C, S82bC, V89L, GlOOaC, Q108L and S112C; f) 16006, corresponding to SEQ ID NO.: 91, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, S25C, K43C, K75C, V89L, GlOOaC, R105C, Q108L and S112C; g) 16007, corresponding to SEQ ID NO.: 92, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, S25C, K43C, K75C, V89L, GlOOaC, R105C, Q108L and a C- terminal -GGC tag; h) 16008, corresponding to SEQ ID NO.: 93, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, K43C, T68C, K75C, V89L, GlOOaC, R105C, Q108L and a C- terminal -GGC tag; i) 16009, corresponding to SEQ ID NO.: 94, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, S25C, K43C, K75C, V89L, DIOOfC, R105C, Q108L and a C-terminal -GGC tag; and j) 16010, corresponding to SEQ ID NO.: 95, which comprises the following point mutations in SEQ ID NO.: 179, at the following positions (according to Kabat numbering): L11V, K43C, T68C, K75C, V89L, DIOOfC, R105C, Q108Land a C-terminal -GGC tag.
Mutation Q108L was performed in all cases as shown in Table 3 above. In some cases, mutations L11V and V89L were also performed to avoid binding by pre-existing antibodies.
Example 2. In silico design of DARPin-based carrier building blocks
DARPin K27 was selected as starting point for developing the DARPin-based carrier building block. In particular, the polypeptide as defined in SEQ ID NO.: 187 was chosen as the buildingblock precursor in this case. Arginine residues at positions 69, 102 and 111 were mutated to alanine, so that the polypeptide no longer binds any human protein, in particular its original target protein, KRAS. In addition, the C-terminal leucine was removed. See Figure 2 and SEQ ID NO.: 68. Using MAESTRO, residues in the building block precursor with a Solvent-Accessible Surface Area (SASA) greater than or equal to 27 A2 (square angstrom) were considered to be solvent-accessible and further considered to calculate stability of a mutation to a cysteine. As a result, the following potential well solvent-accessible residues were selected in SEQ ID NO.: 187 as potential positions for point mutations with cysteines, to generate conjugation sites or attachment points in the protein-based carrier building block:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152 and 154-155.
In addition, the -SH group of an engineered C-terminal Cys (i.e., not present in the building block precursor) was also selected as potential attachment point or conjugation site.
Stability (AG in solvent) of each mutant was calculated using MAESTRO, see, e.g., Laimer J. et al, "MAESTRO-multi agent stability prediction upon point mutations", BMC Bioinformatics, 2015, 16:116, for further details. Destabilizing cysteine mutations were not further considered as potential positions for conjugation sites or attachment points. Based on the stability data the following potential positions in SEQ ID NO.: 187 were further selected as potential solvent- accessible positions based on the in-silico predictions:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148,155 and C-terminal Cys.
The following potential solvent-accessible positions in SEQ ID NO.: 187 were finally selected as potential solvent-accessible positions based on the in-silico predictions:
85, 95, 143, 148 and C-terminal Cys.
As a result, the following protein-based carrier building blocks comprising one, three or five attachment points or conjugation sites which are the -SH group in the side chain of three or five cysteines located at solvent-accessible positions were designed: a) 33001, corresponding to SEQ ID NO.: 96, which comprises the following point mutations in SEQ ID NO.: 68, at the following positions: D143C, D148C and a C- terminal Cys; b) 33002, corresponding to SEQ ID NO.: 97, which comprises the following point mutations in SEQ ID NO.: 68, at the following positions: E85C, N95C and a C- terminal Cys; and c) 35001, corresponding to SEQ ID NO.: 98, which comprises the following point mutations in SEQ ID NO.: 68, at the following positions: E85C, N95C, D143C, D148C and a C-terminal Cys.
In addition, the polypeptides of SEQ ID NOs.: 197, 199 and 208 were designed. These polypeptides comprise a C-terminal Cys but do not comprise the Cys-point mutations in positions 85, 95, 143 and/or 148. In addition, the polypeptide of SEQ ID NOs.: 197 and 208 comprise a Leu before the C-terminal Cys. The polypeptide of SEQ ID NOs.: 197 comprises Arg at positions 69, 102 and 111, whereas the polypeptides of SEQ ID NOs. : 199 and 208 comprise Ala at positions 69, 102 and 111:
SEQ ID NO.: 199:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTAF DISI DNGNEDLAEILQKC
SEQ ID NO.: 197:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGR TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGRTPLHLAAKRGHLEIVEVLLKNGADVNAQDKFGKTAFD ISIDNGNEDLAEILQKLC
SEQ ID NO.: 208:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVEVLLKYGADVNAADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTAF DISI DNGNEDLAEILQKLC
Example 3. In silico design of CKS-based carrier building blocks
Cyclin-dependent kinase subunit 1 (CKS1, Gene I D: 983) was selected as starting point for developing the CKSl-based carrier building block. In particular, the polypeptide of SEQ ID NO.: 190 was selected as the building-block carrier precursor. Figure 3 shows the amino acid sequence of the CKSl-building block precursor. Using MAESTRO, residues in the building block precursor with a Solvent-Accessible Surface Area (SASA) greater than or equal to 27 A2 (square angstrom) were considered to be solvent-accessible and further considered to calculate stability of a mutation to a cysteine. As a result, the following potential well solvent-accessible residues were selected in SEQ ID NO.: 190 as potential positions for point mutations with cysteines, to generate conjugation sites or attachment points in the protein-based carrier building block:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64 and 69-78.
Stability (AG in solvent) of each mutant was calculated using MAESTRO, see, e.g., Laimer J. et al, "MAESTRO-multi agent stability prediction upon point mutations", BMC Bioinformatics, 2015, 16:116, for further details. Destabilizing cysteine mutations were not further considered as potential positions for conjugation sites or attachment points. Based on the stability data, the following potential positions in SEQ ID NO.: 190 were further selected as potential solvent-accessible positions based on the in-silico predictions:
1, 4, 9, 10, 11, 12, 13, 22, 25, 29, 33, 51, 57 and an engineered C-terminal Cys.
As a result, the following protein-based carrier building blocks comprising three or six attachment points or conjugation sites which are the -SH group in the side chain of three or six cysteines located at solvent-accessible positions were designed: a) 23001, corresponding to SEQ ID NO.: 99, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: K10C, D13C, and a C- terminal Cys; b) 23002, corresponding to SEQ ID NO.: 100, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: K10C, D12C, M22C, Q51C; c) 23003, corresponding to SEQ ID NO.: 101, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: Y11C, Q51C and a C- terminal Cys; d) 23004, corresponding to SEQ ID NO.: 102, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: D13C, K33C, and a C- terminal Cys; e) 23005, corresponding to SEQ ID NO.: 103, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: Y11C, K33C, Q51C, and a C-terminal Cys; f) 26001, corresponding to SEQ ID NO.: 104, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: D9C, D13C, M22C, K33C, Q51C, and a C-terminal Cys; and g) 26002, corresponding to SEQ ID NO.: 105, which comprises the following point mutations in SEQ ID NO.: 190, at the following positions: D9C, D13C, K33C, Q51C, M57C, and a C-terminal Cys.
As control, the polypeptides of SEQ ID NOs.: 201
(SHKQIYYSDKCDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHYMIHEPEPHILLFR RPLPKKPKK) and 202
(SHKQIYYSDKYDCEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHYMIHEPEPHILLFR
RPLP) were designed.
Example 4. Construction, expression and purification of molecules comprising a proteinbased building block, a 15GS linker and a HLE moiety
In addition to the protein-based building blocks as described in Examples 1-3, it was contemplated that the molecules of this example would also comprise a HLE moiety linked to the protein-based building block through a 15GS linker (as defined in SEQ ID NO.: 163). The HLE moiety chosen was Alb23002 (SEQ ID NO.: 63) or its EID variant (Alb23002(ElD), SEQ ID NO.: 106). Both Alb23002 and Alb23002(ElD) are ISVDs which bind to human serum albumin, as explained above in this description. Both polypeptides (the protein-based carrier building block and the HLE moiety) were designed to be linked through a 15GS linker, as defined in SEQ ID NO.: 163. The design of the molecule was as follows:
(/V-terminal) HLE - 15GS linker - building block (C-terminal)
To produce/ the above-described molecules comprising a HLE moiety, a 15GS linker and the protein-based building block, Komagataella phaffii (Pichia pastoris) was used for the expression and purification. This organism is a well-known expression system to the skilled person, as for instance described in Bustos, C. et al., "Advances in cell engineering of the Komagataella phaffii platform for recombinant protein production", Metabolites, 2022, 12, 346. E. coli may also be used, as described, e.g., in Correa A and Oppezzo P., "Overcoming the solubility problem in E. coli: available approaches for recombinant protein production", Methods Mol Biol., 2015;1258:27-44.
In order to facilitate purification, constructs were made with at least one Protein A-binding building block (as the HLE moiety) and expressed via Komagataella phaffii. To this end, an Alb23002 building-block (SEQ ID NO.: 106 or SEQ ID NO.: 63) was genetically fused to the selected protein-based carrier building-block via an 15GS linker (SEQ ID NO.: 163) as described above, thus also rendering the constructs with higher in vivo half-life. During fermentation, cysteamine was added to cap any free thiol of the engineered cysteines and render a homogeneous product for down-stream processing (DSP).
The sequences of the protein-based carrier building block used are summarized in Tables 4-7 (SEQ ID NO.: 80-105, 175, 199, 208, 222-225), and described in detail in Examples 1-3 and 9. The sequences of the multivalent molecules comprising the HLE moiety, linker and proteinbased building block, expressed in Komagataella phaffii and purified as described above are summarized in Tables 11-13 (SEQ ID NO.: 107-127, 170-174, 176, 200, 306).
In particular, fusion constructs, with secretion signal, were expressed via Komagataella phaffii, as described above. Production of ISVDs in lower eukaryotic hosts such as Komagataella phaffii has been described by Frenken et al. ("Isolation of antigen specific llama VHH antibody fragments and their high level secretion by Saccharomyces cerevisiae”, J. Biotechnol., 2000, 78: 11-21) and in WO 94/25591, WO 2010/125187, WO 2012/056000, WO 2012/152823 and WO 2017/137579. The contents of these applications are explicitly referred to in the connection with general culturing techniques and methods, including suitable media and conditions. The skilled person can also devise suitable genetic constructs for expression of domains in host cells on the basis of common general knowledge.
Following high cell density fermentation, fusion products were separated from the cells via centrifugation and the molecules were purified from the spent medium. To promote a homogeneous product, a minimum of 10 mM cysteamine (30070, Sigma-Aldrich) was added at the end of the induction phase of the fermentation to cap any free thiol of the engineered cysteines and render a homogeneous product for DSP.
Following 0.22 pm filtration, the thiol-capped product was captured from spent medium via Protein A chromatography and separated on a sizing column and/or via ion exchange (all methods are generally applied during protein purification, see, e.g., Remans, K. et al., "Protein purification strategies must consider downstream applications and individual biological characteristics", Microb Cell Fact, 2022, 21(52) or Rathore AS. et al., "Recent developments in chromatographic purification of biopharmaceuticals" Biotechnol Lett., 2018, 40(6):895-905). General chromatography conditions for Protein A were applied, and the proteins of interest were eluted with 100 mM Glycine pH 2.5 and neutralised using 1 M Tris pH8. The reduced
material was formulated in D-PBS+0.1 mM TCEP to keep the reduced state and allow direct conjugation upon thawing.
Example 5. Conjugation of the APN-maleimide linker to the protein-based carrier building block with three conjugation sites
Conjugation experiments were performed using protein-based building block as defined in SEQ ID NO.: 80 (13001, see Table 4), comprised in a molecule (SEQ ID NO.: 107) which further comprises the Alb23002 HLE moiety (Alb23002(ElD), SEQ ID NO.: 106, see Table 8) and a 15GS linker (SEQ ID NO.: 163, see Table A-l), as defined above, see also, e.g., Table 11. The proteinbased carrier building block as defined in SEQ ID NO.: 80 comprises three attachment points or conjugation sites which are the -SH groups of three cysteines located at positions 43, lOOf and 105 (according to Kabat numbering). The cysteines have been introduced in the original molecule (SEQ ID NO.: 179) as point mutations in the above-mentioned positions (according to Kabat numbering): K43C, DIOOfC and R105C. In addition, the glutamine at hallmark position 108 has been replaced by a leucine (Q108L).
The engineered cysteines present in the ISVD-based building block (SEQ ID NO.: 80)- comprising molecule (SEQ ID NO.: 107) or in the control molecule (SEQ ID NO.: 176, which protein-based building block (SEQ ID NO.: 175) comprises a C-terminal Cys with a -SH in its side chain which is the attachment point of this building block) were reduced and/or uncapped using 10 mM in DTT PBS, after which DTT was separated from the molecule solution (comprising the ISVD-building block with reduced engineered cysteines) via HiPrep™ 26/10 Desalting (Cytiva 17-5087-01) or Size Exclusion Chromatography (SEC) on Superdex® Increase 75 10/300 GL (Cytiva 29-1487-21), equilibrated in D-PBS with 0,1 mM TCEP (20490 Thermo Scientific™ Pierce™). This material can be used directly for conjugations or frozen at -20°C to be used later (TCEP will prevent reoxidation of the free thiols).
For an assessment of the conjugability of the engineered cysteines (with the side chain -SH groups as attachment points or conjugation sites) present in the building block, any small, maleimide-activated ligand can be used. For instance, the APN-Maleimide linker (806536, Sigma-Aldrich) or N-Maleoyl-|3-alanine (394815, Sigma-Aldrich, also referred to as maleimide-
Alanine) can be used to this end. The excess of unconjugated ligand was removed via SEC or desalting, and the conjugation product was analysed via Mass Spectrometry (MS). LC-MS was carried out using a Q Exactive™ Plus Hybrid Quadrupole-Orbitrap™ Mass Spectrometer and a Vanquish Flex UHPLC (both Thermo Scientific®) with an online Waters MassPREP™ Micro Desalting Columns (2.1 x 5 mm, P/N. 186004032).
Alternatively, an early assessment of the conjugability of the engineered cysteines could be carried out making use of tris(2-carboxyethyl)phosphine (TCEP) as reductant. TCEP does not need to be removed during maleimide based conjugations. Typically, 0.1 mM TCEP in D-PBS is used. A 5-fold or higher excess of maleimide-Alanine vs engineered cysteines was used to promote full conjugation of available free thiols. The crude mixture was not separated via SEC, but instead directly analysed via LC-MS to assess the number of cysteines with (near) full conjugation.
In the present example, an APN-maleimide 'bifunctional' linker (Formula I, Sigma-Aldrich
#806536) was used to connect different cargos to the protein-based building block comprised in the molecule. This linker allows for conjugation twice, via cysteine-based chemistry. Both
APN and maleimide couple to free thiols, albeit at different speed. An example is shown in
Formula I (APN-maleimide, also known as 3-(4-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l- yl)phenyl)propiolonitrile)
For maleimide-based conjugation, generic conditions were applied (if not otherwise indicated). 5x molar mass excess of APN-maleimide in DSMO was added to lx ISVD-based building block (SEQ ID NO.: 80)-comprising molecule (SEQ ID NO.: 107). As a control, SEQ ID NO.: 176 was used (same molecule but without the cysteine point mutations as described
above and with a C-terminal Cys (-GGC)). The ISVD-based building block-comprising molecule (SEQ ID NO.: 107) and control molecule (SEQ ID NO.: 176) were formulated in D-PBS+0.1 mM TCEP, to prevent thiol oxidation. The mixture was incubated for 10 min at room temperature (RT), head over head rotating. Afterwards, the excess APN-maleimide was removed via size exclusion chromatography (SEC), and the resulting molecules were formulated in D-PBS. At this point a sample was taken for mass spectrometry analysis, to check the conjugation efficiency on the engineered cysteine residues. LC-MS was done using a Q Exactive™ Plus Hybrid Quadrupole-Orbitrap™ Mass Spectrometer and a Vanquish Flex UHPLC (both Thermo Scientific®) with an online Waters MassPREP™ Micro Desalting Columns (2.1 x 5 mm, P/N. 186004032). As shown in Figure 5, additions of 222 Da (corresponding to the APN-maleimide linker addition) and 286 Da (corresponding to TCEP, used to keep the free thiols reduced) did indicate efficient conjugation onto both molecules (SEQ ID NO.: 107 and control SEQ ID NO.: 176). The presence of two or three peaks for each of the molecules simply reflect the mass of the molecule on its own (with the APN-maleimide linker) or the molecule plus one or two molecules of TCEP, as also indicated in the figure. On the control, the added mass clearly corresponded to linker conjugation (Figure 5A). On all 3 positions of SEQ ID NO.: 107 (43, lOOf and 105, according to Kabat), the linker was present as the corresponding masses were detected (Figure 5B).
Example 6. Non-target specificity profiling of carriers using a Membrane Proteome Array (MPA)
As discussed above, the protein-based building blocks comprised in the molecule of the present technology do not bind any protein or other biological compound (biomolecule), in particular they do not bind any human protein. With binding-FACS experiments, used to study cell binding and internalization of loaded ISVD-Carrier constructs, it was demonstrated that the protein-based building blocks according to the present technology do not specifically bind to any of the cell lines used in the examples: K-562, HeLa, SK-OV3, NCI-H226 and BxPC-3. Table 14 shows the molecules comprising protein-based carrier building blocks comprised in the molecules of the present technology, which has been conjugated with Alanine, which have been tested for cell binding (the tested cells are also disclosed in Table 14).
Table 14.
In addition, 3 different protein-based building blocks were produced, their engineered cysteines (Cys present at solvent-accessible positions in the building blocks) were blocked using maleimide-Alanine and their non-target binding was tested via the Membrane Proteome Array™ (https://www.integralmolecular.com/membrane-proteome-array/). This cell-based array contains one of the largest set of human membrane proteins (including heterocomplexes) assembled to determine specificity and preclinical safety of proteins such as antibodies, CAR-T cell therapies, and other biotherapeutics, see, e.g., Tucker DF., et al., "Isolation of state-dependent monoclonal antibodies against the 12-transmembrane domain glucose transporter 4 using virus-like particles", 2018, 115 (22) E4990-E4999.
The 3 selected carrier molecules were T028100069 (SEQ ID NO.: 107), ALB-3C_hCKSl_c3 (SEQ ID NO.: 125) and ALB-3C_K27m_wl(SEQ ID NO.: 173), comprising an ISVD-based building block (SEQ ID NO.: 80), a human protein-based building block (SEQ ID NO.: 101) and a DARPin-based building block (SEQ ID NO.: 97), respectively.
The assay consisted of screening 6,000 human membrane proteins (>5,300 unique) which are natively expressed in unfixed human cells in a 384-well plate format. Assay read out was done via sensitive flow cytometry detection and an anti-VHH secondary detection reagent from Jackson ImmunoResearch (Cat# 128-605-232) was used. As the above molecules all comprise a serum albumin binding VHH, excess human serum albumin was present to exclude any albumin interactions. The screening of the three selected carrier molecules did not generate positive signals and only background signals were observed.
Example 7. Solubility of molecule T028100070 (SEQ ID NO.: 108)
Before Size Exclusion Chromatography as formulation step, 1.4 gram of purified T028100070 molecule (purified via ProteinA Amphere A3 (JSR) and Capto Q Impress (Cytiva), was concentrated using a 10 kDa MWCO Vivaflow cassette, a disposable and ready-to-use crossflow device (Sartorius), to a final concentration of 35.9 mg/mL in PBS + 0.1 mM TCEP, at RT. The molecule was soluble at this concentration.
Example 8. Conjugation of different cargos on to a single protein-based building block
To confirm the feasibility of different cargo conjugation on to a single protein-based building block, the following construct was generated: EGFR7D12-3C_hCKSl_c3-cMyc NLS (SEQ ID NO.: 215, also referred to as "7D12 EGFR ISDV"):
DVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSSGGGGSGGGGSG GGGSHKQIYYSDKCDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSCGWVHYMIHEPEPHIL LFRRPLPKKPKCGGGPAAKRVKLD
This construct comprises an epidermal growth factor receptor (EGFR)-binding ISVD (VHH) (SEQ ID NO.: 216):
DVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF
TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSS
It also comprises a linker (GGGGSGGGGSGGGG, SEQ ID NO.: 298) and a nuclear localization sequence (cMyc NLS, SEQ ID NO.: 221, see also Day AH. et al., "Targeted cell imaging properties of a deep red luminescent iridium(iii) complex conjugated with a c-Myc signal peptide", Chem Sci., 2020, ll(6):1599-1606) preceded by 3 Gly residues (GGGPAAKRVKLD, SEQ ID NO.: 217). The protein-based building block is a CKS-based building block (SEQ ID NO.: 101).
EGFR7D12-3C_hCKSl_c3-cMyc NLS was produced via Pichia in shake flasks. Cysteine capping was carried out during fermentation via cysteamine addition. Spent medium was harvested via centrifugation and DTT and PMSF were added at 10 mM and ImM respectively. Buffer exchange was carried out via TFF to 20 mM Acetate with 10 mM DTT, pH 5.0; this buffer was used as loading buffer for cation exchange chromatography on Capto S (Cytiva). The eluted material was further purified via size exclusion chromatography (SEC) on Supedex75, equilibrated in D-PBS+0.1 mM TCEP. QC via Mass Spec confirmed intact product.
Site-specific conjugation of maleimide-CMA-1 on the -SH group present on the side chain of solvent-accessible cysteines and stochastic conjugation of pHAb and Alexa 647 on the primary amine present in the side chain of lysines was performed.
CMA-1 is a cationic cell penetrating peptide (CPP) which is adopting its active conformation in acidic conditions (see, e.g., Yang Y. et al., "Application of peptides in construction of nonviral vectors for gene delivery", Nanomaterials (Basel), 2022, 12(22) :4076). This peptide has the following sequence: GGGIGAVLEVLTTGLPALISWIEEEEQQ (SEQ ID NO.: 218). The peptide was custom synthesized via solid phase synthesis with a maleimide group on the amino terminus for conjugation purpose and an amidated C-terminus for higher potency: Maleimide- GGGIGAVLEVLTTGLPALISWIEEEEQQ-NH2 (SEQ ID NO.: 220).
A limited CPP conjugation was carried out using a 1.2 molar ratio/free thiol of maleimide- CMA-1, resulting in a mixed population of CMA1 on the 3Cysteine CKS carrier (SEQ ID NO.: 101). A wide range of DOL was obtained ranging from 0 to 3. The remaining free thiols were
blocked with an excess of maleimide-Alanine. The loaded carrier construct was analyzed via SDS-PAGE; result is shown in Figure 6.
Next the CMA-l-loaded carrier was labeled with Alexa 647 and pHAb fortracking the molecule in cell-based assays. A 6 times molar excess of both NHS-dyes was added to the carrier in D- PBS +0.1 mM TCEP and incubated for 1 hr at room temperature. After 1 hr of incubation, another 6 times molar excess of both NHS-dyes was added, followed by an additional hour of incubation at room temperature. The excess dye was removed via SEC and the material was collected in D-PBS buffer.
Protein concentration and DOL of the fluorophores was determined via Nanodrop Spectrophotometer (Thermo Scientific), using the Proteins & Labels application module. This module displays the UV spectrum, measures the protein's absorbance at 280 nm (A280) (protein absorbance at 280nm minus absorbance at 340 nm), measures the fluorochromes absorbance at Xmax and calculates the concentration of the labeled antibody or protein (mg/mL) and of the fluorochrome (pM). The extinction coefficient of the protein was added and the fluorochromes were selected in the Dye 1 and Dye2 box; the Xmax was set automatically for the two fluorochromes. See Table 15.
The spectrum automatically gave the maximum absorbance values in the scanned range (200 nm to 750 nm), the normalized absorbance at 280 nm and the calculated protein concentration (mg/mL). The DOL was calculated as [0Dmax*Mw Protein (Da)]/[conc labeled protein (mg/ml)*Molar extinction coefficient Label (M-lcm-1)].
The result is shown in Table 16. Both for pHAb and Alexa647 a DOL2 and DOL3 on the CMA1 loaded carrier and control construct was obtained, respectively.
Table 16.
The conjugations which were carried out on both engineered cysteines (site-specific conjugation) and surface exposed lysines (stochastic conjugation) shows the feasibility of conjugating different cargos onto the protein-based building blocks. Here we demonstrate the conjugation in one single building block with five different cargos. In particular, it has been demonstrated that a single protein-based carrier building block can comprise different types of attachment points or conjugation sites (e.g., in this case, the primary amine from the /V- terminus, the C-terminal carboxylic acid, primary amines comprised in the side chain of Lys comprised in the protein-based building block and -SH groups comprised in the side chain of Cys comprised in the protein-based building block). Four different cargos (EGFR-binding ISVD (SEQ ID NO.: 216), two different dyes (pHAb and Alexa647) and CMA-1 peptide (SEQ ID NO.: SEQ ID NO.: 218) were attached to the protein-based building block. Each cargo has been specifically conjugated to one type of attachment point or conjugation site. The EGFR-binding ISVD is conjugated to the /V-terminus primary amine of the carrier building block. The CMA-1 peptide is conjugated to the SH- group comprised in the side chain of Cys comprised in the protein-based building blocks. Two different dyes, pHAb and Alexa647, are conjugated to the primary amine comprised in the side chain of Lys comprised in the protein-based building block and targeting ISVD. Finally, a NLS (cMyc NLS preceded by 3 Gly residues (GGGPAAKRVKLD, SEQ ID NO.: 217) is also a cargo conjugated to the C-terminal carboxylic acid of the protein-based building block.
Example 9. Generation of ISVD-based building block comprising 9 Cys at solvent accessible positions
The ISVD precursor was RSV001A04, SEQ ID NO.: 179:
EVQLVESGGGLVQAGGSLSISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVEGRFTI
SRDNAKNTGYLQMNSLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTQVTVSS
Starting from this precursor, the following ISVD-derived building blocks are generated, as described in detail in Example 1:
19001 (SEQ ID NO.: 222)
EVQLVESGGGLVQAGGSLSICCAASGGSLSCYVLGWFRQAPGCEREFVAAINWRGDITIGPPNVEGRFCI
S RC N AKNTG Y LQM N C LAP D DTAVYYCG AGTPLNPCAYIYCWSYDY WG CGTLVTVSS
19002 (SEQ ID NO.: 223)
EVQLVECGGGLVCAGGSLCISCCASGGSLSNYVLGWFRQAPGKCREFVAAINWRGCITIGPPCVEGRFTIC
RDNCKNTGYLQMNSLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVSS
19003 (SEQ ID NO.: 224)
EVQLVECGGGLVQAGGCLSISCAASGGSLSCYVLGWFRQAPGKCREFVAAINWRGCITIGPPCVEGRFCI
SRDNACNTGYLQMNSLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCS
These building blocks comprise 9 Cys located at solvent-accessible positions, which SH- groups are attachment points or conjugation sites for site-directed conjugation of cargos.
Example 10. NLS-conjugated carrier
Preparation of the NLS-conjugated carrier
A molecule comprising the 3Cysteine CKS1 carrier (3C_hCKSl, SEQ ID NO.: 101) fused with a GGG spacer on its C-terminal carboxylic group and the monopartite NLS of cMyc (PAAKRVKLD, SEQ ID NO.: 221) was at its /V-terminus fused to the monovalent 7D12 EGFR ISDV (SEQ ID NO.: 216). The molecule EGFR7D12-3C_hCKSl_c3-cMycNLS (SEQ ID NO.: 215) was produced via Pichia in shake flasks and purified as described in Example 8. Mass spectrometry
demonstrated that the NLS comprised in the molecule was intact despite Pichia being prone to degrade unprotected peptides. An experimental mass was detected which confirmed the theoretical one, see Figure 7.
Site-specific conjugation of maleimide-CMA-l (SEQ ID NO.: 220, see below) on the -SH group present on the side chain of solvent-accessible cysteines and stochastic conjugation of pHAb and Alexa 647 on the primary amine present in the side chain of lysines was performed, as described in Example 8. The CMA-1 peptide was custom synthesized via solid phase synthesis with a maleimide group on the amino terminus for conjugation purpose and an amidated C- terminus for higher potency: Maleimide-GGGIGAVLEVLTTGLPALISWI EEEEQQ-NH2 (SEQ ID NO.: 220). For negative control, Mal-Ala was used. A limited CPP (CMA-1) conjugation was carried out using a 1.2 molar ratio/free thiol of maleimide-CMA-l, resulting in a mixed population of CMA1 on the 3Cysteine CKS carrier (SEQ ID NO.: 101). A wide range of DOL was obtained ranging from 0 to 3 (50%, DOLO; 15%, DOL1; 15%, DOL2 and 15%, DOL3, approximately). The remaining free thiols were blocked with an excess of maleimide-Alanine. The loaded carrier construct was analyzed via SDS-PAGE; result is shown in Figure 6.
Internalization assay
The molecule (T023800001: EGFR007D12(QlE,K3Q)-20GS-ALBll-GGC; SEQ ID NO.: 299) was functionally characterized in an internalization assay. To follow the subcellular localization, the carrier constructs were labelled with amine reactive dyes Alexa647 and pHAb NHS-dye on Lysines (stochastic conjugation with an average 1-2 each per molecule), as described in Example 8.
Internalization was demonstrated for T023800001, which contains the anti-EGFR ISVD 7D12 as follows: EGFR007D12(QlE,K3Q)-20GS-ALBll-GGC (SEQ ID NO.: 299, see Figure 13)). EVQLEESGGGSVQTGGSLRLTCAASGRTSRSYGMGWFRQAPGKEREFVSGISWRGDSTGYADSVKGRF TISRDNAKNTVDLQMNSLKPEDTAIYYCAAAAGSAWYGTLYEYDYWGQGTQVTVSSGGGGSGGGGSG GGGSGGGGSEVQLVESGGGLVQPGNSLRLSCAASGFTFSSFGMSWVRQAPGKGLEWVSSISGSGSDTL YADSVKGRFTISRDNAKTTLYLQMNSLRPEDTAVYYCTIGGSLSRSSQGTLVTVSSGGC
Generic conditions were applied to block the free thiol via maleimide-Alanine (Mal-Ala), generating T023800001-Mal-Ala (EGFR007D12(QlE,K3Q)-20GS-ALBll-Mal-Ala). An internalization assay via HSA-pHAb binding to Albll ISVD (SEQ ID NO.: 54) using Bx-PC3 and NCI-H226 cell lines (ATCC) was used.
To each well of 96-well F-bottom plate (Corning, cat# 3596), 100 pL of NCI-H226 or BxPC-3 cells (5 x 103 cells /well) were added in their culture medium: RPMI 1640 (Gibco-Technologies, cat# 72400), 10% Heat-Inactivated Fetal Bovine Serum (HI FBS, Sigma-Aldrich, cat# F7524), 1% sodium pyruvate (Gibco-Technologies, cat# 11360) and 1% Penicillin-Streptomycin (P/S, Gibco-Technologies, cat#15140). Outer wells were not used and were filled with 200 pL D-PBS (Gibco-Technologies, cat# 14190).
The 96-well F-bottom plate was placed in the incubator at 37°C, 5% CO2 for ~24 hours. Thereafter, serially diluted ISVD compounds (4X concentration) in cell culture medium were mixed with HSA-pHAb (in house, cat# A8763_PRT1822, 4X concentration - final concentration is 1 pM) in a 1:1 ratio. After 20 minutes at room temperature in the dark, 100 pL serially diluted ISVD constructs with HSA-pHAb mix was added to the cells. The 96-well F-plate was placed in the incubator at 37°C, 5% CO2 for ~42 hours.
The NCI-H226 or BxPC-3 cells were harvested after ~42 hours and transferred to 96-well V- bottom plate (Thermo Scientific, cat# 249570). The cells were washed twice with 50 pL / well D-PBS. Between each washing step, the cells were centrifuged 2 minutes at 300 g at 4°C. Thereafter, the cells were resuspended in 50 pL LIVE/DEAD™ Fixable Near-IR Dead Cell Stain Kit (diluted in D-PBS) (Molecular Probes, cat# L10119- final dilution of 1 / 1000) for 15 minutes at room temperature. After incubation the cells were centrifuged twice for 2 minutes at 300 g at 4°C and washed with 100 pL/ well cold FACS buffer, consisting of D-PBS + 2% HI FBS + 0.05% Sodium Azide (Interchim, cat# NJK63A). The cells were diluted in 120 pL FACS buffer and readout was performed on the Attune NxT (Thermo Fisher Scientific). Data were analyzed with FlowLogic™ software.
Internalization of T023800001 (SEQ ID NO.: 299) using Bx-PC3 and NCI-H226 cell lines and EGFR expression on both cell lines was in house confirmed.
In first instance, T023800001-Mal-Ala was used to determine the internalization in a flow cytometry-based read-out. As illustrated in Figure 8A and 8B, a dose-dependent internalization of T023800001-Mal-Ala was obtained on NCI-H226 (8A) and BxPC-3 (8B) cells. The negative control (CNB00010) did not internalize and was comparable to the baseline set with 1 pM HSA-pHAb.
CNB00010: RSVOO1AO4(E1D,L11V,V89L,Q1O8L)-9GS-ALB23OO2-A (SEQ ID NO.: 294): DVQLVESGGGVVQAGGSLSISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVEGRFTI SRDNAKNTGYLQMNSLAPDDTALYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVSSGGGGSGGGSEV QLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRFTIS RDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSA
Possible toxic effects of CMA-1
Possible cytotoxic effects of CMA-1 were determined using EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 and EGFR7D12-3C_hCKSl_c3-cMyc NLS+Ala (negative control), as described in Example 8, on BxPC-3 and on NCI-H226 cells in IncuCyte®. Staurosporine, a non-selective inhibitor known to induce cell death (Karaman, M. et al., "A quantitative analysis of kinase inhibitor selectivity", Nat Biotechnol, 2008, 26:127-132), was included as positive control.
Green fluorescence-based cytotoxicity assay
To each well of 96-well F-bottom plate (Corning, cat# 3596), 100 pL NCI-H226 or BxPC-3 cells (6 x 103 cells / well) were added in their culture medium: RPMI 1640 (Gibco-Technologies, cat# 72400), 10% HI FBS (Sigma-Aldrich, cat# F7524) and 1% sodium pyruvate (Gibco-Technologies, cat# 11360). Outer wells were not used and were filled with 200 pL D-PBS (Gibco- Technologies, cat# 14190).
The 96-well F-bottom plate was placed in the incubator at 37°C, 5% CO2 for ~24 hours.
Thereafter, 50 pl of IncuCyte™ Cytotox Green Reagent (Essen Bioscience, cat# 4633, final
concentration = 250 nM) and 50 pL serially diluted ISVD compounds or staurosporine (Sigma- Aldrich, cat# S6942, 4X concentration) in the cell culture medium were added to the cells. The 96-well F-plate was placed in the IncuCyte® S3b (Sartorius) in the incubator at 37°C, 5% CO2 while confluency (% phase) and cell death (% green) was measured every 4 hours for 2 days. Images were analyzed and data were generated using the software from Incucyte®.
Results
Possible cytotoxic effects of CMA-1 were evaluated on BxPC-3 and NCI-H226 cells using IncuCyte™ Cytotox Green Reagent: cell death will result in increased % green confluency. This was observed using staurosporine which resulted in increased green, fluorescent signal, hence cell death of the cells (Figure 9 A and B).
As illustrated in Figure 9A and 9B, the % green confluency of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 and EGFR7D12-3C_hCKSl_c3-cMyc NLS+Ala is comparable to the baseline set with Cytotox Green only. These results illustrated that no cytotoxic effects of CMA-1 were observed.
Internalization and endosomal escape of EG FR7D12-3C_hCKSl_c3-cMyc NLS
EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA1 (Figure 10) and EGFR7D12-3C_hCKSl_c3-cMyc NLS+Ala (data not shown) were additionally labeled with two amine reactive dyes Alexa Fluor™ 647 (AF647) Carboxylic Acid, Succinimidyl Ester (Molecular Probes, cat# A37573) and pHAb Amine Reactive Dye (Promega, cat# G9841) via NHS to evaluate internalization and endosomal escape in High-Content Imaging (HCI) using the ImageXpress® (Molecular Devices).
High-Content Imaging analysis
To each well of 96-well F-bottom glass bottom SensoPlate™ (Greiner, cat# 655892), 100 pL NCI-H226 or BxPC-3 cells (1.2 x 104 cells / well) were added in their culture medium: RPMI 1640 (Gibco-Technologies, cat# 72400), 10% HI FBS (Sigma-Aldrich, cat# F7524) and 1% sodium pyruvate (Gibco-Technologies, cat# 11360). Outer wells were not used and were filled with 200 pL D-PBS (Gibco-Technologies, cat# 14190).
The 96-well F-bottom glass bottom SensoPlate™ plate was placed in the incubator at 37°C, 5% CO2 for ~24 hours. Thereafter, 50 pl of the cell culture medium was removed and replaced with 50 pL of ISVD compounds (2X concentration, final concentration = 2 pM). The plate was placed in the incubator at 37°C, 5% CO2 for 4 or 24 hours. After the incubation with ISVD compounds, the cells were gradually fixed with 4% pre-warmed paraformaldehyde (PFA, Thermo Scientific, cat# J61899) at room temperature. After fixation, the cells were washed 3 times with 100 pL D-PBS (5 minutes at room temperature). Thereafter, cells were permeabilized with 50 pL permeabilization buffer, consisting of D-PBS supplemented with 0.1% Tween-20 (in house), for 10 minutes at room temperature. Cells were washed 3 times with 100 pL D-PBS and 150 pL blocking buffer, consisting of D-PBS supplemented with 2% BSA (Sigma - Aldrich, cat# A3059), was added for 45 minutes at room temperature. Thereafter, Anti-EEAl polyclonal antibody (Early Endosome Marker, Abeam, cat# ab2900, final concentration = 5 pg/mL) was added in 1:1 permeabilizatio blocking buffer and incubated for 2 hours at room temperature. After washing, the cells were stained with Goat anti-Rabbit IgG (H+L) Cross-Adsorbed Secondary Antibody, Alexa Fluor™ 488 (Life Technologies- Invitrogen, cat# A-11008) for 45 minutes at room temperature, protected from light. Thereafter, cells were washed and NucBlue™ Fixed Cell ReadyProbes™ Reagent (DAPI) (Molecular probes, cat# R37606, according to manufacturer's guideline) was added for 10 minutes at room temperature. Finally, cells were washed 3 times and 150 pL D-PBS / well was added to perform image acquisition using the ImageXpress® device (60x magnification, confocal 50 pm slit). Z-stacks were also acquired spanning the nucleus from top to bottom using 20 planes, 0.5 pm step size and 9.5 pm range. All images were analyzed and data were generated using the software from ImageXpress®.
Visualization / tracking of EGFR7D12-3C_hCKSl_c3-cMyc NLS ISVD was done using AF647 signal in Cy5 channel and not via pHAb signal in YFP channel as a bright signal originating from the endosomal staining (AF488 in FITC channel) was observed in the YFP channel.
Results
Internalization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 is shown in Figure 10. EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 and EGFR7D12-3C_hCKSl_c3-cMyc NLS+Ala were
additionally labeled with AF647 to visualize internalization of the compounds. Analysis was done with the ImageXpress® in the Cy5 channel. Results illustrated intracellular uptake, hence internalization, of both compounds on BxPC-3 and NCI-H226 cells. As expected, a more intense signal, thus internalization, was obtained after 24 hours.
Co-localization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 (Figure 11) and EGFR7D12- 3C_hCKSl_c3-cMyc NLS+Ala (data not shown) (AF647 in Cy5 channel) and endosomes using anti-EAAl polyclonal antibody (AF488 in FITC channel) is illustrated in Figure 11 (for EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1, but similar results were achieved for EGFR7D12- 3C_hCKSl_c3-cMyc NLS+Ala). Limited co-localization of the compounds with endosomes was noticed, most likely because an early endosomal marker (EAA1) was used. In next experiments, late endosomal (e.g., Rab7) and lysosomal (e.g., LAMP-1) markers will be included in the analysis. See Figure 11.
Co-localization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 (Figure 12) and EGFR7D12- 3C_hCKSl_c3-cMyc NLS+Ala (data not shown) (AF647 in Cy5 channel), anti-EAAl polyclonal antibody staining (endosomal marker AF488 in FITC channel) and nuclei (DAPI channel) was demonstrated using Z-stacks. See Figure 12 (for EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1, but similar results were achieved for EGFR7D12-3C_hCKSl_c3-cMyc NLS+Ala). Co-localization of EGFR7D12-3C_hCKSl_c3-cMyc NLS+CMA-1 (AF647 in Cy5 channel) and nuclei (DAPI channel) was demonstrated using Z-stacks.
Example 11: NLS-conjugated carriers
NLS is conjugated to RSV001A04(S19C,G65C,S82bC,Q108L,S112C), SEQ ID NO.: 225, comprised in SEQ ID NO.: 226. Different NLS sequences are tested: cMyc NLS (SEQ ID NO.: 221, PAAKRVKLD, see also Day AH. et al., "Targeted cell imaging properties of a deep red luminescent iridium(iii) complex conjugated with a c-Myc signal peptide", Chem Sci., 2020, 11(6) :1599-1606)), SV40mono NLS (SEQ ID NO.: 256, PKKKRKV), SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV), NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
The NLS peptide is conjugated by means of a peptide linker, GGG, to the C-terminal carboxylic acid of the protein-based building block RSV001A04(S19C,G65C,S82bC,Q108L,S112C), SEQ ID NO.: 225, to generate the following NLS-building blocks:
SEQ ID NO.: 295:
EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPAAKRVKL D
SEQ ID NO.: 307:
EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKKKRKV
SEQ ID NO.: 308:
EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKKKRKVP KKKRKVPKKKRKV
SEQ ID NO.: 309:
EVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTI SRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGAVKRPAAT KKAGQAKKKKLD
The molecule further comprises two different tumor-targeting moieties which specifically target CEACAM5 molecules in the tumoral cells. These two tumor-targeting moieties are covalently attached to the NLS-building block carrier via an attachment point which is the /V- terminal primary amine of the ISVD-based building block carrier. The sequence of the two tumor-targeting moieties, which are two ISVDs, is as follows:
SEQ ID NO.: 227:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT
ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSS
SEQ ID NO.: 228:
EVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVKGRF
TISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSS
Both tumor-targeting moieties are covalently linked to each other by means of a linker (GGGGSGGGGSGGGGS, SEQ ID NO.: 163). They are both covalently linked to the /V-terminal primary amine of the ISVD-based building block carrier by means of a linker (GGGGSGGGGSGGGGS, SEQ ID NO.: 163).
The resulting molecule is defined by SEQ ID NO.: 226:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGSGGG GSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVK GRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNV ECRFTISRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPAA KRVKLD
The resulting molecule with SV40mono NLS is defined by SEQ ID NO. 310:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGSGGG GSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVK GRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNV ECRFTISRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKK KRKV
The resulting molecule with SV40tri NLS is defined by SEQ ID NO. 311:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGSGGG
GSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVK GRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNV ECRFTISRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKK KRKVPKKKRKVPKKKRKV
The resulting molecule with NLP NLS is defined by SEQ ID NO. 312:
DVQLVESGGGVVQPGGSLRLSCAASGLTFSTYTMGWFRQAPGKEREFVAAIIWSGSNTYYADSVKGRFT ISRDNAKNTVYLQMNSLRPEDTALYYCAAQHFGPIGLTTRGYHYWGQGTLVTVSSGGGGSGGGGSGGG GSEVQLVESGGGVVQPGGSLRLSCAASGHTFSEYALGWFRQAPGKEREFVAAINWGGGWTYYADSVK GRFTISRDNAKNTLYLQMNSLRPEDTALYYCAASSDYAGGNPTGYPYWGQGTLVTVSSGGGGSGGGGS GGGGSEVQLVESGGGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNV ECRFTISRDNAKNTGYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGAVK RPAATKKAGQAKKKKLD
Alternatively or in addition, the molecule further comprises an a HLE moiety linked to the protein-based building block, e.g., through a 15GS linker (as defined in SEQ ID NO.: 163). The HLE moiety can be chosen from Alb23002 (SEQ ID NO.: 63) or its EID variant (Alb23002(ElD), SEQ ID NO.: 106). Both Alb23002 and Alb23002(ElD) are ISVDs which bind to human serum albumin, as explained above in this description. This albumin binding ISVD is covalently attached to the NLS-building block carrier via an attachment point which is the /V-terminal primary amine of the ISVD-based building block carrier.
The resulting molecule is defined by SEQ ID NO.: 313:
DVQLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRF TISRDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSGGGGSGGGGSGGGGSEVQLVESG GGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTISRDNAKNT GYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPAAKRVKLD
The resulting molecule with SV40mono NLS is defined by SEQ ID NO. 314:
DVQLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRF TISRDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSGGGGSGGGGSGGGGSEVQLVESG GGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTISRDNAKNT GYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKKKRKV
The resulting molecule with SV40tri NLS is defined by SEQ ID NO. 315:
DVQLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRF TISRDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSGGGGSGGGGSGGGGSEVQLVESG GGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTISRDNAKNT GYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGPKKKRKVPKKKRKVPKK KRKV
The resulting molecule with NLP NLS is defined by SEQ ID NO. 316:
DVQLVESGGGVVQPGGSLRLSCAASGFTFRSFGMSWVRQAPGKGPEWVSSISGSGSDTLYADSVKGRF TISRDNSKNTLYLQMNSLRPEDTALYYCTIGGSLSRSSQGTLVTVSSGGGGSGGGGSGGGGSEVQLVESG GGLVQAGGSLCISCAASGGSLSNYVLGWFRQAPGKEREFVAAINWRGDITIGPPNVECRFTISRDNAKNT GYLQMNCLAPDDTAVYYCGAGTPLNPGAYIYDWSYDYWGRGTLVTVCSGGGAVKRPAATKKAGQAKK KKLD
The molecules are expressed via Komagataella phaffii (Pichia pastoris) or via CHO cells (Protein expression in Mammalian cells: Methods and Protocols. Methods in Molecular Biology, vol 801) and purified via a Protein A-based capture step, followed by a reduction of the engineered cysteines with 10 mM DTT (minimum of lhr at RT), and then formulated in D- PBS with 0.1 mM TCEP via size exclusion chromatography, as also described, e.g., in Example 4.
Alternatively, the NLS peptides are site-specifically conjugated on the -SH group present on the side chain of solvent-accessible cysteines (as engineering in RSV001A04(S19C,G65C,S82bC,Q108L,S112C), SEQ ID NO.: 225). The NLS peptides are custom synthesized via solid phase synthesis with a maleimide group on the amino terminus for
conjugation purpose and an amidated C-terminus for higher potency. Also, the NLS peptides have three additional glycine residues.
The resulting molecule with cMyc NLS is defined by SEQ ID NO. 317: Mal-GGGPAAKRVKLD- NH2.
The resulting molecule with SV40mono NLS is defined by SEQ ID NO. 318: Mal-GGGPKKKRKV- NH2.
The resulting molecule with SV40tri NLS is defined by SEQ ID NO. 319: Mal- GGGPKKKRKVPKKKRKVPKKKRKV-NH2.
The resulting molecule with NLP NLS is defined by SEQ ID NO. 320: Mal- GGGAVKRPAATKKAGQAKKKKLD-NH2.
Conjugations are done as described in Example 5. As negative controls, the protein-based carrier is conjugated with Mal-Alanine using the same conjugation protocol.
Example 12: Transfection protocols
Protein delivery of AF647-labelled ISVD compounds conjugated with NLS or Mai-Ala is performed in different cell lines, e.g. BxPC-3 (ATCC), HEK293T and/or NCI-H226. Protein delivery is facilitated using lipid-based delivery (e.g., Bioporter protein delivery reagent (Genlantis), ProteoJuice Protein Transfection Reagent (Sigma), PULSin® Protein Delivery (Sartorius)), photoporation (e.g., LumiPore™ (Trince Bio)) or electroporation (e.g., Neon™ NxT Electroporation). Transfection protocols are executed according to manufacturer's guidelines, but optimization of delivery conditions might be needed depending on the cell type, e.g., amount of protein to deliver and volume of transfection reagent. Cells are seeded in 96-well F-bottom glass bottom SensoPlate™ (Greiner, cat# 655892) to allow High-Content Imaging analysis. The 96-well F-bottom glass bottom SensoPlate™ plate are placed in the incubator at 37°C, 5% CO2 up to 24 hours (several timepoint will be evaluated). Outer wells are not used and are be filled with 200 pL D-PBS (Gibco-Technologies, cat# 14190).
Example 13: Evaluation of ISVD protein delivery and nuclear co-localization
At different timepoints after protein delivery of AF647-labelled ISVD compounds conjugated with NLS or Mal-Ala to the cells (e.g. 4 - 16 and 24 hours), the cells are gradually fixed with 4% pre-warmed paraformaldehyde (PFA, Thermo Scientific, cat# J61899) at room temperature. After fixation, the cells are washed 3 times with 100 pL D-PBS (5 minutes at room temperature). Thereafter, cells are permeabilized with 50 pL permeabilization buffer, consisting of D-PBS supplemented with 0.1% Tween-20 (in house), for 10 minutes at room temperature. Cells are washed 3 times with 100 pL D-PBS and 150 pL blocking buffer, consisting of D-PBS supplemented with 2% BSA (Sigma - Aldrich, cat# A3059), for 45 minutes at room temperature. Thereafter, cells are washed and NucBlue™ Fixed Cell ReadyProbes™ Reagent (DAPI) (Molecular probes, cat# R37606, according to manufacturer's guideline) is added for 10 minutes at room temperature. Finally, cells are washed 3 times and 150 pL D- PBS / well is added to perform image acquisition using the ImageXpress® device (60x magnification, confocal 50 pm slit). Z-stacks will be acquired spanning the nucleus from top to bottom using 20 planes, 0.5 pm step size and 9.5 pm range (might differ depending on the cell line). All images are analyzed, and data are generated using the software from ImageXpress®. Nuclear co-localization of AF647-labelled ISVD compounds with NLS or Mal-Ala is illustrated using overlay of DAPI (blue) and CY5 (red) channel.
Example 14: Internalization and endosomal escape
The constructs prepared in Example 11 are further conjugated with cell penetrating peptides. For this, site-specific conjugation of the CPP maleimide-CMA-1 (SEQ ID NO.: 220) on the -SH group present on the side chain of solvent-accessible cysteines is performed, as described in Example 8. In particular, 5 mg of the above molecule (SEQ ID NO.: 226) are conjugated with 3x excess ligand (maleimide-CMA-1) vs. the number of engineered cysteines for a fully loaded carrier (DOL=4). For DOL=1 (in a stochastic way on the carrier with 4 free thiols) a 1.2 molar ratio-ligand vs. ISVD is added, and DOLO is made via excess Mal-Ala. All conjugation reactions are carried out for 2h at RT, and any remaining free thiols are capped with extra Mal-Ala as already described above, e.g., Example 5.
In addition, stochastic conjugation of Hi BiT and other Pro-CPP peptides with 1:3 ratio is carried out. HiBiT is an 11 amino acid peptide tag (VSGWRLFKKIS, SEQ ID NO.: 296) that can be attached to any protein-of-interest, such as to the protein-based building block comprised in the molecule of the present technology and detected in a few minutes using simple bioluminescent reagents. The detection reagent contains an inactive luciferase subunit, Large Bit (LgBiT), which rapidly binds to HiBiT to produce a highly active luciferase enzyme. See, e.g., Sasaki M. et al., "Development of a rapid and quantitative method for the analysis of viral entry and release using a NanoLuc luciferase complementation assay", Virus Res. 2018, 243:69-74. For stochastic conjugation of HiBiT, the following construct was used (SEQ ID NO.: 300):
VSGWRLFKKIS-GGGGS-Maleimide
Other selected Pro-CPP peptides are LAH4 (Maleimide-GGGKKALLALALHHLAHLALHLALALKKA- NH2, SEQ ID NO.: 292), see, e.g., Moulay G. et al., "Histidine-rich designer peptides of the LAH4 family promote cell delivery of a multitude of cargo", J Pept Sci. 2017 Apr;23(4):320-328, and TAT peptide (YGRKKRRQRRREV, SEQ ID NO.: 277) with a poly-Glu sequence to neutralize the cationic nature in neutral conditions (Maleimide-GG-YGRKKRRQRRREV-Cit-EEEEEE, SEQ ID NO.: 293), see, e.g., Lo SL. and Wang S., "An endosomolytic Tat peptide produced by incorporation of histidine and cysteine residues as a nonviral vector for DNA transfection", Biomaterials, 2008, 29(15) :2408-14, Anami Y. et al. "Glutamic acid-valine-citrulline linkers ensure stability and efficacy of antibody-drug conjugates in mice", Nat Commun. 2018 Jun 28;9(1):2512 and Farkhani SM. et al., "Effect of poly-glutamate on uptake efficiency and cytotoxicity of cell penetrating peptides", IET Nanobiotechnol., 2016, 10(2):87-95.
These constructs are evaluated using the HEK293 LgBiT cells (Promega) which are transiently or stably transfected with an expression plasmid encoding for human CEACAM5. The HEK293 LgBiT cells have intracellular expression of the LgBiT protein and upon endosomal escape of molecules containing Pro-CPPs and HiBiT, the latter one will complement with LgBiT protein with a high affinity to produce luminescence in the presence of substrate. Endosomal escape of molecules containing ProCPPs and HiBiT is evaluated by a Nano-Gio® Live Cell Assay System (Promega) which contains a cell permeable substrate fu rimazine.
Items of the present technology
The present technology provides the following items:
1. A molecule comprising at least one protein-based building block, wherein the at least one protein-based building block: a) comprises at least one, preferably at least two conjugation sites or attachment points; b) has a molecular mass of about 2.5 to about 70 kDa, preferably of about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, even more preferably of about 2.5 to about 16 kDa; c) has a globular three-dimensional (3D) structure; d) has a solubility of lO mg/mL or more, preferably of 20 mg/mL, preferably of 50 mg/mL or more, and even more preferably of 100 mg/mL, measured in an aqueous solution at room temperature, preferably wherein the aqueous solution is citrate buffer 5 mM or PBS, at pH 7.0 or 7.4; e) does not specifically bind to any human protein or binds one or more human proteins with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or by surface plasmon resonance (SPR), for instance as described in Ober et al. 2001, Intern. Immunology 13: 1551-1559, or does not specifically bind to any human cell or binds one or more human cells with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cellbinding assay or by SPR; f) optionally, does not specifically bind to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, or binds to any (non-human) molecule which the protein-based carrier building block precursor specifically binds to, such as protein F of RSV, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay or by SPR; g) optionally, does not specifically bind to any human cell and/or cell type, or binds to a human cell and/or cell type with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay;
h) optionally, does not specifically bind any microorganism such as bacteria, fungi, protists, yeast and/or to any virus, or binds to a microorganism such as bacteria, fungi, protists, yeast and/or to virus with a KD (KD value) greater than 5x10" 4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; i) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein; j) optionally, does not specifically bind to any biomolecule, including human biomolecules and non-human biomolecules, such as plant biomolecules, virus biomolecules and/or microorganism biomolecules (such as bacteria, fungi, protists and/or yeast), or binds to biomolecules, including human biomolecules and non- human biomolecules, with a KD (KD value) greater than 5xl0-4 mol/litre, preferably as determined by cell-binding assay and/or SPR, as described herein when it has at least one cargo attached to it (via the at least one conjugation sites or attachment points comprised therein); k) optionally, does not comprise or consists of an amino acid sequence selected from SEQ ID NO.: 1-34 as depicted on Tables A-l and A-2 of WO 2016/055656 and/or SEQ ID NO.: 1-12 as depicted on Table A-l of WO 2010/139808; and l) optionally, does not comprise or consists of the amino acid sequence as defined in SEQ ID NO.: 214, wherein the molecule further comprises at least one nuclear localization sequence (NLS), covalently linked, directly or by means of a linker, to at least one conjugation site or attachment point comprised in the protein-based carrier building block, wherein the NLS preferably comprises or consists of SEQ ID NO.: 221, SV40mono NLS (SEQ ID NO.: 256, PKKKRKV), SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV), or NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
2. The molecule of item 1, wherein the at least one, preferably the at least two conjugation site(s) or attachment point(s) are present at a solvent-accessible positions in the protein-based building block.
3. The molecule of any of items 1 or 2, wherein the at least one, preferably the at least two attachment point(s) or conjugation site(s) are reactive groups present in the side chain of any amino acid in the protein-based carrier building block, preferably an amino acid present at a solvent-accessible position in the protein-based carrier building block, more preferably are reactive groups present in the side chain of a cysteine and/or in the side chain of a tyrosine, and/or in the side chain of a lysine, and/or in the side chain of a non-natural amino acid, preferably located at solvent-accessible positions in the protein-based carrier building block, and/or are the /V-terminal primary amine and/or the C-terminal carboxylic acid of the proteinbased building block.
4. The molecule of any of items 1 to 3, wherein at least one of the attachment point(s) or conjugation site(s), preferably at least two, more preferably all of the attachment points or conjugation sites, is (are) an engineered attachment point or conjugation site.
5. The molecule of any of items 1 to 4, wherein the at least one protein-based building block and/or the molecule do not specifically bind crystallizable fragment (Fc) receptors (FcRs), Fc-binding proteins and/or Fc-sensors (or bind FcRs, Fc-binding proteins and/or Fc- sensors with a KD value greater than 5xl0-4 mo l/litre).
6. The molecule of any of items 1 to 5, wherein the at least one protein-based building block and/or molecule does not show effector functions of conventional antibodies mediated by the Fc domain.
7. The molecule of any of items 1 to 6, wherein the at least one protein-based building block does not specifically bind the variable domain of the light chain (VL) and/or the variable domain of the heavy chain (VH) of an antibody, such as the VL and/or the VH of a monoclonal antibody (mAb).
8. The molecule of any of items 1 to 7, wherein the at least one protein-based building block does not specifically bind the first constant domain of the heavy chain (CHI) of an antibody, such as the CHI of a mAb, and/or does not specifically bind the constant domain of the light chain (CL) of an antibody, such as the CL of a mAb; and/or does not specifically bind the third constant domain of the heavy chain (CH3) of an antibody, such as the CH3 of a mAb, and/or does not specifically bind the second constant domain of the heavy chain (CH2) of an antibody, such as the CH2 of a mAb.
9. The molecule of any of items 1 to 8, wherein the at least one protein-based building block does not specifically bind to any non-protein molecule, such as DNA, RNA, lipids (e.g., such as phosphatidylserine (PS)) or glycans, or binds one or more non-protein molecules with a KD value greater than 5xl0-4 mol/litre.
10. The molecule of any of items 1 to 9, wherein the at least one protein-based building block does not specifically bind the precursor's target.
11. The molecule of any of items 1 to 10, wherein the at least one protein-based building block comprises at least one further cargo attached to at least one attachment point or conjugation site.
12. The molecule of any of items 1 to 11, wherein the at least one protein-based building block is not derived from the crystallizable fragment of an antibody such as the Fc fragment of a mAb, and/or is not derived from the CH2 and/or the CH3 domains of the Fc fragment, and/or is not derived from a CHi and/or the CL domains comprised in the antigen-binding fragment (Fab) of an antibody, such as the CHi and/or the CL domains comprised in the Fab of a mAb.
13. The molecule of any of items 1 to 12, wherein the molecule is not (or is not derived from) a crystallizable fragment (Fc) of an antibody, such as a mAb and/or is not (or is not derived from) the Fab of an antibody, such as a mAb.
14. The molecule of any of items 1 to 13, wherein, if there is more than one attachment point or conjugation site in the at least one protein-based building block, the conjugation sites are spatially distant from each other.
15. The molecule of any of items 1 to 14, wherein the at least one, preferably two, more preferably four and even more preferably five conjugation site(s) is(are) selected from a primary amine, a thiol group, a hydroxyl group, a guanidino group, a carboxyl group and/or a thioether group, preferably from a primary amine and/or a thiol group, more preferably a thiol group.
16. The molecule of any of items 1 to 15, wherein the at least one protein-based carrier building block comprises at least two engineered cysteines, preferably located at solvent accessible positions, such as three engineered cysteines, or four engineered cysteines, or six engineered cysteines, or nine engineered cysteines, preferably located at solvent accessible positions, with free or capped thiol groups at their side chains, that are the at least two, such as three, or four, or six, or nine, conjugation sites or attachment points.
17. The molecule of any of items 1 to 16, wherein the at least one protein-based carrier building block comprises at least three engineered cysteines with free or capped thiol groups at their side chain, and at least three lysines with free or capped amine groups at their side chain, preferably located at solvent accessible positions, or wherein the at least one proteinbased carrier building block comprises at least two engineered lysines with free or capped amine groups at their side chain, or wherein the at least one protein-based carrier building block comprises at least four cysteines, preferably located at solvent accessible positions, and optionally or additionally a free /V-terminal amine.
18. The molecule of any of items 1 to 17, wherein the at least one protein-based building block comprises a /V- and/or a C-terminal Cys and/or a /V- and/or a C-terminal Tyr, preceded or followed by a (GG) or (G4SI)I-3GG sequence, such as CGG-, -GGC, YGG-, -GGY, -(G4SI)I-3GGY, Y(G4SI)I-3GG-, YGG(SIG4)I-3-, or YGG(G4SI)I-3-.
19. The molecule of any of items 1 to 18, wherein the protein-based building block is a small globular non-human protein-based building block or a small globular human proteinbased building block.
20. The molecule of item 19, wherein the small globular non-human protein-based building block is an immunoglobulin single variable domain (ISVD)-based building block, a DARP-in-based building block, an affi body-based building block or an affitin-based building block.
21. The molecule of item 19, wherein the small globular human protein-based building block is derived from cyclin-dependent kinase subunit 1 (CKS1) protein.
22. The molecule of any of items 19 to 20, wherein the ISVD-based building block is derived from a VH, a humanized VH, a human VH, a VHH, a humanized VHH or a camelized VH (derived from a heavy-chain ISVD).
23. The molecule of item 22, wherein the ISVD-based building block is derived from an ISVD belonging to the "VH3 class".
24. The molecule of any of items 19 to 20 or 22 to 23, wherein, in the ISVD-derived proteinbased carrier building block, the amino acid at position 11 (according to Kabat) is Vai or Leu, preferably Vai, and/or the amino acid at position 89 (according to Kabat) is Vai, Thr or Leu, preferably Leu; and/or the amino acid at position 108 is a Leu or Gin, preferably Leu; and/or the amino acid at position 110 (according to Kabat) is Thr, Lys or Gin, preferably Thr; and/or the amino acid at position 112 (according to Kabat) is Ser, Lys or Gin, preferably Ser; and/or the ISVD-based building block contains a C-terminal extension of 1-5 amino acids chosen from any naturally occurring amino acid, preferably chosen from Ala, Gly and/or Cys.
25. The molecule of any of items 19 to 20 or 22 to 24, wherein the ISVD-derived building block is derived from RSV001A04 (SEQ ID NO.: 179).
26. The molecule of any of items 19 to 20 or 22 to 25, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 186:
X1VX2LX3EX4X5GX6X7X8X9X10X11GX12X13X14IX15CX16AX17X18X19X20LX21X22X23VLGWFRX24AX25X26X2 7X28X29X30FVAAI NX31X32X33X34X35X36X37X38PX39X40VX41X42X43FX44IX45X46X47X48X49X50X51TGX52LX5
3MX54X55LX56X57X58DX59AX6OYX61CGAGX62PX63X64X65X66AYX67X68X69X7OSYX71X72X73GX74X75TX76V X77VX78X79X80X81X82 wherein
Xi (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai or any amino acid with a reactive group in its side chain, such as cysteine;
Xs (position 12 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X10 (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X20: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X23: (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X24: (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X25: (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X26: (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X27: (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X28 : (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X29: (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X30: (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X31: (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X32: (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X33 : (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X34: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X35 : (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X36 : (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X37 : (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X38 : (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X39 : (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X40: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X41: (position 64 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X42: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X43: (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X44: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X45: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X46: (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X47: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X48: (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X49: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X50: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X51: (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X52: (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X53: (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X54: (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs6: (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X57: (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xss: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeo: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai or any other amino acid with a reactive group in its side chain, such as cysteine;
Xei: (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X62: (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X64: (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xee: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeg: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X70: (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X71: (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X72: (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X73: (position 103 according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X74: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X75: (position 106 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X76: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu, or any other amino acid with a reactive group in its side chain, such as cysteine;
X77: (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X78: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X79: (position 113 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xso: is absent or Gly;
Xsi: is absent or Gly;
X82: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 186, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 186, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
27. The molecule of any of items 19 to 20 or 22 to 26, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 206:
XiaVQLVEXiGGGZiVX2AGGX3LX4lX5CX6AX7X7bGX7cLSX8YVLGWFRQAPGX9XioREFVAAINWRGXnl TIGPPXi2VEXi3RFXi4lXi5RXi6NXi7Xi8NTGYLQMNXi9LAPXi9bDTAZ2YYCGAGTPLNPX2oAYIYX2iWS YDYWGX22GTZ3VTVX23SX24X25X26 wherein
Xia (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Zi (position 11 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xe: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X7: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X7b: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X7C: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9: ( position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X10: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X13: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X14: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xi9t>: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Z2: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
Z3: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
X23: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X24: is absent or Gly;
X25: is absent or Gly;
X26: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 206, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 206, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
28. The molecule of any of items 19 to 20 or 22 to 27, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 185:
EVQLVEX1GGGZ1VX2AGGX3LX4IX5CX6AX7GGSLSX8YVLGWFRQAPGX9X10REFVAAINWRGX11ITIGP PX12VEX13RFX14IX15RX16NX17X18NTGYLQMNX19LAPDDTAZ2YYCGAGTPLNPX20AYIYX21WSYDYWG X22GTZ3VTVX23SX24X25X26 wherein
Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Zi (position 11 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xe: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X7: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9: ( position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X10: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X13: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X14: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Z2: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
Z3: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
X23: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X24: is absent or Gly;
X25: is absent or Gly;
X26: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa,
such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about
16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
29. The molecule of any of items 19 to 20 or 22 to 28, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 185:
EVQLVEX1GGGZ1VX2AGGX3LX4IX5CX6AX7GGSLSX8YVLGWFRQAPGX9X10REFVAAINWRGX11ITIGP PX12VEX13RFX14IX15RX16NX17X18NTGYLQMNX19LAPDDTAZ2YYCGAGTPLNPX20AYIYX21WSYDYWG X22GTZ3VTVX23SX24X25X26 wherein
(a)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
Xi?: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(b)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
Xi2: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(c)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(d)
Xi:(position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(e)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(f)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent;
(g)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys,
(h)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
Xi?: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent. or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa and does not specifically bind to any human protein.
30. The molecule of any of items 19 to 20 or 22 to 28, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 185:
EVQLVEX1GGGZ1VX2AGGX3LX4IX5CX6AX7GGSLSX8YVLGWFRQAPGX9X10REFVAAINWRGX11ITIGP PX12VEX13RFX14IX15RX16NX17X18NTGYLQMNX19LAPDDTAZ2YYCGAGTPLNPX20AYIYX21WSYDYWG X22GTZ3VTVX23SX24X25X26 wherein
(a)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
(b)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) is Asn;
Xis: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
(c)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X?: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(d)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
Xis: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
(f)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X?: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent, or
(g)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
(h)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or
(j)
Xi: (position 7 according to Kabat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (positionlOOa according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is Gly;
X25: is Gly;
X26: is Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa and does not specifically bind to any human protein.
31. The molecule of any of items 19 to 20 or 22 to 28, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 185:
EVQLVEX1GGGZ1VX2AGGX3LX4IX5CX6AX7GGSLSX8YVLGWFRQAPGX9X10REFVAAINWRGX11ITIGP PX12VEX13RFX14IX15RX16NX17X18NTGYLQMNX19LAPDDTAZ2YYCGAGTPLNPX20AYIYX21WSYDYWG X22GTZ3VTVX23SX24X25X26 wherein
(a)
Xi: (position 7 according to Ka bat numbering) is Ser;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X10: (position 44 according to Kabat numbering) is Glu;
Xu: (position 55 according to Kabat numbering) is Asp;
X12: (position 62 according to Kabat numbering) is Asn;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a nonnatural amino acid, preferably a cysteine;
X21: (position lOOf according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X22: (position 105 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(b)
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X3: (position 17 according to Kabat numbering) is Ser;
X4: (position 19 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) is Asn;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) is Thr;
X15: (position 70 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xis: (position 75 according to Kabat numbering) is Lys;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) is Ser;
X24: is absent;
X25: is absent;
X26: is absent, or
(c)
Xi: (position 7 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Zi: (position 11 according to Kabat numbering) is Leu or Vai;
X2: (position 13 according to Kabat numbering) is Gin;
X3: (position 17 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X4: (position 19 according to Kabat numbering) is Ser;
X5: (position 21 according to Kabat numbering) is Ser;
Xe: (position 23 according to Kabat numbering) is Ala;
X7: (position 25 according to Kabat numbering) is Ser;
Xs: (position 31 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X9: (position 43 according to Kabat numbering) is Lys;
X10: (position 44 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
Xu: (position 55 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X12: (position 62 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X13: (position 65 according to Kabat numbering) is Gly;
X14: (position 68 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X15: (position 70 according to Kabat numbering) is Ser;
Xie: (position 72 according to Kabat numbering) is Asp;
X17: (position 74 according to Kabat numbering) is Ala;
Xis: (position 75 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X19: (position 82b according to Kabat numbering) is Ser;
Z2: (position 89 according to Kabat numbering) is Vai or Leu;
X20: (position 100a according to Kabat numbering) is Gly;
X21: (position lOOf according to Kabat numbering) is Asp;
X22: (position 105 according to Kabat numbering) is Arg;
Z3: (position 108 according to Kabat numbering) is Gin or Leu;
X23: (position 112 according to Kabat numbering) can be any amino acid with a reactive group in its side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine;
X24: is absent;
X25: is absent;
X26: is absent. or a sequence which has 80% or more identity with SEQ ID NO.: 185, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 185, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa and does not specifically bind to any human protein.
32. The molecule of any of items 19 to 20 or 22 to 31, wherein the ISVD-derived building block additionally comprises an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NO.: 185, 206 or 186, preferably wherein the cysteine is preceded/followed by a flexible tag (sequence), such as a (GG) tag, and preferably wherein the tyrosine is preceded/followed by flexible tags, such as (GG) or (G4SI)I-3GG tags.
33. The molecule of any of items 19 to 20 or 22 to 32, wherein the ISVD-derived building block comprises or, alternatively consists of, one of the polypeptides as defined in SEQ ID NOs.: 80-95, 175 or 222-225, or a sequence which has 80% or more identity with SEQ ID NOs.: SO- 95, 175 or 222-225, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NOs.: 80-95, 175 or 222-225.
34. The molecule of any of items 19 or 20, wherein the protein-based building block is derived from a DARPin protein.
35. The molecule of any of items 19, 20 or 34, wherein the at least one protein-based building block is derived from the polypeptide as defined in SEQ ID NO.: 187.
36. The molecule of any of items 19, 20 or 34 to 35, wherein the at least one protein-based building block comprises or, alternatively, consists of, a polypeptide which has 80% or more identity with SEQ ID NO.: 187, preferably a polypeptide which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 187, wherein the polypeptide comprises at least one amino acids with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 187:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 187:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 187:
85, 95, 143, 148.
37. The molecule of item 36 wherein the at least one protein-based building block does not specifically bind human KRAS protein and/or wherein the at least one protein-based building block comprises the following point mutations in the polypeptide as defined in SEQ ID NO.: 187: R69A, R102A and R111A, preferably wherein the at least one protein-based building block comprises or, alternatively consists of, a polypeptide as defined in SEQ ID NO.: 180, or a polypeptide which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 180.
38. The molecule of item 37 wherein at least one protein-based building block comprises at least one amino acid with a reactive group in its side chain, such as cysteine or lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 180:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 180:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 180:
85, 95, 143, 148.
39. The molecule according to any one of items 37 or 38, wherein the at least one proteinbased building block comprises or, alternatively consists of, a polypeptide as defined in SEQ ID NO.: 68, or a polypeptide which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 68, preferably wherein the polypeptide comprises at least one amino acids with a reactive group in its side chain, such as cysteine or
lysine, or tyrosine, or a non-natural amino acid, preferably cysteine, in at least one of the following positions in SEQ ID NO.: 68:
1-2, 4-5, 8, 11-17, 19-20, 23-25, 27, 29, 31-34, 36, 44-49, 52, 56-58, 60, 62, 64, 66-67, 77-82, 85, 89-91, 93, 95, 97, 99-100, 107, 110-115, 118-119, 121-124, 126-128, 130, 132-135, 138- 139, 142-148, 151-152, 154-155, preferably in at least one of the following positions in SEQ ID NO.: 68:
5, 49, 60, 64, 82, 85, 93, 95, 97, 100, 115, 126, 143, 148, 155, more preferably in at least one of the following positions in SEQ ID NO.: 68:
85, 95, 143, 148.
40. The molecule of any of items 19, 20 or 34 to 39, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 188:
X1X2GX3X4LLX5AAX6X7X8X9X10X11X12VX13X14LMX15X16X17AX18VX19AX20X21X22X23GX24TPLHLAAX25 X26X27X28X29X30IVX31VLLX32X33X34AX35VX36AX37DX38X39GATPLHLAAX40X41X42X43X44X45IVX46VLLX4 7X48X49AX5OVX51AX52DX53X54GATPLHX55AAX56X57X58X59X6OX61IVX62X63LX64X65X66X67AX68X69X7OAX 71DX72X73X74X75TAX76X77ISX78X79X80X81X82X83X84LAX85X86LX87X88X89X90, wherein
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xe can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X10 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xu can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X12 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X13 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X14 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X15 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X20 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X21 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X24 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be His or any amino acid with a reactive group in its side chain, such as cysteine X29 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine X30 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X31 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X32 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine X33 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X34 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine X35 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X36 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X37 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X38 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine
X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X40 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X41 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X44 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X50 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X51 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X52 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X54 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X58 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X59 can be His or any amino acid with a reactive group in its side chain, such as cysteine; Xeo can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; Xei can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X62 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X63 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X64 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X65 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xee can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X67 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xes can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X69 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X?o can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X71 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X72 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X73 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X74 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X75 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X76 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X77 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X78 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X79 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xso can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xs2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xs3 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xs6 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
Xs7 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xs8 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs9 can be absent or Leu;
X90 can be absent or Cys or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
41. The molecule of any of items 19, 20 or 34 to 40, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 189,
DLGKX1LLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLX2IVEVLLKNGAX3VNAX4DSY GATPLHLAAMRGHLX5IVX6VLLKYGAX7VX8AX9DEX10GATPLHLAAKAGHLX11IVEVLLKNGAX12VNAQ DKFGKTAFDISIX13NGNEX14LAEILQX15X16X17, wherein
Xi can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xecan be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X10 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X12 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X13 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X14 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X15 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xie can be absent or Leu;
X17 can be absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 189, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 189, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa,
such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about
16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
42. The molecule of any of items 19, 20 or 34 to 41, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 181:
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVX1VLLKYGADVX2AADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTA
F D I S I X3 N G N EX4 LA E I LQKX5X6, wherein
Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be absent or Leu; and
Xe can be absent or Cys. or a sequence which has 80% or more identity with SEQ ID NO.: 181, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 181, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
43. The molecule of item 42, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 181, or variants thereof with sequence identity of 80% or more, and said at least one protein-based building block comprises at least two amino acids with a reactive group in their side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following
solvent-accessible positions, such as two amino acids with a reactive group in its side chain, such as two cysteines, or two lysines, or two tyrosines, or two non-natural amino acids, preferably two cysteines in the following solvent-accessible positions (see SEQ ID NO.: 181), and X5 and Xe are absent:
Xi and X2; or
X3 and X4.
44. The molecule of item 42, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 181, or variants thereof with sequence identity of 80% or more, and said protein-based building block comprises at least four amino acids with a reactive group in their side chain, such as a cysteine, or a lysine, or a tyrosine, or a non-natural amino acid, preferably a cysteine, in at least one of the following solvent- accessible positions, such as four cysteines, or four lysines, or four tyrosines, or four non- natural amino acids, preferably four cysteines in the following solvent-accessible positions, and X5 and Xe are absent:
Xi, X2, X3 and X4, see SEQ ID NO.: 181.
45. The molecule of any of items 19, 20 or 34 to 44, wherein the DARPin-derived building block additionally comprises an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NOs.: 181 or 188-189, preferably wherein the cysteine is preceded/followed by a flexible tag (sequence), such as a (GG) tag, and preferably wherein the tyrosine is preceded/followed by flexible tags, such as (GG) or (G4SI)I-3GG tags.
46. The molecule of item 45, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 182,
DLGKKLLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLEIVEVLLKNGADVNADDSYGA TPLHLAAMRGHLEIVX1VLLKYGADVX2AADEEGATPLHLAAKAGHLEIVEVLLKNGADVNAQDKFGKTA FDISIX3NGNEX4LAEILQKC, wherein
Xi can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; and
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine.
47. The molecule of item 45 or 46, wherein the DARPin-derived building block comprises or, alternatively consists of, SEQ ID NO.: 182, or a sequence which has 80% or more identity with SEQ ID NO.: 182, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 182, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
48. The molecule of any of items 19, 20 or 34 to 47, wherein the DARPin -derived building block comprises or, alternatively consists of, one of the polypeptides as defined in SEQ ID NOs.: 96-98, 199 or 208, or a sequence which has 80% or more identity with SEQ ID NOs.: 96-98, 199 or 208, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NOs.: 96-98, 199 or 208.
49. The molecule of any of items 19 or 20, wherein the at least one protein-based building block is an affi body-derived building block or an affitin-derived building block.
50. The molecule of any of items 19 or 21, wherein the at least one protein-based building block is derived from cyclin-dependent kinase subunit 1 (CKS1) protein.
51. The molecule of any of items 19, 21 or 50, wherein the at least one protein-based building block is derived from the polypeptide as defined in SEQ ID NO.: 190.
52. The molecule of any of items 19, 21 or 50 to 51, wherein the at least one protein-based building block comprises, or alternatively, consists of a polypeptide as defined in SEQ ID NO.: 190, or a sequence which has 80% or more identity with SEQ ID NO.: 190, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 190, wherein the polypeptide comprises at least one amino acids with a reactive group in their side chain, such as cysteine, in at least one of the following positions in SEQ ID NO.: 190:
1-4, 6-7, 9-20, 22, 25-27, 29-30, 32-36, 38-41, 43-44, 46, 48, 50-52, 54, 56-64, 69-78, preferably in at least one of the following positions in SEQ ID NO.: 190:
9-13, 22, 33, 51, 57 and 78, or in at least one of the following positions in SEQ ID NO.: 190:
1, 4, 10, 12, 25, 29, 33, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
53. The molecule of any of items 19, 21 or 50 to 52, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 191:
XlX2X3X4lX5X6SX7X8X9X10XllX12X13X14X15Xl6X17X18VX19LRX20X21X22AX23X24VX25X23bX24bX25bX26MX2 7X28X29X30WX31X32LX33VX34QX35X36X37WX38HX39X40X41X42X43X44X45X46X47I LLFX48X49X50X51X52X53X 54X55X56X57, wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X2 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X4can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X5 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xs can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xn can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xi2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi4can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be His or any amino acid with a reactive group in its side chain, such as cysteine; X19 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X2ocan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X2i can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X23bcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24bcan be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X25bcan be His or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X29 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X31 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X32 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X34can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X35 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xse can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X3? can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Vai or any amino acid with a reactive group in its side chain, such as cysteine; X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X4ocan be Met or any amino acid with a reactive group in its side chain, such as cysteine; X4i can be He or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X44can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Pro or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xs4can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine,
or a sequence which has 80% or more identity with SEQ ID NO.: 191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
54. The molecule of any of items 19, 21 or 50 to 53, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 205:
SHKQIYYSX1X2X3X4X5EEFEYRHVX6LPKDIAKLVPX7TH LMSESEWRNLGVQQSX8GWVHYX9I HEPEPHI LLFRRPLPKKPKX10, wherein
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X4can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Met or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa,
such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about
16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein
55. The molecule of any of items 19, 21 or 50 to 54, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 192:
X1HKX2IYYSDX3YX4DEEFEYRHVMLPX5DIAX6LVPX7THLMSESEWRNLGVQQSQGWVHYMIHEPEPHI LLFRRPLPKKPKK wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; X2can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X4can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 192, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 192, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
56. The molecule of any of items 19, 21 or 50 to 55, wherein the small globular human protein-derived building block additionally comprises an extra cysteine and/or an extra tyrosine at one or both ends of the polypeptide defined by SEQ ID NOs.: 191-192 or 205, preferably wherein the cysteine is preceded/fol lowed by a flexible tag (sequence), such as a
(GG) tag, and preferably wherein the tyrosine is preceded/followed by flexible tags, such as
(GG) or (G4Si)i-3GG tags.
57. The molecule of any of items 19, 21 or 50 to 56, wherein the small globular human protein-derived building block comprises or, alternatively consists of, one of the polypeptides as defined in SEQ ID NOs.: 99-105, or a sequence which has 80% or more identity with SEQ ID NOs.: 99-105, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NOs.: 99-105.
58. The molecule of any of items 1 to 57, wherein the at least one protein-based building block comprises or consist of a polypeptide selected from SEQ ID NO.: 80-105, 175, 199, 208 and/or 222-225.
59. The molecule of any of items 1 to 58, wherein the molecule further comprises at least one (cell)-targeting moiety, such as a tumour-targeting moiety, at least one therapeutic moiety, and/or at least one cell-penetrating peptide directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker.
60. The molecule of item 59, wherein the at least one (cell)-targeting moiety is a single chain variable fragment (scFv), preferably an immunoglobulin single variable domain (ISVD), more preferably wherein the ISVD is a VHH, a humanized VHH , a domain antibody (dAb) or a camelized VH, such as camelized human VH.
61. The molecule of item 60, wherein the molecule comprises (i) two (cell)-targeting moieties directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker, such as two tumor-targeting moieties, preferably two tumor-targeting ISVDs, more preferably selected from SEQ ID NO.: 227 and 228, even more preferably wherein the molecule comprises two tumor-targeting proteins defined by SEQ ID NO.: 227 and 228 directly attached to the at least one proteinbased building block or attached to the at least one protein-based building block through a
linker and/or (ii) at least one, preferably more than one, cell-penetrating peptides directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker, preferably selected from CMA-1 (SEQ ID NO.: 218), LAH4 (SEQ ID NO.:292) and TAT (SEQ ID NO.:277).
62. The molecule of any of items 1 to 61, wherein the molecule comprises at least one further moiety or cargo, preferably wherein the at least one further moiety or cargo is selected from a) a half-life extending (HLE) moiety, such as PEG, and/or an albumin binding ISVD; b) a targeting moiety, such as an EGFR-targeting moiety, e.g., GE11 peptide or an anti-EGFR ISVD, an anti-CEACAM6 ISVD and/or other cell specific binding moieties; c) a therapeutic moiety or precursor therefrom, preferably a therapeutic moiety which target is in the cell nucleus, e.g., such as a CDK inhibitor; d) an imaging moiety, such as deferoxamine (DFO); e) a toxic moiety; f) nucleic acids such as DNA or ASOs; g) vitamins, such as folate; h) tumor associated glycans; and/or i) lipids.
63. The molecule of item 62, wherein the at least one protein-based carrier building block comprises at least one further cargo attached to one of the conjugation sites or attachment points present in the protein-based building block, preferably to a conjugation site or attachment point which is the side chain of an amino acid preferably located at a solvent- accessible position of the protein-based building block.
64. The molecule of item 63, wherein the at least one protein-based carrier building block is an ISVD-derived building block, preferably derived from a VH, a humanized VH, a human VH, a domain antibody (dAb), a VHH, a humanized VHH or a camelized VH (derived from a heavychain ISVD), more preferably derived from an ISVD belonging to the "VH3 class".
65. The molecule of item 64, wherein the at least one further cargo attached to the at least one building block is an ISVD.
66. The molecule of any one of items 62 to 65, wherein the at least one nuclear localization sequence (NLS) and, optionally, the at least one (cell)-targeting moiety, such as a tumourtargeting moiety, the at least one therapeutic moiety, the at least one cell-penetrating peptide, and/or the at least one further cargo are, independently, directly attached to the at least one protein-based building block, or attached to the at least one protein-based building block through a linker.
67. The molecule of any one of items 1 to 66, wherein the linker is a cleavable linker.
68. The molecule of any of items 1 to 67, wherein the linker is a peptide linker.
69. The molecule of any of items 1 to 67, wherein the linker is not a peptide linker.
70. The molecule of any of items 1 to 69, wherein the linker is an amino acid or an amino acid sequence, preferably of between 1 and 50 amino acids, such as for example Gly-Ser linkers ((glyxsery)z), or A3, GS30, GS15, GS9 and GS7 linkers, or a linker as defined in SEQ ID NOs.: 158-169 or 193-196, or 298, more preferably a linker as defined in SEQ ID NO.: 163.
71. The molecule of any of items 1 to 67 and 69, wherein the linker is a linear or branched polyethylene glycol (PEG) moiety, preferably with a molecular weight of about 1-60 kDa, preferably with a weight of about 1-10 kDa, such as 5 kDa or 10 kDa.
72. The molecule of any of items I to 67 and 69, wherein the linker is an ELNN polypeptide.
73. The molecule of any one of items 1 to 67 and 69, wherein the linker is an APN- maleimide linker (3-(4-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l-yl)phenyl)propiolonitrile, MAPN) or a bis-maleimido-PEG3 (BM(PEG)3) linker (BM(PEG)3 (1,11-bismaleimido- triethyleneglycol)).
74. The molecule of any of items 62 to 73, wherein the further cargo is an (in vivo) half-life extending moiety.
75. The molecule of item 74, wherein the further cargo is a PEG molecule, an ELNN polypeptide or an albumin-binding polypeptide, preferably a PEG molecule, more preferably a PEG molecule with a MW of less than 20 kDa, such as less than 10 kDa, or less than 5 kDa, even more preferably a 1-5 kDa PEG molecule,
76. The molecule of any of items 74 to 75, wherein the cargo is an albumin-binding ISVD.
77. The molecule of any of items 74 to 76, wherein the cargo comprises or, alternatively consists of, a polypeptide as defined in any one of SEQ ID NOs.: 50-64 or 106, preferably SEQ ID NOs.: 63 or 106.
78. The molecule of any of items 1 to 77, wherein the molecule comprises or, alternatively consists of, a polypeptide as defined in any one of SEQ ID NOs.: 107-127, 170-174, 176, 200, 215 or 226.
79. A nucleic acid encoding the molecule as defined in any one of items I to 78, part of the molecule as defined in any one of items 1 to 78 and/or the protein-based building block as defined in any one of items 1 to 58.
80. A vector comprising the nucleic acid as defined in item 79.
81. A composition comprising the molecule as defined in any one of items 1 to 78, or the nucleic acid as defined in item 79, such as a pharmaceutical composition.
82. A method for producing the molecule as defined in any one of items 1 to 78, wherein the method comprises:
a) expressing, in a suitable host cell or host organism or in another suitable expression system, a nucleic acid sequence encoding the at least one protein-based carrier building block and/or the molecule or part of the molecule as defined in any one of items 1- 78; b) optionally isolating and/or purifying the at least one protein-based carrier building block and/or the molecule or part of the molecule expressed in a); c) optionally conjugating one or more (further) cargos to the attachment point(s) or conjugation sites(s) of the protein-based carrier building block.
83. A method for producing the molecule as defined in any one of items 1 to 78, wherein the method comprises: a) chemically synthesizing the at least one protein-based carrier building block and/or the molecule or part of the molecule as defined in any one of items I to 78, preferably by using solid-phase peptide synthesis; b) optionally isolating and/or purifying the at least one protein-based carrier building block and/or the molecule or part of the molecule synthesized in a); c) optionally conjugating one or more (further) cargos to the attachment point(s) or conjugation sites(s) of the protein-based carrier building block.
84. The molecule according to any one of items 1 to 78 or the composition according to item 81 for use in medicine.
85. The molecule according to any one of items 1 to 78 or the composition according to item 81 for use in the prophylactic and/or therapeutic treatment of an autoimmune/inflammatory disease, an infectious disease and/or cancer, such as hematological (blood) and solid tumor cancer disease.
86. The molecule according to any one of items 1 to 78 or the composition according to item 81 for use in entering a therapeutic moiety into the nucleus of a cell and/or for use in intracellular delivery.
87. The molecule according to any one of items 1 to 78 or the composition according to item
81 for use in intranuclear therapy.
88. The molecule according to any one of items 1 to 78 or the composition according to item 81 for use as a vaccine.
89. A vaccine comprising a molecule as defined in any one of items 1 to 78 or the composition as defined in item 81, optionally further comprising an adjuvant.
90. The molecule of item 1 to 78, wherein the molecule comprises at least one further cargo molecule, wherein the further cargo molecule is a therapeutic moiety.
91. The molecule of item 90, wherein the therapeutic moiety is a Death receptor 5 (DR5) antagonist, such as one or more anti-DR5 ISVDs, e.g. such as three anti-DR5 ISVDs.
92. The molecule of item 90, wherein the therapeutic moiety specifically targets the cell nucleus.
93. The molecule of item 1 to 78 or 90-92, wherein the molecule comprises at least one further cargo molecule, wherein the cargo molecule is a toxic moiety.
94. The molecule of item 93, wherein the toxic moiety is a chemotherapeutic agent.
95. The molecule of item 93, wherein the toxic moiety is a topoisomerase inhibitor.
96. The molecule of item 93, wherein the toxic moiety is a cell cycle blocker.
97. The molecule of item 1 to 78 or 90-96, wherein the molecule comprises at least one
ISVD-derived building block and at least one cargo molecule, wherein the cargo molecule is a toxic moiety.
98. The molecule of item 97, wherein the toxic moiety is a chemotherapeutic agent.
99. The molecule of item 97, wherein the toxic moiety is a topoisomerase inhibitor.
100. The molecule of item 97, wherein the toxic moiety is a cell cycle blocker.
101. The molecule of item 1 to 78 or 90-100, wherein the molecule comprises at least one further cargo molecule, wherein the further cargo molecule is a targeting moiety.
102. The molecule of item 101, wherein the targeting moiety is an EGFR targeting moiety, such as GE11 peptide or an anti-EGFR ISVD (e.g., an anti-EGFR VHH), or a CEACAM5 targeting moiety, such as an anti-CEACAM5 ISVD.
103. The molecule of item 1 to 78 or 90-102, wherein the molecule comprises at least one further cargo molecule, wherein the further cargo molecule is a nucleic acid.
104. The molecule of item 103, wherein the nucleic acid is an antisense oligonucleotide.
105. The molecule of item 1 to 78 or 90-104, wherein the molecule comprises at least one further cargo molecule, wherein the cargo molecule is a vitamin.
106. The molecule of item 105, wherein the vitamin is folate.
107. The molecule of item 1 to 78 or 90-106, wherein the molecule comprises at least one
ISVD-derived building block and at least one cargo molecule, wherein the cargo molecule is a vitamin.
108. The molecule of item 103, wherein the vitamin is folate.
109. The molecule of item 1 to 78 or 90-108, wherein the molecule comprises at least one further cargo molecule, wherein the cargo molecule is a glycan.
110. The molecule of item 109, wherein the glycan is a tumor-associated carbohydrate antigen.
111. The molecule of item 1 to 78 or 90-110, wherein the molecule comprises at least one ISVD-derived building block and at least one cargo molecule, wherein the cargo molecule is a glycan.
112. The molecule of item 111, wherein the glycan is a tumor-associated carbohydrate antigen.
113. The molecule of item 1 to 78 or 90-112, wherein the molecule comprises at least one further cargo molecule, wherein the cargo molecule is a lipid.
114. The molecule of item 113, wherein the lipid is a short-chain fatty acid.
115. The molecule of item 1 to 78 or 90-114, wherein the molecule comprises at least one ISVD-derived building block and at least one cargo molecule, wherein the cargo molecule is a lipid.
116. The molecule of item 115, wherein the lipid is a short-chain fatty acid.
117. The molecule of item 1 to 78 or 90-116, wherein the molecule comprises at least one protein-based building block, at least one nuclear localization sequence (NLS) and at least two different cargo molecules.
118. The molecule of item 1 to 78 or 90-117, wherein the molecule comprises at least one ISVD-derived building block, at least one nuclear localization sequence (NLS), and at least two different cargo molecules.
119. The molecule of items 117 or 118, wherein the two different cargo molecules are a half-life extending moiety, such as PEG, and a radiolabel, such as 89Zr-DFO.
120. The molecule of item 117 or 118, wherein the two different cargo molecules are a halflife extending moiety, such as PEG, and a toxic moiety, such as cryptophycin, DM4 or resiquimod.
121. The molecule of items 117 or 118, wherein the two different cargo molecules are a half-life extending moiety, such as PEG or an albumin binder, and a targeting moiety.
122. The molecule of items 117 or 118, wherein the two different cargo molecules are a half-life extending moiety, such as PEG or an albumin binder, and a therapeutic moiety.
123. The molecule of items 117 or 118, wherein the two different cargo molecules are a therapeutic moiety and a targeting moiety.
124. The molecule of item 1 to 78 or 90-123, wherein the molecule comprises at least one protein-based building block, at least one nuclear localization sequence (NLS) and at least three different cargo molecules.
125. The molecule of item 1 to 78 or 90-124, wherein the molecule comprises at least one ISVD-derived building block, at least one nuclear localization sequence (NLS), and at least three different cargo molecules.
126. The molecule of items 124 or 125, wherein the three different cargo molecules are a half-life extending moiety, such as PEG or an albumin binding polypeptide, a targeting moiety and a therapeutic moiety.
127. The molecule of item 1 to 78, wherein the molecule comprises at least one DARPin- derived building block and at least one further cargo molecule, wherein the further cargo molecule is a targeting moiety.
128. The molecule of item 127, wherein the targeting moiety is an EGFR targeting moiety, such as GE11 peptide or an anti-EGFR ISVD or an CEACAM5 targeting moiety, such as an anti- CEACAM5 ISVD.
129. The molecule of item 1 to 78, wherein the molecule comprises at least one small globular human protein-derived building block, such as a CKSl-derived building block, and at least one further cargo molecule, wherein the cargo molecule is a lipid.
130. The molecule of item 129, wherein the lipid is a short-chain fatty acid.
131. The molecule of item 1 to 78 or 129-130, wherein the molecule comprises at least one small globular human protein-derived building block, such as a CKSl-derived building block, at least one nuclear localization sequence (NLS), and at least two different further cargo molecules.
132. The molecule of item 131, wherein the two different further cargo molecules are a cell penetrating peptide, such as CMA-1, and an imaging moiety, such as a fluorophore, e.g. Alexa 647 or pHAb.
133. The molecule of any one of items 98 to 132, wherein the molecule additionally comprises a cargo molecule that is an (in vivo) half-life extending moiety.
134. The molecule of item 133, wherein the cargo is a PEG molecule, an ELNN polypeptide or an albumin-binding polypeptide, preferably an albumin-binding ISVD.
135. The molecule of any of items 133 to 134, wherein the cargo comprises or, alternatively consists of, a polypeptide as defined in any one of SEQ ID NOs.: 50-64 or 106, preferably SEQ ID NOs.: 63 or 106.
Claims
1. A molecule comprising at least one protein-based building block, wherein the at least one protein-based building block: a) comprises at least two conjugation sites or attachment points; b) has a molecular mass of about 2.5 to about 70 kDa; c) has a globular three-dimensional (3D) structure; d) has a solubility of 10 mg/mL or more, measured in an aqueous solution at room temperature, wherein the aqueous solution is citrate buffer or PBS, at pH 7.0 or 7.4; and e) does not specifically bind to any human protein or binds one or more human proteins with a KD value greater than 5xl0-4 mol/litre, as determined by surface plasmon resonance, for instance as described in Ober et al. 2001, Intern. Immunology 13: 1551-1559, wherein the molecule further comprises at least one, preferably at least two nuclear localization sequences (NLS), covalently linked, directly or by means of a linker, to the at least one, preferably at least two conjugation sites or attachment points comprised in the proteinbased carrier building block, wherein the NLS preferably comprises or consists of SEQ ID NO.: 221, SV40mono NLS (SEQ ID NO.: 256, PKKKRKV), SV40tri NLS (SEQ ID NO.: 304, PKKKRKVPKKKRKVPKKKRKV), or NLP NLS (SEQ ID NO.: 305, AVKRPAATKKAGQAKKKKLD).
2. The molecule of claim 1, wherein one of the at least two, preferably the at least two conjugation site(s) or attachment point(s) are present at a solvent-accessible positions in the protein-based building block.
3. The molecule of any of claims 1 to 2, wherein at least one of the attachment points or conjugation sites, preferably at least two attachment points or conjugation sites, more preferably all of the attachment points or conjugation sites, is(are) an engineered attachment point or conjugation site.
4. The molecule of any of claims 1 to 3, wherein the at least two attachment points or conjugation sites are reactive groups present in the side chain of any amino acid in the protein-
based carrier building block, preferably reactive groups present in the side chain of a cysteine and/or in the side chain of a tyrosine, and/or in the side chain of a lysine, and/or in the side chain of a non-natural amino acid.
5. The molecule of any of claims 1 to 4, wherein the at least two conjugation sites are selected from a primary amine, a thiol group, a hydroxyl group, a guanidino group, a carboxyl group and/or a thioether group, preferably from a primary amine and/or a thiol group, more preferably a thiol group.
6. The molecule of any of claims 1 to 5, wherein the at least one protein-based building block does not specifically bind to any non-protein molecule, such as DNA, RNA, lipids or glycans, or binds one or more non-protein molecules with a KD value greater than 5x10" 4 mol/litre.
7. The molecule of any of claims 1 to 6, wherein the at least one protein-based building block comprises at least one further cargo attached to at least one of the attachment points or conjugation sites.
8. The molecule of any of claims 1 to 7, wherein the protein-based building block is a small globular non-human protein-based building block or a small globular human proteinbased building block, preferably wherein small globular non-human protein-based building block is an immunoglobulin single variable domain (ISVD)-based building block, a DARP-in- based building block, an affi body-based building block or an affitin-based building block and wherein the small globular human protein-based building block is a cyclin-dependent kinase subunit 1 (CKS1) protein-based building block.
9. The molecule of claim 8, wherein the ISVD-based building block is derived from a VH, a humanized VH, a human VH, a VHH, a humanized VHH or a camelized VH (derived from a heavychain ISVD), preferably derived from an ISVD belonging to the "VH3 class".
10. The molecule of any of claims 8 to 9, wherein the ISVD-derived building block is derived from RSV001A04 (SEQ ID NO.: 179).
11. The molecule of any of claims 8 to 10, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 186:
X1VX2LX3EX4X5GX6X7X8X9X10X11GX12X13X14IX15CX16AX17X18X19X20LX21X22X23VLGWFRX24AX25X26X2 7X28X29X30FVAAI NX31X32X33X34X35X36X37X38PX39X40VX41X42X43FX44IX45X46X47X48X49X50X51TGX52LX5 3MX54X55LX56X57X58DX59AX6OYX61CGAGX62PX63X64X65X66AYX67X68X69X7OSYX71X72X73GX74X75TX76V X77VX78X79X80X81X82 wherein
Xi (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X2 (position 3 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 5 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X4 (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5 (position 8 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe (position 10 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X7 (position 11 according to Kabat numbering) can be Leu, Vai Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He, preferably Leu or Vai or any amino acid with a reactive group in its side chain, such as cysteine;
Xs (position 12 according to Kabat numbering) can be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X9 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xio (position 14 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xu (position 15 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X12 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X13 (position 18 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X14 (position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 27 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X20: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position 30 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X23: (position 32 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X24: (position 39 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X25 : (position 41 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X26: (position 42 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X27: (position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X28 : (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X29: (position 45 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X30: (position 46 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X31: (position 52a according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X32: (position 53 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X33 : (position 54 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X34: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X35 : (position 56 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X36 : (position 57 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X37 : (position 58 according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
X38 : (position 59 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X39 : (position 61 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
X40: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X41: (position 64 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X42: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X43: (position 66 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X44: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X45: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X46: (position 71 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X47: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X48: (position 73 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X49: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X50: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X51: (position 76 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X52: (position 79 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X53: (position 81 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X54: (position 82a according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X55: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X56: (position 83 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X57: (position 84 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xss: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X59: (position 87 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeo: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai or any other amino acid with a reactive group in its side chain, such as cysteine;
Xei: (position 91 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X62: (position 96 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 98 according to Kabat numbering) can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X64: (position 99 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (position 100 according to Kabat numbering) can be Pro or any amino acid with a reactive group in its side chain, such as cysteine;
Xee: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xe?: (positionlOOd according to Kabat numbering) can be lie or any amino acid with a reactive group in its side chain, such as cysteine;
Xes: (positionlOOe according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xeg: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X70: (position 100g according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X71: (position 101 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X72: (position 102 according to Kabat numbering) can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X73: (position 103 according to Kabat numbering) can be Trp or any amino acid with a reactive group in its side chain, such as cysteine;
X74: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X75: (position 106 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X76: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu, or any other amino acid with a reactive group in its side chain, such as cysteine;
X77: (position 110 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X78: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X79: (position 113 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xso: is absent or Gly;
Xsi: is absent or Gly;
X82: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 186, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 186, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa,
such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about
16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
12. The molecule of any of claims 8 to 11, wherein the ISVD-derived building block comprises or, alternatively, consists of SEQ ID NO.: 206:
XiaVQLVEXiGGGZiVX2AGGX3LX4lX5CX6AX7X7bGX7cLSX8YVLGWFRQAPGX9XioREFVAAINWRGXnl TIGPPXi2VEXi3RFXi4lXi5RXi6NXi7Xi8NTGYLQMNXi9LAPXi9bDTAZ2YYCGAGTPLNPX2oAYIYX2iWS YDYWGX22GTZ3VTVX23SX24X25X26 wherein
Xia (position 1 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xi (position 7 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Zi (position 11 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X2 (position 13 according to Kabat numbering) can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3 (position 17 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X4(position 19 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X5: (position 21 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xe: (position 23 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
X7: (position 25 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X?b: (position 26 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X?c: (position 28 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xs: (position 31 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9: ( position 43 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X10: (position 44 according to Kabat numbering) can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu: (position 55 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12: (position 62 according to Kabat numbering) can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X13: (position 65 according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X14: (position 68 according to Kabat numbering) can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X15: (position 70 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xie: (position 72 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X17: (position 74 according to Kabat numbering) can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xis: (position 75 according to Kabat numbering) can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X19: (position 82b according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xi9t>: (position 85 according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Z2: (position 89 according to Kabat numbering) can be Leu, Vai, Ser, Met, Trp, Phe, Thr, Gin, Glu, Ala, Arg, Gly, Lys, Tyr, Asn, Pro or He; preferably Leu, Vai, Ser or Glu, more preferably Leu or Vai;
X20: (positionlOOa according to Kabat numbering) can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X21: (position lOOf according to Kabat numbering) can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22: (position 105 according to Kabat numbering) can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
Z3: (position 108 according to Kabat numbering) can be Gin, Leu, Arg, Pro, Glu, Lys, Ser, Thr, Met, Ala or His; preferably Gin or Leu;
X23: (position 112 according to Kabat numbering) can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X24: is absent or Gly;
X25: is absent or Gly;
X26: is absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 206, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 206, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
13. The molecule of claim 8, wherein the DARPin-based building block is derived from the polypeptide as defined in SEQ ID NO.: 187.
14. The molecule of claim 13, wherein at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 188:
X1X2GX3X4LLX5AAX6X7X8X9X10X11X12VX13X14LMX15X16X17AX18VX19AX20X21X22X23GX24TPLHLAAX25
X26X27X28X29X30IVX31VLLX32X33X34AX35VX36AX37DX38X39GATPLHLAAX40X41X42X43X44X45IVX46VLLX4
7X48X49AX5OVX51AX52DX53X54GATPLHX55AAX56X57X58X59X6OX61IVX62X63LX64X65X66X67AX68X69X7OAX
71DX72X73X74X75TAX76X77ISX78X79X80X81X82X83X84LAX85X86LX87X88X89X90, wherein
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xe can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X10 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xu can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X12 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X13 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X14 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X15 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine;
Xie can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X17 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xis can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X19 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X20 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X21 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X22 can be Thr or any amino acid with a reactive group in its side chain, such as cysteine;
X23 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X24 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine;
X25 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine;
X26 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X28 can be His or any amino acid with a reactive group in its side chain, such as cysteine X29 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine X30 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X31 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine X32 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine X33 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X34 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine X35 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X36 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine X37 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine X38 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X40 can be Met or any amino acid with a reactive group in its side chain, such as cysteine
X41 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X42 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X43 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X44 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X45 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X48 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X49 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X50 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X51 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X54 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X55 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X57 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X58 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X59 can be His or any amino acid with a reactive group in its side chain, such as cysteine; Xeo can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; Xei can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X62 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X63 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine; X64 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X65 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xee can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X67 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xes can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X69 can be Vai or any amino acid with a reactive group in its side chain, such as cysteine; X70 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; X71 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine; X72 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X73 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X74 can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X75 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X76 can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; X77 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X78 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
X79 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xso can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xsi can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; Xs2 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xs3 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xs4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xs5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xs6 can be He or any amino acid with a reactive group in its side chain, such as cysteine;
Xs7 can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
Xs8 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs9 can be absent or Leu;
X90 can be absent or Cys or a sequence which has 80% or more identity with SEQ ID NO.: 188, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 188, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
15. The molecule of any of claims 13 to 14, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 189,
DLGKX1LLEAARAGQDDEVRILMANGADVNAHDTFGFTPLHLAALYGHLX2IVEVLLKNGAX3VNAX4DSY GATPLHLAAMRGHLX5IVX6VLLKYGAX7VX8AX9DEX10GATPLHLAAKAGHLX11IVEVLLKNGAX12VNAQ DKFGKTAFDISIX13NGNEX14LAEILQX15X16X17, wherein
Xi can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X4 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X5 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xecan be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Asn or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Ala or any amino acid with a reactive group in its side chain, such as cysteine; X10 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
Xu can be Glu or any amino acid with a reactive group in its side chain, such as cysteine;
X12 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X13 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X14 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X15 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xi6 can be absent or Leu;
X17 can be absent or Cys, or a sequence which has 80% or more identity with SEQ ID NO.: 189, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 189, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
16. The molecule of claim 8, wherein the CSKl-derived building block is derived from the polypeptide as defined in SEQ ID NO.: 190.
17. The molecule of claim 16, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 191:
XlX2X3X4lX5X6SX7X8X9X10XllX12X13X14X15Xl6X17X18VX19LRX20X21X22AX23X24VX25X23bX24bX25bX26MX2 7X28X29X30WX31X32LX33VX34QX35X36X37WX38HX39X40X41X42X43X44X45X46X47I LLFX48X49X50X51X52X53X 54X55X56X57, wherein
Xi can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X4can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
Xe can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X? can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xs can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X9 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xn can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xi2 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi4can be Phe or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; Xi6 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine; X17 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xis can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X19 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; X2ocan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X2i can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; X22 can be He or any amino acid with a reactive group in its side chain, such as cysteine; X23 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X25 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X23bcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X24bcan be Thr or any amino acid with a reactive group in its side chain, such as cysteine; X25bcan be His or any amino acid with a reactive group in its side chain, such as cysteine; X26 can be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X27 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
X28 can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X29 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X31 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; X32 can be Asn or any amino acid with a reactive group in its side chain, such as cysteine; Xss can be Gly or any amino acid with a reactive group in its side chain, such as cysteine; X34can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X35 can be Ser or any amino acid with a reactive group in its side chain, such as cysteine;
Xsecan be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X3?can be Gly or any amino acid with a reactive group in its side chain, such as cysteine;
Xsscan be Vai or any amino acid with a reactive group in its side chain, such as cysteine;
X39 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X4ocan be Met or any amino acid with a reactive group in its side chain, such as cysteine; X4ican be He or any amino acid with a reactive group in its side chain, such as cysteine;
X42 can be His or any amino acid with a reactive group in its side chain, such as cysteine; X43can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X44can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X45can be Glu or any amino acid with a reactive group in its side chain, such as cysteine; X46 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X47 can be His or any amino acid with a reactive group in its side chain, such as cysteine;
X48 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine;
X49 can be Arg or any amino acid with a reactive group in its side chain, such as cysteine; Xsocan be Pro or any amino acid with a reactive group in its side chain, such as cysteine; Xsican be Leu or any amino acid with a reactive group in its side chain, such as cysteine; X52 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X53 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; Xs4can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X55 can be Pro or any amino acid with a reactive group in its side chain, such as cysteine; X56 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine; X57 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 191, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 191, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
18. The molecule of any of claims 16 to 17, wherein the at least one protein-based building block comprises, or alternatively, consists of, SEQ ID NO.: 205:
SHKQIYYSX1X2X3X4X5EEFEYRHVX6LPKDIAKLVPX7TH LMSESEWRNLGVQQSX8GWVHYX9I HEPEPHI LLFRRPLPKKPKX10, wherein
Xi can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X2 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
X3 can be Tyr or any amino acid with a reactive group in its side chain, such as cysteine;
X4can be Asp or any amino acid with a reactive group in its side chain, such as cysteine;
X5 can be Asp or any amino acid with a reactive group in its side chain, such as cysteine; Xe can be Met or any amino acid with a reactive group in its side chain, such as cysteine;
X7 can be Lys or any amino acid with a reactive group in its side chain, such as cysteine;
Xs can be Gin or any amino acid with a reactive group in its side chain, such as cysteine;
X9 can be Met or any amino acid with a reactive group in its side chain, such as cysteine; Xwcan be Lys or any amino acid with a reactive group in its side chain, such as cysteine, or a sequence which has 80% or more identity with SEQ ID NO.: 205, preferably a sequence which has 85% or more, 90% or more, 95% or more, 97% or more or 99% or more sequence identity with SEQ ID NO.: 205, provided that the building block has a globular 3D structure, is soluble, has a size (molecular mass) of about 2.5 to about 70 kDa, such as about 2.5 to about 50 kDa, or of about 2.5 to less than 50 kDa, more preferably of about 2.5 to about 30 kDa, such as about 2.5 to about 16 kDa, such as about 5 to about 16 kDa, or about 7 to about 16 kDa, or about 10 to about 16 kDa, and does not specifically bind to any human protein.
19. The molecule of any of claims 1 to 18, wherein the at least one protein-based building block comprises or consist of a polypeptide selected from SEQ ID NO.: 80-105, 175, 199, 208 and/or 222-225.
20. The molecule of any of claims 1 to 19, wherein the molecule further comprises at least one cel I -targeting moiety directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker.
21. The molecule of claim 20, wherein the at least one cel I -targeting moiety is a single chain variable fragment (scFv), preferably an immunoglobulin single variable domain (ISVD), more preferably wherein the ISVD is a VHH, a humanized VHH, a domain antibody (dAb), or a camelized VH, such as camelized human VH.
22. The molecule of claim 21, wherein the molecule comprises (i) two cel I -targeting moieties directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker, such as two tumor-targeting moieties, preferably two tumor-targeting ISVDs, more preferably selected from SEQ ID NO.: 227 and 228, even more preferably wherein the molecule comprises two tumor-targeting moieties defined by SEQ ID NO.: 227 and 228 directly attached to the at least one proteinbased building block or attached to the at least one protein-based building block through a linker and/or (ii) at least one, preferably more than one, cell-penetrating peptides directly attached to the at least one protein-based building block or attached to the at least one protein-based building block through a linker, preferably selected from CMA-1 (SEQ ID NO.: 218), LAH4 (SEQ ID NO.:292) and TAT (SEQ ID NO.:277).
23. The molecule of any of claims 1 to 22, wherein the molecule comprises at least one further moiety or cargo, preferably wherein the at least one further moiety or cargo is selected from a) a half-life extending (HLE) moiety, such as PEG, and/or an albumin binding ISVD; b) a targeting moiety, preferably an EGFR-targeting moiety such as GE11 peptide or an anti-EGFR ISVD and/or other cell specific binding moieties; and/or c) a therapeutic moiety or precursor therefrom, such as a Toll-like receptor agonist, preferably a therapeutic moiety which target is in the cell nucleus; d) an imaging moiety, such as deferoxamine (DFO); e) a toxic moiety;
f) nucleic acids such as DNA or an antisense oligonucleotide; g) vitamins, preferably folate; h) a tumor associated carbohydrate or glycan; i) a lipid; and/or j) Toll-like receptor agonists such as resiquimod.
24. The molecule of claim 23, wherein the at least one further cargo is directly attached to the at least one protein-based building block, or wherein the at least one cargo is attached to the at least one protein-based building block through a linker.
25. The molecule of any of claims 23 to 24, wherein the cargo is an (in vivo) half-life extending moiety, preferably a PEG molecule, preferably a 1-20 kDa PEG molecule, more preferably a 1-10 kDa PEG molecule, even more preferably a 1-5 kDa PEG molecule, an ELNN polypeptide or an albumin-binding polypeptide.
26. The molecule of claim 25, wherein the albumin-binding polypeptide is an albuminbinding ISVD, wherein preferably the albumin-binding ISVD comprises or, alternatively consists of, a polypeptide as defined in any one of SEQ ID NOs.: 50-64 or 106, preferably SEQ ID NOs.: 63 or 106.
27. The molecule of any of claims I to 26, wherein the molecule comprises or, alternatively consists of, a polypeptide as defined in any one of SEQ ID NOs.: 107-127, 170-174, 176, 200, 215 or 226.
28. A nucleic acid encoding the molecule as defined in any one of claims 1 to 27, part of the molecule as defined in any one of claims 1 to 27 and/or the protein-based building block as defined in any one of claims 1-19.
29. A vector comprising the nucleic acid as defined in claim 28.
30. A composition comprising the molecule as defined in any one of claims 1 to 27, or the nucleic acid as defined in claim 28, such as a pharmaceutical composition.
31. A method for producing the molecule as defined in any one of claims 1 to 27, wherein the method comprises: a) expressing, in a suitable host cell or host organism or in another suitable expression system, a nucleic acid sequence encoding the at least one protein-based carrier building block and/or the molecule or part of the molecule as defined in any one of claims 1 to 27; b) optionally isolating and/or purifying the at least one protein-based carrier building block and/or the molecule or part of the molecule expressed in a); c) optionally conjugating one or more (further) cargos to the attachment point(s) or conjugation sites(s) of the protein-based carrier building block.
32. A method for producing the molecule as defined in any one of claims 1 to 27, wherein the method comprises: a) chemically synthesizing the at least one protein-based carrier building block and/or the molecule or part of the molecule as defined in any one of claims I to 27, preferably by using solid-phase peptide synthesis; b) optionally isolating and/or purifying the at least one protein-based carrier building block and/or the molecule or part of the molecule synthesized in a); c) optionally conjugating one or more (further) cargos to the attachment point(s) or conjugation sites(s) of the protein-based carrier building block.
33. The molecule according to any one of claims 1 to 27 or the composition according to claim 30 for use in medicine.
34. The molecule according to any one of claims 1 to 27 or the composition according to claim 30 for use in the prophylactic and/or therapeutic treatment of an autoimmune/inflammatory disease, an infectious disease and/or cancer, such as hematological (blood) and solid tumor cancer disease.
35. The molecule according to any one of claims 1 to 27 or the composition according to claim 30 for use as a vaccine.
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/EP2023/087731 WO2024133935A1 (en) | 2022-12-23 | 2023-12-22 | Protein-based conjugation carriers |
| EPPCT/EP2023/087731 | 2023-12-22 | ||
| US202463661524P | 2024-06-18 | 2024-06-18 | |
| US63/661,524 | 2024-06-18 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2025133253A2 true WO2025133253A2 (en) | 2025-06-26 |
| WO2025133253A3 WO2025133253A3 (en) | 2025-08-28 |
Family
ID=94238486
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/EP2024/088112 Pending WO2025133253A2 (en) | 2023-12-22 | 2024-12-20 | Protein-based conjugation carriers for intranuclear delivery |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025133253A2 (en) |
Citations (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB335768A (en) | 1929-10-24 | 1930-10-02 | Jacoviac Maurice | Improvements in protecting devices for gramophone disc records |
| WO1994004678A1 (en) | 1992-08-21 | 1994-03-03 | Casterman Cecile | Immunoglobulins devoid of light chains |
| WO1998049185A1 (en) | 1997-04-28 | 1998-11-05 | Fmc Corporation | Lepidopteran gaba-gated chloride channels |
| WO1999023221A2 (en) | 1997-10-27 | 1999-05-14 | Unilever Plc | Multivalent antigen-binding proteins |
| WO1999042077A2 (en) | 1998-02-19 | 1999-08-26 | Xcyte Therapies, Inc. | Compositions and methods for regulating lymphocyte activation |
| EP0967284A1 (en) | 1998-05-28 | 1999-12-29 | Pfizer Limited | Phosphodiesterases |
| WO2000046383A2 (en) | 1999-02-05 | 2000-08-10 | Rijksuniversiteit Leiden | Method of modulating metabolite biosynthesis in recombinant cells |
| WO2000055318A2 (en) | 1999-03-15 | 2000-09-21 | University Of British Columbia | Abc1 polypeptide and methods and reagents for modulating cholesterol levels |
| WO2000078972A2 (en) | 1999-06-18 | 2000-12-28 | Cv Therapeutics, Inc. | Regulation with binding cassette transporter protein abc1 |
| WO2001009300A2 (en) | 1999-08-02 | 2001-02-08 | Keygene N.V. | Method for generating cgmmv resistant plants, genetic constructs, and obtained cgmmv-resistant plants |
| EP1085089A2 (en) | 1999-09-17 | 2001-03-21 | Pfizer Limited | Human cyclic nucleotide phosphodiesterase |
| GB2357768A (en) | 1999-11-18 | 2001-07-04 | Bayer Ag | GABA B receptors |
| WO2004037999A2 (en) | 2002-10-23 | 2004-05-06 | Ludwig Institute For Cancer Research | A34 and a33-like 3 dna, proteins, antibodies thereto and methods of treatment using same |
| WO2004041865A2 (en) | 2002-11-08 | 2004-05-21 | Ablynx N.V. | Stabilized single domain antibodies |
| WO2004060965A2 (en) | 2002-12-31 | 2004-07-22 | Nektar Therapeutics Al, Corporation | Hydrolytically stable maleimide-terminated polymers |
| US6875841B2 (en) | 2001-07-31 | 2005-04-05 | Nof Corporation | Polyoxyalkylene derivative and process of producing the same |
| WO2006040153A2 (en) | 2004-10-13 | 2006-04-20 | Ablynx N.V. | Single domain camelide anti -amyloid beta antibodies and polypeptides comprising the same for the treatment and diagnosis of degenarative neural diseases such as alzheimer's disease |
| WO2006122787A1 (en) | 2005-05-18 | 2006-11-23 | Ablynx Nv | Serum albumin binding proteins |
| WO2006122825A2 (en) | 2005-05-20 | 2006-11-23 | Ablynx Nv | Single domain vhh antibodies against von willebrand factor |
| WO2007118670A1 (en) | 2006-04-14 | 2007-10-25 | Ablynx N.V. | Dp-78-like nanobodies |
| WO2008020079A1 (en) | 2006-08-18 | 2008-02-21 | Ablynx N.V. | Amino acid sequences directed against il-6r and polypeptides comprising the same for the treatment of deseases and disorders associated with il-6-mediated signalling |
| WO2008068280A1 (en) | 2006-12-05 | 2008-06-12 | Ablynx N.V. | Peptides capable of binding to serum proteins |
| WO2009127691A1 (en) | 2008-04-17 | 2009-10-22 | Ablynx N.V. | Peptides capable of binding to serum proteins and compounds, constructs and polypeptides comprising the same |
| WO2009147248A2 (en) | 2008-06-05 | 2009-12-10 | Ablynx N.V. | Amino acid sequences directed against envelope proteins of a virus and polypeptides comprising the same for the treatment of viral diseases |
| WO2010139808A2 (en) | 2009-06-05 | 2010-12-09 | Ablynx Nv | IMPROVED AMINO ACID SEQUENCES DIRECTED AGAINST HUMAN RESPIRATORY SYNCYTIAL VIRUS (hRSV) AND POLYPEPTIDES COMPRISING THE SAME FOR THE PREVENTION AND/OR TREATMENT OF RESPIRATORY TRACT INFECTIONS |
| WO2011095545A1 (en) | 2010-02-05 | 2011-08-11 | Ablynx Nv | Peptides capable of binding to serum albumin and compounds, constructs and polypeptides comprising the same |
| WO2012175400A1 (en) | 2011-06-23 | 2012-12-27 | Ablynx Nv | Serum albumin binding proteins |
| WO2012175741A2 (en) | 2011-06-23 | 2012-12-27 | Ablynx Nv | Techniques for predicting, detecting and reducing aspecific protein interference in assays involving immunoglobulin single variable domains |
| US20140301974A1 (en) | 2009-02-03 | 2014-10-09 | Amunix Operating Inc. | Extended recombinant polypeptides and compositions comprising same |
| WO2015173325A2 (en) | 2014-05-16 | 2015-11-19 | Ablynx Nv | Improved immunoglobulin variable domains |
| WO2016055656A1 (en) | 2014-10-10 | 2016-04-14 | Ablynx N.V. | Methods of treating rsv infections |
| WO2017080850A1 (en) | 2015-11-13 | 2017-05-18 | Ablynx Nv | Improved serum albumin-binding immunoglobulin variable domains |
| WO2017085172A2 (en) | 2015-11-18 | 2017-05-26 | Ablynx Nv | Improved serum albumin binders |
| WO2017089618A1 (en) | 2015-11-27 | 2017-06-01 | Ablynx Nv | Polypeptides inhibiting cd40l |
| WO2018099968A1 (en) | 2016-11-29 | 2018-06-07 | Ablynx N.V. | Treatment of infection by respiratory syncytial virus (rsv) |
| WO2018104444A1 (en) | 2016-12-07 | 2018-06-14 | Ablynx Nv | Improved serum albumin binding immunoglobulin single variable domains |
| WO2018134235A1 (en) | 2017-01-17 | 2018-07-26 | Ablynx Nv | Improved serum albumin binders |
| WO2018134234A1 (en) | 2017-01-17 | 2018-07-26 | Ablynx Nv | Improved serum albumin binders |
| WO2021050554A1 (en) | 2019-09-10 | 2021-03-18 | Synthorx, Inc. | Il-2 conjugates and methods of use to treat autoimmune diseases |
| WO2021072167A1 (en) | 2019-10-10 | 2021-04-15 | The Scripps Research Institute | Compositions and methods for in vivo synthesis of unnatural polypeptides |
| WO2024133935A1 (en) | 2022-12-23 | 2024-06-27 | Ablynx Nv | Protein-based conjugation carriers |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2007035092A2 (en) * | 2005-09-23 | 2007-03-29 | Academisch Ziekenhuis Leiden | Vhh for the diagnosis, prevention and treatment of diseases associated with protein aggregates |
| AU2008219216A1 (en) * | 2007-02-21 | 2008-08-28 | Ablynx N.V. | Amino acid sequences directed against vascular endothelial growth factor and polypeptides comprising the same for the treatment of conditions and diseases characterized by excessive and/or pathological angiogenesis or neovascularization |
| KR102272213B1 (en) * | 2014-07-08 | 2021-07-01 | 삼성전자주식회사 | Fusion protein comprising targeting moiety, cleavage site, and cell membrane penetrating domain, and use thereof |
| WO2018050833A1 (en) * | 2016-09-15 | 2018-03-22 | Ablynx Nv | Immunoglobulin single variable domains directed against macrophage migration inhibitory factor |
| JP2023145812A (en) * | 2020-08-17 | 2023-10-12 | 国立大学法人東海国立大学機構 | ARTIFICIAL PROTEIN, Ras INHIBITOR, AND ANTICANCER AGENT |
-
2024
- 2024-12-20 WO PCT/EP2024/088112 patent/WO2025133253A2/en active Pending
Patent Citations (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB335768A (en) | 1929-10-24 | 1930-10-02 | Jacoviac Maurice | Improvements in protecting devices for gramophone disc records |
| WO1994004678A1 (en) | 1992-08-21 | 1994-03-03 | Casterman Cecile | Immunoglobulins devoid of light chains |
| WO1998049185A1 (en) | 1997-04-28 | 1998-11-05 | Fmc Corporation | Lepidopteran gaba-gated chloride channels |
| WO1999023221A2 (en) | 1997-10-27 | 1999-05-14 | Unilever Plc | Multivalent antigen-binding proteins |
| WO1999042077A2 (en) | 1998-02-19 | 1999-08-26 | Xcyte Therapies, Inc. | Compositions and methods for regulating lymphocyte activation |
| EP0967284A1 (en) | 1998-05-28 | 1999-12-29 | Pfizer Limited | Phosphodiesterases |
| WO2000046383A2 (en) | 1999-02-05 | 2000-08-10 | Rijksuniversiteit Leiden | Method of modulating metabolite biosynthesis in recombinant cells |
| WO2000055318A2 (en) | 1999-03-15 | 2000-09-21 | University Of British Columbia | Abc1 polypeptide and methods and reagents for modulating cholesterol levels |
| WO2000078972A2 (en) | 1999-06-18 | 2000-12-28 | Cv Therapeutics, Inc. | Regulation with binding cassette transporter protein abc1 |
| WO2001009300A2 (en) | 1999-08-02 | 2001-02-08 | Keygene N.V. | Method for generating cgmmv resistant plants, genetic constructs, and obtained cgmmv-resistant plants |
| EP1085089A2 (en) | 1999-09-17 | 2001-03-21 | Pfizer Limited | Human cyclic nucleotide phosphodiesterase |
| GB2357768A (en) | 1999-11-18 | 2001-07-04 | Bayer Ag | GABA B receptors |
| US6875841B2 (en) | 2001-07-31 | 2005-04-05 | Nof Corporation | Polyoxyalkylene derivative and process of producing the same |
| WO2004037999A2 (en) | 2002-10-23 | 2004-05-06 | Ludwig Institute For Cancer Research | A34 and a33-like 3 dna, proteins, antibodies thereto and methods of treatment using same |
| WO2004041865A2 (en) | 2002-11-08 | 2004-05-21 | Ablynx N.V. | Stabilized single domain antibodies |
| WO2004060965A2 (en) | 2002-12-31 | 2004-07-22 | Nektar Therapeutics Al, Corporation | Hydrolytically stable maleimide-terminated polymers |
| WO2006040153A2 (en) | 2004-10-13 | 2006-04-20 | Ablynx N.V. | Single domain camelide anti -amyloid beta antibodies and polypeptides comprising the same for the treatment and diagnosis of degenarative neural diseases such as alzheimer's disease |
| WO2006122787A1 (en) | 2005-05-18 | 2006-11-23 | Ablynx Nv | Serum albumin binding proteins |
| WO2006122825A2 (en) | 2005-05-20 | 2006-11-23 | Ablynx Nv | Single domain vhh antibodies against von willebrand factor |
| WO2007118670A1 (en) | 2006-04-14 | 2007-10-25 | Ablynx N.V. | Dp-78-like nanobodies |
| WO2008020079A1 (en) | 2006-08-18 | 2008-02-21 | Ablynx N.V. | Amino acid sequences directed against il-6r and polypeptides comprising the same for the treatment of deseases and disorders associated with il-6-mediated signalling |
| WO2008068280A1 (en) | 2006-12-05 | 2008-06-12 | Ablynx N.V. | Peptides capable of binding to serum proteins |
| WO2009127691A1 (en) | 2008-04-17 | 2009-10-22 | Ablynx N.V. | Peptides capable of binding to serum proteins and compounds, constructs and polypeptides comprising the same |
| WO2009147248A2 (en) | 2008-06-05 | 2009-12-10 | Ablynx N.V. | Amino acid sequences directed against envelope proteins of a virus and polypeptides comprising the same for the treatment of viral diseases |
| US20140301974A1 (en) | 2009-02-03 | 2014-10-09 | Amunix Operating Inc. | Extended recombinant polypeptides and compositions comprising same |
| WO2010139808A2 (en) | 2009-06-05 | 2010-12-09 | Ablynx Nv | IMPROVED AMINO ACID SEQUENCES DIRECTED AGAINST HUMAN RESPIRATORY SYNCYTIAL VIRUS (hRSV) AND POLYPEPTIDES COMPRISING THE SAME FOR THE PREVENTION AND/OR TREATMENT OF RESPIRATORY TRACT INFECTIONS |
| WO2011095545A1 (en) | 2010-02-05 | 2011-08-11 | Ablynx Nv | Peptides capable of binding to serum albumin and compounds, constructs and polypeptides comprising the same |
| WO2012175400A1 (en) | 2011-06-23 | 2012-12-27 | Ablynx Nv | Serum albumin binding proteins |
| WO2012175741A2 (en) | 2011-06-23 | 2012-12-27 | Ablynx Nv | Techniques for predicting, detecting and reducing aspecific protein interference in assays involving immunoglobulin single variable domains |
| WO2015173325A2 (en) | 2014-05-16 | 2015-11-19 | Ablynx Nv | Improved immunoglobulin variable domains |
| WO2016055656A1 (en) | 2014-10-10 | 2016-04-14 | Ablynx N.V. | Methods of treating rsv infections |
| WO2017080850A1 (en) | 2015-11-13 | 2017-05-18 | Ablynx Nv | Improved serum albumin-binding immunoglobulin variable domains |
| WO2017085172A2 (en) | 2015-11-18 | 2017-05-26 | Ablynx Nv | Improved serum albumin binders |
| WO2017089618A1 (en) | 2015-11-27 | 2017-06-01 | Ablynx Nv | Polypeptides inhibiting cd40l |
| WO2018099968A1 (en) | 2016-11-29 | 2018-06-07 | Ablynx N.V. | Treatment of infection by respiratory syncytial virus (rsv) |
| WO2018104444A1 (en) | 2016-12-07 | 2018-06-14 | Ablynx Nv | Improved serum albumin binding immunoglobulin single variable domains |
| WO2018134235A1 (en) | 2017-01-17 | 2018-07-26 | Ablynx Nv | Improved serum albumin binders |
| WO2018134234A1 (en) | 2017-01-17 | 2018-07-26 | Ablynx Nv | Improved serum albumin binders |
| WO2021050554A1 (en) | 2019-09-10 | 2021-03-18 | Synthorx, Inc. | Il-2 conjugates and methods of use to treat autoimmune diseases |
| WO2021072167A1 (en) | 2019-10-10 | 2021-04-15 | The Scripps Research Institute | Compositions and methods for in vivo synthesis of unnatural polypeptides |
| WO2024133935A1 (en) | 2022-12-23 | 2024-06-27 | Ablynx Nv | Protein-based conjugation carriers |
Non-Patent Citations (82)
| Title |
|---|
| "Antibody Engineering", vol. 2, 2010, SPRINGER VERLAG HEIDELBERG, pages: 33 - 51 |
| "Single antibody domains as small recognition units: design and in vitro antigen selection of camelized, human VH domains with improved protein stability", PROT. ENG, vol. 9, no. 6, 1996, pages 531 - 537 |
| ABDICHE ET AL., ANAL. BIOCHEM., vol. 377, 2008, pages 209 - 217 |
| ALAN M. MARMELSTEIN ET AL., JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, vol. 142, no. 11, 2020, pages 5078 - 5086 |
| ALTSCHUL ET AL.: "Basic local alignment search tool", J MOL BIOL., vol. 215, no. 3, 1990, pages 403 - 10, XP002949123, DOI: 10.1006/jmbi.1990.9999 |
| ALVAREZ: "Improving protein pharmacokinetics by genetic fusion to simple amino acid sequences", J. BIOL. CHEM., vol. 279, 2004, pages 3375 - 3381, XP002613037, DOI: 10.1074/JBC.M311356200 |
| AUSUBEL ET AL.: "Current protocols in molecular biology", 1987, GREEN PUBLISHING AND WILEY INTERSCIENCE |
| BRANDL ET AL., JOURNAL OF CONTROLLED RELEASE, vol. 327, 2020, pages 186 - 197 |
| BROADHEAD JGIBSON M: "Pharmaceutical preformulation and formulation", 2009, INFORMA HEALTHCARE, article "Parenteral dosage forms", pages: 325 - 47 |
| C. S. ET AL.: "Site-specific N-terminal labeling of proteins using sortase-mediated reactions", NATURE PROTOCOLS, vol. 8, no. 9, 2013, pages 1800 - 1807 |
| CAMPANERO-RHODES MA ET AL.: "Microarray strategies for exploring bacterial surface glycans and their interactions with glycan-binding proteins", FRONT MICROBIOL, vol. 10, 2020, pages 2909 |
| CHANG CCHSIA KC.: "More than a zip code: global modulation of cellular function by nuclear localization signals", FEBS J., vol. 288, no. 19, October 2021 (2021-10-01), pages 5569 - 5585 |
| CHANG-HUI SHEN: "Diagnostic Molecular Biology", 2019, article "Gene Expression: Translation of the Genetic Code" |
| CHAPMAN, NAT. BIOTECHNOL., vol. 54, 2002, pages 531 - 545 |
| D. ALVAREZ DORTA ET AL., CHEM. EUR. J, vol. 26, 2020, pages 14257 |
| DANG CVLEE WM.: "Identification of the human c-myc protein nuclear translocation signal", MOL CELL BIOL., vol. 8, no. 10, 1988, pages 4048 - 5, XP008138957 |
| DAVIESRIECHMANN: "Camelising' human antibody fragments: NMR studies on VH domains", FEBS LETT, vol. 339, 1994, pages 285 - 290 |
| DAVYDOVA M. ET AL.: "Synthesis and bioconjugation of thiol-reactive reagents for the creation of site-selectively modified immunoconjugates", J VIS EXP, 2019, pages 145 |
| DRAKE ET AL.: "Characterizing high-affinity antigen/antibody complexes by kinetic- and equilibrium-based methods", ANAL. BIOCHEM., vol. 328, 2004, pages 35 - 43, XP004501899, DOI: 10.1016/j.ab.2003.12.025 |
| ENGLANDER SW ET AL.: "Hydrogen exchange: the modern legacy of Linderstrøm-Lang", PROTEIN SCI, vol. 6, no. 5, 1997, pages 1101 - 9 |
| ESSAYS BIOCHEM, vol. 63, no. 2, 3 July 2019 (2019-07-03), pages 237 - 266 |
| FANG JL. ET AL.: "Toxicity of high-molecular-weight polyethylene glycols in Sprague Dawley rats", TOXICOL LETT., vol. 359, 2022, pages 22 - 30 |
| FARKHANI SM. ET AL.: "Effect of poly-glutamate on uptake efficiency and cytotoxicity of cell penetrating peptides", IET NANOBIOTECHNOL., vol. 10, no. 2, 2016, pages 87 - 95, XP006056163, DOI: 10.1049/iet-nbt.2015.0030 |
| FEBS LETTERS, vol. 584, 18 June 2010 (2010-06-18), pages 2670 - 2680 |
| FRALEY ET AL.: "The GyrolabTM immunoassay system: a platform for automated bioanalysis and rapid sample turnaround", BIOANALYSIS, vol. 5, 2013, pages 1765 - 74 |
| GONZALES ET AL., TUMOUR BIOL, vol. 26, 2005, pages 31 |
| GREENFIELD NJ: "Using circular dichroism spectra to estimate protein secondary structure", NAT PROTOC., vol. 1, no. 6, 2006, pages 2876 - 90, XP055220113, DOI: 10.1038/nprot.2006.202 |
| GREG T.HERMANSON: "Bioconjugate Techniques", 2013, article "The reactions of bioconjugation" |
| GUIMARAES C. P. ET AL.: "Site-specific C-terminal and internal loop labelling of proteins using sortase-mediated reactions", NATURE PROTOCOLS, vol. 8, no. 9, 2013, pages 1787 - 1799 |
| HAMERS-CASTERMAN ET AL.: "Naturally occurring antibodies devoid of light chains", NATURE, vol. 363, 1993, pages 446 - 448, XP002535892, DOI: 10.1038/363446a0 |
| HARRISCHESS, NAT. REV. DRUG. DISCOV., vol. 2, 2003 |
| HUNTER S. ACOCHRAN J. R: "Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations", METHODS ENZYMOL, vol. 580, 2016, pages 21 - 44 |
| HUNTER SACOCHRAN JR: "Cell-binding assays for determining the affinity of protein-protein interactions: technologies and considerations", METHODS ENZYMOL., vol. 580, 2016, pages 21 - 44 |
| IRVING ET AL., J. IMMUNOL. METHODS, vol. 248, 2001, pages 31 |
| ISIDRO-LLOBET, A. ET AL.: "Amino acid-protecting groups", CHEM REV., vol. 109, no. 6, 2009, pages 2455 - 504, XP055559012, DOI: 10.1021/cr800323s |
| JOHNNSON ET AL., ANAL. BIOCHEM., vol. 198, 1991, pages 268 - 277 |
| JOHNSSON ET AL., J. MOL. RECOGNIT, vol. 8, 1995, pages 125 - 131 |
| JONES, C: "Circular dichroism of biopharmaceutical proteins in a quality-regulated environment", J PHARM BIOMED ANAL, vol. 219, 2022, pages 114945, XP087150571, DOI: 10.1016/j.jpba.2022.114945 |
| JONSSON ET AL., ANN. BIOL. CLIN, vol. 51, 1993, pages 19 - 26 |
| JONSSON ET AL., BIOTECHNIQUES, vol. 11, 1991, pages 620 - 627 |
| JUN Y. AXUP ET AL.: "Synthesis of site-specific antibody-drug conjugates using unnatural amino acids", PNAS, vol. 109, no. 40, 2012, pages 16101 - 16106 |
| JUNUTULA, J ET AL.: "Site-specific conjugation of a cytotoxic drug to an antibody improves the therapeutic index", NAT BIOTECHNOL, vol. 26, 2008, pages 925 - 932 |
| KALICHUK V. ET AL.: "A novel, smaller scaffold for Affitins: Showcase with binders specific for EpCAM", BIOTECHNOL BIOENG., vol. 115, no. 2, 2018, pages 290 - 299, XP071153733, DOI: 10.1002/bit.26463 |
| KENNETH, A ET AL., CHEMICAL STABILITY OF PHARMACEUTICALS: A HANDBOOK FOR PHARMACISTS |
| KJELDSEN T. ET AL.: "Dually reactive long recombinant linkers for bioconjugations as an alternative to PEG", ACS OMEGA, vol. 5, 2020, pages 19827 - 19833 |
| KRAMER RM. ET AL.: "Toward a molecular understanding of protein solubility: increased negative surface charge correlates with increased solubility", BIOPHYS J, vol. 102, no. 8, 2012, pages 1907 - 15, XP028412916, DOI: 10.1016/j.bpj.2012.01.060 |
| LAIMER J. ET AL.: "MAESTRO--multi agent stability prediction upon point mutations", BMC BIOINFORMATICS, vol. 16, 2015, pages 116, XP021216699, DOI: 10.1186/s12859-015-0548-6 |
| LEHTO T.: "Cell-penetrating peptides for the delivery of nucleic acids", EXPERT OPIN. DRUG DELIV., vol. 9, no. 7, 2012, pages 823 - 36, XP009177242, DOI: 10.1517/17425247.2012.689285 |
| LEVINWEISS, MOL. BIOSYST, vol. 2, 2006, pages 49 |
| LEWIN: "Genes II", 1985, JOHN WILEY & SONS |
| LIM S. ET AL.: "Exquisitely specific anti-KRAS biodegraders inform on the cellular prevalence of nucleotide-loaded states", ACS CENT. SCI, vol. 7, no. 2, 2021, pages 274 - 291 |
| LU J. ET AL.: "Types of nuclear localization signals and mechanisms of protein import into the nucleus", CELL COMMUN SIGNAL., vol. 19, no. 1, 2021, pages 60, XP093073531, DOI: 10.1186/s12964-021-00741-y |
| LU, L ET AL.: "Beyond binding: antibody effector functions in infectious diseases", NAT REV IMMUNOL, vol. 18, 2018, pages 46 - 61, XP037923233, DOI: 10.1038/nri.2017.106 |
| M GIBALDID PERRON: "Pharmacokinetics", 1982, MARCEL DEKKER |
| MUYLDERMANS ET AL.: "Single domain camel antibodies: current status", J BIOTECHNOL, vol. 74, 2001, pages 277 - 302, XP055277195, DOI: 10.1016/S1389-0352(01)00021-6 |
| MUYLDERMANS S.: "A guide to: generation and design of nanobodies", FEBS J, vol. 288, no. 7, 2021, pages 2084 - 2102, XP055946167, DOI: 10.1111/febs.15515 |
| OBER ET AL., INTERN. IMMUNOLOGY, vol. 13, 2001, pages 1551 - 1559 |
| OLD ET AL.: "Principles of Gene Manipulation: An Introduction to Genetic Engineering", 1981, UNIVERSITY OF CALIFORNIA PRESS |
| PETERS ET AL., PHARMACOKINETIC ANALYSIS: A PRACTICAL APPROACH, 1996 |
| PRESTA, ADV. DRUG DELIV. REV, vol. 58, no. 640, 2006 |
| PURE & APPL. CHEM, vol. 56, no. 5, 1984, pages 595 - 624 |
| RADON ET AL., ADVANCED FUNCTIONAL MATERIALS, vol. 31, no. 2101633, 2021, pages 1 - 33 |
| RAMACHANDER, RRATHORE, N: "Sterile Product Development, AAPS Advances in the Pharmaceutical Sciences Series", vol. 6, 2013, SPRINGER, article "Molecule and manufacturability assessment leading to robust commercial formulation for therapeutic proteins" |
| RAY ET AL., BIOCONJ. CHEM, vol. 26, no. 6, 2015, pages 1004 - 1007 |
| RIECHMANN LMUYLDERMANS S.: "Single domain antibodies: comparison of camel VH and camelised human VH domains", J IMMUNOL METHODS, vol. 231, no. 1-2, 1999, pages 25 - 38, XP093118807, DOI: 10.1016/S0022-1759(99)00138-6 |
| ROY ET AL.: "Overcoming the barriers of nuclear-targeted drug delivery using nanomedicine-based strategies for enhanced anticancer therapy", JOURNAL OF DRUG DELIVERY SCIENCE AND TECHNOLOGY, 2023, pages 83 |
| SCHELLENBERGER ET AL., NAT BIOTECHNOL, vol. 27, no. 12, 2009, pages 1186 - 90 |
| SCHMITZ ET AL., PLACENTA, vol. 21, 2000, pages 106 |
| SCHRODINGER, LLC, 2021 |
| SLEEP D. ET AL.: "Albumin as a versatile platform for drug half-life extension", BIOCHIM BIOPHYS ACTA, vol. 1830, no. 12, 2013, pages 5526 - 34, XP028740219, DOI: 10.1016/j.bbagen.2013.04.023 |
| SPEARS R. J. ET AL.: "Cysteine protecting groups: applications in peptide and protein science", CHEM. SOC. REV, vol. 50, 2021, pages 11098 - 11155, XP055880017, DOI: 10.1039/D1CS00271F |
| SPICER C. D. ET AL.: "Achieving controlled biomolecule-biomaterial conjugation", CHEM REV., vol. 118, no. 16, 2018, pages 7702 - 7743, XP055751612, DOI: 10.1021/acs.chemrev.8b00253 |
| STUMPP MT ET AL.: "DARPins: A new generation of protein therapeutics", DRUG DISCOVERY TODAY, vol. 13, no. 15-16, 2008, pages 695 - 701, XP023440383, DOI: 10.1016/j.drudis.2008.04.013 |
| SU, Z. ET AL.: "Antibody-drug conjugates: Recent advances in linker chemistry", ACTA PHARMACEUTICA SINICA B, vol. 11, no. 796073-69-3, 2021, pages 3889 - 3907, XP093087740, DOI: 10.1016/j.apsb.2021.03.042 |
| SWIERCZEWSKAA M. ET AL.: "What is the future of PEGylated therapies?", EXPERT OPIN EMERG DRUGS., vol. 20, no. 4, 2015, pages 531 - 536, XP093097371, DOI: 10.1517/14728214.2015.1113254 |
| VERONESEHARRIS, ADV. DRUG DELIV. REV., vol. 54, 2003, pages 453 - 456 |
| WARD ET AL., NATURE, vol. 341, 1989, pages 544 |
| WITTE M. D. ET AL.: "Production of unnaturally linked chimeric proteins using a combination of sortase-catalyzed transpeptidation and click chemistry", NATURE PROTOCOLS, vol. 8, no. 9, 2013, pages 1808 - 1819, XP037547549, DOI: 10.1038/nprot.2013.103 |
| XIE ET AL.: "Cell-penetrating peptides in diagnosis and treatment of human diseases: from preclinical research to clinical application", FRONT. PHARMACOL., vol. 11, 2020, pages 697, XP055965167, DOI: 10.3389/fphar.2020.00697 |
| YANG Y. ET AL.: "Application of peptides in construction of nonviral vectors for gene delivery", NANOMATERIALS (BASEL, vol. 12, no. 22, 2022, pages 4076 |
| YOUNG ET AL.: "Beyond the canonical 20 amino acids: expanding the genetic lexicon", J. OF BIOLOGICAL CHEMISTRY, vol. 285, no. 15, 2010, pages 11039 - 11044, XP055157080, DOI: 10.1074/jbc.R109.091306 |
| ZHANG H. ET AL.: "Recent advances of cell-penetrating peptides and their application as vectors for delivery of peptide and protein-based cargo molecules", PHARMACEUTICS, vol. 15, no. 8, 2023, pages 2093 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2025133253A3 (en) | 2025-08-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6619743B2 (en) | Antibody-drug conjugates and immunotoxins | |
| CN105142675B (en) | Method for preparing immunoligand/effector molecule conjugates by sequence-specific transpeptidases | |
| KR102342934B1 (en) | Antibody-drug conjugates and immunotoxins | |
| US10738115B2 (en) | Humanized antibodies transmigrating the blood-brain barrier and uses thereof | |
| CN107406497A (en) | The nano antibody dimer of cysteine connection | |
| US20230295293A1 (en) | BINDING MOLECULES AGAINST FRa | |
| CN107667113A (en) | The nano antibody dimer that the cysteine transformed by C-terminal connects | |
| CN114828895A (en) | Method for preparing eribulin-based antibody-drug conjugates | |
| CN116761824B (en) | Engineered anti-TROP 2 antibodies and antibody-drug conjugates thereof | |
| JP7448638B2 (en) | Antibody variants and their uses | |
| CN117242091A (en) | Cysteine engineered antibody constructs, conjugates and methods of use | |
| US20240415974A1 (en) | Protein-based conjugation carriers | |
| CN119137154A (en) | Anti-folate receptor alpha antibodies and methods of use | |
| US20250295799A1 (en) | Antibody-drug conjugates targeting glypican-3 and methods of use | |
| WO2025133253A2 (en) | Protein-based conjugation carriers for intranuclear delivery | |
| US20240424127A1 (en) | Il-7 polypeptides, immunocytokines comprising same, and uses thereof | |
| TW202543670A (en) | Protein-based conjugation carriers for intranuclear delivery | |
| EP4638489A1 (en) | Protein-based conjugation carriers | |
| WO2025262150A2 (en) | Antibody-recruiting molecules | |
| US20250236673A1 (en) | Protein-based carriers for site-specific amine conjugation | |
| TW202535941A (en) | Antibody-drug conjugates having a tailor-made drug-to-antibody ratio | |
| TW202542180A (en) | Protein-based carriers for site-specific amine conjugation | |
| CA3029136C (en) | Humanized antibodies transmigrating the blood-brain barrier and uses thereof | |
| KR20250126789A (en) | ALPP-specific variant antigen binding molecule | |
| HK1217296B (en) | Method of producing an immunoligand/payload conjugate by means of a sequence-specific transpeptidase enzyme |