US20100273172A1 - Micrornas expression signature for determination of tumors origin - Google Patents
Micrornas expression signature for determination of tumors origin Download PDFInfo
- Publication number
- US20100273172A1 US20100273172A1 US12/782,067 US78206710A US2010273172A1 US 20100273172 A1 US20100273172 A1 US 20100273172A1 US 78206710 A US78206710 A US 78206710A US 2010273172 A1 US2010273172 A1 US 2010273172A1
- Authority
- US
- United States
- Prior art keywords
- cancer
- mir
- hsa
- expression profile
- brain
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 205
- 230000014509 gene expression Effects 0.000 title claims abstract description 186
- 108091070501 miRNA Proteins 0.000 title description 76
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 109
- 238000000034 method Methods 0.000 claims abstract description 82
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 72
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 72
- 239000000523 sample Substances 0.000 claims description 124
- 210000004556 brain Anatomy 0.000 claims description 71
- 206010073071 hepatocellular carcinoma Diseases 0.000 claims description 65
- 208000003174 Brain Neoplasms Diseases 0.000 claims description 64
- 231100000844 hepatocellular carcinoma Toxicity 0.000 claims description 63
- 201000011510 cancer Diseases 0.000 claims description 57
- 206010027476 Metastases Diseases 0.000 claims description 55
- 210000004185 liver Anatomy 0.000 claims description 50
- 208000009956 adenocarcinoma Diseases 0.000 claims description 43
- 230000002496 gastric effect Effects 0.000 claims description 38
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 31
- 230000000295 complement effect Effects 0.000 claims description 30
- 208000014018 liver neoplasm Diseases 0.000 claims description 30
- 239000012634 fragment Substances 0.000 claims description 27
- 239000012472 biological sample Substances 0.000 claims description 23
- 201000007270 liver cancer Diseases 0.000 claims description 20
- 230000009401 metastasis Effects 0.000 claims description 20
- 210000004072 lung Anatomy 0.000 claims description 15
- 210000001072 colon Anatomy 0.000 claims description 12
- 208000002699 Digestive System Neoplasms Diseases 0.000 claims description 11
- 239000000203 mixture Substances 0.000 claims description 11
- 210000000496 pancreas Anatomy 0.000 claims description 11
- 201000010231 gastrointestinal system cancer Diseases 0.000 claims description 10
- 208000005017 glioblastoma Diseases 0.000 claims description 8
- 201000010133 Oligodendroglioma Diseases 0.000 claims description 7
- 238000007477 logistic regression Methods 0.000 claims description 7
- 230000002441 reversible effect Effects 0.000 claims description 7
- 206010003571 Astrocytoma Diseases 0.000 claims description 5
- 210000001124 body fluid Anatomy 0.000 claims description 5
- 206010006187 Breast cancer Diseases 0.000 claims description 4
- 206010009944 Colon cancer Diseases 0.000 claims description 4
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 claims description 4
- 206010061902 Pancreatic neoplasm Diseases 0.000 claims description 4
- 208000029742 colonic neoplasm Diseases 0.000 claims description 4
- 208000020816 lung neoplasm Diseases 0.000 claims description 4
- 238000012706 support-vector machine Methods 0.000 claims description 4
- 208000026310 Breast neoplasm Diseases 0.000 claims description 3
- 238000003066 decision tree Methods 0.000 claims description 3
- 208000037841 lung tumor Diseases 0.000 claims description 3
- 206010061968 Gastric neoplasm Diseases 0.000 claims description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 claims description 2
- 238000013528 artificial neural network Methods 0.000 claims description 2
- 206010017758 gastric cancer Diseases 0.000 claims description 2
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 claims description 2
- 208000008443 pancreatic carcinoma Diseases 0.000 claims description 2
- 201000011549 stomach cancer Diseases 0.000 claims description 2
- 239000002679 microRNA Substances 0.000 abstract description 147
- 108700011259 MicroRNAs Proteins 0.000 abstract description 80
- 238000011282 treatment Methods 0.000 abstract description 13
- 238000004458 analytical method Methods 0.000 abstract description 11
- 230000008569 process Effects 0.000 abstract description 4
- 238000005457 optimization Methods 0.000 abstract 1
- 238000002560 therapeutic procedure Methods 0.000 abstract 1
- 125000003729 nucleotide group Chemical group 0.000 description 51
- 210000001519 tissue Anatomy 0.000 description 51
- 210000004027 cell Anatomy 0.000 description 48
- 239000002773 nucleotide Substances 0.000 description 43
- 206010061289 metastatic neoplasm Diseases 0.000 description 34
- 238000002493 microarray Methods 0.000 description 22
- 238000011529 RT qPCR Methods 0.000 description 21
- 206010059282 Metastases to central nervous system Diseases 0.000 description 20
- 108090000623 proteins and genes Proteins 0.000 description 19
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 18
- 206010027457 Metastases to liver Diseases 0.000 description 17
- 238000009396 hybridization Methods 0.000 description 17
- 230000001394 metastastic effect Effects 0.000 description 17
- 108091007428 primary miRNA Proteins 0.000 description 17
- 108091068991 Homo sapiens miR-141 stem-loop Proteins 0.000 description 16
- 230000027455 binding Effects 0.000 description 15
- 238000009739 binding Methods 0.000 description 15
- 230000035945 sensitivity Effects 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 13
- 238000000018 DNA microarray Methods 0.000 description 12
- 108091067482 Homo sapiens miR-205 stem-loop Proteins 0.000 description 12
- 238000003745 diagnosis Methods 0.000 description 12
- 206010019695 Hepatic neoplasm Diseases 0.000 description 11
- 210000000481 breast Anatomy 0.000 description 11
- 238000001514 detection method Methods 0.000 description 11
- 201000009030 Carcinoma Diseases 0.000 description 10
- 238000001574 biopsy Methods 0.000 description 10
- 230000007170 pathology Effects 0.000 description 10
- 239000007787 solid Substances 0.000 description 10
- 206010041823 squamous cell carcinoma Diseases 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 239000000090 biomarker Substances 0.000 description 9
- 230000036210 malignancy Effects 0.000 description 9
- 239000000758 substrate Substances 0.000 description 9
- 102000000574 RNA-Induced Silencing Complex Human genes 0.000 description 8
- 108010016790 RNA-Induced Silencing Complex Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 239000011230 binding agent Substances 0.000 description 8
- 239000010432 diamond Substances 0.000 description 8
- 230000002440 hepatic effect Effects 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 7
- 210000003734 kidney Anatomy 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 230000007017 scission Effects 0.000 description 7
- 238000012549 training Methods 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 6
- 108020004414 DNA Proteins 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 230000000955 neuroendocrine Effects 0.000 description 6
- 208000030266 primary brain neoplasm Diseases 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- -1 rRNA Proteins 0.000 description 6
- 238000003753 real-time PCR Methods 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 5
- 210000004369 blood Anatomy 0.000 description 5
- 239000008280 blood Substances 0.000 description 5
- 230000004663 cell proliferation Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000003211 malignant effect Effects 0.000 description 5
- 201000001441 melanoma Diseases 0.000 description 5
- 210000001672 ovary Anatomy 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 210000002784 stomach Anatomy 0.000 description 5
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 208000006265 Renal cell carcinoma Diseases 0.000 description 4
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- 108010090804 Streptavidin Proteins 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 230000001594 aberrant effect Effects 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 125000000524 functional group Chemical group 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 230000035755 proliferation Effects 0.000 description 4
- 125000002652 ribonucleotide group Chemical group 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000011269 treatment regimen Methods 0.000 description 4
- 108091032955 Bacterial small RNA Proteins 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 201000010915 Glioblastoma multiforme Diseases 0.000 description 3
- 206010073069 Hepatic cancer Diseases 0.000 description 3
- 208000005927 Myosarcoma Diseases 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 108010057163 Ribonuclease III Proteins 0.000 description 3
- 206010039491 Sarcoma Diseases 0.000 description 3
- 108700005078 Synthetic Genes Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 238000011888 autopsy Methods 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 210000003169 central nervous system Anatomy 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000011109 contamination Methods 0.000 description 3
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000003748 differential diagnosis Methods 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 3
- 238000003384 imaging method Methods 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- 201000000462 keratinizing squamous cell carcinoma Diseases 0.000 description 3
- 206010024627 liposarcoma Diseases 0.000 description 3
- 210000005228 liver tissue Anatomy 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 206010027191 meningioma Diseases 0.000 description 3
- 208000037819 metastatic cancer Diseases 0.000 description 3
- 201000002077 muscle cancer Diseases 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 238000004393 prognosis Methods 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 208000008732 thymoma Diseases 0.000 description 3
- 206010044412 transitional cell carcinoma Diseases 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 206010001233 Adenoma benign Diseases 0.000 description 2
- 206010073128 Anaplastic oligodendroglioma Diseases 0.000 description 2
- 201000003076 Angiosarcoma Diseases 0.000 description 2
- 201000000274 Carcinosarcoma Diseases 0.000 description 2
- 208000005243 Chondrosarcoma Diseases 0.000 description 2
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 2
- 208000006402 Ductal Carcinoma Diseases 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 201000008808 Fibrosarcoma Diseases 0.000 description 2
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 2
- 208000001258 Hemangiosarcoma Diseases 0.000 description 2
- 208000017604 Hodgkin disease Diseases 0.000 description 2
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 2
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 2
- 108091069016 Homo sapiens miR-122 stem-loop Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 108091007780 MiR-122 Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 206010061332 Paraganglion neoplasm Diseases 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 206010035603 Pleural mesothelioma Diseases 0.000 description 2
- 101710124239 Poly(A) polymerase Proteins 0.000 description 2
- 206010036790 Productive cough Diseases 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 208000002669 Sex Cord-Gonadal Stromal Tumors Diseases 0.000 description 2
- 102000039471 Small Nuclear RNA Human genes 0.000 description 2
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 2
- 238000000692 Student's t-test Methods 0.000 description 2
- PPBRXRYQALVLMV-UHFFFAOYSA-N Styrene Chemical compound C=CC1=CC=CC=C1 PPBRXRYQALVLMV-UHFFFAOYSA-N 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 150000003838 adenosines Chemical class 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 210000005013 brain tissue Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 231100000504 carcinogenesis Toxicity 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 208000006990 cholangiocarcinoma Diseases 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000001684 chronic effect Effects 0.000 description 2
- 208000009060 clear cell adenocarcinoma Diseases 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 208000035250 cutaneous malignant susceptibility to 1 melanoma Diseases 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 239000006260 foam Substances 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 125000001475 halogen functional group Chemical group 0.000 description 2
- 201000011066 hemangioma Diseases 0.000 description 2
- 210000003494 hepatocyte Anatomy 0.000 description 2
- 210000003630 histaminocyte Anatomy 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000003364 immunohistochemistry Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 201000010985 invasive ductal carcinoma Diseases 0.000 description 2
- 208000003849 large cell carcinoma Diseases 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 201000005249 lung adenocarcinoma Diseases 0.000 description 2
- 208000012804 lymphangiosarcoma Diseases 0.000 description 2
- 230000000527 lymphocytic effect Effects 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 210000002752 melanocyte Anatomy 0.000 description 2
- 208000011575 metastatic malignant neoplasm Diseases 0.000 description 2
- 108091023127 miR-196 stem-loop Proteins 0.000 description 2
- 238000001531 micro-dissection Methods 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 208000001611 myxosarcoma Diseases 0.000 description 2
- 230000009826 neoplastic cell growth Effects 0.000 description 2
- IDBIFFKSXLYUOT-UHFFFAOYSA-N netropsin Chemical compound C1=C(C(=O)NCCC(N)=N)N(C)C=C1NC(=O)C1=CC(NC(=O)CN=C(N)N)=CN1C IDBIFFKSXLYUOT-UHFFFAOYSA-N 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 231100000590 oncogenic Toxicity 0.000 description 2
- 230000002246 oncogenic effect Effects 0.000 description 2
- 239000012188 paraffin wax Substances 0.000 description 2
- 208000007312 paraganglioma Diseases 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 108090000765 processed proteins & peptides Chemical group 0.000 description 2
- 238000001959 radiotherapy Methods 0.000 description 2
- 238000012340 reverse transcriptase PCR Methods 0.000 description 2
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 208000000649 small cell carcinoma Diseases 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 210000003802 sputum Anatomy 0.000 description 2
- 208000024794 sputum Diseases 0.000 description 2
- 238000013179 statistical model Methods 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 238000012353 t test Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- CALDMMCNNFPJSI-CRCLSJGQSA-N (3r,5s)-5-(hydroxymethyl)pyrrolidin-3-ol Chemical compound OC[C@@H]1C[C@@H](O)CN1 CALDMMCNNFPJSI-CRCLSJGQSA-N 0.000 description 1
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 description 1
- GZEFTKHSACGIBG-UGKPPGOTSA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)-2-propyloxolan-2-yl]pyrimidine-2,4-dione Chemical compound C1=CC(=O)NC(=O)N1[C@]1(CCC)O[C@H](CO)[C@@H](O)[C@H]1O GZEFTKHSACGIBG-UGKPPGOTSA-N 0.000 description 1
- XQCZBXHVTFVIFE-UHFFFAOYSA-N 2-amino-4-hydroxypyrimidine Chemical compound NC1=NC=CC(O)=N1 XQCZBXHVTFVIFE-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- FSVMNZBNUZWLMV-UHFFFAOYSA-N 3,6,7,8-tetrahydropyrrolo[3,2-e]indole-2-carboxylic acid Chemical compound C1=C2NC(C(=O)O)=CC2=C2CCNC2=C1 FSVMNZBNUZWLMV-UHFFFAOYSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- AGFIRQJZCNVMCW-UAKXSSHOSA-N 5-bromouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 AGFIRQJZCNVMCW-UAKXSSHOSA-N 0.000 description 1
- 108020004565 5.8S Ribosomal RNA Proteins 0.000 description 1
- 108020005075 5S Ribosomal RNA Proteins 0.000 description 1
- ASUCSHXLTWZYBA-UMMCILCDSA-N 8-Bromoguanosine Chemical compound C1=2NC(N)=NC(=O)C=2N=C(Br)N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O ASUCSHXLTWZYBA-UMMCILCDSA-N 0.000 description 1
- HDZZVAMISRMYHH-UHFFFAOYSA-N 9beta-Ribofuranosyl-7-deazaadenin Natural products C1=CC=2C(N)=NC=NC=2N1C1OC(CO)C(O)C1O HDZZVAMISRMYHH-UHFFFAOYSA-N 0.000 description 1
- 208000010400 APUDoma Diseases 0.000 description 1
- 208000007876 Acrospiroma Diseases 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 208000010507 Adenocarcinoma of Lung Diseases 0.000 description 1
- 208000000583 Adenolymphoma Diseases 0.000 description 1
- 208000003200 Adenoma Diseases 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 208000005034 Angiolymphoid Hyperplasia with Eosinophilia Diseases 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- 208000035821 Benign schwannoma Diseases 0.000 description 1
- 208000003609 Bile Duct Adenoma Diseases 0.000 description 1
- 206010004593 Bile duct cancer Diseases 0.000 description 1
- 208000000529 Branchioma Diseases 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 206010007270 Carcinoid syndrome Diseases 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 208000007389 Cementoma Diseases 0.000 description 1
- 206010008263 Cervical dysplasia Diseases 0.000 description 1
- 206010008642 Cholesteatoma Diseases 0.000 description 1
- 201000005262 Chondroma Diseases 0.000 description 1
- 201000009047 Chordoma Diseases 0.000 description 1
- 208000006332 Choriocarcinoma Diseases 0.000 description 1
- 208000016216 Choristoma Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 206010010774 Constipation Diseases 0.000 description 1
- 208000009798 Craniopharyngioma Diseases 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- 201000005171 Cystadenoma Diseases 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 208000007033 Dysgerminoma Diseases 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 206010063045 Effusion Diseases 0.000 description 1
- 208000003468 Ehrlich Tumor Carcinoma Diseases 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 206010014967 Ependymoma Diseases 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 206010073306 Exposure to radiation Diseases 0.000 description 1
- 206010053717 Fibrous histiocytoma Diseases 0.000 description 1
- 208000007569 Giant Cell Tumors Diseases 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- 201000005618 Glomus Tumor Diseases 0.000 description 1
- 208000005234 Granulosa Cell Tumor Diseases 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical class C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 208000035773 Gynandroblastoma Diseases 0.000 description 1
- 208000002927 Hamartoma Diseases 0.000 description 1
- 208000002125 Hemangioendothelioma Diseases 0.000 description 1
- 208000006050 Hemangiopericytoma Diseases 0.000 description 1
- 206010061998 Hepatic lesion Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 208000037564 High-grade astrocytoma Diseases 0.000 description 1
- 101001120822 Homo sapiens Putative microRNA 17 host gene protein Proteins 0.000 description 1
- 101000574648 Homo sapiens Retinoid-inducible serine carboxypeptidase Proteins 0.000 description 1
- 108091068957 Homo sapiens miR-194-1 stem-loop Proteins 0.000 description 1
- 241000714259 Human T-lymphotropic virus 2 Species 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 208000009164 Islet Cell Adenoma Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- 201000004462 Leydig Cell Tumor Diseases 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 206010024612 Lipoma Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 206010025219 Lymphangioma Diseases 0.000 description 1
- 208000004138 Lymphangiomyoma Diseases 0.000 description 1
- 208000008095 Malignant Carcinoid Syndrome Diseases 0.000 description 1
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 1
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 1
- 208000000172 Medulloblastoma Diseases 0.000 description 1
- 208000010153 Mesonephroma Diseases 0.000 description 1
- 206010027406 Mesothelioma Diseases 0.000 description 1
- 108091030146 MiRBase Proteins 0.000 description 1
- 108091027977 Mir-200 Proteins 0.000 description 1
- 208000007727 Muscle Tissue Neoplasms Diseases 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- VQAYFKKCNSOZKM-UHFFFAOYSA-N NSC 29409 Natural products C1=NC=2C(NC)=NC=NC=2N1C1OC(CO)C(O)C1O VQAYFKKCNSOZKM-UHFFFAOYSA-N 0.000 description 1
- 108010042309 Netropsin Proteins 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 201000004404 Neurofibroma Diseases 0.000 description 1
- 208000009905 Neurofibromatoses Diseases 0.000 description 1
- 208000005890 Neuroma Diseases 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- CTQNGGLPUBDAKN-UHFFFAOYSA-N O-Xylene Chemical compound CC1=CC=CC=C1C CTQNGGLPUBDAKN-UHFFFAOYSA-N 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 241001440127 Phyllodes Species 0.000 description 1
- 208000007641 Pinealoma Diseases 0.000 description 1
- 208000007452 Plasmacytoma Diseases 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102100026055 Putative microRNA 17 host gene protein Human genes 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 238000012341 Quantitative reverse-transcriptase PCR Methods 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 208000034541 Rare lymphatic malformation Diseases 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 206010057071 Rectal tenesmus Diseases 0.000 description 1
- 206010038802 Reticuloendothelial system stimulated Diseases 0.000 description 1
- 102100025483 Retinoid-inducible serine carboxypeptidase Human genes 0.000 description 1
- 208000005678 Rhabdomyoma Diseases 0.000 description 1
- 102000003661 Ribonuclease III Human genes 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 108010081734 Ribonucleoproteins Proteins 0.000 description 1
- 102000004389 Ribonucleoproteins Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 208000003274 Sertoli cell tumor Diseases 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 206010042658 Sweat gland tumour Diseases 0.000 description 1
- 206010043276 Teratoma Diseases 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 108091026822 U6 spliceosomal RNA Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 208000021146 Warthin tumor Diseases 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 229920006397 acrylic thermoplastic Polymers 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 201000004471 adenofibroma Diseases 0.000 description 1
- 201000008395 adenosquamous carcinoma Diseases 0.000 description 1
- 208000018234 adnexal spiradenoma/cylindroma of a sweat gland Diseases 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical group OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000000304 alkynyl group Chemical group 0.000 description 1
- 125000003275 alpha amino acid group Chemical group 0.000 description 1
- 208000010029 ameloblastoma Diseases 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 206010002224 anaplastic astrocytoma Diseases 0.000 description 1
- 208000007502 anemia Diseases 0.000 description 1
- 201000009431 angiokeratoma Diseases 0.000 description 1
- 208000000252 angiomatosis Diseases 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 210000000270 basal cell Anatomy 0.000 description 1
- 208000021592 benign granular cell tumor Diseases 0.000 description 1
- 208000026900 bile duct neoplasm Diseases 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 238000002725 brachytherapy Methods 0.000 description 1
- 201000008274 breast adenocarcinoma Diseases 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 231100000357 carcinogen Toxicity 0.000 description 1
- 239000003183 carcinogenic agent Substances 0.000 description 1
- 208000005761 carcinoid heart disease Diseases 0.000 description 1
- 208000002458 carcinoid tumor Diseases 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000003320 cell separation method Methods 0.000 description 1
- 230000004640 cellular pathway Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 210000003756 cervix mucus Anatomy 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 239000012829 chemotherapy agent Substances 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 201000005217 chondroblastoma Diseases 0.000 description 1
- ZYVSOIYQKUDENJ-WKSBCEQHSA-N chromomycin A3 Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@@H]1OC(C)=O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@@H](O)[C@H](O[C@@H]3O[C@@H](C)[C@H](OC(C)=O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@@H]1C[C@@H](O)[C@@H](OC)[C@@H](C)O1 ZYVSOIYQKUDENJ-WKSBCEQHSA-N 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 230000002079 cooperative effect Effects 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 208000002445 cystadenocarcinoma Diseases 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000011500 cytoreductive surgery Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000013872 defecation Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- CFCUWKMKBJTWLW-UHFFFAOYSA-N deoliosyl-3C-alpha-L-digitoxosyl-MTM Natural products CC=1C(O)=C2C(O)=C3C(=O)C(OC4OC(C)C(O)C(OC5OC(C)C(O)C(OC6OC(C)C(O)C(C)(O)C6)C5)C4)C(C(OC)C(=O)C(O)C(C)O)CC3=CC2=CC=1OC(OC(C)C1O)CC1OC1CC(O)C(O)C(C)O1 CFCUWKMKBJTWLW-UHFFFAOYSA-N 0.000 description 1
- 230000003831 deregulation Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000009274 differential gene expression Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- XNYZHCFCZNMTFY-UHFFFAOYSA-N diminazene Chemical compound C1=CC(C(=N)N)=CC=C1N\N=N\C1=CC=C(C(N)=N)C=C1 XNYZHCFCZNMTFY-UHFFFAOYSA-N 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000002121 endocytic effect Effects 0.000 description 1
- 230000002357 endometrial effect Effects 0.000 description 1
- 210000004696 endometrium Anatomy 0.000 description 1
- 208000023965 endometrium neoplasm Diseases 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000007387 excisional biopsy Methods 0.000 description 1
- 206010016629 fibroma Diseases 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 239000012520 frozen sample Substances 0.000 description 1
- 230000005021 gait Effects 0.000 description 1
- 210000000232 gallbladder Anatomy 0.000 description 1
- 201000008361 ganglioneuroma Diseases 0.000 description 1
- 201000006585 gastric adenocarcinoma Diseases 0.000 description 1
- 230000037440 gene silencing effect Effects 0.000 description 1
- 239000003365 glass fiber Substances 0.000 description 1
- 201000005626 glomangioma Diseases 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 210000004209 hair Anatomy 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 208000010710 hepatitis C virus infection Diseases 0.000 description 1
- 201000005133 hidradenoma Diseases 0.000 description 1
- 201000009379 histiocytoid hemangioma Diseases 0.000 description 1
- 201000000284 histiocytoma Diseases 0.000 description 1
- 201000008298 histiocytosis Diseases 0.000 description 1
- 230000002962 histologic effect Effects 0.000 description 1
- 230000003118 histopathologic effect Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012296 in situ hybridization assay Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000007386 incisional biopsy Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000007917 intracranial administration Methods 0.000 description 1
- 206010073095 invasive ductal breast carcinoma Diseases 0.000 description 1
- 206010073096 invasive lobular breast carcinoma Diseases 0.000 description 1
- 201000002529 islet cell tumor Diseases 0.000 description 1
- DRAVOWXCEBXPTN-UHFFFAOYSA-N isoguanine Natural products NC1=NC(=O)NC2=C1NC=N2 DRAVOWXCEBXPTN-UHFFFAOYSA-N 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 201000010260 leiomyoma Diseases 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 238000012417 linear regression Methods 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 208000020442 loss of weight Diseases 0.000 description 1
- 208000022080 low-grade astrocytoma Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000009607 mammography Methods 0.000 description 1
- 239000011565 manganese chloride Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000001809 melena Anatomy 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 210000002418 meninge Anatomy 0.000 description 1
- 208000004197 mesenchymoma Diseases 0.000 description 1
- 208000011831 mesonephric neoplasm Diseases 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 108091007426 microRNA precursor Proteins 0.000 description 1
- CFCUWKMKBJTWLW-BKHRDMLASA-N mithramycin Chemical compound O([C@@H]1C[C@@H](O[C@H](C)[C@H]1O)OC=1C=C2C=C3C[C@H]([C@@H](C(=O)C3=C(O)C2=C(O)C=1C)O[C@@H]1O[C@H](C)[C@@H](O)[C@H](O[C@@H]2O[C@H](C)[C@H](O)[C@H](O[C@@H]3O[C@H](C)[C@@H](O)[C@@](C)(O)C3)C2)C1)[C@H](OC)C(=O)[C@@H](O)[C@@H](C)O)[C@H]1C[C@@H](O)[C@H](O)[C@@H](C)O1 CFCUWKMKBJTWLW-BKHRDMLASA-N 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 201000004130 myoblastoma Diseases 0.000 description 1
- 208000009091 myxoma Diseases 0.000 description 1
- UPBAOYRENQEPJO-UHFFFAOYSA-N n-[5-[[5-[(3-amino-3-iminopropyl)carbamoyl]-1-methylpyrrol-3-yl]carbamoyl]-1-methylpyrrol-3-yl]-4-formamido-1-methylpyrrole-2-carboxamide Chemical compound CN1C=C(NC=O)C=C1C(=O)NC1=CN(C)C(C(=O)NC2=CN(C)C(C(=O)NCCC(N)=N)=C2)=C1 UPBAOYRENQEPJO-UHFFFAOYSA-N 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000001613 neoplastic effect Effects 0.000 description 1
- 230000010309 neoplastic transformation Effects 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 208000007538 neurilemmoma Diseases 0.000 description 1
- 201000002120 neuroendocrine carcinoma Diseases 0.000 description 1
- 201000011519 neuroendocrine tumor Diseases 0.000 description 1
- 208000029986 neuroepithelioma Diseases 0.000 description 1
- 201000004931 neurofibromatosis Diseases 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 210000004287 null lymphocyte Anatomy 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 208000004128 odontoma Diseases 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000005305 organ development Effects 0.000 description 1
- 208000008798 osteoma Diseases 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 125000004043 oxo group Chemical group O=* 0.000 description 1
- 208000022102 pancreatic neuroendocrine neoplasm Diseases 0.000 description 1
- 201000010198 papillary carcinoma Diseases 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 238000012335 pathological evaluation Methods 0.000 description 1
- XDRYMKDFEDOLFX-UHFFFAOYSA-N pentamidine Chemical compound C1=CC(C(=N)N)=CC=C1OCCCCCOC1=CC=C(C(N)=N)C=C1 XDRYMKDFEDOLFX-UHFFFAOYSA-N 0.000 description 1
- 229960004448 pentamidine Drugs 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 238000000206 photolithography Methods 0.000 description 1
- INAAIJLSXJJHOZ-UHFFFAOYSA-N pibenzimol Chemical compound C1CN(C)CCN1C1=CC=C(N=C(N2)C=3C=C4NC(=NC4=CC=3)C=3C=CC(O)=CC=3)C2=C1 INAAIJLSXJJHOZ-UHFFFAOYSA-N 0.000 description 1
- 208000024724 pineal body neoplasm Diseases 0.000 description 1
- 201000004123 pineal gland cancer Diseases 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 229960003171 plicamycin Drugs 0.000 description 1
- 229920003229 poly(methyl methacrylate) Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001748 polybutylene Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920001184 polypeptide Chemical group 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 229920002635 polyurethane Polymers 0.000 description 1
- 239000004814 polyurethane Substances 0.000 description 1
- 238000010837 poor prognosis Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000002600 positron emission tomography Methods 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 208000037821 progressive disease Diseases 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 125000000168 pyrrolyl group Chemical group 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- UOWVMDUEMSNCAV-WYENRQIDSA-N rachelmycin Chemical compound C1([C@]23C[C@@H]2CN1C(=O)C=1NC=2C(OC)=C(O)C4=C(C=2C=1)CCN4C(=O)C1=CC=2C=4CCN(C=4C(O)=C(C=2N1)OC)C(N)=O)=CC(=O)C1=C3C(C)=CN1 UOWVMDUEMSNCAV-WYENRQIDSA-N 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 150000003290 ribose derivatives Chemical group 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 201000007416 salivary gland adenoid cystic carcinoma Diseases 0.000 description 1
- 206010039667 schwannoma Diseases 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 150000003376 silicon Chemical class 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 108091029842 small nuclear ribonucleic acid Proteins 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 108010042747 stallimycin Proteins 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 238000011477 surgical intervention Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000035900 sweating Effects 0.000 description 1
- 206010042863 synovial sarcoma Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 208000012271 tenesmus Diseases 0.000 description 1
- ISXSCDLOGDJUNJ-UHFFFAOYSA-N tert-butyl prop-2-enoate Chemical compound CC(C)(C)OC(=O)C=C ISXSCDLOGDJUNJ-UHFFFAOYSA-N 0.000 description 1
- 208000024662 testicular seminoma Diseases 0.000 description 1
- 201000009646 testis seminoma Diseases 0.000 description 1
- 208000001644 thecoma Diseases 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000009752 translational inhibition Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 208000029387 trophoblastic neoplasm Diseases 0.000 description 1
- HDZZVAMISRMYHH-KCGFPETGSA-N tubercidin Chemical compound C1=CC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O HDZZVAMISRMYHH-KCGFPETGSA-N 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 230000036642 wellbeing Effects 0.000 description 1
- 238000010626 work up procedure Methods 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 239000008096 xylene Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/178—Oligonucleotides characterized by their use miRNA, siRNA or ncRNA
Definitions
- the present invention relates to methods for classification of cancers and tumors origin. Specifically the invention relates to microRNA molecules associated with specific cancer, as well as various nucleic acid molecules relating thereto or derived therefrom.
- microRNAs are a family of 18-24 nucleotide long non-coding small RNAs, that suppress translation of target genes by binding to their mRNA, thereby regulating the expression of at least 30% of all human genes.
- mRNAs There are currently about 850 known human microRNAs. Though highly conserved throughout evolution, a significant proportion of them is primate specific. microRNAs association with cancer has been demonstrated and several microRNAs have already been identified as oncogenes and tumor suppressors (He, H., et al., Proc Natl Acad Sci USA, 2005. 102(52): p. 19075-80).
- microRNAs One of the major characteristics of microRNAs is their marked tissue specificity. Many of them also exhibit temporal patterns of expression, suggesting that they play a critical role in specific tissues and in organ development, function and maintenance.
- hepatic lesions that include primary liver tumor and metastatic tumors are a frequent challenge in modern surgical pathology.
- Hepatic malignancies are often found in patients with advanced metastatic cancer. Identifying the origin of these tumors as well as differentiating liver metastases from primary hepatocellular carcinoma (HCC) is frequently required and poses a significant challenge that requires clinical-radiological correlation on top of careful pathological evaluation. Thorough examination assisted by a panel of immunostains is required for the identification of the origin of metastases.
- HCC primary hepatocellular carcinoma
- the present invention provides specific nucleic acid sequences that are used for the identification, classification and diagnosis of cancers and tumor origin.
- the nucleic acid sequences can also be used for the differentiation between primary and metastatic tumors based on the expression pattern of a biological sample.
- the present invention provides a method of classifying a specific cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-33, a fragment thereof, or a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the results of said comparison allows for classification of said specific cancer.
- said cancer is selected from the group consisting of liver cancer, brain cancer and gastrointestinal (GI) cancer.
- said liver cancer is hepatocellular carcimoma (HCC).
- HCC hepatocellular carcimoma
- said GI cancer is selected from the group consisting of colon, pancreas and stomach cancer.
- said brain cancer is selected from the group consisting of glioblastoma, astrocytoma and oligodendroglioma.
- the invention further provides a method for identifying liver cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 13-6, 32-33, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said liver cancer.
- the invention further provides a method to distinguish between hepatocellular carcimoma (HCC) and metastasis to the liver, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 6-13, 32-33, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of hepatocellular carcimoma (HCC) or metastasis to the liver.
- HCC hepatocellular carcimoma
- the nucleic acid sequence is selected from the group consisting of SEQ ID NOS: 6, 8, 10, 12, a fragment thereof and a sequence having at least about 80% identity thereto, wherein relatively high expression levels of any of said nucleic acid sequence, as compared to said reference expression profile, is indicative of metastasis to the liver.
- the liver metastasis is adenocarcinoma.
- the invention further provides a method for identifying brain cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said brain cancer.
- the invention further provides a method to distinguish between primary brain tumor and metastasis to the brain, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary brain tumor or metastasis to the brain.
- the nucleic acid sequence is selected from the group consisting of SEQ ID NOS: 14, 20, a fragment thereof and a sequence having at least about 80% identity thereto, wherein relatively high expression levels of any of said nucleic acid sequence, as compared to said reference expression profile, is indicative of primary brain tumor.
- the invention further provides a method to distinguish between primary brain tumor and other primary cancers, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary brain tumor or other primary cancers.
- the invention further provides a method for identifying a gastrointestinal (GI) cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-5, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said GI cancer.
- GI gastrointestinal
- the invention further provides a method to distinguish between primary GI and non-GI tumor, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-5, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary GI or non-GI tumor.
- said primary GI tumor is selected from the group consisting of colon, pancreas and stomach tumor.
- said non-GI tumor is selected from the group consisting of lung and breast tumor.
- said sample is selected from the group consisting of bodily fluid, a cell line and a tissue sample.
- said tissue is a fresh, frozen, fixed, wax-embedded or formalin fixed paraffin-embedded (FFPE) tissue.
- the classification method of the present invention further comprises a classifier algorithm, said classifier algorithm is selected from the group consisting of logistic regression classifier, linear regression classifier, nearest neighbor classifier (including K nearest neighbors), neural network classifier, Gaussian mixture model (GMM) classifier and Support Vector Machine (SVM) classifier.
- the classifier may use a decision tree structure (including binary tree) or a voting (including weighted voting) scheme to compare one or more models which compare one or more classes to other classes.
- the nucleic acid sequence expression profile is determined by a method selected from the group consisting of nucleic acid hybridization and nucleic acid amplification.
- the nucleic acid hybridization is performed using a solid-phase nucleic acid biochip array or in situ hybridization.
- the nucleic acid amplification method is real-time PCR.
- the real-time PCR method may comprise forward and reverse primers.
- the forward primer comprises a sequence selected from the group consisting of SEQ ID NOS: 34-38.
- the reverse primer comprises SEQ ID NO: 44.
- the real-time PCR method further comprises a probe.
- the probe comprises a sequence selected from the group consisting of a sequence that is complementary to a sequence selected from SEQ ID NOS: 1-33; a fragment thereof and a sequence having at least about 80% identity thereto.
- the probe comprises a sequence selected from the group consisting of SEQ ID NOS: 39-43.
- the present invention provides a kit for cancer classification, said kit comprising a probe comprising a sequence selected from the group consisting of a sequence that is complementary to a sequence selected from SEQ ID NOS: SEQ ID NOS: 1-33; a fragment thereof and a sequence having at least about 80% identity thereto.
- said cancer is selected from the group consisting of liver cancer, brain cancer and gastrointestinal (GI) cancer.
- said cancer is brain cancer.
- said probe comprises a sequence selected from the group consisting of SEQ ID NOS: 39-43.
- said the kit further comprises a forward primer selected from the group consisting of SEQ ID NOS: 34-38.
- said the kit further comprises a reverse primer comprising SEQ ID NO: 44.
- FIG. 1 demonstrates the identification of non-HCC epithelial tumor samples and metastases using microRNA biomarkers.
- Expression level of hsa-miR-122a is high in all samples taken from liver, including primary and metastases.
- Expression level of hsa-miR-200b is high in samples of non-liver origin, both primary and metastases.
- C Expression levels (qRT-PCR data, 50 minus normalized C) of hsa-miR-122a and hsa-miR-200b in 5 HCC samples (squares), 19 non-HCC primary tumors of epithelial origin (diamonds), and 7 samples of adenocarcinoma metastases to the liver of known origin (circles). Expression level of hsa-miR-122a is high in all samples taken from liver, including primary and metastases. D) Expression levels (qRT-PCR) of hsa-miR-141 and hsa-miR-200c in the same samples.
- C RT ⁇ [(hsa-miR-141)+(hsa-miR-200c)] 34.
- FIG. 2 demonstrates the identification of gastro-intestinal tumors and metastases using a combination of two microRNA biomarkers.
- A) Expression levels of hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4) in 24 primary tumor samples of gastrointestinal (GI) origin (squares) and 39 primary tumors from breast or lung (diamonds). The dashed gray line marks the line where (hsa-miR-205) (hsa-miR-194)/2 (see methods).
- FIG. 3 demonstrates the identification of non-HCC epithelial tumor samples and metastases using a combination of two microRNA biomarkers.
- FIG. 4 demonstrates the identification of metastatic brain tumors using microRNA microarray data.
- the expression levels of hsa-miR-124 and hsa-miR-219-5p in metastatic samples span a wide range on both sides of the separating line.
- FIG. 5 demonstrates the expression levels of microRNA as detected by microarray in 15 primary brain tumors (squares), 187 primary tumors from other tissues (diamonds), and 50 brain metastases from various tissue origins (circles).
- FIG. 6 demonstrates the identification of metastatic brain tumors using microRNA qRT-PCR data.
- Expression levels of hsa-miR-124 and hsa-miR-9 are higher in brain primary tumors compared to primary tumors from other tissues.
- the expression levels of hsa-miR-124 in metastatic samples span a wide range and are more similar to brain primary tumors; the expression levels of hsa-miR-9 in metastatic samples are more similar to the non-brain primary tumors.
- C RT* 100-[C t (hsa-miR-9*)+C t (hsa-miR-92b)] 39.9, a threshold which was fit to the training set half of the data.
- the test-set samples (dark squares/circles/diamonds) were accurately classified by this threshold, with one outlier.
- the invention is based on the discovery that specific nucleic acids (SEQ ID NOS: 1-44) may be used for the classification of cancers.
- SEQ ID NOS: 1-44 may be used for the classification of cancers.
- the present invention provides a sensitive, specific and accurate method which may be used to distinguish between different tumor origins and between primary and metastatic malignancies.
- Metastatic tumors account for the overwhelming majority of all hepatic malignancies in the non cirrhotic liver. In the cirrhotic liver, however, primary hepatic malignancies are more common than metastatic tumors. Carcinomas of the lung, breast, colon and pancreas are the most common primary sites in adults. The distinction between primary and metastatic malignancy in the liver is of both therapeutic and prognostic significance.
- liver tumor includes hepatocellular carcinoma versus metastatic tumor.
- Primary extrahepatic carcinomas of the stomach and colon and carcinoma of sex cord-stromal tumors of the ovary may closely resemble HCC in both their morphology and immunoexpression of CEA and alfafetoprotein.
- Metastatic tumors tend to recapitulate their appearance in the primary organ and specific tumor types generally maintain consistent cytologic appearance.
- Adenocarcinoma although frequently recognizable as an entity, presents the greatest difficulty for those attempting to make a specific diagnosis as to site of origin.
- microRNA biomarkers to differentiate HCC from metastatic liver tumors of various origins.
- the expression of human microRNAs in 144 hepatic and non-hepatic tumors was examined by microRNA microarray.
- Similar results were obtained by using the combination of hsa-miR-200a (SEQ ID NO: 10) and hsa-miR-200b (SEQ ID NO: 12).
- hsa-miR-141, hsa-miR-200a, hsa-miR-200b and hsa-miR-200c are part of one predicted polycistronic pri-microRNA, in an intronic region of a transcription unit (EST with no ORF) on Chr.12p13.31 (Landgraf, P., et al., Cell, 2007. 129(7): p. 1401-14).
- Hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4) were differentially expressed in primary GI and non-GI tumors (e.g., lung or breast adenocarcinomas) (p-value ⁇ 0.01 for hsa-miR-205 and p-value ⁇ 10 ⁇ -11 for hsa-miR-194), and may serve as potent biomarkers for the identification of tumors of GI origin.
- non-GI tumors e.g., lung or breast adenocarcinomas
- the combination of another pair of microRNAs, hsa-miR-124a (SEQ ID NO: 16) and hsa-miR-219 (SEQ ID NO: 24) was capable of differentiating normal brain and primary brain tumors from other primary tumors (p-value ⁇ 2e-44 for each) or from brain metastases (p-value ⁇ 6e-6 for each).
- the present invention provides diagnostic assays and methods, both quantitative and qualitative for detecting, diagnosing, monitoring, staging and prognosticating cancers by comparing levels of the specific microRNA molecules of the invention. Such levels are preferably measured in at least one of biopsies, tumor samples, cells, tissues and/or bodily fluids, including determination of normal and abnormal levels.
- the present invention provides methods for diagnosing the presence of a specific cancer by analyzing for changes in levels of said microRNA molecules in biopsies, tumor samples, cells, tissues or bodily fluids.
- determining the presence of said microRNA levels in biopsies, tumor samples, cells, tissues or bodily fluid is particularly useful for discriminating between different cancers.
- All the methods of the present invention may optionally include measuring levels of other cancer markers.
- Other cancer markers in addition to said microRNA molecules, useful in the present invention will depend on the cancer being tested and are known to those of skill in the art.
- Assay techniques that can be used to determine levels of gene expression, such as the nucleic acid sequence of the present invention, in a sample derived from a patient are well known to those of skill in the art.
- Such assay methods include, without limitation, radioimmunoassays, reverse transcriptase PCR (RT-PCR) assays, immunohistochemistry assays, in situ hybridization assays, competitive-binding assays, Northern Blot analyses, ELISA assays and biochip analysis.
- correlations and/or hierarchical clustering can be used to assess the similarity of the expression level of the nucleic acid sequences of the invention between a specific sample and different exemplars of cancer samples, by setting an arbitrary threshold for assigning a sample or cancer sample to one of two groups.
- the threshold for assignment is treated as a parameter, which can be used to quantify the confidence with which samples are assigned to each class.
- the threshold for assignment can be scaled to favor sensitivity or specificity, depending on the clinical scenario.
- the correlation value to the reference data generates a continuous score that can be scaled.
- aberrant proliferation means cell proliferation that deviates from the normal, proper, or expected course.
- aberrant cell proliferation may include inappropriate proliferation of cells whose DNA or other cellular components have become damaged or defective.
- Aberrant cell proliferation may include cell proliferation whose characteristics are associated with an indication caused by, mediated by, or resulting in inappropriately high levels of cell division, inappropriately low levels of apoptosis, or both.
- Such indications may be characterized, for example, by single or multiple local abnormal proliferations of cells, groups of cells, or tissue(s), whether cancerous or non-cancerous, benign or malignant.
- “Attached” or “immobilized” as used herein to refer to a probe and a solid support may mean that the binding between the probe and the solid support is sufficient to be stable under conditions of binding, washing, analysis, and removal.
- the binding may be covalent or non-covalent. Covalent bonds may be formed directly between the probe and the solid support or may be formed by a cross linker or by inclusion of a specific reactive group on either the solid support or the probe or both molecules.
- Non-covalent binding may be one or more of electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent attachment of a molecule, such as streptavidin, to the support and the non-covalent binding of a biotinylated probe to the streptavidin. Immobilization may also involve a combination of covalent and non-covalent interactions.
- Bio sample as used herein may mean a sample of biological tissue or fluid that comprises nucleic acids. Such samples include, but are not limited to, tissue or fluid isolated from subjects. Biological samples may also include sections of tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, blood, plasma, serum, sputum, stool, tears, mucus, hair, and skin. Biological samples also include explants and primary and/or transformed cell cultures derived from animal or patient tissues.
- Biological samples may also be blood, a blood fraction, urine, effusions, ascitic fluid, amniotic fluid, saliva, cerebrospinal fluid, cervical secretions, vaginal secretions, endometrial secretions, gastrointestinal secretions, bronchial secretions, sputum, cell line, tissue sample, or secretions from the breast.
- a biological sample may be provided by removing a sample of cells from a subject but can also be accomplished by using previously isolated cells (e.g., isolated by another person, at another time, and/or for another purpose), or by performing the methods described herein in vivo.
- Archival tissues such as those having treatment or outcome history, may also be used.
- cancer is meant to include all types of cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness.
- cancers include but are nor limited to solid tumors and leukemias, including: apudoma, choristoma, branchioma, malignant carcinoid syndrome, carcinoid heart disease, carcinoma (e.g., Walker, basal cell, basosquamous, Brown-Pearce, ductal, Ehrlich tumor, non-small cell lung, oat cell, papillary, bronchiolar, bronchogenic, squamous cell, and transitional cell), histiocytic disorders, leukemia (e.g., B cell, mixed cell, null cell, T cell, T-cell chronic, HTLV-II-associated, lymphocytic acute, lymphocytic chronic, mast cell, and myeloid), histiocytosis malignant,
- classification refers to a procedure and/or algorithm in which individual items are placed into groups or classes based on quantitative information on one or more characteristics inherent in the items (referred to as traits, variables, characters, features, etc) and based on a statistical model and/or a training set of previously labeled items.
- a “classification tree” is a decision tree that places categorical variables into classes.
- Ct signals represent the first cycle of PCR where amplification crosses a threshold (cycle threshold) of fluorescence. Accordingly, low values of Ct represent high abundance or expression levels of the microRNA.
- the PCR Ct signal is normalized such that the normalized Ct remains inversed from the expression level. In other embodiments the PCR Ct signal may be normalized and then inverted such that low normalized-inverted Ct represents low abundance or expression levels of the microRNA.
- “Complement” or “complementary” as used herein to refer to a nucleic acid may mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules.
- a full complement or fully complementary may mean 100% complementary base pairing between nucleotides or nucleotide analogs of nucleic acid molecules.
- a “data processing routine” refers to a process that can be embodied in software that determines the biological significance of acquired data (i.e., the ultimate results of an assay or analysis). For example, the data processing routine can make determination of tissue of origin based upon the data collected. In the systems and methods herein, the data processing routine can also control the data collection routine based upon the results determined. The data processing routine and the data collection routines can be integrated and provide feedback to operate the data acquisition, and hence provide assay-based judging methods.
- data set refers to numerical values obtained from the analysis, These numerical values associated with analysis may be values such as peak height and area under the curve.
- data structure refers to a combination of two or more data sets, applying one or more mathematical manipulations to one or more data sets to obtain one or more new data sets, or manipulating two or more data sets into a form that provides a visual illustration of the data in a new way.
- An example of a data structure prepared from manipulation of two or more data sets would be a hierarchical cluster.
- Detection means detecting the presence of a component in a sample. Detection also means detecting the absence of a component. Detection also means measuring the level of a component, either quantitatively or qualitatively.
- “Differential expression” means qualitative or quantitative differences in the temporal and/or cellular gene expression patterns within and among cells and tissue.
- a differentially expressed gene may qualitatively have its expression altered, including an activation or inactivation, in, e.g., normal versus disease tissue.
- Genes may be turned on or turned off in a particular state, relative to another state thus permitting comparison of two or more states.
- a qualitatively regulated gene may exhibit an expression pattern within a state or cell type which may be detectable by standard techniques. Some genes may be expressed in one state or cell type, but not in both.
- the difference in expression may be quantitative, e.g., in that expression is modulated, either up-regulated, resulting in an increased amount of transcript, or down-regulated, resulting in a decreased amount of transcript.
- the degree to which expression differs need only be large enough to quantify via standard characterization techniques such as expression arrays, quantitative reverse transcriptase PCR, northern analysis, real-time PCR, in situ hybridization and RNase protection.
- expression profile is used broadly to include a genomic expression profile, e.g., an expression profile of microRNAs. Profiles may be generated by any convenient means for determining a level of a nucleic acid sequence e.g. quantitative hybridization of microRNA, labeled microRNA, amplified microRNA, cRNA, etc., quantitative PCR, ELISA for quantitation, and the like, and allow the analysis of differential gene expression between two samples.
- a subject or patient tumor sample e.g., cells or collections thereof, e.g., tissues, is assayed. Samples are collected by any convenient method, as known in the art.
- Nucleic acid sequences of interest are nucleic acid sequences that are found to be predictive, including the nucleic acid sequences provided above, where the expression profile may include expression data for 5, 10, 20, 25, 50, 100 or more of, including all of the listed nucleic acid sequences.
- expression profile means measuring the abundance of the nucleic acid sequences in the measured samples.
- “Expression ratio” as used herein refers to relative expression levels of two or more nucleic acids as determined by detecting the relative expression levels of the corresponding nucleic acids in a biological sample.
- “Fragment” is used herein to indicate a non-full length part of a nucleic acid. Thus, a fragment is itself also a nucleic acid.
- Gene used herein may be a natural (e.g., genomic) or synthetic gene comprising transcriptional and/or translational regulatory sequences and/or a coding region and/or non-translated sequences (e.g., introns, 5′- and 3′-untranslated sequences).
- the coding region of a gene may be a nucleotide sequence coding for an amino acid sequence or a functional RNA, such as tRNA, rRNA, catalytic RNA, siRNA, miRNA or antisense RNA.
- a gene may also be an mRNA or cDNA corresponding to the coding regions (e.g., exons and miRNA) optionally comprising 5′- or 3′-untranslated sequences linked thereto.
- a gene may also be an amplified nucleic acid molecule produced in vitro comprising all or a part of the coding region and/or 5′- or 3′-untranslated sequences linked thereto.
- “Groove binder” and/or “minor groove binder” may be used interchangeably and refer to small molecules that fit into the minor groove of double-stranded DNA, typically in a sequence-specific manner.
- Minor groove binders may be long, flat molecules that can adopt a crescent-like shape and thus, fit snugly into the minor groove of a double helix, often displacing water.
- Minor groove binding molecules may typically comprise several aromatic rings connected by bonds with torsional freedom such as furan, benzene, or pyrrole rings.
- Minor groove binders may be antibiotics such as netropsin, distamycin, berenil, pentamidine and other aromatic diamidines, Hoechst 33258, SN 6999, aureolic anti-tumor drugs such as chromomycin and mithramycin, CC-1065, dihydrocyclopyrroloindole tripeptide (DPI 3 ), 1,2-dihydro-(3H)-pyrrolo[3,2-e]indole-7-carboxylate (CDPI 3 ), and related compounds and analogues, including those described in Nucleic Acids in Chemistry and Biology, 2d ed., Blackburn and Gait, eds., Oxford University Press, 1996, and PCT Published Application No.
- antibiotics such as netropsin, distamycin, berenil, pentamidine and other aromatic diamidines, Hoechst 33258, SN 6999, aureolic anti-tumor drugs such as chromomycin and
- a minor groove binder may be a component of a primer, a probe, a hybridization tag complement, or combinations thereof. Minor groove binders may increase the T m of the primer or a probe to which they are attached, allowing such primers or probes to effectively hybridize at higher temperatures.
- Host cell used herein may be a naturally occurring cell or a transformed cell that may contain a vector and may support replication of the vector.
- Host cells may be cultured cells, explants, cells in vivo, and the like.
- Host cells may be prokaryotic cells such as E. coli , or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells, such as CHO and HeLa.
- “Identical” or “identity” as used herein in the context of two or more nucleic acids or polypeptide sequences may mean that the sequences have a specified percentage of residues that are the same over a specified region. The percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity.
- the residues of single sequence are included in the denominator but not the numerator of the calculation.
- thymine (T) and uracil (U) may be considered equivalent.
- Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0.
- “In situ detection” as used herein means the detection of expression or expression levels in the original site hereby meaning in a tissue sample such as biopsy.
- k-nearest neighbor refers to a classification method that classifies a point by calculating the distances between the point and points in the training data set. Then it assigns the point to the class that is most common among its k-nearest neighbors (where k is an integer).
- Label as used herein may mean a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical means.
- useful labels include 32 P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and other entities which can be made detectable.
- a label may be incorporated into nucleic acids and proteins at any position.
- liver cancer means malignancy of the liver, either a primary cancer or metastasized cancer.
- liver cancer includes, but is not limited to, cancer arising from hepatocytes, such as, for example, hepatomas and hepatocellular carcinomas; fibrolamellar; and cholangiocarcinomas (or bile duct cancer).
- Logistic regression is part of a category of statistical models called generalized linear models. Logistic regression can allows one to predict a discrete outcome, such as group membership, from a set of variables that may be continuous, discrete, dichotomous, or a mix of any of these. The dependent or response variable can be dichotomous, for example, one of two possible types of cancer. Logistic regression models the natural log of the odds ratio, i.e. the ratio of the probability of belonging to the first group (P) over the probability of belonging to the second group (1-P), as a linear combination of the different expression levels (in log-space).
- the logistic regression output can be used as a classifier by prescribing that a case or sample will be classified into the first type is P is greater than 0.5 or 50%. Alternatively, the calculated probability P can be used as a variable in other contexts such as a 1D or 2D threshold classifier.
- “1D/2D threshold classifier” used herein may mean an algorithm for classifying a case or sample such as a cancer sample into one of two possible types such as two types of cancer.
- the decision is based on one variable and one predetermined threshold value; the sample is assigned to one class if the variable exceeds the threshold and to the other class if the variable is less than the threshold.
- a 2D threshold classifier is an algorithm for classifying into one of two types based on the values of two variables.
- a threshold may be calculated as a function (usually a continuous or even a monotonic function) of the first variable; the decision is then reached by comparing the second variable to the calculated threshold, similar to the 1D threshold classifier.
- Metalastasis means the process by which cancer spreads from the place at which it first arose as a primary tumor to other locations in the body.
- the metastatic progression of a primary tumor reflects multiple stages, including dissociation from neighboring primary tumor cells, survival in the circulation, and growth in a secondary location.
- Nucleic acid or “oligonucleotide” or “polynucleotide” used herein may mean at least two nucleotides covalently linked together.
- the depiction of a single strand also defines the sequence of the complementary strand.
- a nucleic acid also encompasses the complementary strand of a depicted single strand.
- Many variants of a nucleic acid may be used for the same purpose as a given nucleic acid.
- a nucleic acid also encompasses substantially identical nucleic acids and complements thereof.
- a single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions.
- a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions.
- Nucleic acids may be single stranded or double stranded, or may contain portions of both double stranded and single stranded sequence.
- the nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine Nucleic acids may be obtained by chemical synthesis methods or by recombinant methods.
- a nucleic acid will generally contain phosphodiester bonds, although nucleic acid analogs may be included that may have at least one different linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphosphoroamidite linkages and peptide nucleic acid backbones and linkages.
- Other analog nucleic acids include those with positive backbones; non-ionic backbones, and non-ribose backbones, including those described in U.S. Pat. Nos. 5,235,033 and 5,034,506, which are incorporated by reference.
- Nucleic acids containing one or more non-naturally occurring or modified nucleotides are also included within one definition of nucleic acids.
- the modified nucleotide analog may be located for example at the 5′-end and/or the 3′-end of the nucleic acid molecule.
- Representative examples of nucleotide analogs may be selected from sugar- or backbone-modified ribonucleotides. It should be noted, however, that also nucleobase-modified ribonucleotides, i.e. ribonucleotides, containing a non-naturally occurring nucleobase instead of a naturally occurring nucleobase such as uridines or cytidines modified at the 5-position, e.g.
- the 2′-OH-group may be replaced by a group selected from H, OR, R, halo, SH, SR, NH 2 , NHR, NR 2 or CN, wherein R is C 1 -C 6 alkyl, alkenyl or alkynyl and halo is F, Cl, Br or I.
- Modified nucleotides also include nucleotides conjugated with cholesterol through, e.g., a hydroxyprolinol linkage as described in Krutzfeldt et al., Nature 438:685-689 (2005), Soutschek et al., Nature 432:173-178 (2004), and U.S. Patent Publication No. 20050107325, which are incorporated herein by reference.
- Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological environments, to enhance diffusion across cell membranes, or as probes on a biochip.
- the backbone modification may also enhance resistance to degradation, such as in the harsh endocytic environment of cells.
- the backbone modification may also reduce nucleic acid clearance by hepatocytes, such as in the liver and kidney. Mixtures of naturally occurring nucleic acids and analogs may be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made.
- Probe as used herein may mean an oligonucleotide capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. Probes may bind target sequences lacking complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. There may be any number of base pair mismatches which will interfere with hybridization between the target sequence and the single stranded nucleic acids described herein. However, if the number of mutations is so great that no hybridization can occur under even the least stringent of hybridization conditions, the sequence is not a complementary target sequence.
- a probe may be single stranded or partially single and partially double stranded. The strandedness of the probe is dictated by the structure, composition, and properties of the target sequence. Probes may be directly labeled or indirectly labeled such as with biotin to which a streptavidin complex may later bind.
- the phrase “reference expression profile” refers to a criterion expression value to which measured values are compared in order to determine the detection of a subject with lung cancer.
- the reference expression profile may be based on the abundance of the nucleic acids, or may be based on a combined metric score thereof.
- sensitivity used herein may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example how frequently it correctly classifies a cancer into the correct type out of two possible types.
- the sensitivity for class A is the proportion of cases that are determined to belong to class “A” by the test out of the cases that are in class “A”, as determined by some absolute or gold standard.
- “specificity” used herein may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example how frequently it correctly classifies a cancer into the correct type out of two possible types.
- the sensitivity for class A is the proportion of cases that are determined to belong to class “not A” by the test out of the cases that are in class “not A”, as determined by some absolute or gold standard.
- Stringent hybridization conditions used herein may mean conditions under which a first nucleic acid sequence (e.g., probe) will hybridize to a second nucleic acid sequence (e.g., target), such as in a complex mixture of nucleic acids. Stringent conditions are sequence-dependent and will be different in different circumstances. Stringent conditions may be selected to be about 5-10° C. lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m may be the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are occupied at equilibrium).
- T m thermal melting point
- Stringent conditions may be those in which the salt concentration is less than about 1.0 M sodium ion, such as about 0.01-1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., about 10-50 nucleotides) and at least about 60° C. for long probes (e.g., greater than about 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive signal may be at least 2 to 10 times background hybridization.
- Exemplary stringent hybridization conditions include the following: 50% formamide, 5 ⁇ SSC, and 1% SDS, incubating at 42° C., or, 5 ⁇ SSC, 1% SDS, incubating at 65° C., with wash in 0.2 ⁇ SSC, and 0.1% SDS at 65° C.
- “Substantially complementary” used herein may mean that a first sequence is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical to the complement of a second sequence over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more nucleotides, or that the two sequences hybridize under stringent hybridization conditions.
- “Substantially identical” used herein may mean that a first and second sequence are at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more nucleotides or amino acids, or with respect to nucleic acids, if the first sequence is substantially complementary to the complement of the second sequence.
- the term “subject” refers to a mammal, including both human and other mammals.
- the methods of the present invention are preferably applied to human subjects.
- Target as used herein may mean a polynucleotide that may be bound by one or more probes under stringent hybridization conditions.
- threshold expression profile refers to a criterion expression profile to which measured values are compared in order to classify a cancer.
- tissue sample is tissue obtained from a tissue biopsy using methods well known to those of ordinary skill in the related medical arts.
- the phrase “suspected of being cancerous” as used herein means a cancer tissue sample believed by one of ordinary skill in the medical arts to contain cancerous cells. Methods for obtaining the sample from the biopsy include gross apportioning of a mass, microdissection, laser-based microdissection, or other art-known cell-separation methods.
- Tumor refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.
- “Variant” used herein to refer to a nucleic acid may mean (i) a portion of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequence substantially identical thereto.
- wild type sequence refers to a coding, non-coding or interface sequence is an allelic form of sequence that performs the natural or normal function for that sequence. Wild type sequences include multiple allelic forms of a cognate sequence, for example, multiple alleles of a wild type sequence may encode silent or conservative changes to the protein sequence that a coding sequence encodes.
- the present invention employs miRNAs and related nucleic acids for the identification, classification and diagnosis of specific cancers.
- a gene coding for a miRNA may be transcribed leading to production of a miRNA primary transcript known as the pri-miRNA.
- the pri-miRNA may comprise a hairpin with a stem and loop.
- the stem of the hairpin may comprise mismatched bases.
- the pri-miRNA may comprise several hairpins in a polycistronic structure.
- the hairpin structure of the pri-miRNA may be recognized by Drosha, which is an RNase III endonuclease. Drosha may recognize terminal loops in the pri-miRNA and cleave approximately two helical turns into the stem to produce a 60-70 nt precursor known as the pre-miRNA. Drosha may cleave the pri-miRNA with a staggered cut typical of RNase III endonucleases yielding a pre-miRNA stem loop with a 5′ phosphate and ⁇ 2 nucleotide 3′ overhang. Approximately one helical turn of stem ( ⁇ 10 nucleotides) extending beyond the Drosha cleavage site may be essential for efficient processing. The pre-miRNA may then be actively transported from the nucleus to the cytoplasm by Ran-GTP and the export receptor Ex-portin-5.
- the pre-miRNA may be recognized by Dicer, which is also an RNase III endonuclease. Dicer may recognize the double-stranded stem of the pre-miRNA. Dicer may also off the terminal loop two helical turns away from the base of the stem loop leaving an additional 5′ phosphate and ⁇ 2 nucleotide 3′ overhang.
- the resulting siRNA-like duplex which may comprise mismatches, comprises the mature miRNA and a similar-sized fragment known as the miRNA*.
- the miRNA and miRNA* may be derived from opposing arms of the pri-miRNA and pre-miRNA. mRNA* sequences may be found in libraries of cloned miRNAs but typically at lower frequency than the miRNAs.
- RISC RNA-induced silencing complex
- the miRNA* When the miRNA strand of the miRNA:miRNA* duplex is loaded into the RISC, the miRNA* may be removed and degraded.
- the strand of the miRNA:miRNA* duplex that is loaded into the RISC may be the strand whose 5′ end is less tightly paired. In cases where both ends of the miRNA:miRNA* have roughly equivalent 5′ pairing, both miRNA and miRNA* may have gene silencing activity.
- the RISC may identify target nucleic acids based on high levels of complementarity between the miRNA and the mRNA, especially by nucleotides 2-7 of the miRNA. Only one case has been reported in animals where the interaction between the miRNA and its target was along the entire length of the miRNA. This was shown for mir-196 and Hox B8 and it was further shown that mir-196 mediates the cleavage of the Hox B8 mRNA (Yekta et al 2004, Science 304-594). Otherwise, such interactions are known only in plants (Bartel & Bartel 2003, Plant Physiol 132-709).
- the target sites in the mRNA may be in the 5′ UTR, the 3′ UTR or in the coding region.
- multiple miRNAs may regulate the same mRNA target by recognizing the same or multiple sites.
- the presence of multiple miRNA binding sites in most genetically identified targets may indicate that the cooperative action of multiple RISCs provides the most efficient translational inhibition.
- miRNAs may direct the RISC to downregulate gene expression by either of two mechanisms: mRNA cleavage or translational repression.
- the miRNA may specify cleavage of the mRNA if the mRNA has a certain degree of complementarity to the miRNA. When a miRNA guides cleavage, the cut may be between the nucleotides pairing to residues 10 and 11 of the miRNA.
- the miRNA may repress translation if the miRNA does not have the requisite degree of complementarity to the miRNA. Translational repression may be more prevalent in animals since animals may have a lower degree of complementarity between the miRNA and binding site.
- any pair of miRNA and miRNA* there may be variability in the 5′ and 3′ ends of any pair of miRNA and miRNA*. This variability may be due to variability in the enzymatic processing of Drosha and Dicer with respect to the site of cleavage. Variability at the 5′ and 3′ ends of miRNA and miRNA* may also be due to mismatches in the stem structures of the pri-miRNA and pre-miRNA. The mismatches of the stem strands may lead to a population of different hairpin structures. Variability in the stem structures may also lead to variability in the products of cleavage by Drosha and Dicer.
- nucleic acids are provided herein.
- the nucleic acid may comprise the sequence of SEQ ID NOS: 1-44 or variants thereof.
- the variant may be a complement of the referenced nucleotide sequence.
- the variant may also be a nucleotide sequence that is substantially identical to the referenced nucleotide sequence or the complement thereof.
- the variant may also be a nucleotide sequence which hybridizes under stringent conditions to the referenced nucleotide sequence, complements thereof, or nucleotide sequences substantially identical thereto.
- the nucleic acid may have a length of from 10 to 250 nucleotides.
- the nucleic acid may have a length of at least 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200 or 250 nucleotides.
- the nucleic acid may be synthesized or expressed in a cell (in vitro or in vivo) using a synthetic gene described herein.
- the nucleic acid may be synthesized as a single strand molecule and hybridized to a substantially complementary nucleic acid to form a duplex.
- the nucleic acid may be introduced to a cell, tissue or organ in a single- or double-stranded form or capable of being expressed by a synthetic gene using methods well known to those skilled in the art, including as described in U.S. Pat. No. 6,506,559 which is incorporated by reference.
- the nucleic acid may further comprise one or more of the following: a peptide, a protein, a RNA-DNA hybrid, an antibody, an antibody fragment, a Fab fragment, and an aptamer.
- the nucleic acid may comprise a sequence of a pri-miRNA or a variant thereof.
- the pri-miRNA sequence may comprise from 45-30,000, 50-25,000,100-20,000, 1,000-1,500 or 80-100 nucleotides.
- the sequence of the pri-miRNA may comprise a pre-miRNA, miRNA and miRNA*, as set forth herein, and variants thereof.
- the sequence of the pri-miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- the pri-miRNA may comprise a hairpin structure.
- the hairpin may comprise a first and second nucleic acid sequence that are substantially complimentary.
- the first and second nucleic acid sequence may be from 37-50 nucleotides.
- the first and second nucleic acid sequence may be separated by a third sequence of from 8-12 nucleotides.
- the hairpin structure may have a free energy less than ⁇ 25 Kcal/mole as calculated by the Vienna algorithm with default parameters, as described in Hofacker et al., Monatshefte f. Chemie 125: 167-188 (1994), the contents of which are incorporated herein.
- the hairpin may comprise a terminal loop of 4-20, 8-12 or 10 nucleotides.
- the pri-miRNA may comprise at least 19% adenosine nucleotides, at least 16% cytosine nucleotides, at least 23% thymine nucleotides and at least 19% guanine nucleotides.
- the nucleic acid may also comprise a sequence of a pre-miRNA or a variant thereof.
- the pre-miRNA sequence may comprise from 45-90, 60-80 or 60-70 nucleotides.
- the sequence of the pre-miRNA may comprise a miRNA and a miRNA* as set forth herein.
- the sequence of the pre-miRNA may also be that of a pri-miRNA excluding from 0-160 nucleotides from the 5′ and 3′ ends of the pri-miRNA.
- the sequence of the pre-miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- the nucleic acid may also comprise a sequence of a miRNA (including miRNA*) or a variant thereof.
- the miRNA sequence may comprise from 13-33, 18-24 or 21-23 nucleotides.
- the miRNA may also comprise a total of at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39 or 40 nucleotides.
- the sequence of the miRNA may be the first 13-33 nucleotides of the pre-miRNA.
- the sequence of the miRNA may also be the last 13-33 nucleotides of the pre-miRNA.
- the sequence of the miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- a probe is also provided comprising a nucleic acid described herein. Probes may be used for screening and diagnostic methods, as outlined below. The probe may be attached or immobilized to a solid substrate, such as a biochip.
- the probe may have a length of from 8 to 500, 10 to 100 or 20 to 60 nucleotides.
- the probe may also have a length of at least 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280 or 300 nucleotides.
- the probe may further comprise a linker sequence of from 10-60 nucleotides.
- the probe may comprise a nucleic acid that is complementary to a sequence selected from the group consisting of SEQ ID NOS: 1-27, 32-33; a fragment thereof, and a sequence having at least about 80% identity thereto.
- a biochip is also provided.
- the biochip may comprise a solid substrate comprising an attached probe or plurality of probes described herein.
- the probes may be capable of hybridizing to a target sequence under stringent hybridization conditions.
- the probes may be attached at spatially defined addresses on the substrate. More than one probe per target sequence may be used, with either overlapping probes or probes to different sections of a particular target sequence.
- the probes may be capable of hybridizing to target sequences associated with a single disorder appreciated by those in the art.
- the probes may either be synthesized first, with subsequent attachment to the biochip, or may be directly synthesized on the biochip.
- the solid substrate may be a material that may be modified to contain discrete individual sites appropriate for the attachment or association of the probes and is amenable to at least one detection method.
- substrates include glass and modified or functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic glasses and plastics.
- the substrates may allow optical detection without appreciably fluorescing.
- the substrate may be planar, although other configurations of substrates may be used as well. For example, probes may be placed on the inside surface of a tube, for flow-through sample analysis to minimize sample volume.
- the substrate may be flexible, such as a flexible foam, including closed cell foams made of particular plastics.
- the biochip and the probe may be derivatized with chemical functional groups for subsequent attachment of the two.
- the biochip may be derivatized with a chemical functional group including, but not limited to, amino groups, carboxyl groups, oxo groups or thiol groups.
- the probes may be attached using functional groups on the probes either directly or indirectly using a linker.
- the probes may be attached to the solid support by either the 5′ terminus, 3′ terminus, or via an internal nucleotide.
- the probe may also be attached to the solid support non-covalently.
- biotinylated oligonucleotides can be made, which may bind to surfaces covalently coated with streptavidin, resulting in attachment.
- probes may be synthesized on the surface using techniques such as photopolymerization and photolithography.
- diagnosis refers to classifying a pathology or a symptom, determining a severity of the pathology (grade or stage), monitoring pathology progression, forecasting an outcome of a pathology and/or prospects of recovery.
- the phrase “subject in need thereof” refers to an animal or human subject who is known to have cancer, at risk of having cancer [e.g., a genetically predisposed subject, a subject with medical and/or family history of cancer, a subject who has been exposed to carcinogens, occupational hazard, environmental hazard] and/or a subject who exhibits suspicious clinical signs of cancer [e.g., blood in the stool or melena, unexplained pain, sweating, unexplained fever, unexplained loss of weight up to anorexia, changes in bowel habits (constipation and/or diarrhea), tenesmus (sense of incomplete defecation, for rectal cancer specifically), anemia and/or general weakness].
- cancer e.g., a genetically predisposed subject, a subject with medical and/or family history of cancer, a subject who has been exposed to carcinogens, occupational hazard, environmental hazard
- a subject who exhibits suspicious clinical signs of cancer e.g.
- the subject in need thereof can be a healthy human subject undergoing a routine well-being check up.
- the subject has a primary tumor.
- the subject has metastatic cancer.
- the subject has cancer of unknown primary (CUP).
- Analyzing presence of malignant or pre-malignant cells can be effected in-vivo or ex-vivo, whereby a biological sample (e.g., biopsy) is retrieved.
- a biological sample e.g., biopsy
- Such biopsy samples comprise cells and may be an incisional or excisional biopsy. Alternatively the cells may be retrieved from a complete resection.
- treatment regimen refers to a treatment plan that specifies the type of treatment, dosage, schedule and/or duration of a treatment provided to a subject in need thereof (e.g., a subject diagnosed with a pathology).
- the selected treatment regimen can be an aggressive one which is expected to result in the best clinical outcome (e.g., complete cure of the pathology) or a more moderate one which may relieve symptoms of the pathology yet results in incomplete cure of the pathology. It will be appreciated that in certain cases the treatment regimen may be associated with some discomfort to the subject or adverse side effects (e.g., a damage to healthy cells or tissue).
- the type of treatment can include a surgical intervention (e.g., removal of lesion, diseased cells, tissue, or organ), a cell replacement therapy, an administration of a therapeutic drug (e.g., receptor agonists, antagonists, hormones, chemotherapy agents) in a local or a systemic mode, an exposure to radiation therapy using an external source (e.g., external beam) and/or an internal source (e.g., brachytherapy) and/or any combination thereof.
- a surgical intervention e.g., removal of lesion, diseased cells, tissue, or organ
- a cell replacement therapy e.g., an administration of a therapeutic drug (e.g., receptor agonists, antagonists, hormones, chemotherapy agents) in a local or a systemic mode
- an exposure to radiation therapy using an external source e.g., external beam
- an internal source e.g., brachytherapy
- the dosage, schedule and duration of treatment can vary, depending on the severity of pathology and the selected type of treatment, and those
- a method of diagnosis comprises detecting an expression level of a specific cancer-associated nucleic acid in a biological sample.
- the sample may be derived from a patient. Diagnosis of a specific cancer state in a patient may allow for prognosis and selection of therapeutic strategy. Further, the developmental stage of cells may be classified by determining temporarily expressed specific cancer-associated nucleic acids.
- In situ hybridization of labeled probes to tissue arrays may be performed.
- the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is further understood that the nucleic acid sequence which indicate the diagnosis may differ from those which indicate the prognosis and molecular profiling of the condition of the cells may lead to distinctions between responsive or refractory conditions or may be predictive of outcomes.
- kits may comprise a nucleic acid described herein together with any or all of the following: assay reagents, buffers, probes and/or primers, and sterile saline or another pharmaceutically acceptable emulsion and suspension base.
- the kits may include instructional materials containing directions (e.g., protocols) for the practice of the methods described herein.
- the kit may further comprise a software package for data analysis of expression profiles.
- the kit may be a kit for the amplification, detection, identification or quantification of a target nucleic acid sequence.
- the kit may comprise a poly(T) primer, a forward primer, a reverse primer, and a probe.
- astrocytomas glioblastoma multiforme
- Standard treatments include cytoreductive surgery followed by radiation therapy or chemotherapy. There is no cure, and virtually all patients ultimately succumb to recurrent or progressive disease.
- the overall survival for grade IV astrocytomas is poor, with 50% of patients dying in the first year after diagnosis.
- brain tumors were directly compared to a wide range of epithelial tumors and metastases to the brain.
- microarray data it was found that elevated expression of just two microRNAs, hsa-miR-92b (SEQ ID NO: 14) and hsa-miR-9*(SEQ ID NO: 20), is sufficient to distinguish brain primary tumors from tumors derived from non-brain tissues, and most significantly for diagnostic purposes, from metastases located in the brain.
- This assay was translated to a qRT-PCR platform, using additional samples as a training set to develop a classifier.
- Liver cancer Primary liver cancer is the fifth most common cancer worldwide. Hepatocellular carcinoma (HCC) accounts for 80% of all liver cancer and the rates of HCC have increased by over 70% in the last two decades in the U.S. The fatality ratio (mortality/incidence) of liver cancer is approximately 1, indicating that the majority of patients live less than a year. Late diagnosis due to lack of clinical symptoms is one of the main reasons for the high fatality ratio. Liver cancer can result from both viral infection and chemical exposure. Known risk factors include hepatitis B and C virus infection. It is not known whether distinct routes to liver cancer affect the same or different cellular pathways. No mutational model has yet been developed for liver cancer as it has been for other cancers. The molecular events that precede neoplastic transformation of the liver are not well understood. With no clearly identified cause, successful treatment options are lacking.
- liver metastasis the most common cause of death from colorectal cancer is liver metastasis.
- liver metastases Up to 50% of liver metastases are of colorectal cancer origin, while the remainder metastasizes from a wide variety of primary cancer sites including sarcomas, breast and kidney, as well as neuroendocrine tumors.
- HCC may be solitary or multicentric, and it may mimic liver metastases. Furthermore hemangiomas and liver metastases are often confused in imaging methods. In general, the imaging appearances of liver metastases are nonspecific, and biopsy specimens are required for histological diagnosis. Various biochemical markers have been proposed to indicate liver metastases. However, the diagnostic accuracy of tumor markers has not yet been defined.
- FFPE formalin-fixed paraffin embedded
- FFPE samples were evaluated by a pathologist for histological type, grade and tumor percentages based on hematoxilin-eosin (H&E) stained slides, performed on the first and/or last sections of the sample.
- H&E hematoxilin-eosin
- Custom microRNA microarrays were produced by printing DNA oligonucleotide probes representing ⁇ 650 DNA oligonucleotide probes representing microRNAs (Sanger database, version 9 and additional Rosetta validated and predicted miRNAs). Each probe, printed in triplicate, carries up to 22-nt linker at the 3′ end of the microRNA's complement sequence in addition to an amine group used to couple the probes to coated glass slides. 20 ⁇ M of each probe were dissolved in 2 ⁇ SSC+0.0035% SDS and spotted in triplicate on Slide E coated microarray slides (Schott Nexterion, Mainz, Germany) using the BioRobotics MicroGrid II microarrater (Genomic Solutions, Ann Arbor, Mich.) according to the manufacturer's directions.
- RNA samples were designed using the sense sequences of different microRNAs.
- Two groups of positive control probes were designed to hybridize to miRdicatorTM array (i) synthetic small RNA were spiked to the RNA before labeling to verify the labeling efficiency and (ii) probes for abundant small RNA (e.g. small nuclear RNAs (U43, U49, U24, Z30, U6, U48, U44), 5.8s and 5s ribosomal RNA) are spotted on the array to verify RNA quality.
- the slides were blocked in a solution containing 50 mM ethanolamine, 1M Tris (pH9.0) and 0.1% SDS for 20 min at 50° C., then thoroughly rinsed with water and spun dry.
- RNA-linker Up to 5 ⁇ g (mean: 4.5 ⁇ g) of total RNA were labeled by ligation of an RNA-linker, p-rCrU-Cy/dye (Dharmacon, Lafayette, Colo.), to the 3′-end with Cy3 or Cy5.
- the labeling reaction contained total RNA, spikes (0.1-20 fmoles), 300 ng RNA-linker-dye, 15% DMSO, 1 ⁇ ligase buffer and 20 units of T4 RNA ligase (NEB New England Biolabs, Ipswich, Mass.) and proceeded at 4° C. for 1 hr followed by 1 hr at 37° C.
- Arrays were scanned using the Agilent DNA Microarray Scanner Bundle (Agilent Technologies, Santa Clara, Calif.) at resolution of 10 ⁇ m at 100% power. Array images were analyzed using the SpotReader software (Niles Scientific, Portola Valley, Calif.).
- the initial data set consisted of signals measured for multiple probes for every sample. Triplicate spots were combined to one signal by taking the logarithmic mean of the reliable spots. All data was log-transformed (natural base) and the analysis was performed in log-space.
- a reference data vector for normalization R was calculated by taking the median expression level for each probe across all samples in each dataset. For each sample data vector S, a 2nd degree polynomial F was found so as to provide the best fit between the sample data and the reference data, such that R ⁇ F(S). Remote data points (“outliers”) were not used for fitting the polynomial F.
- Data in FIGS. 1 and 2 was translated back to linear-space by taking the exponent.
- PAP poly (A) polymerase
- An oligo-dT primer harboring a consensus sequence was used for reverse transcription reaction. The primer is first annealed to the poly A-RNA and then subjected to a reverse transcription reaction of SuperScript II RT (Invitrogen).
- the cDNA was then amplified by real-time PCR reaction, using a miRNA-specific forward primer, TaqMan probe and universal reverse primer. The reactions were incubated for 10 min. at 95° C. followed by 42 cycles of 95° C. for 15 s and 60° C. for 1 min. Normalizing the C t values (per sample) by the C t of either U6 snRNA, the C t of hsa-miR-24, or their average C t , shifted at most one sample from each side in the test-set classification predictions.
- the combined metric “L” was defined as the mean log 2 expression of two microRNAs, L ⁇ [log 2(hsa-miR-141 signal)+log 2(hsa-miR-200c signal)]/2. This metric could be used to identify the non-liver primary tumor samples with near-perfect accuracy.
- the receiver operating characteristic curve (ROC curve) plots the sensitivity against one minus the specificity, and is a measure of classification performance.
- a random classifier has an area under the curve (AUC) of 0.5, and an optimal classifier with perfect sensitivity and specificity of 100% has an area of 1.
- the ROC curve of the combination of hsa-miR-141 and hsa-miR-200c has an AUC of 0.999.
- the decision rule “classify as non-liver when L>10” identified samples as non-liver primary tumors with a sensitivity of 98% and a specificity of 100%, with one pancreatic exocrine adenocarcinoma incorrectly classified as HCC.
- the simple decision rule “classify as GI when the expression of hsa-miR-205 is smaller than half the expression of hsa-miR-194” ( FIG. 2A ) is accurate in all but one case of a stomach primary tumor that is misclassified as non-GI by this rule.
- the metric (expression of hsa-miR-205)/(expression of hsa-miR-194) had an AUC of 0.989.
- miR name miR name MID HID hsa-miR-194 1 2, 3 hsa-miR-205 4 5 hsa-miR-141 6 7 hsa-miR-200c 8 9 hsa-miR-200a 10 11 hsa-miR-200b 12 13 hsa-miR-92b 14 15 hsa-miR-124a 16 17-19 hsa-miR-9* 20 21-23 hsa-miR-219 24 25-26 hsa-miR-9 27 21-23 hsa-miR-128a 28 29 hsa-miR-128b 30 31 hsa-miR- 122a 32 33 miR name: is the miRBase registry name (release 9.1). MID: is the SEQ ID NO of the mature microRNA. HID: is the SEQ ID NO of the microRNA hairpin precursor (Pre
- microRNAs are Able to Distinguish Between Primary Non-Hepatic and Hepatic Tumors
- microRNA expression levels were profiled in 144 tumor samples including 30 primary HCC samples, 63 primary tumors from epithelial origins, 46 liver metastases from epithelial origins, and 5 adenocarcinoma metastases to the liver from unknown origin.
- the primary HCC samples were compared to the other primary tumors and to the liver metastases samples.
- Hsa-miR-122a SEQ ID NO: 32
- this microRNA is also found at high levels in the RNA extracted from liver metastases ( FIG.
- the microRNA family of hsa-miR-200a,b,c (SEQ ID NOS: 8, 10 and 12) and hsa-miR-141 (SEQ ID NO: 6) were identified as strongly expressed in primary tumors from epithelial origins of common liver metastases, but are not expressed in liver primary tumors (p-value ⁇ 1e-11, AUC>0.9 for each).
- microRNAs are not expressed in the liver background, they are also useful in distinguishing between primary HCC tumors and metastatic tumors to the liver (p-value ⁇ 1e-11, AUC>0.9 for each).
- the expression level in the liver metastases is similar to the expression level in the non-HCC primary tumors, in sharp contrast to their expression level in the liver primary tumors ( FIG. 1A ).
- microRNAs Of this family of microRNAs, the strongest effect was found for hsa-miR-200c and hsa-miR-141 ( FIG. 1B ). Each of these microRNAs could be used to distinguish between primary HCC tumors and metastatic tumors to the liver with very high accuracy (AUC>0.98).
- the expression level of these two microRNAs can be combined to create a powerful classifier.
- the combined metric L is defined as the sum of the logarithm (base 2) of the signals of the two microRNAs, providing robustness to the classifier by adding two signals.
- a more conservative classifier can be defined by allowing a margin for uncertainty, of factor 4 above or below the threshold (equivalent to two cycles in qRT-PCR measurements).
- the classification rule “classify as HCC if L ⁇ 16, classify as non-HCC if L>20, leave unidentified if 16 ⁇ L ⁇ 20” leaves 7 samples of 144 ( ⁇ 5%) unclassified, and correctly classifies all other samples, including 5 cases of metastatic liver adenocarcinoma of unknown origin ( FIG. 1B ).
- FIG. 3A-B Similar results were obtained by using the combination of hsa-miR-200a (SEQ ID NO: 10) and hsa-miR-200b (SEQ ID NO: 12) ( FIG. 3A-B , p-value ⁇ 2*10 ⁇ -12 for each comparing hepatocellular carcinoma samples to non-hepatic primary tumor samples or to metastatic liver adenocarcinoma of a known origin or to both together).
- HCC primary liver tumors vs.: Compared to other Compared to metastases Compared to both other primary tumors in liver primary and metastases microRNA or metric: p- fold- p- fold- p- fold- value ⁇ chng ⁇ AUC value ⁇ chng ⁇ AUC value ⁇ chng ⁇ AUC hsa-miR-122 ⁇ 1.4E ⁇ 38 91.1 1.0000 2.8E ⁇ 01 1.1 0.5623 6.3E ⁇ 09 56.7 0.8193 hsa-miR-200a 4.8E ⁇ 13 1/5.7 0.9185 7.3E ⁇ 14 1/4.8 0.9492 1.3E ⁇ 17 1/5.1 0.9277 hsa-miR-200b 1.5E ⁇ 15 1/5.7 0.9339 1.9E ⁇ 12 1/4.3 0.9297 2.7E ⁇ 19 1/4.6 0.9249 hsa-mi
- ⁇ “fold-chng” is the fold change, calculated by the median signal in HCC divided by the median signal in other tissues. For hsa-miR-122a, fold change is greater than 1 indicating a higher expression in HCC. For all other rows, signal is lower in HCC.
- fold-chng is the fold change, calculated by the median signal (2 Ct ) in HCC divided by the median signal (2 Ct ) in other tissues.
- fold change is greater than 1 indicating a higher expression in HCC.
- signal is lower in HCC.
- microRNAs are Able to Distinguish Between Primary GI Tumors and Non-GI Primary Tumors
- microRNA expression can provide further information on the possible origin of liver metastases.
- Another pair of microRNAs, hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4) had significant different expression (p-value ⁇ 1e-12 for each) in primary tumors from gastrointestinal (GI) origin (14 colon, 5 pancreas, 5 stomach) compared to primary tumors of non-GI epithelial origin (24 lung, 15 breast).
- GI gastrointestinal
- non-GI epithelial origin 24 lung, 15 breast.
- the ratio of these expression levels could be used to accurately identify primary tumors from non-GI origin: the decision rule “classify as non-GI primary when the expression of hsa-miR-205 is greater than half the expression of hsa-miR-194” ( FIG.
- microRNAs are Able to Distinguish Between Primary Brain Tumors, Brain Metastases, Non-Brain Primary Tumors and Normal Brain Samples
- microRNA expression levels were profiled on a microarray platform in 252 tumor samples including 15 brain primary tumor samples, 187 non-brain primary tumors, 50 brain metastases from various tissue origins and 2 normal brain samples.
- the brain primary tumor samples were compared to the other primary tumor samples, to normal brain samples and to samples of brain-located metastases (Table 6).
- Hsa-miR-124 SEQ ID NO: 16
- a combination of hsa-miR-124 and hsa-miR-219 (SEQ ID NO: 24) (B 0 , see methods) could be used to distinguish brain primary tumors from non-brain primary tumors with 100% accuracy ( FIG. 4A ).
- hsa-miR-124, hsa-miR-219-5p, hsa-miR-128a and hsa-miR-128b hsa-miR-9* (SEQ ID NO: 20) and hsa-miR-92b (SEQ ID NO: 14) are expressed specifically in brain tumors and not expressed in other tumor types ( FIG. 4B , AUC>0.99).
- these two microRNAs also differentiate accurately between brain primary tumors and metastatic tumors located in the brain (p-value ⁇ 3e-18, AUC>0.99 for each) or normal brain samples.
- the classification rule “classify as brain primary if B 1 >20, classify as other if B 1 ⁇ 18, leave unidentified if 18 ⁇ B 1 ⁇ 20” leaves only 2 samples out of 252 ( ⁇ 1%) as unclassified ( FIG. 4B ), and classifies correctly all other samples.
- hsa-miR-124 was not a good candidate for identifying metastatic tumors to the brain.
- hsa-miR-9 SEQ ID NO: 27
- hsa-miR-9* SEQ ID NO: 20
- hsa-miR-92b SEQ ID NO: 14
- Combinations of hsa-miR-92b with either hsa-miR-9 (B RT ) or with hsa-miR-9* (B RT *) were defined by summing their qRT-PCR C t values (see Methods).
- a threshold was selected for classification for each combination using half of the samples as a training set. The classification accuracy was then tested on the second half of the data set which was used as a test set. The classifications on the test set were near perfect with one outlier of 23 samples, reaching 100% accuracy in identifying non-brain primary tumors from brain primary tumors, and 88% sensitivity with 100% specificity in identifying metastatic brain tumors from brain primary tumors, for both B RT ( FIG. 6B ) and B RT *. Indeed, these combinations show significant differences in expression that can be used to classify primary from metastatic brain tumors (Table 7).
- hsa-miR-9/9* and hsa-miR-92b represent new biomarkers that can be used to classify brain malignancies—primary versus secondary.
- Hsa-miR-92b is found on chromosome 1 and differs by only one nucleotide in its first 20 from hsa-miR-92a, a member of the oncogenic miR-17-92 cluster.
- the expression patterns of hsa-miR-92a correlates only very weakly with that of and hsa-miR-92b and does not enable classification of brain malignancies (Table 6).
- ⁇ P-values are calculated on log-signal of microRNAs, and on B 0 and B 1 , which are in log-space. Less that 1000 probes were tested, and even after the more severe Bonferroni correction (multiplying each p-value by ⁇ 1000), the p-values remain highly significant.
- ⁇ “fold-chng” is the fold change, calculated by dividing the median signal in brain primary tumors by the median signal in other tissues.
- ⁇ “fold-chng” is the fold change, calculated by converting the data to linear space (by taking the exponent base 2) and dividing the median signal in brain primary tumors by the median signal in other tissues.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Hospice & Palliative Care (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
- The present application is a Continuation-In-Part of Ser. No. 12/532,941, filed Sep. 24, 2009, which is the U.S. National Stage of PCT/IL2008/00396, filed Mar. 20, 2008, which claims priority to U.S. Provisional applications 60/907,266, filed Mar. 27, 2007, 60/929,244, filed Jun. 19, 2007, and 61/024,565, filed Jan. 30, 2008. The present application is also a Continuation-In Part of PCT/IL2008/001525, filed Nov. 20, 2008, which claims priority to U.S. Provisional Application Nos. 60/989,458, filed Nov. 21, 2008, 61/043,407, filed Apr. 9, 2008, and 61/073,774, filed Jun. 19, 2008. All of the priority applications are herein incorporated by reference in their entirety.
- The present invention relates to methods for classification of cancers and tumors origin. Specifically the invention relates to microRNA molecules associated with specific cancer, as well as various nucleic acid molecules relating thereto or derived therefrom.
- microRNAs (miRs, miRNAs) are a family of 18-24 nucleotide long non-coding small RNAs, that suppress translation of target genes by binding to their mRNA, thereby regulating the expression of at least 30% of all human genes. There are currently about 850 known human microRNAs. Though highly conserved throughout evolution, a significant proportion of them is primate specific. microRNAs association with cancer has been demonstrated and several microRNAs have already been identified as oncogenes and tumor suppressors (He, H., et al., Proc Natl Acad Sci USA, 2005. 102(52): p. 19075-80).
- One of the major characteristics of microRNAs is their marked tissue specificity. Many of them also exhibit temporal patterns of expression, suggesting that they play a critical role in specific tissues and in organ development, function and maintenance.
- The differential diagnosis of hepatic lesions that include primary liver tumor and metastatic tumors is a frequent challenge in modern surgical pathology. Hepatic malignancies are often found in patients with advanced metastatic cancer. Identifying the origin of these tumors as well as differentiating liver metastases from primary hepatocellular carcinoma (HCC) is frequently required and poses a significant challenge that requires clinical-radiological correlation on top of careful pathological evaluation. Thorough examination assisted by a panel of immunostains is required for the identification of the origin of metastases. Extensive work-up using modern pathological tools (immunohistochemistry, electron microscopy and molecular diagnosis) and advanced imaging technology (computed tomography (CT), mammography and positron emission tomography (PET)) have resulted in some improvements in diagnosis. However, the primary site remains unknown in many patients, even on autopsy. The appropriate management of such patients is unclear and there is a high variability in clinical approaches, accompanied by poor prognosis in most cases.
- The pathological characterization of brain malignancies remains a diagnostic challenge. Despite the advent of various high throughput genomic level technologies, which allow multiple DNA sequences, mRNAs or proteins to be evaluated simultaneously and systematically, these have had little impact on clinical procedures.
- Differentiation between primary and metastatic tumors in the brain is often encountered in pathological practice, since metastatic tumors to the brain are quite frequent. The most common tumors to metastasize to the brain originate in the lung, breast and skin (melanomas); their respective contributions to all central nervous system (CNS) metastases are 30%, 20% and 10%. Although rare, choriocarcinoma disseminates to the brain with a particularly high frequency. In autopsy studies, 24% of cancer patients exhibited metastatic tumors in the CNS. Indeed, surgical pathologists are regularly presented with specimens from patients with a history of systemic neoplasia but with findings that suggest a primary intracranial tumor.
- Therefore, there is a need for efficient and effective methods for the differentiation between primary and metastatic tumors.
- The present invention provides specific nucleic acid sequences that are used for the identification, classification and diagnosis of cancers and tumor origin. The nucleic acid sequences can also be used for the differentiation between primary and metastatic tumors based on the expression pattern of a biological sample.
- According to one aspect, the present invention provides a method of classifying a specific cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-33, a fragment thereof, or a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the results of said comparison allows for classification of said specific cancer.
- According to certain embodiments, said cancer is selected from the group consisting of liver cancer, brain cancer and gastrointestinal (GI) cancer.
- According to one embodiment, said liver cancer is hepatocellular carcimoma (HCC).
- According to certain embodiments, said GI cancer is selected from the group consisting of colon, pancreas and stomach cancer.
- According to certain embodiments, said brain cancer is selected from the group consisting of glioblastoma, astrocytoma and oligodendroglioma.
- The invention further provides a method for identifying liver cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 13-6, 32-33, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said liver cancer.
- The invention further provides a method to distinguish between hepatocellular carcimoma (HCC) and metastasis to the liver, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 6-13, 32-33, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of hepatocellular carcimoma (HCC) or metastasis to the liver.
- According to some embodiments the nucleic acid sequence is selected from the group consisting of SEQ ID NOS: 6, 8, 10, 12, a fragment thereof and a sequence having at least about 80% identity thereto, wherein relatively high expression levels of any of said nucleic acid sequence, as compared to said reference expression profile, is indicative of metastasis to the liver.
- According to one embodiment, the liver metastasis is adenocarcinoma.
- The invention further provides a method for identifying brain cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said brain cancer.
- The invention further provides a method to distinguish between primary brain tumor and metastasis to the brain, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary brain tumor or metastasis to the brain.
- According to some embodiments the nucleic acid sequence is selected from the group consisting of SEQ ID NOS: 14, 20, a fragment thereof and a sequence having at least about 80% identity thereto, wherein relatively high expression levels of any of said nucleic acid sequence, as compared to said reference expression profile, is indicative of primary brain tumor.
- The invention further provides a method to distinguish between primary brain tumor and other primary cancers, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 14-27, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary brain tumor or other primary cancers.
- The invention further provides a method for identifying a gastrointestinal (GI) cancer, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-5, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile allows for the identification of said GI cancer.
- The invention further provides a method to distinguish between primary GI and non-GI tumor, the method comprising: obtaining a biological sample from a subject; determining an expression profile of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1-5, a fragment thereof and a sequence having at least about 80% identity thereto from said sample; and comparing said expression profile to a reference expression profile, wherein the comparison of said expression profile to said reference expression profile is indicative of primary GI or non-GI tumor.
- According to some embodiments, said primary GI tumor is selected from the group consisting of colon, pancreas and stomach tumor.
- According to other embodiments, said non-GI tumor is selected from the group consisting of lung and breast tumor.
- According to some embodiments, said sample is selected from the group consisting of bodily fluid, a cell line and a tissue sample. According to other embodiments, said tissue is a fresh, frozen, fixed, wax-embedded or formalin fixed paraffin-embedded (FFPE) tissue.
- The classification method of the present invention further comprises a classifier algorithm, said classifier algorithm is selected from the group consisting of logistic regression classifier, linear regression classifier, nearest neighbor classifier (including K nearest neighbors), neural network classifier, Gaussian mixture model (GMM) classifier and Support Vector Machine (SVM) classifier. The classifier may use a decision tree structure (including binary tree) or a voting (including weighted voting) scheme to compare one or more models which compare one or more classes to other classes.
- According to some embodiments the nucleic acid sequence expression profile is determined by a method selected from the group consisting of nucleic acid hybridization and nucleic acid amplification. According to some embodiments the nucleic acid hybridization is performed using a solid-phase nucleic acid biochip array or in situ hybridization.
- According to some embodiments the nucleic acid amplification method is real-time PCR. The real-time PCR method may comprise forward and reverse primers. According to some embodiments the forward primer comprises a sequence selected from the group consisting of SEQ ID NOS: 34-38. According to some embodiments the reverse primer comprises SEQ ID NO: 44.
- According to additional embodiments the real-time PCR method further comprises a probe. According to some embodiments the probe comprises a sequence selected from the group consisting of a sequence that is complementary to a sequence selected from SEQ ID NOS: 1-33; a fragment thereof and a sequence having at least about 80% identity thereto. According to additional embodiments the probe comprises a sequence selected from the group consisting of SEQ ID NOS: 39-43.
- According to another aspect, the present invention provides a kit for cancer classification, said kit comprising a probe comprising a sequence selected from the group consisting of a sequence that is complementary to a sequence selected from SEQ ID NOS: SEQ ID NOS: 1-33; a fragment thereof and a sequence having at least about 80% identity thereto.
- According to certain embodiments, said cancer is selected from the group consisting of liver cancer, brain cancer and gastrointestinal (GI) cancer.
- According to some embodiments, said cancer is brain cancer. According to certain embodiments, said probe comprises a sequence selected from the group consisting of SEQ ID NOS: 39-43. According to other embodiments, said the kit further comprises a forward primer selected from the group consisting of SEQ ID NOS: 34-38. According to other embodiments, said the kit further comprises a reverse primer comprising SEQ ID NO: 44.
- These and other embodiments of the present invention will become apparent in conjunction with the figures, description and claims that follow.
-
FIG. 1 demonstrates the identification of non-HCC epithelial tumor samples and metastases using microRNA biomarkers. A) Expression levels (microarray data) of hsa-miR-122a (SEQ ID NO: 32) and hsa-miR-200b (SEQ ID NO: 12) in 30 HCC samples (squares), 63 non-HCC primary tumors of epithelial origin (diamonds), 46 samples of adenocarcinoma metastases to the liver of known origin (circles) and 5 samples of adenocarcinoma metastases to the liver of unknown origin (stars). Expression level of hsa-miR-122a is high in all samples taken from liver, including primary and metastases. Expression level of hsa-miR-200b is high in samples of non-liver origin, both primary and metastases. B) Expression levels (microarray data) of hsa-miR-141 (SEQ ID NO: 6) and hsa-miR-200c (SEQ ID NO: 8) in the same samples. The solid line marks the line where C′1≡[log 2(hsa-miR-141)+log 2(hsa-miR-200c)]=18, the dashed lines mark C1=20 (upper line) and C1=16 (lower line). All samples with C1<16 are HCC, and all samples with C1>20 are non-HCC. The sharp threshold at C1=18 is accurate in >97% of 144 samples. C) Expression levels (qRT-PCR data, 50 minus normalized C) of hsa-miR-122a and hsa-miR-200b in 5 HCC samples (squares), 19 non-HCC primary tumors of epithelial origin (diamonds), and 7 samples of adenocarcinoma metastases to the liver of known origin (circles). Expression level of hsa-miR-122a is high in all samples taken from liver, including primary and metastases. D) Expression levels (qRT-PCR) of hsa-miR-141 and hsa-miR-200c in the same samples. The solid line marks the line where CRT≡[(hsa-miR-141)+(hsa-miR-200c)]=34. With the exception of one renal cell carcinoma metastasis to the liver, all samples with CRT<34 are HCC, and all samples with CRT>34 are non-HCC. -
FIG. 2 demonstrates the identification of gastro-intestinal tumors and metastases using a combination of two microRNA biomarkers. A) Expression levels of hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4) in 24 primary tumor samples of gastrointestinal (GI) origin (squares) and 39 primary tumors from breast or lung (diamonds). The dashed gray line marks the line where (hsa-miR-205)=(hsa-miR-194)/2 (see methods). B) Expression levels of hsa-miR-194 and hsa-miR-205 in 42 samples of liver metastases from GI origin (circles) and 4 metastases to the liver of non-GI origin (stars). -
FIG. 3 demonstrates the identification of non-HCC epithelial tumor samples and metastases using a combination of two microRNA biomarkers. A) Expression levels of hsa-miR-200a (SEQ ID NO: 10) and hsa-miR-200b (SEQ ID NO: 12) in 30 HCC samples (squares) and 63 non-HCC primary tumors of epithelial origin (diamonds). The solid gray line marks the line where Cab≡[log 2(hsa-miR-200a)+log 2(hsa-miR-200b)]=17.7. Only four HCC samples have Cab>17.7, and only three non-HCC lung tumor samples have Cab<17.7. B) Expression levels of hsa-miR-200a and hsa-miR-200b in 46 samples of adenocarcinoma metastases to the liver of known origin (circles) and 5 samples of adenocarcinoma metastases to the liver of unknown origin (stars). The gray line marks Cab=17.7. Only three adenocarcinoma metastases samples (one from breast cancer origin and two from unknown origin) have Cab<17.7. -
FIG. 4 demonstrates the identification of metastatic brain tumors using microRNA microarray data. A) Expression levels of hsa-miR-124a (SEQ ID NO: 16) and hsa-miR-219-5p (SEQ ID NO: 24) in 15 brain primary tumors including GBM (squares), astrocytoma (triangles), oligodendroglioma (upside down triangles); 187 primary tumors from other tissues (diamonds), 50 brain metastases originating from various tissues (circles) and 2 normal brain samples (stars). Expression levels of hsa-miR-124 and hsa-miR-219-5p are higher in brain primary tumors compared to primary tumors from other tissues. The solid line marks the line where C0[log 2(hsa-miR-124)+log 2(hsa-miR-219-5p)]=16.8, and provides perfect separation between brain primary and other primary tumors. The expression levels of hsa-miR-124 and hsa-miR-219-5p in metastatic samples span a wide range on both sides of the separating line. B) Expression levels of hsa-miR-9* (SEQ ID NO: 20) and hsa-miR-92b (SEQ ID NO: 14) in the same samples. Expression levels of these microRNAs are high in brain primary tumors but are low in all other samples. The solid line marks the line where C1≡[log 2(hsa-miR-9*)+log 2(hsa-miR-92b)]=19, and provides perfect separation between brain primary tumors and other samples, including other primary tumors and metastases to the brain. The dashed lines mark a confidence range offactor 2 above or below, C1=20 (upper line) and C1=18 (lower line). Only 2 of the samples (<1%) fall within the low-confidence range. -
FIG. 5 demonstrates the expression levels of microRNA as detected by microarray in 15 primary brain tumors (squares), 187 primary tumors from other tissues (diamonds), and 50 brain metastases from various tissue origins (circles). A) Hsa-miR-128a (SEQ ID NO: 28) and hsa-miR-128b (SEQ ID NO: 30) have highly correlated expression values. These microRNAs are high in brain primary tumors, low in other primary tumors, and intermediate in brain metastasis samples. B) In contrast to hsa-miR-128a, hsa-miR-92b (SEQ ID NO: 14) is specifically expressed in brain primary tumors, and is lower in primary tumors from other tissues and in their brain metastases. -
FIG. 6 demonstrates the identification of metastatic brain tumors using microRNA qRT-PCR data. A) Expression levels (50-Ct) of hsa-miR-124 (SEQ ID NO: 16) and hsa-miR-9 (SEQ ID NO: 27) in 16 brain primary tumors (squares), 15 primary tumors from other tissues (diamonds) and 16 brain metastases originating from various tissues (circles). Expression levels of hsa-miR-124 and hsa-miR-9 are higher in brain primary tumors compared to primary tumors from other tissues. The expression levels of hsa-miR-124 in metastatic samples span a wide range and are more similar to brain primary tumors; the expression levels of hsa-miR-9 in metastatic samples are more similar to the non-brain primary tumors. B) Expression levels (50-Ct) of hsa-miR-9* (SEQ ID NO: 20) and hsa-miR-92b (SEQ ID NO: 14) in the same samples. Expression levels of these microRNAs are high in brain primary tumors and lower in all other samples. The solid line marks CRT*100-[Ct(hsa-miR-9*)+Ct(hsa-miR-92b)]=39.9, a threshold which was fit to the training set half of the data. The test-set samples (dark squares/circles/diamonds) were accurately classified by this threshold, with one outlier. Data points with Ct larger than 40 are shown with Ct=40, at (50-Ct)=10. - The invention is based on the discovery that specific nucleic acids (SEQ ID NOS: 1-44) may be used for the classification of cancers. The present invention provides a sensitive, specific and accurate method which may be used to distinguish between different tumor origins and between primary and metastatic malignancies.
- Metastatic tumors account for the overwhelming majority of all hepatic malignancies in the non cirrhotic liver. In the cirrhotic liver, however, primary hepatic malignancies are more common than metastatic tumors. Carcinomas of the lung, breast, colon and pancreas are the most common primary sites in adults. The distinction between primary and metastatic malignancy in the liver is of both therapeutic and prognostic significance.
- In many cases the differential diagnosis of liver tumor includes hepatocellular carcinoma versus metastatic tumor. Some metastases, as from renal origin, can mimic hepatocellular carcinoma, and metastatic tumor may invade liver-cell plates, giving a false impression of primary carcinoma arising within them. Primary extrahepatic carcinomas of the stomach and colon and carcinoma of sex cord-stromal tumors of the ovary may closely resemble HCC in both their morphology and immunoexpression of CEA and alfafetoprotein.
- Metastatic tumors tend to recapitulate their appearance in the primary organ and specific tumor types generally maintain consistent cytologic appearance. Adenocarcinoma, although frequently recognizable as an entity, presents the greatest difficulty for those attempting to make a specific diagnosis as to site of origin.
- According to the present invention, the ability of microRNA biomarkers to differentiate HCC from metastatic liver tumors of various origins was demonstrated. The expression of human microRNAs in 144 hepatic and non-hepatic tumors was examined by microRNA microarray. The expression of hsa-miR-141 (SEQ ID NO: 6) and hsa-miR-200c (SEQ ID NO: 8) was significantly higher in non-hepatic primary tumors compared to HCC (p-value<10̂-9 for each) and allows highly accurate differential diagnosis of hepatocellular carcinoma from metastatic adenocarcinoma (sensitivity=98%; specificity=93%). Similar results were obtained by using the combination of hsa-miR-200a (SEQ ID NO: 10) and hsa-miR-200b (SEQ ID NO: 12).
- hsa-miR-141, hsa-miR-200a, hsa-miR-200b and hsa-miR-200c are part of one predicted polycistronic pri-microRNA, in an intronic region of a transcription unit (EST with no ORF) on Chr.12p13.31 (Landgraf, P., et al., Cell, 2007. 129(7): p. 1401-14).
- Hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4) were differentially expressed in primary GI and non-GI tumors (e.g., lung or breast adenocarcinomas) (p-value<0.01 for hsa-miR-205 and p-value<10̂-11 for hsa-miR-194), and may serve as potent biomarkers for the identification of tumors of GI origin.
- The combination of another pair of microRNAs, hsa-miR-92b (SEQ ID NO: 14) and hsa-miR-9* (SEQ ID NO: 20), was capable of identifying primary brain tumors (p-value<2.5e-4 for each compared to normal brain, p-value<5e-25 for each compared to other primary and metastatic tumors). The combination of another pair of microRNAs, hsa-miR-124a (SEQ ID NO: 16) and hsa-miR-219 (SEQ ID NO: 24), was capable of differentiating normal brain and primary brain tumors from other primary tumors (p-value<2e-44 for each) or from brain metastases (p-value<6e-6 for each).
- The present invention provides diagnostic assays and methods, both quantitative and qualitative for detecting, diagnosing, monitoring, staging and prognosticating cancers by comparing levels of the specific microRNA molecules of the invention. Such levels are preferably measured in at least one of biopsies, tumor samples, cells, tissues and/or bodily fluids, including determination of normal and abnormal levels. The present invention provides methods for diagnosing the presence of a specific cancer by analyzing for changes in levels of said microRNA molecules in biopsies, tumor samples, cells, tissues or bodily fluids.
- In the present invention, determining the presence of said microRNA levels in biopsies, tumor samples, cells, tissues or bodily fluid, is particularly useful for discriminating between different cancers.
- All the methods of the present invention may optionally include measuring levels of other cancer markers. Other cancer markers, in addition to said microRNA molecules, useful in the present invention will depend on the cancer being tested and are known to those of skill in the art.
- Assay techniques that can be used to determine levels of gene expression, such as the nucleic acid sequence of the present invention, in a sample derived from a patient are well known to those of skill in the art. Such assay methods include, without limitation, radioimmunoassays, reverse transcriptase PCR (RT-PCR) assays, immunohistochemistry assays, in situ hybridization assays, competitive-binding assays, Northern Blot analyses, ELISA assays and biochip analysis.
- In some embodiments of the invention, correlations and/or hierarchical clustering can be used to assess the similarity of the expression level of the nucleic acid sequences of the invention between a specific sample and different exemplars of cancer samples, by setting an arbitrary threshold for assigning a sample or cancer sample to one of two groups. Alternatively, in a preferred embodiment, the threshold for assignment is treated as a parameter, which can be used to quantify the confidence with which samples are assigned to each class. The threshold for assignment can be scaled to favor sensitivity or specificity, depending on the clinical scenario. The correlation value to the reference data generates a continuous score that can be scaled.
- Before the present compositions and methods are disclosed and described, it is to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting. It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.
- Aberrant Proliferation
- As used herein, the term “aberrant proliferation” means cell proliferation that deviates from the normal, proper, or expected course. For example, aberrant cell proliferation may include inappropriate proliferation of cells whose DNA or other cellular components have become damaged or defective. Aberrant cell proliferation may include cell proliferation whose characteristics are associated with an indication caused by, mediated by, or resulting in inappropriately high levels of cell division, inappropriately low levels of apoptosis, or both. Such indications may be characterized, for example, by single or multiple local abnormal proliferations of cells, groups of cells, or tissue(s), whether cancerous or non-cancerous, benign or malignant.
- About
- As used herein, the term “about” refers to +/−10%.
- Attached
- “Attached” or “immobilized” as used herein to refer to a probe and a solid support may mean that the binding between the probe and the solid support is sufficient to be stable under conditions of binding, washing, analysis, and removal. The binding may be covalent or non-covalent. Covalent bonds may be formed directly between the probe and the solid support or may be formed by a cross linker or by inclusion of a specific reactive group on either the solid support or the probe or both molecules. Non-covalent binding may be one or more of electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent attachment of a molecule, such as streptavidin, to the support and the non-covalent binding of a biotinylated probe to the streptavidin. Immobilization may also involve a combination of covalent and non-covalent interactions.
- Biological Sample
- “Biological sample” as used herein may mean a sample of biological tissue or fluid that comprises nucleic acids. Such samples include, but are not limited to, tissue or fluid isolated from subjects. Biological samples may also include sections of tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, blood, plasma, serum, sputum, stool, tears, mucus, hair, and skin. Biological samples also include explants and primary and/or transformed cell cultures derived from animal or patient tissues. Biological samples may also be blood, a blood fraction, urine, effusions, ascitic fluid, amniotic fluid, saliva, cerebrospinal fluid, cervical secretions, vaginal secretions, endometrial secretions, gastrointestinal secretions, bronchial secretions, sputum, cell line, tissue sample, or secretions from the breast. A biological sample may be provided by removing a sample of cells from a subject but can also be accomplished by using previously isolated cells (e.g., isolated by another person, at another time, and/or for another purpose), or by performing the methods described herein in vivo. Archival tissues, such as those having treatment or outcome history, may also be used.
- Cancer
- The term “cancer” is meant to include all types of cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness. Examples of cancers include but are nor limited to solid tumors and leukemias, including: apudoma, choristoma, branchioma, malignant carcinoid syndrome, carcinoid heart disease, carcinoma (e.g., Walker, basal cell, basosquamous, Brown-Pearce, ductal, Ehrlich tumor, non-small cell lung, oat cell, papillary, bronchiolar, bronchogenic, squamous cell, and transitional cell), histiocytic disorders, leukemia (e.g., B cell, mixed cell, null cell, T cell, T-cell chronic, HTLV-II-associated, lymphocytic acute, lymphocytic chronic, mast cell, and myeloid), histiocytosis malignant, Hodgkin disease, immunoproliferative small, non-Hodgkin lymphoma, plasmacytoma, reticuloendotheliosis, melanoma, chondroblastoma, chondroma, chondrosarcoma, fibroma, fibrosarcoma, giant cell tumors, histiocytoma, lipoma, liposarcoma, mesothelioma, myxoma, myxosarcoma, osteoma, osteosarcoma, Ewing sarcoma, synovioma, adenofibroma, adenolymphoma, carcinosarcoma, chordoma, craniopharyngioma, dysgerminoma, hamartoma, mesenchymoma, mesonephroma, myosarcoma, ameloblastoma, cementoma, odontoma, teratoma, thymoma, trophoblastic tumor, adeno-carcinoma, adenoma, cholangioma, cholesteatoma, cylindroma, cystadenocarcinoma, cystadenoma, granulosa cell tumor, gynandroblastoma, hepatoma, hidradenoma, islet cell tumor, Leydig cell tumor, papilloma, Sertoli cell tumor, theca cell tumor, leiomyoma, leiomyosarcoma, myoblastoma, myosarcoma, rhabdomyoma, rhabdomyosarcoma, ependymoma, ganglioneuroma, glioma, medulloblastoma, meningioma, neurilemmoma, neuroblastoma, neuroepithelioma, neurofibroma, neuroma, paraganglioma, paraganglioma nonchromaffin, angiokeratoma, angiolymphoid hyperplasia with eosinophilia, angioma sclerosing, angiomatosis, glomangioma, hemangioendothelioma, hemangioma, hemangiopericytoma, hemangiosarcoma, lymphangioma, lymphangiomyoma, lymphangiosarcoma, pinealoma, carcinosarcoma, chondrosarcoma, cystosarcoma, phyllodes, fibrosarcoma, hemangiosarcoma, leimyosarcoma, leukosarcoma, liposarcoma, lymphangiosarcoma, myosarcoma, myxosarcoma, ovarian carcinoma, rhabdomyosarcoma, sarcoma (e.g., Ewing, experimental, Kaposi, and mast cell), neurofibromatosis, and cervical dysplasia, and other conditions in which cells have become immortalized or transformed.
- Classification
- The term classification refers to a procedure and/or algorithm in which individual items are placed into groups or classes based on quantitative information on one or more characteristics inherent in the items (referred to as traits, variables, characters, features, etc) and based on a statistical model and/or a training set of previously labeled items. A “classification tree” is a decision tree that places categorical variables into classes.
- Ct
- Ct signals represent the first cycle of PCR where amplification crosses a threshold (cycle threshold) of fluorescence. Accordingly, low values of Ct represent high abundance or expression levels of the microRNA.
- In some embodiments the PCR Ct signal is normalized such that the normalized Ct remains inversed from the expression level. In other embodiments the PCR Ct signal may be normalized and then inverted such that low normalized-inverted Ct represents low abundance or expression levels of the microRNA.
- Complement
- “Complement” or “complementary” as used herein to refer to a nucleic acid may mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules. A full complement or fully complementary may mean 100% complementary base pairing between nucleotides or nucleotide analogs of nucleic acid molecules.
- Data Processing Routine
- As used herein, a “data processing routine” refers to a process that can be embodied in software that determines the biological significance of acquired data (i.e., the ultimate results of an assay or analysis). For example, the data processing routine can make determination of tissue of origin based upon the data collected. In the systems and methods herein, the data processing routine can also control the data collection routine based upon the results determined. The data processing routine and the data collection routines can be integrated and provide feedback to operate the data acquisition, and hence provide assay-based judging methods.
- Data Set
- As use herein, the term “data set” refers to numerical values obtained from the analysis, These numerical values associated with analysis may be values such as peak height and area under the curve.
- Data Structure
- As used herein the term “data structure” refers to a combination of two or more data sets, applying one or more mathematical manipulations to one or more data sets to obtain one or more new data sets, or manipulating two or more data sets into a form that provides a visual illustration of the data in a new way. An example of a data structure prepared from manipulation of two or more data sets would be a hierarchical cluster.
- Detection
- “Detection” means detecting the presence of a component in a sample. Detection also means detecting the absence of a component. Detection also means measuring the level of a component, either quantitatively or qualitatively.
- Differential Expression
- “Differential expression” means qualitative or quantitative differences in the temporal and/or cellular gene expression patterns within and among cells and tissue. Thus, a differentially expressed gene may qualitatively have its expression altered, including an activation or inactivation, in, e.g., normal versus disease tissue. Genes may be turned on or turned off in a particular state, relative to another state thus permitting comparison of two or more states. A qualitatively regulated gene may exhibit an expression pattern within a state or cell type which may be detectable by standard techniques. Some genes may be expressed in one state or cell type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in that expression is modulated, either up-regulated, resulting in an increased amount of transcript, or down-regulated, resulting in a decreased amount of transcript. The degree to which expression differs need only be large enough to quantify via standard characterization techniques such as expression arrays, quantitative reverse transcriptase PCR, northern analysis, real-time PCR, in situ hybridization and RNase protection.
- Expression Profile
- The term “expression profile” is used broadly to include a genomic expression profile, e.g., an expression profile of microRNAs. Profiles may be generated by any convenient means for determining a level of a nucleic acid sequence e.g. quantitative hybridization of microRNA, labeled microRNA, amplified microRNA, cRNA, etc., quantitative PCR, ELISA for quantitation, and the like, and allow the analysis of differential gene expression between two samples. A subject or patient tumor sample, e.g., cells or collections thereof, e.g., tissues, is assayed. Samples are collected by any convenient method, as known in the art. Nucleic acid sequences of interest are nucleic acid sequences that are found to be predictive, including the nucleic acid sequences provided above, where the expression profile may include expression data for 5, 10, 20, 25, 50, 100 or more of, including all of the listed nucleic acid sequences. According to some embodiments, the term “expression profile” means measuring the abundance of the nucleic acid sequences in the measured samples.
- Expression Ratio
- “Expression ratio” as used herein refers to relative expression levels of two or more nucleic acids as determined by detecting the relative expression levels of the corresponding nucleic acids in a biological sample.
- FDR
- When performing multiple statistical tests, for example in comparing the signal between two groups in multiple data features, there is an increasingly high probability of obtaining false positive results, by random differences between the groups that can reach levels that would otherwise be considered as statistically significant. In order to limit the proportion of such false discoveries, statistical significance is defined only for data features in which the differences reached a p-value (by two-sided t-test) below a threshold, which is dependent on the number of tests performed and the distribution of p-values obtained in these tests.
- Fragment
- “Fragment” is used herein to indicate a non-full length part of a nucleic acid. Thus, a fragment is itself also a nucleic acid.
- Gene
- “Gene” used herein may be a natural (e.g., genomic) or synthetic gene comprising transcriptional and/or translational regulatory sequences and/or a coding region and/or non-translated sequences (e.g., introns, 5′- and 3′-untranslated sequences). The coding region of a gene may be a nucleotide sequence coding for an amino acid sequence or a functional RNA, such as tRNA, rRNA, catalytic RNA, siRNA, miRNA or antisense RNA. A gene may also be an mRNA or cDNA corresponding to the coding regions (e.g., exons and miRNA) optionally comprising 5′- or 3′-untranslated sequences linked thereto. A gene may also be an amplified nucleic acid molecule produced in vitro comprising all or a part of the coding region and/or 5′- or 3′-untranslated sequences linked thereto.
- Groove Binder/Minor Groove Binder (MGB)
- “Groove binder” and/or “minor groove binder” may be used interchangeably and refer to small molecules that fit into the minor groove of double-stranded DNA, typically in a sequence-specific manner. Minor groove binders may be long, flat molecules that can adopt a crescent-like shape and thus, fit snugly into the minor groove of a double helix, often displacing water. Minor groove binding molecules may typically comprise several aromatic rings connected by bonds with torsional freedom such as furan, benzene, or pyrrole rings. Minor groove binders may be antibiotics such as netropsin, distamycin, berenil, pentamidine and other aromatic diamidines, Hoechst 33258, SN 6999, aureolic anti-tumor drugs such as chromomycin and mithramycin, CC-1065, dihydrocyclopyrroloindole tripeptide (DPI3), 1,2-dihydro-(3H)-pyrrolo[3,2-e]indole-7-carboxylate (CDPI3), and related compounds and analogues, including those described in Nucleic Acids in Chemistry and Biology, 2d ed., Blackburn and Gait, eds., Oxford University Press, 1996, and PCT Published Application No. WO 03/078450, the contents of which are incorporated herein by reference. A minor groove binder may be a component of a primer, a probe, a hybridization tag complement, or combinations thereof. Minor groove binders may increase the Tm of the primer or a probe to which they are attached, allowing such primers or probes to effectively hybridize at higher temperatures.
- Host Cell
- “Host cell” used herein may be a naturally occurring cell or a transformed cell that may contain a vector and may support replication of the vector. Host cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells, such as CHO and HeLa.
- Identity
- “Identical” or “identity” as used herein in the context of two or more nucleic acids or polypeptide sequences may mean that the sequences have a specified percentage of residues that are the same over a specified region. The percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity. In cases where the two sequences are of different lengths or the alignment produces one or more staggered ends and the specified region of comparison includes only a single sequence, the residues of single sequence are included in the denominator but not the numerator of the calculation. When comparing DNA and RNA, thymine (T) and uracil (U) may be considered equivalent. Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0.
- In Situ Detection
- “In situ detection” as used herein means the detection of expression or expression levels in the original site hereby meaning in a tissue sample such as biopsy.
- k-Nearest Neighbor
- The phrase “k-nearest neighbor” refers to a classification method that classifies a point by calculating the distances between the point and points in the training data set. Then it assigns the point to the class that is most common among its k-nearest neighbors (where k is an integer).
- Label
- “Label” as used herein may mean a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical means. For example, useful labels include 32P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and other entities which can be made detectable. A label may be incorporated into nucleic acids and proteins at any position.
- Liver Cancer
- “Liver cancer” means malignancy of the liver, either a primary cancer or metastasized cancer. In certain embodiments, liver cancer includes, but is not limited to, cancer arising from hepatocytes, such as, for example, hepatomas and hepatocellular carcinomas; fibrolamellar; and cholangiocarcinomas (or bile duct cancer).
- Logistic Regression
- Logistic regression is part of a category of statistical models called generalized linear models. Logistic regression can allows one to predict a discrete outcome, such as group membership, from a set of variables that may be continuous, discrete, dichotomous, or a mix of any of these. The dependent or response variable can be dichotomous, for example, one of two possible types of cancer. Logistic regression models the natural log of the odds ratio, i.e. the ratio of the probability of belonging to the first group (P) over the probability of belonging to the second group (1-P), as a linear combination of the different expression levels (in log-space). The logistic regression output can be used as a classifier by prescribing that a case or sample will be classified into the first type is P is greater than 0.5 or 50%. Alternatively, the calculated probability P can be used as a variable in other contexts such as a 1D or 2D threshold classifier.
- 1D/2D Threshold Classifier
- “1D/2D threshold classifier” used herein may mean an algorithm for classifying a case or sample such as a cancer sample into one of two possible types such as two types of cancer. For a 1D threshold classifier, the decision is based on one variable and one predetermined threshold value; the sample is assigned to one class if the variable exceeds the threshold and to the other class if the variable is less than the threshold. A 2D threshold classifier is an algorithm for classifying into one of two types based on the values of two variables. A threshold may be calculated as a function (usually a continuous or even a monotonic function) of the first variable; the decision is then reached by comparing the second variable to the calculated threshold, similar to the 1D threshold classifier.
- Metastasis
- “Metastasis” means the process by which cancer spreads from the place at which it first arose as a primary tumor to other locations in the body. The metastatic progression of a primary tumor reflects multiple stages, including dissociation from neighboring primary tumor cells, survival in the circulation, and growth in a secondary location.
- Nucleic Acid
- “Nucleic acid” or “oligonucleotide” or “polynucleotide” used herein may mean at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a nucleic acid also encompasses the complementary strand of a depicted single strand. Many variants of a nucleic acid may be used for the same purpose as a given nucleic acid. Thus, a nucleic acid also encompasses substantially identical nucleic acids and complements thereof. A single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions. Thus, a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions.
- Nucleic acids may be single stranded or double stranded, or may contain portions of both double stranded and single stranded sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine Nucleic acids may be obtained by chemical synthesis methods or by recombinant methods.
- A nucleic acid will generally contain phosphodiester bonds, although nucleic acid analogs may be included that may have at least one different linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O-methylphosphoroamidite linkages and peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with positive backbones; non-ionic backbones, and non-ribose backbones, including those described in U.S. Pat. Nos. 5,235,033 and 5,034,506, which are incorporated by reference. Nucleic acids containing one or more non-naturally occurring or modified nucleotides are also included within one definition of nucleic acids. The modified nucleotide analog may be located for example at the 5′-end and/or the 3′-end of the nucleic acid molecule. Representative examples of nucleotide analogs may be selected from sugar- or backbone-modified ribonucleotides. It should be noted, however, that also nucleobase-modified ribonucleotides, i.e. ribonucleotides, containing a non-naturally occurring nucleobase instead of a naturally occurring nucleobase such as uridines or cytidines modified at the 5-position, e.g. 5-(2-amino)propyl uridine, 5-bromo uridine; adenosines and guanosines modified at the 8-position, e.g. 8-bromo guanosine; deaza nucleotides, e.g. 7-deaza-adenosine; O- and N-alkylated nucleotides, e.g. N6-methyl adenosine are suitable. The 2′-OH-group may be replaced by a group selected from H, OR, R, halo, SH, SR, NH2, NHR, NR2 or CN, wherein R is C1-C6 alkyl, alkenyl or alkynyl and halo is F, Cl, Br or I. Modified nucleotides also include nucleotides conjugated with cholesterol through, e.g., a hydroxyprolinol linkage as described in Krutzfeldt et al., Nature 438:685-689 (2005), Soutschek et al., Nature 432:173-178 (2004), and U.S. Patent Publication No. 20050107325, which are incorporated herein by reference. Additional modified nucleotides and nucleic acids are described in U.S. Patent Publication No. 20050182005, which is incorporated herein by reference. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological environments, to enhance diffusion across cell membranes, or as probes on a biochip. The backbone modification may also enhance resistance to degradation, such as in the harsh endocytic environment of cells. The backbone modification may also reduce nucleic acid clearance by hepatocytes, such as in the liver and kidney. Mixtures of naturally occurring nucleic acids and analogs may be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made.
- Probe
- “Probe” as used herein may mean an oligonucleotide capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. Probes may bind target sequences lacking complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. There may be any number of base pair mismatches which will interfere with hybridization between the target sequence and the single stranded nucleic acids described herein. However, if the number of mutations is so great that no hybridization can occur under even the least stringent of hybridization conditions, the sequence is not a complementary target sequence. A probe may be single stranded or partially single and partially double stranded. The strandedness of the probe is dictated by the structure, composition, and properties of the target sequence. Probes may be directly labeled or indirectly labeled such as with biotin to which a streptavidin complex may later bind.
- Reference Expression Profile
- As used herein, the phrase “reference expression profile” refers to a criterion expression value to which measured values are compared in order to determine the detection of a subject with lung cancer. The reference expression profile may be based on the abundance of the nucleic acids, or may be based on a combined metric score thereof.
- Sensitivity
- “sensitivity” used herein may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example how frequently it correctly classifies a cancer into the correct type out of two possible types. The sensitivity for class A is the proportion of cases that are determined to belong to class “A” by the test out of the cases that are in class “A”, as determined by some absolute or gold standard.
- Specificity
- “specificity” used herein may mean a statistical measure of how well a binary classification test correctly identifies a condition, for example how frequently it correctly classifies a cancer into the correct type out of two possible types. The sensitivity for class A is the proportion of cases that are determined to belong to class “not A” by the test out of the cases that are in class “not A”, as determined by some absolute or gold standard.
- Stringent Hybridization Conditions
- “Stringent hybridization conditions” used herein may mean conditions under which a first nucleic acid sequence (e.g., probe) will hybridize to a second nucleic acid sequence (e.g., target), such as in a complex mixture of nucleic acids. Stringent conditions are sequence-dependent and will be different in different circumstances. Stringent conditions may be selected to be about 5-10° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm may be the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions may be those in which the salt concentration is less than about 1.0 M sodium ion, such as about 0.01-1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., about 10-50 nucleotides) and at least about 60° C. for long probes (e.g., greater than about 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive signal may be at least 2 to 10 times background hybridization. Exemplary stringent hybridization conditions include the following: 50% formamide, 5×SSC, and 1% SDS, incubating at 42° C., or, 5×SSC, 1% SDS, incubating at 65° C., with wash in 0.2×SSC, and 0.1% SDS at 65° C.
- Substantially Complementary
- “Substantially complementary” used herein may mean that a first sequence is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical to the complement of a second sequence over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more nucleotides, or that the two sequences hybridize under stringent hybridization conditions.
- Substantially Identical
- “Substantially identical” used herein may mean that a first and second sequence are at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% identical over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more nucleotides or amino acids, or with respect to nucleic acids, if the first sequence is substantially complementary to the complement of the second sequence.
- Subject
- As used herein, the term “subject” refers to a mammal, including both human and other mammals. The methods of the present invention are preferably applied to human subjects.
- Target “Target” as used herein may mean a polynucleotide that may be bound by one or more probes under stringent hybridization conditions.
- Threshold Expression Profile
- As used herein, the phrase “threshold expression profile” refers to a criterion expression profile to which measured values are compared in order to classify a cancer.
- Tissue Sample
- As used herein, a tissue sample is tissue obtained from a tissue biopsy using methods well known to those of ordinary skill in the related medical arts. The phrase “suspected of being cancerous” as used herein means a cancer tissue sample believed by one of ordinary skill in the medical arts to contain cancerous cells. Methods for obtaining the sample from the biopsy include gross apportioning of a mass, microdissection, laser-based microdissection, or other art-known cell-separation methods.
- Tumor
- “Tumor” as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.
- Variant
- “Variant” used herein to refer to a nucleic acid may mean (i) a portion of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequence substantially identical thereto.
- Wild Type
- As used herein, the term “wild type” sequence refers to a coding, non-coding or interface sequence is an allelic form of sequence that performs the natural or normal function for that sequence. Wild type sequences include multiple allelic forms of a cognate sequence, for example, multiple alleles of a wild type sequence may encode silent or conservative changes to the protein sequence that a coding sequence encodes.
- The present invention employs miRNAs and related nucleic acids for the identification, classification and diagnosis of specific cancers.
- microRNA Processing
- A gene coding for a miRNA may be transcribed leading to production of a miRNA primary transcript known as the pri-miRNA. The pri-miRNA may comprise a hairpin with a stem and loop. The stem of the hairpin may comprise mismatched bases. The pri-miRNA may comprise several hairpins in a polycistronic structure.
- The hairpin structure of the pri-miRNA may be recognized by Drosha, which is an RNase III endonuclease. Drosha may recognize terminal loops in the pri-miRNA and cleave approximately two helical turns into the stem to produce a 60-70 nt precursor known as the pre-miRNA. Drosha may cleave the pri-miRNA with a staggered cut typical of RNase III endonucleases yielding a pre-miRNA stem loop with a 5′ phosphate and ˜2
nucleotide 3′ overhang. Approximately one helical turn of stem (−10 nucleotides) extending beyond the Drosha cleavage site may be essential for efficient processing. The pre-miRNA may then be actively transported from the nucleus to the cytoplasm by Ran-GTP and the export receptor Ex-portin-5. - The pre-miRNA may be recognized by Dicer, which is also an RNase III endonuclease. Dicer may recognize the double-stranded stem of the pre-miRNA. Dicer may also off the terminal loop two helical turns away from the base of the stem loop leaving an additional 5′ phosphate and ˜2
nucleotide 3′ overhang. The resulting siRNA-like duplex, which may comprise mismatches, comprises the mature miRNA and a similar-sized fragment known as the miRNA*. The miRNA and miRNA* may be derived from opposing arms of the pri-miRNA and pre-miRNA. mRNA* sequences may be found in libraries of cloned miRNAs but typically at lower frequency than the miRNAs. - Although initially present as a double-stranded species with miRNA*, the miRNA may eventually become incorporated as a single-stranded RNA into a ribonucleoprotein complex known as the RNA-induced silencing complex (RISC). Various proteins can form the RISC, which can lead to variability in specificity for miRNA/miRNA* duplexes, binding site of the target gene, activity of miRNA (repress or activate), and which strand of the miRNA/miRNA* duplex is loaded in to the RISC.
- When the miRNA strand of the miRNA:miRNA* duplex is loaded into the RISC, the miRNA* may be removed and degraded. The strand of the miRNA:miRNA* duplex that is loaded into the RISC may be the strand whose 5′ end is less tightly paired. In cases where both ends of the miRNA:miRNA* have roughly equivalent 5′ pairing, both miRNA and miRNA* may have gene silencing activity.
- The RISC may identify target nucleic acids based on high levels of complementarity between the miRNA and the mRNA, especially by nucleotides 2-7 of the miRNA. Only one case has been reported in animals where the interaction between the miRNA and its target was along the entire length of the miRNA. This was shown for mir-196 and Hox B8 and it was further shown that mir-196 mediates the cleavage of the Hox B8 mRNA (Yekta et al 2004, Science 304-594). Otherwise, such interactions are known only in plants (Bartel & Bartel 2003, Plant Physiol 132-709).
- A number of studies have looked at the base-pairing requirement between miRNA and its mRNA target for achieving efficient inhibition of translation (reviewed by Bartel 2004, Cell 116-281). In mammalian cells, the first 8 nucleotides of the miRNA may be important (Doench & Sharp 2004 GenesDev 2004-504). However, other parts of the microRNA may also participate in mRNA binding. Moreover, sufficient base pairing at the 3′ can compensate for insufficient pairing at the 5′ (Brennecke et al, 2005 PLoS 3-e85). Computation studies, analyzing miRNA binding on whole genomes have suggested a specific role for bases 2-7 at the 5′ of the miRNA in target binding but the role of the first nucleotide, found usually to be “A” was also recognized (Lewis et al 2005 Cell 120-15). Similarly, nucleotides 1-7 or 2-8 were used to identify and validate targets by Krek et al (2005, Nat Genet 37-495).
- The target sites in the mRNA may be in the 5′ UTR, the 3′ UTR or in the coding region. Interestingly, multiple miRNAs may regulate the same mRNA target by recognizing the same or multiple sites. The presence of multiple miRNA binding sites in most genetically identified targets may indicate that the cooperative action of multiple RISCs provides the most efficient translational inhibition.
- miRNAs may direct the RISC to downregulate gene expression by either of two mechanisms: mRNA cleavage or translational repression. The miRNA may specify cleavage of the mRNA if the mRNA has a certain degree of complementarity to the miRNA. When a miRNA guides cleavage, the cut may be between the nucleotides pairing to
10 and 11 of the miRNA. Alternatively, the miRNA may repress translation if the miRNA does not have the requisite degree of complementarity to the miRNA. Translational repression may be more prevalent in animals since animals may have a lower degree of complementarity between the miRNA and binding site.residues - It should be noted that there may be variability in the 5′ and 3′ ends of any pair of miRNA and miRNA*. This variability may be due to variability in the enzymatic processing of Drosha and Dicer with respect to the site of cleavage. Variability at the 5′ and 3′ ends of miRNA and miRNA* may also be due to mismatches in the stem structures of the pri-miRNA and pre-miRNA. The mismatches of the stem strands may lead to a population of different hairpin structures. Variability in the stem structures may also lead to variability in the products of cleavage by Drosha and Dicer.
- Nucleic acids are provided herein. The nucleic acid may comprise the sequence of SEQ ID NOS: 1-44 or variants thereof. The variant may be a complement of the referenced nucleotide sequence. The variant may also be a nucleotide sequence that is substantially identical to the referenced nucleotide sequence or the complement thereof. The variant may also be a nucleotide sequence which hybridizes under stringent conditions to the referenced nucleotide sequence, complements thereof, or nucleotide sequences substantially identical thereto.
- The nucleic acid may have a length of from 10 to 250 nucleotides. The nucleic acid may have a length of at least 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200 or 250 nucleotides. The nucleic acid may be synthesized or expressed in a cell (in vitro or in vivo) using a synthetic gene described herein. The nucleic acid may be synthesized as a single strand molecule and hybridized to a substantially complementary nucleic acid to form a duplex. The nucleic acid may be introduced to a cell, tissue or organ in a single- or double-stranded form or capable of being expressed by a synthetic gene using methods well known to those skilled in the art, including as described in U.S. Pat. No. 6,506,559 which is incorporated by reference.
- Nucleic Acid Complexes
- The nucleic acid may further comprise one or more of the following: a peptide, a protein, a RNA-DNA hybrid, an antibody, an antibody fragment, a Fab fragment, and an aptamer.
- Pri-miRNA
- The nucleic acid may comprise a sequence of a pri-miRNA or a variant thereof. The pri-miRNA sequence may comprise from 45-30,000, 50-25,000,100-20,000, 1,000-1,500 or 80-100 nucleotides. The sequence of the pri-miRNA may comprise a pre-miRNA, miRNA and miRNA*, as set forth herein, and variants thereof. The sequence of the pri-miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- The pri-miRNA may comprise a hairpin structure. The hairpin may comprise a first and second nucleic acid sequence that are substantially complimentary. The first and second nucleic acid sequence may be from 37-50 nucleotides. The first and second nucleic acid sequence may be separated by a third sequence of from 8-12 nucleotides. The hairpin structure may have a free energy less than −25 Kcal/mole as calculated by the Vienna algorithm with default parameters, as described in Hofacker et al., Monatshefte f. Chemie 125: 167-188 (1994), the contents of which are incorporated herein. The hairpin may comprise a terminal loop of 4-20, 8-12 or 10 nucleotides. The pri-miRNA may comprise at least 19% adenosine nucleotides, at least 16% cytosine nucleotides, at least 23% thymine nucleotides and at least 19% guanine nucleotides.
- Pre-miRNA
- The nucleic acid may also comprise a sequence of a pre-miRNA or a variant thereof. The pre-miRNA sequence may comprise from 45-90, 60-80 or 60-70 nucleotides. The sequence of the pre-miRNA may comprise a miRNA and a miRNA* as set forth herein. The sequence of the pre-miRNA may also be that of a pri-miRNA excluding from 0-160 nucleotides from the 5′ and 3′ ends of the pri-miRNA. The sequence of the pre-miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- miRNA
- The nucleic acid may also comprise a sequence of a miRNA (including miRNA*) or a variant thereof. The miRNA sequence may comprise from 13-33, 18-24 or 21-23 nucleotides. The miRNA may also comprise a total of at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39 or 40 nucleotides. The sequence of the miRNA may be the first 13-33 nucleotides of the pre-miRNA. The sequence of the miRNA may also be the last 13-33 nucleotides of the pre-miRNA. The sequence of the miRNA may comprise the sequence of SEQ ID NOS: 1-33 or variants thereof.
- A probe is also provided comprising a nucleic acid described herein. Probes may be used for screening and diagnostic methods, as outlined below. The probe may be attached or immobilized to a solid substrate, such as a biochip.
- The probe may have a length of from 8 to 500, 10 to 100 or 20 to 60 nucleotides. The probe may also have a length of at least 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280 or 300 nucleotides. The probe may further comprise a linker sequence of from 10-60 nucleotides. The probe may comprise a nucleic acid that is complementary to a sequence selected from the group consisting of SEQ ID NOS: 1-27, 32-33; a fragment thereof, and a sequence having at least about 80% identity thereto.
- A biochip is also provided. The biochip may comprise a solid substrate comprising an attached probe or plurality of probes described herein. The probes may be capable of hybridizing to a target sequence under stringent hybridization conditions. The probes may be attached at spatially defined addresses on the substrate. More than one probe per target sequence may be used, with either overlapping probes or probes to different sections of a particular target sequence. The probes may be capable of hybridizing to target sequences associated with a single disorder appreciated by those in the art. The probes may either be synthesized first, with subsequent attachment to the biochip, or may be directly synthesized on the biochip.
- The solid substrate may be a material that may be modified to contain discrete individual sites appropriate for the attachment or association of the probes and is amenable to at least one detection method. Representative examples of substrates include glass and modified or functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic glasses and plastics. The substrates may allow optical detection without appreciably fluorescing.
- The substrate may be planar, although other configurations of substrates may be used as well. For example, probes may be placed on the inside surface of a tube, for flow-through sample analysis to minimize sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed cell foams made of particular plastics.
- The biochip and the probe may be derivatized with chemical functional groups for subsequent attachment of the two. For example, the biochip may be derivatized with a chemical functional group including, but not limited to, amino groups, carboxyl groups, oxo groups or thiol groups. Using these functional groups, the probes may be attached using functional groups on the probes either directly or indirectly using a linker. The probes may be attached to the solid support by either the 5′ terminus, 3′ terminus, or via an internal nucleotide.
- The probe may also be attached to the solid support non-covalently. For example, biotinylated oligonucleotides can be made, which may bind to surfaces covalently coated with streptavidin, resulting in attachment. Alternatively, probes may be synthesized on the surface using techniques such as photopolymerization and photolithography.
- As used herein the term “diagnosing” refers to classifying a pathology or a symptom, determining a severity of the pathology (grade or stage), monitoring pathology progression, forecasting an outcome of a pathology and/or prospects of recovery.
- As used herein the phrase “subject in need thereof” refers to an animal or human subject who is known to have cancer, at risk of having cancer [e.g., a genetically predisposed subject, a subject with medical and/or family history of cancer, a subject who has been exposed to carcinogens, occupational hazard, environmental hazard] and/or a subject who exhibits suspicious clinical signs of cancer [e.g., blood in the stool or melena, unexplained pain, sweating, unexplained fever, unexplained loss of weight up to anorexia, changes in bowel habits (constipation and/or diarrhea), tenesmus (sense of incomplete defecation, for rectal cancer specifically), anemia and/or general weakness]. Additionally or alternatively, the subject in need thereof can be a healthy human subject undergoing a routine well-being check up. According to some embodiments, the subject has a primary tumor. According to other embodiments, the subject has metastatic cancer. According to another embodiment, the subject has cancer of unknown primary (CUP).
- Analyzing presence of malignant or pre-malignant cells can be effected in-vivo or ex-vivo, whereby a biological sample (e.g., biopsy) is retrieved. Such biopsy samples comprise cells and may be an incisional or excisional biopsy. Alternatively the cells may be retrieved from a complete resection.
- While employing the present teachings, additional information may be gleaned pertaining to the determination of treatment regimen, treatment course and/or to the measurement of the severity of the disease.
- As used herein the phrase “treatment regimen” refers to a treatment plan that specifies the type of treatment, dosage, schedule and/or duration of a treatment provided to a subject in need thereof (e.g., a subject diagnosed with a pathology). The selected treatment regimen can be an aggressive one which is expected to result in the best clinical outcome (e.g., complete cure of the pathology) or a more moderate one which may relieve symptoms of the pathology yet results in incomplete cure of the pathology. It will be appreciated that in certain cases the treatment regimen may be associated with some discomfort to the subject or adverse side effects (e.g., a damage to healthy cells or tissue). The type of treatment can include a surgical intervention (e.g., removal of lesion, diseased cells, tissue, or organ), a cell replacement therapy, an administration of a therapeutic drug (e.g., receptor agonists, antagonists, hormones, chemotherapy agents) in a local or a systemic mode, an exposure to radiation therapy using an external source (e.g., external beam) and/or an internal source (e.g., brachytherapy) and/or any combination thereof. The dosage, schedule and duration of treatment can vary, depending on the severity of pathology and the selected type of treatment, and those of skills in the art are capable of adjusting the type of treatment with the dosage, schedule and duration of treatment.
- A method of diagnosis is also provided. The method comprises detecting an expression level of a specific cancer-associated nucleic acid in a biological sample. The sample may be derived from a patient. Diagnosis of a specific cancer state in a patient may allow for prognosis and selection of therapeutic strategy. Further, the developmental stage of cells may be classified by determining temporarily expressed specific cancer-associated nucleic acids.
- In situ hybridization of labeled probes to tissue arrays may be performed. When comparing the fingerprints between individual samples the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is further understood that the nucleic acid sequence which indicate the diagnosis may differ from those which indicate the prognosis and molecular profiling of the condition of the cells may lead to distinctions between responsive or refractory conditions or may be predictive of outcomes.
- A kit is also provided and may comprise a nucleic acid described herein together with any or all of the following: assay reagents, buffers, probes and/or primers, and sterile saline or another pharmaceutically acceptable emulsion and suspension base. In addition, the kits may include instructional materials containing directions (e.g., protocols) for the practice of the methods described herein. The kit may further comprise a software package for data analysis of expression profiles.
- For example, the kit may be a kit for the amplification, detection, identification or quantification of a target nucleic acid sequence. The kit may comprise a poly(T) primer, a forward primer, a reverse primer, and a probe.
- Detectable Malignancies
- Brain cancer: Each year, approximately 15,000 cases of high grade astrocytomas (glioblastoma multiforme) are diagnosed in the United States. The number is growing in both pediatric and adult populations. Standard treatments include cytoreductive surgery followed by radiation therapy or chemotherapy. There is no cure, and virtually all patients ultimately succumb to recurrent or progressive disease. The overall survival for grade IV astrocytomas (glioblastoma multiforme) is poor, with 50% of patients dying in the first year after diagnosis.
- According to the present invention, brain tumors were directly compared to a wide range of epithelial tumors and metastases to the brain. Using microarray data, it was found that elevated expression of just two microRNAs, hsa-miR-92b (SEQ ID NO: 14) and hsa-miR-9*(SEQ ID NO: 20), is sufficient to distinguish brain primary tumors from tumors derived from non-brain tissues, and most significantly for diagnostic purposes, from metastases located in the brain. This assay was translated to a qRT-PCR platform, using additional samples as a training set to develop a classifier. Validating on an independent set of test samples, it was found that the simple combination of hsa-miR-92b and hsa-miR-9 (SEQ ID NO: 27) (or hsa-miR-9*) can identify brain metastases from brain primary tumors with sensitivity of 88% and specificity of 100%. Thus, economical and relatively easy evaluation of hsa-miR-92b and hsa-miR-9/9* expression, which can be performed robustly using either fresh frozen or fixed materials in the clinical setting, reveals whether neoplastic tissue excised from the brain is brain-derived or represents a metastasis from another tissue. Taken together, the expression data concerning hsa-miR-92b and hsa-miR-9/9* suggest a connection between deregulation of microRNAs, pluripotency, and tumorigenesis.
- Liver cancer: Primary liver cancer is the fifth most common cancer worldwide. Hepatocellular carcinoma (HCC) accounts for 80% of all liver cancer and the rates of HCC have increased by over 70% in the last two decades in the U.S. The fatality ratio (mortality/incidence) of liver cancer is approximately 1, indicating that the majority of patients live less than a year. Late diagnosis due to lack of clinical symptoms is one of the main reasons for the high fatality ratio. Liver cancer can result from both viral infection and chemical exposure. Known risk factors include hepatitis B and C virus infection. It is not known whether distinct routes to liver cancer affect the same or different cellular pathways. No mutational model has yet been developed for liver cancer as it has been for other cancers. The molecular events that precede neoplastic transformation of the liver are not well understood. With no clearly identified cause, successful treatment options are lacking.
- Nearly any primary tumor site can deposit metastases in the liver, since the liver filters blood from throughout the body. Most discussions related to the treatment of metastatic tumors in the liver focus on those originating from the colon. In fact, the most common cause of death from colorectal cancer is liver metastasis.
- Up to 50% of liver metastases are of colorectal cancer origin, while the remainder metastasizes from a wide variety of primary cancer sites including sarcomas, breast and kidney, as well as neuroendocrine tumors.
- HCC may be solitary or multicentric, and it may mimic liver metastases. Furthermore hemangiomas and liver metastases are often confused in imaging methods. In general, the imaging appearances of liver metastases are nonspecific, and biopsy specimens are required for histological diagnosis. Various biochemical markers have been proposed to indicate liver metastases. However, the diagnostic accuracy of tumor markers has not yet been defined.
- The following examples are presented in order to more fully illustrate some embodiments of the invention. They should, in no way be construed, however, as limiting the broad scope of the invention.
- 27 fresh frozen and 141 formalin-fixed paraffin embedded (FFPE) tumor samples obtained from several sources (Sheba Medical Center, Tel-Hashomer, Israel; ABS Inc., Wilmington, Del.; Seoul National University College of Medicine, Seoul, South Korea; Indivumed GmbH, Hamburg, Germany; Soroka University Medical Center, Beer-Sheva, Israel) were used for comparing liver tumors to non-liver tumors and liver metastases. 2 fresh frozen brain normal samples (obtained from Ambion Inc.), 3 fresh-frozen liver tumor samples (obtained from Seoul National University College of Medicine, Seoul, South Korea) and 285 FFPE tumor samples (obtained from Sheba Medical Center, Tel-Hashomer, Israel; Soroka University Medical Center, Beer-Sheva, Israel; Beilinson Hospital, Rabin Medical Center, Petah-Tikva, Israel; ABS Inc., Wilmington, Del.; Tel Aviv Sourasky Medical Center, Tel Aviv, Israel; Bnai Zion Medical Center, Haifa, Israel) were used for comparing brain tumors to normal brain, non-brain tumors and brain metastases. The study protocol was approved by the Research Ethics Board of each of the contributing institutes. Each of the FFPE samples was evaluated by a pathologist for histological type, grade and tumor percentages based on hematoxilin-eosin (H&E) stained slides, performed on the first and/or last sections of the sample. The tumor content was ≧50% in 85% of the FFPE samples. For frozen samples, information was extracted from medical records.
- 252 of the samples were profiled by microRNA microarray. 14 of these samples and 59 additional samples were profiled by qRT-PCR. Histological classification of the study samples is summarized in Table 1a-b.
-
TABLE 1a Summary of sample types, numbers and histology used for comparing liver tumors to non-liver tumors and liver metastases. N Sample category Detail 30 Liver Primary Tumor 30 Liver (7 FFPE, 23 fresh) 63 Non-Liver Primary Tumor 15 Breast (FFPE, 1 identified as adenocarcinoma) 14 Colon (FFPE, 10 identified as adenocarcinoma) 24 Lung (FFPE, 7 adenocarcinoma, 8 squamous cell carcinoma, 1 large cell carcinoma, 3 NSCLC, 1 neuroendocrine SCLC, 1 mixed adeno-squamous carcinoma) 5 Pancreas (FFPE, 4 exocrine adenocarcinoma and 1 pancreatic ducr adenocarcinoma) 5 Stomach (FFPE, adenomcarcinoma) 46 Liver Metastasis of Known Origin 3 Breast (FFPE, adenocarcinoma) 36 Colon (35 FFPE and 1 fresh, adenocarcinoma) 1 Lung (FFPE, adenocarcinoma) 2 Pancreas (FFPE, adenocarcinoma) 3 Rectum (fresh, adenomcarcinoma) 1 Stomach (FFPE, adenomcarcinoma) 5 Liver Metastasis of Unknown Origin 5 Unknown (FFPE, adenocarcinoma) Additional samples in qRT-PCR N validation set, by category Detail 5 Liver Primary Tumor 5 Liver (FFPE) 18 Non-Liver Primary Tumor 2 Ovary and 16 Lung (FFPE) 1 Liver Metastasis of Known Origin 1 Kidney (FFPE) -
TABLE 1b Summary of sample types, numbers and histology used for comparing brain tumors to normal brain, non-brain tumors and brain metastases Samples in microarray N data -by category Detail 15 Brain primary tumors anaplastic astrocytoma (2), anaplastic oligodendroglioma (1), glioblastoma multiforme (7), low grade astrocytoma (3), oligodendroglioma (2) 187 Other primary tumors adipose liposarcoma (4), bladder (1 transitional cell carcinoma), breast (3 including 1 infiltrating lobular carcinoma), cervix (3 adenocarcinoma, 2 squamous cell carcinoma), colon (4 adenocarcinoma), endometrium (7 adenocarcinoma), esophagus (2 adenocarcinoma, 5 squamous cell carcinoma), esophagus-stomach (7 adenocarcinoma), gallbladder (3 adenocarcinoma), kidney (6 renal cell carcinoma), larynx (4 squamous cell carcinoma), liver (2 hepatocellular carcinoma), lung (7 neuroendocrine carcinoid, 1 neuroendocrine large cell, 1 neuroendocrine; mix small cell-large cell, 7 neuroendocrine small cell, 8 non-small cell adenocarcinoma, 3 non-small large cell carcinoma, 8 non-small squamous cell carcinoma, 7 pleura mesothelioma), lymphocytes (10 hodgkin's lymphoma), melanocytes (3 malignant melanoma), meninges (8 meningioma, 1 atypical meningioma), mouth (4 squamous cell carcinoma, 1 keratinizing squamous cell carcinoma), nose (4 squamous cell carcinoma, 1 keratinizing squamous cell carcinoma), ovary (7 serous papillary cancer), pancreas (3 adenocarcinoma, 2 ductal adenocarcinoma, 2 exocrine adenocarcinoma), prostate (7 samples including 2 bph samples) small intestine (7 stromal tumor, 1 adenocarcinoma), stomach adenocarcinoma (5), testis seminoma (3), thymus thymoma (3 type b2, 4 type b3), thyroid (4 carcinoma, 3 papillary carcinoma, 1 papillary tall cell carcinoma), tongue (2 squamous cell carcinoma, 8 keratinizing squamous cell carcinoma), 50 Metastases in brain bladder (1 transitional cell carcinoma), breast (2 carcinoma, 2 adenocarcinoma, 4 ductal carcinoma, 5 infiltrating ductal carcinoma), colon (5 adenocarcinoma), endometrial tumor (1), kidney (2 clear cell carcinoma, 1 renal cell carcinoma), lung (10 including 1 carcinoma, 1 neuroendocrine small-cell carcinoma, 6 non-small cell adenocarcinoma, 1 non-small squamous cell carcinoma), melanocytes (4 melanoma, 2 malignant melanoma), unknown (3 carcinoma, 5 adenocarcinoma, 1 small cell carcinoma, 2 sarcoma), Additional samples in N qRT-PCR validation set Detail 15 Brain primary tumors anaplastic oligodendroglioma (1), astrocytoma (5), glioblastoma qRT-PCR validation multiforme (2), oligodendroglioma (7) set 8 Other primary tumors bladder (1 transitional cell carcinoma), kidney (1 renal cell carcinoma), liver (1 hepatocellular carcinoma), lung (2 including 1 adenocarcinoma, 1 pleura mesothelioma), ovary (1 adenocarcinoma), pancreas (3 adenocarcinoma, 2 ductal adenocarcinoma, 2 exocrine adenocarcinoma), pancreas (1 neuroendocrine carcinoma), thymus thymoma (1 type b2) 10 Metastases in brain breast (2 ductal carcinoma), kidney (3 adenocarcinoma), lung (3 including 1 adenocarcinoma, 2 non-small squamous cell carcinoma), ovary (2 adenocarcinoma) - Total RNA was extracted from both the frozen and the FFPE tissues. From the frozen tissues, a sample of approximately 0.5 cm3 was used per case. Total RNA was extracted using the miRvana miRNA isolation kit (Ambion) according to the manufacturer's instructions. Briefly, the sample was homogenized in a denaturing lysis solution followed by an acid-phenol:chloroform extraction and purification on a glass-fiber filter.
- From the FFPE samples, total RNA was isolated from seven to ten 10-micron-thick tissue sections per case using the miRdictor™ extraction protocol developed at Rosetta Genomics. Briefly, the sample was incubated few times in xylene at 57° C. to remove paraffin excess, followed by ethanol washes. Proteins were degraded by proteinase K solution at 45° C. for few hours. The RNA was extracted with acid phenol:chloroform followed by ethanol precipitation and DNAse digestion. Total RNA quantity and quality was measured by Nanodrop ND-1000 (NanoDrop Technologies, Wilmington, Del.).
- 3. miRdicator™ Array Platform
- Custom microRNA microarrays were produced by printing DNA oligonucleotide probes representing ˜650 DNA oligonucleotide probes representing microRNAs (Sanger database,
version 9 and additional Rosetta validated and predicted miRNAs). Each probe, printed in triplicate, carries up to 22-nt linker at the 3′ end of the microRNA's complement sequence in addition to an amine group used to couple the probes to coated glass slides. 20 μM of each probe were dissolved in 2×SSC+0.0035% SDS and spotted in triplicate on Slide E coated microarray slides (Schott Nexterion, Mainz, Germany) using the BioRobotics MicroGrid II microarrater (Genomic Solutions, Ann Arbor, Mich.) according to the manufacturer's directions. 54 negative control probes were designed using the sense sequences of different microRNAs. Two groups of positive control probes were designed to hybridize to miRdicator™ array (i) synthetic small RNA were spiked to the RNA before labeling to verify the labeling efficiency and (ii) probes for abundant small RNA (e.g. small nuclear RNAs (U43, U49, U24, Z30, U6, U48, U44), 5.8s and 5s ribosomal RNA) are spotted on the array to verify RNA quality. The slides were blocked in a solution containing 50 mM ethanolamine, 1M Tris (pH9.0) and 0.1% SDS for 20 min at 50° C., then thoroughly rinsed with water and spun dry. - 4. Cy-Dye Labeling of miRNA for Microarray
- Up to 5 μg (mean: 4.5 μg) of total RNA were labeled by ligation of an RNA-linker, p-rCrU-Cy/dye (Dharmacon, Lafayette, Colo.), to the 3′-end with Cy3 or Cy5. The labeling reaction contained total RNA, spikes (0.1-20 fmoles), 300 ng RNA-linker-dye, 15% DMSO, 1× ligase buffer and 20 units of T4 RNA ligase (NEB New England Biolabs, Ipswich, Mass.) and proceeded at 4° C. for 1 hr followed by 1 hr at 37° C. The labeled RNA was mixed with 3× hybridization buffer (Ambion, Austin, Tex.), heated to 95° C. for 3 min and than added on top of the miR microarray. Slides were hybridized 12-16 hr in 42° C., followed by two washes in room temperature with 1×SSC and 0.2% SDS and a final wash with 0.1×SSC.
- Arrays were scanned using the Agilent DNA Microarray Scanner Bundle (Agilent Technologies, Santa Clara, Calif.) at resolution of 10 μm at 100% power. Array images were analyzed using the SpotReader software (Niles Scientific, Portola Valley, Calif.).
- The initial data set consisted of signals measured for multiple probes for every sample. Triplicate spots were combined to one signal by taking the logarithmic mean of the reliable spots. All data was log-transformed (natural base) and the analysis was performed in log-space. A reference data vector for normalization R was calculated by taking the median expression level for each probe across all samples in each dataset. For each sample data vector S, a 2nd degree polynomial F was found so as to provide the best fit between the sample data and the reference data, such that R≈F(S). Remote data points (“outliers”) were not used for fitting the polynomial F. For each probe in the sample (element Si in the vector S), the normalized value (in log-space) Mi is calculated from the initial value Si by transforming it with the polynomial function F, so that Mi=F(Si). Data in
FIGS. 1 and 2 was translated back to linear-space by taking the exponent. - 6. qRT-PCR
- One microgram of total RNA was subjected to polyadenylation reaction as described before (Shi and Chiang, 2005, Biotechniques, 39(4):519-25). Briefly, RNA was incubated in the presence of poly (A) polymerase (PAP) (Takara-2180A), MnCl2, and ATP for 1 h at 37° C. Reverse transcription was performed on the poly-adenylated product. An oligo-dT primer harboring a consensus sequence (complementary to the reverse primer) was used for reverse transcription reaction. The primer is first annealed to the poly A-RNA and then subjected to a reverse transcription reaction of SuperScript II RT (Invitrogen). The cDNA was then amplified by real-time PCR reaction, using a miRNA-specific forward primer, TaqMan probe and universal reverse primer. The reactions were incubated for 10 min. at 95° C. followed by 42 cycles of 95° C. for 15 s and 60° C. for 1 min. Normalizing the Ct values (per sample) by the Ct of either U6 snRNA, the Ct of hsa-miR-24, or their average Ct, shifted at most one sample from each side in the test-set classification predictions.
- Normalized expression values for each of the samples of the miRs were calculated. P-values were calculated using two-sided t-test on the logarithm of the signal.
- The combined metric “L” was defined as the
mean log 2 expression of two microRNAs, L≡[log 2(hsa-miR-141 signal)+log 2(hsa-miR-200c signal)]/2. This metric could be used to identify the non-liver primary tumor samples with near-perfect accuracy. The receiver operating characteristic curve (ROC curve) plots the sensitivity against one minus the specificity, and is a measure of classification performance. A random classifier has an area under the curve (AUC) of 0.5, and an optimal classifier with perfect sensitivity and specificity of 100% has an area of 1. The ROC curve of the combination of hsa-miR-141 and hsa-miR-200c has an AUC of 0.999. The decision rule “classify as non-liver when L>10” identified samples as non-liver primary tumors with a sensitivity of 98% and a specificity of 100%, with one pancreatic exocrine adenocarcinoma incorrectly classified as HCC. A more “conservative” cutoff at L=9.5 had a sensitivity of 98% and a specificity of 93% (FIG. 1A ), with two HCC samples incorrectly identified as non-HCC primary tumors. The same combined metric had AUC=0.997 in identifying metastatic liver adenocarcinoma of a known origin from primary HCC (FIG. 1B ), with sensitivity=98% and specificity of 93%. For the qRT-PCR data, LRT≡[(hsa-miR-141 normalized Ct)+(hsa-miR-200c normalized Ct)]/2 had AUC=1. - For comparing GI and non-GI primary tumors, the simple decision rule “classify as GI when the expression of hsa-miR-205 is smaller than half the expression of hsa-miR-194” (
FIG. 2A ) is accurate in all but one case of a stomach primary tumor that is misclassified as non-GI by this rule. The metric (expression of hsa-miR-205)/(expression of hsa-miR-194) had an AUC of 0.989. - For comparing brain tumors, the combined metric B0 was defined as the summed
log 2 expression measured by microarray of hsa-miR-124 and hsa-miR-219-5p: Bo[log 2(hsa-miR-124 signal)+log 2(hsa-miR-219-5p signal)], and had AUC=1 when used to identify primary brain tumors from other primary tumors, but had AUC=0.8987 when used to identify brain primary tumors from brain metastases. The combined metric B1 was defined as the summedlog 2 expression measured by microarray of hsa-miR-9* and hsa-miR-92b: B1≡[log 2(hsa-miR-9* signal)+log 2(hsa-miR-92b signal)], and had AUC=1 when used to identify primary brain tumors from other primary tumors or from brain metastases. - The combined metric BRT* was defined as the summed
log 2 expression levels measured by qRT-PCR data (the Ct values) of hsa-miR-9 and hsa-miR-92b: BRT100-[Ct(hsa-miR-9)+Ct(hsa-miR-92b signal)], had AUC=1 in the training set data and one error in the test-set data when used to identify primary brain tumors from other primary tumors or from brain metastases. The combined metric BRT* was defined as the summed qRT-PCR Ct values of hsa-miR-9* and hsa-miR-92b: BRT*100-[Ct(hsa-miR-9*)+Ct(hsa-miR-92b signal)], had AUC=1 in the training set data and one error in the test-set data when used to identify primary brain tumors from other primary tumors or from brain metastases. -
TABLE 2 miR and hairpin SEQ ID NOS: miR name MID HID hsa-miR-194 1 2, 3 hsa-miR-205 4 5 hsa-miR-141 6 7 hsa-miR- 200c 8 9 hsa-miR- 200a 10 11 hsa-miR- 200b 12 13 hsa-miR- 92b 14 15 hsa-miR- 124a 16 17-19 hsa-miR-9* 20 21-23 hsa-miR-219 24 25-26 hsa-miR-9 27 21-23 hsa-miR-128a 28 29 hsa-miR- 128b 30 31 hsa-miR- 122a 32 33 miR name: is the miRBase registry name (release 9.1). MID: is the SEQ ID NO of the mature microRNA. HID: is the SEQ ID NO of the microRNA hairpin precursor (Pre-microRNA). -
TABLE 3 Primers and probes SEQ ID NOS: Fwd Primer MGB probe Rev Primer Target Sequence Sequence Sequence miR name SEQ ID NO: SEQ ID NO: SEQ ID NO: hsa-miR-124 34 39 44 hsa-miR-9 35 40 hsa-miR-9* 36 41 hsa-miR-92b 37 42 U6 38 43 - microRNA expression levels were profiled in 144 tumor samples including 30 primary HCC samples, 63 primary tumors from epithelial origins, 46 liver metastases from epithelial origins, and 5 adenocarcinoma metastases to the liver from unknown origin. The primary HCC samples were compared to the other primary tumors and to the liver metastases samples. Hsa-miR-122a (SEQ ID NO: 32), which is a highly liver-specific microRNA, had the strongest effect when comparing primary HCC tumors to other primary tumors with a fold-change>90, and could identify HCC from other primary tumors (p-value=1.4e-38, AUC=1). However, this microRNA is also found at high levels in the RNA extracted from liver metastases (
FIG. 1A ), ostensibly due to contamination from the adjacent normal liver tissue, and is not a good marker for identifying liver metastases (fold-change of medians 1.1, p-value=0.28, AUC=0.56). By using high-throughput profiling, the microRNA family of hsa-miR-200a,b,c (SEQ ID NOS: 8, 10 and 12) and hsa-miR-141 (SEQ ID NO: 6) were identified as strongly expressed in primary tumors from epithelial origins of common liver metastases, but are not expressed in liver primary tumors (p-value<1e-11, AUC>0.9 for each). Because these microRNAs are not expressed in the liver background, they are also useful in distinguishing between primary HCC tumors and metastatic tumors to the liver (p-value<1e-11, AUC>0.9 for each). For these microRNAs, unlike hsa-miR-122a, the expression level in the liver metastases is similar to the expression level in the non-HCC primary tumors, in sharp contrast to their expression level in the liver primary tumors (FIG. 1A ). - Of this family of microRNAs, the strongest effect was found for hsa-miR-200c and hsa-miR-141 (
FIG. 1B ). Each of these microRNAs could be used to distinguish between primary HCC tumors and metastatic tumors to the liver with very high accuracy (AUC>0.98). The expression level of these two microRNAs can be combined to create a powerful classifier. The combined metric L is defined as the sum of the logarithm (base 2) of the signals of the two microRNAs, providing robustness to the classifier by adding two signals. A simple decision rule, “classify as HCC if L≦18, classify as non-HCC if L>18”, has a sensitivity of 98% in identifying non-HCC samples and a specificity of 93%, with only 4 errors of 144 samples (AUC=0.9980). A more conservative classifier can be defined by allowing a margin for uncertainty, offactor 4 above or below the threshold (equivalent to two cycles in qRT-PCR measurements). The classification rule “classify as HCC if L<16, classify as non-HCC if L>20, leave unidentified if 16≦L≦20” leaves 7 samples of 144 (<5%) unclassified, and correctly classifies all other samples, including 5 cases of metastatic liver adenocarcinoma of unknown origin (FIG. 1B ). - These findings were validated by qRT-PCR, measuring the expression levels of these microRNAs in 31 samples including 24 new samples. The qRT-PCR data showed an identical pattern, with hsa-miR-122a high in all samples from the liver (
FIG. 1C ), and the miR-200 family specifically high in samples of non-liver origin (FIG. 1C ). Again, a simple combination of hsa-miR-200c and hsa-miR-141 could identify primary from metastatic liver tumors with near-perfect accuracy (FIG. 1D ). - Similar results were obtained by using the combination of hsa-miR-200a (SEQ ID NO: 10) and hsa-miR-200b (SEQ ID NO: 12) (
FIG. 3A-B , p-value<2*10̂-12 for each comparing hepatocellular carcinoma samples to non-hepatic primary tumor samples or to metastatic liver adenocarcinoma of a known origin or to both together). -
TABLE 4 microRNA expression in HCC primary tumors compared to their expression in other primary tumors and metastases, from microarray data. HCC primary liver tumors vs.: Compared to other Compared to metastases Compared to both other primary tumors in liver primary and metastases microRNA or metric: p- fold- p- fold- p- fold- value† chng‡ AUC value† chng‡ AUC value† chng‡ AUC hsa-miR-122‡ 1.4E−38 91.1 1.0000 2.8E−01 1.1 0.5623 6.3E−09 56.7 0.8193 hsa-miR-200a 4.8E−13 1/5.7 0.9185 7.3E−14 1/4.8 0.9492 1.3E−17 1/5.1 0.9277 hsa-miR-200b 1.5E−15 1/5.7 0.9339 1.9E−12 1/4.3 0.9297 2.7E−19 1/4.6 0.9249 hsa-miR-200c 1.7E−34 1/25 0.9947 8.9E−27 1/23 0.9877 1.0E−46 1/25 0.9921 hsa-miR-141 1.7E−39 1/44 0.9979 1.2E−27 1/27 0.9905 1.5E−48 1/35 0.9950 L† 5.7E−38 1/989 0.9984 4.2E−29 1/627 0.9971 2.1E−50 1/866 0.9980 †P-values are calculated on log-signal of microRNA expression, measure by microarray, and on L which is in log-space (methods). ‡“fold-chng” is the fold change, calculated by the median signal in HCC divided by the median signal in other tissues. For hsa-miR-122a, fold change is greater than 1 indicating a higher expression in HCC. For all other rows, signal is lower in HCC. -
TABLE 5 microRNA expression in HCC primary tumors compared to their expression in other primary tumors and metastases, from qRT-PCR data. HCC primary liver tumors vs.: Compared to other Compared to metastases Compared to both other primary tumors in liver primary and metastases microRNA or metric: p- fold- p- fold- p- fold- value† chng‡ AUC value† chng‡ AUC value† chng‡ AUC hsa-miR-122‡ 1.5E−10 5E+4 1.0000 8.3E−01 1.0 0.4857 2.6E−03 3E+4 0.8615 hsa-miR-200a 4.4E−09 1/883 1.0000 5.3E−03 1/89 0.9286 8.1E−07 1/542 0.9808 hsa-miR-200b 2.5E−09 1/80 1.0000 1.5E−03 1/16 1.0000 4.0E−08 1/60 1.0000 hsa-miR-200c 9.9E−14 1/643 1.0000 3.5E−05 1/678 1.0000 1.2E−12 1/661 1.0000 hsa-miR-141 2.3E−11 1/323 1.0000 5.7E−04 1/164 1.0000 8.1E−10 1/305 1.0000 LRT † 3.0E−13 1/319 1.0000 1.1E−04 1/230 1.0000 1.2E−11 1/279 1.0000 †P-values are calculated on normalized Ct values of microRNA expression by qRT-PCR, and LRT, which is the average Ct of hsa-miR-200c and hsa-miR-141 (see methods). ‡“fold-chng” is the fold change, calculated by the median signal (2Ct) in HCC divided by the median signal (2Ct) in other tissues. For hsa-miR-122a, fold change is greater than 1 indicating a higher expression in HCC. For all other rows, signal is lower in HCC. - The microRNA expression can provide further information on the possible origin of liver metastases. Another pair of microRNAs, hsa-miR-194 (SEQ ID NO: 1) and hsa-miR-205 (SEQ ID NO: 4), had significant different expression (p-value<1e-12 for each) in primary tumors from gastrointestinal (GI) origin (14 colon, 5 pancreas, 5 stomach) compared to primary tumors of non-GI epithelial origin (24 lung, 15 breast). The ratio of these expression levels could be used to accurately identify primary tumors from non-GI origin: the decision rule “classify as non-GI primary when the expression of hsa-miR-205 is greater than half the expression of hsa-miR-194” (
FIG. 2A ; dashed line marks the decision boundary) had a sensitivity of 100% and specificity of 96% (AUC=0.9893). In the liver metastases, despite the small number of samples, this trend was maintained (FIG. 2B ). However, since hsa-miR-194 is also highly expressed in liver tissue, the contamination of these metastases by the surrounding liver tissue raised significantly the expression of hsa-miR-194, and thereby reduced the significance of its differential expression (p-value=0.0011 for hsa-miR-205, but only p-value=0.15 for hsa-miR-194). Nevertheless, a very high ratio of expression of hsa-miR-194 to hsa-miR-205 is observed only in metastases of GI origin and can thus be used for their identification (AUC=0.8988). - microRNA expression levels were profiled on a microarray platform in 252 tumor samples including 15 brain primary tumor samples, 187 non-brain primary tumors, 50 brain metastases from various tissue origins and 2 normal brain samples. The brain primary tumor samples were compared to the other primary tumor samples, to normal brain samples and to samples of brain-located metastases (Table 6). Hsa-miR-124 (SEQ ID NO: 16), which is highly specific to the nervous system, displayed the greatest disparity in expression when comparing brain primary tumors to other primary tumors, with a fold-change of ˜100 (p-value=5.1e-57, AUC=0.9976, see Table 6). A combination of hsa-miR-124 and hsa-miR-219 (SEQ ID NO: 24) (B0, see methods) could be used to distinguish brain primary tumors from non-brain primary tumors with 100% accuracy (
FIG. 4A ). Other brain-specific microRNAs such as hsa-miR-128a (SEQ ID NO: 28) and hsa-miR-128b (SEQ ID NO: 30) also showed very strong differential expression between brain primary tumors and other primary tumors (p-value<4e-28, AUC=0.9932, seeFIG. 5A ). However, these four microRNAs are highly expressed in normal brain and are also found at high levels in RNA extracted from brain metastases (FIG. 4A ). This latter effect, ostensibly due to contamination from the adjacent normal brain tissue, limits the utility of these microRNAs to serve as biomarkers for differentiating between brain primary tumors and brain-located metastases (AUC of 0.85˜0.95, see Table 6). - In addition to the aforementioned four microRNAs (hsa-miR-124, hsa-miR-219-5p, hsa-miR-128a and hsa-miR-128b) hsa-miR-9* (SEQ ID NO: 20) and hsa-miR-92b (SEQ ID NO: 14) are expressed specifically in brain tumors and not expressed in other tumor types (
FIG. 4B , AUC>0.99). Importantly, these two microRNAs also differentiate accurately between brain primary tumors and metastatic tumors located in the brain (p-value<3e-18, AUC>0.99 for each) or normal brain samples. Indeed, using a combination of hsa-miR-9* and hsa-miR-92b expression (B1, see methods) it is possible to distinguish brain primary tumor samples from all other samples with 100% accuracy in the microarray data (FIG. 4B ). A simple decision rule, “classify as primary brain tumor if B1>19, classify as other if B1<19”, identifies correctly all samples. A more conservative classifier can be defined by allowing a margin for uncertainty offactor 2 above or below the threshold (equivalent to one cycle in qRT-PCR measurements). The classification rule “classify as brain primary if B1>20, classify as other if B1<18, leave unidentified if 18≦B1≦20” leaves only 2 samples out of 252 (<1%) as unclassified (FIG. 4B ), and classifies correctly all other samples. - To validate these findings, 14 of these samples and 33 additional samples were profiled by qRT-PCR, for four potential biomarkers: hsa-miR-124, hsa-miR-9, hsa-miR-9* and hsa-miR-92b (Table 7), and two controls: hsa-miR-24, which was found to be relatively constantly expressed in the microarray data, and snRNA U6. These microRNAs showed the same pattern as observed in the microarray data (Table 7). Hsa-miR-124 showed strong expression in the brain primary tumors, weak expression in other primary tumors, and intermediate expression in the metastases (
FIG. 6A ). Thus, hsa-miR-124 was not a good candidate for identifying metastatic tumors to the brain. On the other hand, hsa-miR-9 (SEQ ID NO: 27), hsa-miR-9* (SEQ ID NO: 20), and hsa-miR-92b (SEQ ID NO: 14) showed specific strong expression in primary brain tumors with lower expression in other tumors and in metastases to the brain (FIG. 6 ), with significant differences and strong separability between brain primary tumors and brain metastases (Table 7). - Combinations of hsa-miR-92b with either hsa-miR-9 (BRT) or with hsa-miR-9* (BRT*) were defined by summing their qRT-PCR Ct values (see Methods). A threshold was selected for classification for each combination using half of the samples as a training set. The classification accuracy was then tested on the second half of the data set which was used as a test set. The classifications on the test set were near perfect with one outlier of 23 samples, reaching 100% accuracy in identifying non-brain primary tumors from brain primary tumors, and 88% sensitivity with 100% specificity in identifying metastatic brain tumors from brain primary tumors, for both BRT (
FIG. 6B ) and BRT*. Indeed, these combinations show significant differences in expression that can be used to classify primary from metastatic brain tumors (Table 7). - Based on these data, it is proposed that hsa-miR-9/9* and hsa-miR-92b, and their combination, represent new biomarkers that can be used to classify brain malignancies—primary versus secondary.
- The gene encoding hsa-miR-9/9* appears in the human genome in three places, in
1, 5, and 14, each an identical copy. Hsa-miR-92b is found on chromosome 1 and differs by only one nucleotide in its first 20 from hsa-miR-92a, a member of the oncogenic miR-17-92 cluster. However, the expression patterns of hsa-miR-92a correlates only very weakly with that of and hsa-miR-92b and does not enable classification of brain malignancies (Table 6).chromosomes -
TABLE 6 Comparison between microRNA expression in primary brain tumors and expression in other primary tumors or expression in brain metastases, based on microRNA microarray data Primary brain vs. Other primary tumors Brain metastases microRNA or metric: fold- fold- p-value† chng‡ AUC p-value† chng‡ AUC hsa-miR-124 1.4E−54 97.1 0.9975 5.4E−06 12.6 0.8600 hsa-miR-219-5p 9.7E−43 10.0 0.9679 4.1E−09 6.9 0.8840 B0 † 1.8E−49 293.0 1.0000 9.0E−09 27.7 0.8987 hsa-miR-128a 5.4E−27 9.3 0.9929 4.5E−11 4.2 0.9507 hsa-miR-128b 4.6E−31 9.0 0.9932 1.7E−11 4.5 0.9520 hsa-miR-9* 1.4E−64 31.3 1.0000 9.1E−22 18.9 0.9933 hsa-miR-92b 1.8E−26 7.3 0.9993 2.1E−18 5.8 1.0000 B1 † 1.7E−57 205.9 1.0000 3.3E−26 128.7 1.0000 See Material and Methods for definitions. †P-values are calculated on log-signal of microRNAs, and on B0 and B1, which are in log-space. Less that 1000 probes were tested, and even after the more severe Bonferroni correction (multiplying each p-value by ~1000), the p-values remain highly significant. ‡“fold-chng” is the fold change, calculated by dividing the median signal in brain primary tumors by the median signal in other tissues. -
TABLE 7 Comparison between microRNA expression in primary brain tumors and expression in other primary tumors or expression in brain metastases, based on microRNA qRT-PCR data. Primary brain vs. Other primary tumors Brain metastases microRNA or metric: fold- fold- p-value† chng‡ AUC p-value† chng‡ AUC hsa-miR-124 4.7E−9 2144 1.0000 1.4E−4 48 0.8633 hsa-miR-9 2.3E−11 17648 1.0000 2.0 E− 11543 0.9833 hsa-miR-9* 1.5E−11 1887 1.0000 1.4 E− 12415 0.9922 hsa-miR-92b 1.7E−6 16 0.9542 7.2E−7 8 0.9219 BRT 1.1E−10 2.9E+5 1.0000 7.7E−12 9993 0.9961 BRT* 2.8 E− 1011868 1.0000 6.4E−12 2428 1.0000 †P-values are calculated on measured Ct values and on BRT and BRT*, which are in log-space. Here only the listed 4 potential biomarkers and two combinations were tested, and no correction for multiple hypothesis testing is needed. ‡“fold-chng” is the fold change, calculated by converting the data to linear space (by taking the exponent base 2) and dividing the median signal in brain primary tumors by the median signal in other tissues. - The foregoing description of the specific embodiments so fully reveals the general nature of the invention that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without undue experimentation and without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.
- It should be understood that the detailed description and specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
Claims (20)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/782,067 US20100273172A1 (en) | 2007-03-27 | 2010-05-18 | Micrornas expression signature for determination of tumors origin |
| US14/284,256 US9803247B2 (en) | 2007-03-27 | 2014-05-21 | MicroRNAs expression signature for determination of tumors origin |
Applications Claiming Priority (10)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US90726607P | 2007-03-27 | 2007-03-27 | |
| US92924407P | 2007-06-19 | 2007-06-19 | |
| US98945807P | 2007-11-21 | 2007-11-21 | |
| US2456508P | 2008-01-30 | 2008-01-30 | |
| PCT/IL2008/000396 WO2008117278A2 (en) | 2007-03-27 | 2008-03-20 | Gene expression signature for classification of cancers |
| US4340708P | 2008-04-09 | 2008-04-09 | |
| US7377408P | 2008-06-19 | 2008-06-19 | |
| PCT/IL2008/001525 WO2009066291A2 (en) | 2007-11-21 | 2008-11-20 | Micrornas expression signature for determination of tumors origin |
| US53294009A | 2009-09-24 | 2009-09-24 | |
| US12/782,067 US20100273172A1 (en) | 2007-03-27 | 2010-05-18 | Micrornas expression signature for determination of tumors origin |
Related Parent Applications (4)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/IL2008/000396 Continuation-In-Part WO2008117278A2 (en) | 2007-03-27 | 2008-03-20 | Gene expression signature for classification of cancers |
| US12/532,940 Continuation-In-Part US20100178653A1 (en) | 2007-03-27 | 2008-03-20 | Gene expression signature for classification of cancers |
| PCT/IL2008/001525 Continuation-In-Part WO2009066291A2 (en) | 2007-03-27 | 2008-11-20 | Micrornas expression signature for determination of tumors origin |
| US53294009A Continuation-In-Part | 2007-03-27 | 2009-09-24 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/284,256 Continuation US9803247B2 (en) | 2007-03-27 | 2014-05-21 | MicroRNAs expression signature for determination of tumors origin |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20100273172A1 true US20100273172A1 (en) | 2010-10-28 |
Family
ID=42992485
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/782,067 Abandoned US20100273172A1 (en) | 2007-03-27 | 2010-05-18 | Micrornas expression signature for determination of tumors origin |
| US14/284,256 Active US9803247B2 (en) | 2007-03-27 | 2014-05-21 | MicroRNAs expression signature for determination of tumors origin |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/284,256 Active US9803247B2 (en) | 2007-03-27 | 2014-05-21 | MicroRNAs expression signature for determination of tumors origin |
Country Status (1)
| Country | Link |
|---|---|
| US (2) | US20100273172A1 (en) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120258442A1 (en) * | 2011-04-09 | 2012-10-11 | bio Theranostics, Inc. | Determining tumor origin |
| WO2013017861A2 (en) | 2011-07-29 | 2013-02-07 | University Of East Anglia | Analysing sequencing bias |
| US20130259839A1 (en) * | 2007-03-27 | 2013-10-03 | Ranit Aharonov | Gene expression signature for classification of tissue of origin of tumor samples |
| WO2014071082A3 (en) * | 2012-10-31 | 2014-06-26 | Institute For Systems Biology | Detection of brain cancer types |
| WO2015056195A1 (en) * | 2013-10-15 | 2015-04-23 | Warszawski Uniwersytet Medyczny | Use of microrna markers for diagnosis of liver lesions |
| US10538816B2 (en) | 2004-06-04 | 2020-01-21 | Biotheranostics, Inc. | Identification of tumors |
| US11430544B2 (en) | 2005-06-03 | 2022-08-30 | Biotheranostics, Inc. | Identification of tumors and tissues |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8338109B2 (en) | 2006-11-02 | 2012-12-25 | Mayo Foundation For Medical Education And Research | Predicting cancer outcome |
| EP2806054A1 (en) | 2008-05-28 | 2014-11-26 | Genomedx Biosciences Inc. | Systems and methods for expression-based discrimination of distinct clinical disease states in prostate cancer |
| US10407731B2 (en) | 2008-05-30 | 2019-09-10 | Mayo Foundation For Medical Education And Research | Biomarker panels for predicting prostate cancer outcomes |
| US9495515B1 (en) | 2009-12-09 | 2016-11-15 | Veracyte, Inc. | Algorithms for disease diagnostics |
| US10236078B2 (en) | 2008-11-17 | 2019-03-19 | Veracyte, Inc. | Methods for processing or analyzing a sample of thyroid tissue |
| US9074258B2 (en) | 2009-03-04 | 2015-07-07 | Genomedx Biosciences Inc. | Compositions and methods for classifying thyroid nodule disease |
| US8669057B2 (en) | 2009-05-07 | 2014-03-11 | Veracyte, Inc. | Methods and compositions for diagnosis of thyroid conditions |
| US10446272B2 (en) | 2009-12-09 | 2019-10-15 | Veracyte, Inc. | Methods and compositions for classification of samples |
| US20130267443A1 (en) | 2010-11-19 | 2013-10-10 | The Regents Of The University Of Michigan | ncRNA AND USES THEREOF |
| EP2791359B1 (en) | 2011-12-13 | 2020-01-15 | Decipher Biosciences, Inc. | Cancer diagnostics using non-coding transcripts |
| CA2881627A1 (en) | 2012-08-16 | 2014-02-20 | Genomedx Biosciences Inc. | Cancer diagnostics using biomarkers |
| US10526655B2 (en) | 2013-03-14 | 2020-01-07 | Veracyte, Inc. | Methods for evaluating COPD status |
| US11976329B2 (en) | 2013-03-15 | 2024-05-07 | Veracyte, Inc. | Methods and systems for detecting usual interstitial pneumonia |
| US12297505B2 (en) | 2014-07-14 | 2025-05-13 | Veracyte, Inc. | Algorithms for disease diagnostics |
| US20170335396A1 (en) | 2014-11-05 | 2017-11-23 | Veracyte, Inc. | Systems and methods of diagnosing idiopathic pulmonary fibrosis on transbronchial biopsies using machine learning and high dimensional transcriptional data |
| AU2017315425B2 (en) | 2016-08-24 | 2023-11-09 | The Regents Of The University Of Michigan | Use of genomic signatures to predict responsiveness of patients with prostate cancer to post-operative radiation therapy |
| EP3571322B9 (en) | 2017-01-20 | 2023-10-04 | VERACYTE SD, Inc. | Molecular subtyping, prognosis, and treatment of bladder cancer |
| EP3593140A4 (en) | 2017-03-09 | 2021-01-06 | Decipher Biosciences, Inc. | SUBTYPING PROSTATE CANCER TO PREDICT RESPONSE TO HORMONE THERAPY |
| US11078542B2 (en) | 2017-05-12 | 2021-08-03 | Decipher Biosciences, Inc. | Genetic signatures to predict prostate cancer metastasis and identify tumor aggressiveness |
| US11217329B1 (en) | 2017-06-23 | 2022-01-04 | Veracyte, Inc. | Methods and systems for determining biological sample integrity |
Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5034506A (en) * | 1985-03-15 | 1991-07-23 | Anti-Gene Development Group | Uncharged morpholino-based polymers having achiral intersubunit linkages |
| US5235033A (en) * | 1985-03-15 | 1993-08-10 | Anti-Gene Development Group | Alpha-morpholino ribonucleoside derivatives and polymers thereof |
| US6506559B1 (en) * | 1997-12-23 | 2003-01-14 | Carnegie Institute Of Washington | Genetic inhibition by double-stranded RNA |
| US20030225526A1 (en) * | 2001-11-14 | 2003-12-04 | Golub Todd R. | Molecular cancer diagnosis using tumor gene expression signature |
| US20050182005A1 (en) * | 2004-02-13 | 2005-08-18 | Tuschl Thomas H. | Anti-microRNA oligonucleotide molecules |
| WO2005118806A2 (en) * | 2004-05-28 | 2005-12-15 | Ambion, Inc. | METHODS AND COMPOSITIONS INVOLVING MicroRNA |
| US20070065844A1 (en) * | 2005-06-08 | 2007-03-22 | Massachusetts Institute Of Technology | Solution-based methods for RNA expression profiling |
| US20080269072A1 (en) * | 2004-10-21 | 2008-10-30 | Hart Ronald P | Rational Probe Optimization for Detection of MicroRNAs |
| US20080306018A1 (en) * | 2006-01-05 | 2008-12-11 | The Ohio State University | Micro-Rna Expression Abnormalities of Pancreatic, Endocrine and Acinar Tumors |
| US7745608B2 (en) * | 2003-04-17 | 2010-06-29 | Alnylam Pharmaceuticals, Inc. | Modified iRNA agents |
| US20100286044A1 (en) * | 2005-12-29 | 2010-11-11 | Exiqon A/S | Detection of tissue origin of cancer |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7354710B2 (en) * | 2001-07-11 | 2008-04-08 | Genomics Usa, Inc. | Methods and devices based upon a novel form of nucleic acid duplex on a surface |
| EP2284265B1 (en) * | 2004-11-12 | 2015-01-07 | Asuragen, Inc. | Methods and compositions involving miRNA and miRNA inhibitor molecules |
| CA2595716A1 (en) | 2005-01-25 | 2006-08-03 | Rosetta Inpharmatics Llc | Methods for quantitating small rna molecules |
| US20070092882A1 (en) | 2005-10-21 | 2007-04-26 | Hui Wang | Analysis of microRNA |
| CA2672246A1 (en) | 2006-06-30 | 2008-03-13 | Rosetta Genomics Ltd | Detecting nucleic acids |
-
2010
- 2010-05-18 US US12/782,067 patent/US20100273172A1/en not_active Abandoned
-
2014
- 2014-05-21 US US14/284,256 patent/US9803247B2/en active Active
Patent Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5034506A (en) * | 1985-03-15 | 1991-07-23 | Anti-Gene Development Group | Uncharged morpholino-based polymers having achiral intersubunit linkages |
| US5235033A (en) * | 1985-03-15 | 1993-08-10 | Anti-Gene Development Group | Alpha-morpholino ribonucleoside derivatives and polymers thereof |
| US6506559B1 (en) * | 1997-12-23 | 2003-01-14 | Carnegie Institute Of Washington | Genetic inhibition by double-stranded RNA |
| US20030225526A1 (en) * | 2001-11-14 | 2003-12-04 | Golub Todd R. | Molecular cancer diagnosis using tumor gene expression signature |
| US7745608B2 (en) * | 2003-04-17 | 2010-06-29 | Alnylam Pharmaceuticals, Inc. | Modified iRNA agents |
| US20050182005A1 (en) * | 2004-02-13 | 2005-08-18 | Tuschl Thomas H. | Anti-microRNA oligonucleotide molecules |
| WO2005118806A2 (en) * | 2004-05-28 | 2005-12-15 | Ambion, Inc. | METHODS AND COMPOSITIONS INVOLVING MicroRNA |
| US20080269072A1 (en) * | 2004-10-21 | 2008-10-30 | Hart Ronald P | Rational Probe Optimization for Detection of MicroRNAs |
| US20070065844A1 (en) * | 2005-06-08 | 2007-03-22 | Massachusetts Institute Of Technology | Solution-based methods for RNA expression profiling |
| US20100286044A1 (en) * | 2005-12-29 | 2010-11-11 | Exiqon A/S | Detection of tissue origin of cancer |
| US20080306018A1 (en) * | 2006-01-05 | 2008-12-11 | The Ohio State University | Micro-Rna Expression Abnormalities of Pancreatic, Endocrine and Acinar Tumors |
Non-Patent Citations (2)
| Title |
|---|
| Notterman et al, in Microarrays and Cancer Research, 2002, Warrington et al (eds.), Eaton Publishing, Westborough, MA, pp. 81-111 * |
| Strausberg et al, in Microarrays and Cancer Research, 2002, Warrington et al (eds.), Eaton Publishing, Westborough, MA, pp. xi-xvi * |
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9670553B2 (en) | 2004-06-04 | 2017-06-06 | Biotheranostics, Inc. | Determining tumor origin |
| US10538816B2 (en) | 2004-06-04 | 2020-01-21 | Biotheranostics, Inc. | Identification of tumors |
| US11430544B2 (en) | 2005-06-03 | 2022-08-30 | Biotheranostics, Inc. | Identification of tumors and tissues |
| US20130259839A1 (en) * | 2007-03-27 | 2013-10-03 | Ranit Aharonov | Gene expression signature for classification of tissue of origin of tumor samples |
| US9096906B2 (en) * | 2007-03-27 | 2015-08-04 | Rosetta Genomics Ltd. | Gene expression signature for classification of tissue of origin of tumor samples |
| US20120258442A1 (en) * | 2011-04-09 | 2012-10-11 | bio Theranostics, Inc. | Determining tumor origin |
| WO2013017861A2 (en) | 2011-07-29 | 2013-02-07 | University Of East Anglia | Analysing sequencing bias |
| WO2014071082A3 (en) * | 2012-10-31 | 2014-06-26 | Institute For Systems Biology | Detection of brain cancer types |
| WO2015056195A1 (en) * | 2013-10-15 | 2015-04-23 | Warszawski Uniwersytet Medyczny | Use of microrna markers for diagnosis of liver lesions |
Also Published As
| Publication number | Publication date |
|---|---|
| US9803247B2 (en) | 2017-10-31 |
| US20160115546A1 (en) | 2016-04-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9803247B2 (en) | MicroRNAs expression signature for determination of tumors origin | |
| US20190241966A1 (en) | Gene Expression Signature for Classification of Tissue of Origin of Tumor Samples | |
| US9133522B2 (en) | Compositions and methods for the diagnosis and prognosis of mesothelioma | |
| US9096906B2 (en) | Gene expression signature for classification of tissue of origin of tumor samples | |
| US20150099665A1 (en) | Methods for distinguishing between specific types of lung cancers | |
| US20100178653A1 (en) | Gene expression signature for classification of cancers | |
| WO2010073248A2 (en) | Gene expression signature for classification of tissue of origin of tumor samples | |
| WO2007148235A2 (en) | Cancer-related nucleic acids | |
| US9068232B2 (en) | Gene expression signature for classification of kidney tumors | |
| EP2643479A2 (en) | Methods and materials for classification of tissue of origin of tumor samples | |
| US9914972B2 (en) | Methods for lung cancer classification | |
| US9834821B2 (en) | Diagnosis and prognosis of various types of cancers | |
| WO2009066291A2 (en) | Micrornas expression signature for determination of tumors origin | |
| WO2010004562A2 (en) | Methods and compositions for detecting colorectal cancer | |
| US9340823B2 (en) | Gene expression signature for classification of kidney tumors | |
| WO2011039757A2 (en) | Compositions and methods for prognosis of renal cancer |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: ROSETTA GENOMICS LTD., ISRAEL Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ROSENFELD, NITZAN;ROSENWALD, SHAI;BARSHACK, IRIS;AND OTHERS;SIGNING DATES FROM 20100620 TO 20100624;REEL/FRAME:024690/0176 Owner name: TEL HASHOMER MEDICAL RESEARCH INFRASTRUCTUE AND SE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ROSENFELD, NITZAN;ROSENWALD, SHAI;BARSHACK, IRIS;AND OTHERS;SIGNING DATES FROM 20100620 TO 20100624;REEL/FRAME:024690/0176 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |