Summary of the invention
The technical matters that the present invention will solve provides medical field degree of depth answering method and medical retrieval system, improves question and answer precision as a result.
According to an aspect of the present invention, a kind of degree of depth answering method is provided, comprises:
Data are putd question in step 1, reception;
Step 2, in repository database, carry out first the retrieval; Said repository database comprises the factual information in the encyclopaedia; With
Step 3, concentrate in self-defining data and to carry out the 3rd retrieval, comprising:
Step 3.1, based on the sentence pattern set of patterns, utilize the method for machine learning to give said problem classification, the sentence pattern pattern of problem identificatioin; With
Step 3.2, use the sentence pattern pattern matching problem, obtain second type of key word, with second type of key search self-defining data collection.
Optional, described degree of depth answering method also comprises:
Step 4, carry out second retrieval in to database in question and answer; Said question and answer comprise that to database verified question and answer accurately are to information.
Optional, the generation method of the sentence pattern set of patterns of step 3.1 comprises:
Step 3.1.1, set up the self-defining data collection;
The sentence pattern set of patterns is set up in step 3.1.2a, manual work; And/or
Step 3.1.2b, through artificial labeled data, training data obtains the sentence pattern set of patterns.
Optional, said step 3 also comprises:
Step 3.3, utilize second type of keyword retrieval repository database.
Optional, step 1 comprises: from the text message of said enquirement extracting data problem.
Optional, step 2 comprises:
Step 2.1, be unit, each sentence is carried out word segmentation processing with the sentence in the problem;
Step 2.2, from the word segmentation processing result, extract first kind keyword; With
Step 2.3, utilize first kind keyword retrieval repository database.
Optional, step 4 comprises:
Step 4.1, retrieval question and answer are calculated said problem and the question and answer similarity to the record in the database to database; With
If step 4.2 exists similarity to reach the record of certain threshold value, according to the size of similarity said record is carried out rank, obtain the top n matching result, N is a natural number.
According to a further aspect of the present invention, a kind of medical retrieval system is provided, comprises:
Load module is used to receive the input from the user;
First retrieval module is used for the retrieval knowledge library database; Said repository database comprises the factual information in the encyclopaedia; With
The 3rd retrieval module is used to retrieve the self-defining data collection;
Wherein, said the 3rd retrieval module also comprises:
Sentence pattern pattern analysis module is used for based on the sentence pattern set of patterns, utilizes the method for machine learning to give said problem classification, the sentence pattern pattern of problem identificatioin; With
The second generic key word extracts and retrieval module, is used to utilize the sentence pattern pattern to come matching problem to obtain second type of key word, uses second type of key search self-defining data collection then.
Optional, said medical retrieval system also comprises:
Second retrieval module is used to retrieve question and answer to database; Said question and answer comprise that to database verified question and answer accurately are to information.
Optional, said second retrieval module also comprises:
Similarity calculation module is used to retrieve question and answer to database, calculates said problem and the question and answer similarity to the record in the database; With
Feedback module if exist similarity to reach the record of certain threshold value, carries out rank according to the size of similarity to said record as a result, obtains the top n matching result, and N is a natural number.
Optional, said load module also is used for from the text message of said enquirement extracting data problem.
Optional, said first retrieval module also comprises:
Word-dividing mode, the sentence that is used for problem is a unit, and each sentence is carried out word segmentation processing; With
First kind keyword extraction and retrieval module are used for extracting first kind keyword from the word segmentation processing result, utilize first kind keyword retrieval repository database then.
Compared with prior art, the invention has the advantages that: set up three kinds of databases, distinct type data-base is carried out the retrieval of distinct methods, improved the precision of result for retrieval; Wherein, extract the precision of all right further raising of attribute result for retrieval such as problem domain.
Embodiment
In order to make the object of the invention, technical scheme and advantage clearer, below in conjunction with accompanying drawing, to further explain of the present invention.Should be appreciated that specific embodiment described herein only in order to explanation the present invention, and be not used in qualification the present invention.
(Question Answering System QA) is a kind of advanced form of information retrieval system to question answering system.It can use accurate, succinct natural language to answer the problem that the user proposes with natural language.The main cause that its research is risen is that people are to obtain the demand of information quickly and accurately.Question answering system is research direction that receives much attention and have the broad development prospect in present artificial intelligence and the natural language processing field.
The inventor finds after deliberation, based on the personal information and the health of patient's input, through in extensive text/data, excavating similar state of an illness information and returning the highest result of similarity (for example returning first three), can improve the question and answer precision.Wherein large-scale data comprises: (1) obtains factual information from encyclopaedia, the artificial knowledge hierarchy of setting up, set up " knowledge base "; (2) from community question and answer website, obtain question and answer, set up " question and answer are to database " information; (3) set up " self-defining data collection " by other data.
(1) repository database is called for short knowledge base, from the artificial knowledge hierarchy of setting up of encyclopaedia and other, obtains factual information, and with forming knowledge base in these information importing non-relational databases.Because the content of need clear in structure, mating easily when information such as importing encyclopaedia, is main with structured messages such as business card, form, tabulations mainly.The notion of some repetitions can be put in order through manual method; Be fused in the tangible system; For example: disease is divided into internal medicine, surgery, gynemetrics, paediatrics; Tell concrete disease below each subject again in detail; Every kind of disease all can have concrete symptom, every kind of symptom or disease be lack that which nutrient causes or infect that which virus causes or because other what reasons cause, every kind of disease generally be when, weather, which kind of environment morbidity be in the majority, need with what medicine or the like.All information of this method storage all are accurately, if mate successfully, the answer accuracy rate can be very high.Because not absolute contact between a lot of information in knowledge base the inside, generally without relational database, and use non-relational database in the time of storage, for example MongoDB.
(2) question and answer are from there are the information such as community of medical science relevant issues and answer community question and answer website etc., to obtain to the data in the database.Answer this complicacy of disease, highly professional problem; Best bet is exactly borrower's a strength, and the problem of in community, dishing out is answered by the user who knows the answer; Query person selects oneself satisfied answer, and the user of similar demand is recommended in these answers.This method has improved the accuracy of answer, if can the data centralization in these websites be got up, returns suitable answer to the user quickly and easily, then can very practical reference be provided for the user.
Question and answer are that question and answer are right to the data in the database, and the form that question and answer are right is: problem+optimum answer+other answers.When question and answer are set up database, at first will mark problem, for example, the mark problem belongs to which kind of problem in " internal medicine, surgery, gynemetrics, paediatrics ", and the classification label symbol is as shown in table 1.Then, machine can produce series of parameters according to the word and the classification of manual work mark, and these parameters have been represented the relation of word and contextual relation, word and classification.After manual work mark, machine generated parameter, for the word of unknown classification, system just can automatically identify the classification of word.
Because the relevant disease under the such section office of internal medicine, surgery, gynemetrics, paediatrics, medicine etc. have remarkable difference; If so can before retrieval, just can confirm the affiliated field of customer problem through automatic identification; Can dwindle range of search, get rid of extraneous data, make the result more accurate.
Table 1
(3) data of self-defining data collection can from: (a) retrieve, wherein comprise based on existing encyclopaedia language material: universal experience knowledge and objective fact etc., scope is wider, can be used for common knowledge question and medical diagnosis; (b) extract based on the answer of medical science voluminous dictionary, can be used for professional knowledge inquiry and medical diagnosis; (c) extract based on the answer of nutrition knowledge collection, comprise relevant knowledge such as books handbook etc., can be used to expand knowledge query and medical diagnosis, the suggestion of medical science voluminous dictionary; (d) open retrieval promptly at existing network retrieval system (for example Google) input keyword, is returned relevant documentation, in document, extracts the bigger part of probability as related data, can be used for the question and answer in open field, and is relevant with daily life.
The file layout of self-defining data intensive data is complicated, comprising: (a), the encyclopaedia language material of html format.Knowledge base before is that information is stored in the non-relational database with textual form, is with the original HTML stores of information at this moment.(b), medical science voluminous dictionary and nutrition knowledge collection all be with the textual form storage, comprises that form is like TXT, PDF, WORD.
Based on above-mentioned database,, a kind of degree of depth answering method is provided according to one embodiment of the invention.As shown in Figure 1, this method comprises:
Step S10, reception user's input (promptly puing question to data); These enquirement data can be text input, phonetic entry, image input etc.; Being input as example with text below describes;
Step S20, in repository database, retrieve (be also referred to as first retrieval); If first retrieval obtains required answer, finish; If first retrieval does not obtain required answer, carry out step S30;
Step S30, retrieve (being also referred to as second retrieval) in to database in question and answer; If second retrieval obtains required answer, finish; If second retrieval does not obtain required answer, carry out step S40;
Step S40, concentrate in self-defining data and to retrieve (being also referred to as the 3rd retrieval); If the 3rd retrieval obtains required answer, finish; If the 3rd retrieval does not obtain required answer, then guide the user to use other problems to describe, for example, the guiding user further or from different perspectives (for example other symptoms, patient's medical history, once make a definite diagnosis result etc.) describe.
According to one embodiment of present invention; In step S10; The user also can put question to or put question to through image through voice, in these cases, and after the enquirement data that receive the user; Carry out speech recognition (or speech recognition reciprocal process) or image recognition (or image recognition reciprocal process), confirm text message.
According to one embodiment of present invention,, also can utilize the keyword in the question sentence to carry out open retrieval, promptly utilize existing commercial search engine to retrieve if step S40 does not obtain required answer.
According to a further embodiment of the invention, as shown in Figure 2, step S20 first retrieval also comprises:
S201 is a unit to put question to the sentence in the data, and each sentence is carried out word segmentation processing (being also referred to as entryization);
S202 extracts keyword from the word segmentation processing result; Field under all right problem identificatioin;
S203 utilizes the keyword retrieval repository database; With
S204 if find required answer, is converted into natural language with answer and returns.
According to one embodiment of present invention, in step S201, word segmentation processing can utilize the Chinese word segmentation assembly of increasing income to carry out, for example: Pan Gu's participle, MMSEG4J, Paoding, CC-CEDICT, IK, ICTCLAS etc.Word segmentation processing also can be carried out based on the segmenting method of statistics, and combines named entity recognition, word sense disambiguation; Named entity recognition is to the identification like speech such as name, place name, mechanism's name, place name, ProductName, trade (brand) name, abbreviation, ellipsis; The effect of word sense disambiguation is to reduce wrong understanding.At last, sentence is cut into independent one by one speech.
According to one embodiment of present invention, in step S202, from word segmentation result, extract keyword and can carry out in two ways:
First kind of mode, remove stop words (speech such as ",, what ") after, stay the speech in the dictionaries such as named entity.For example, stop words, common subject (as you, I etc.) do not belong to the speech in the dictionary; The corresponding synon colloquial style saying of named entity, common specialized word (as, headache, aspirin etc.) belong to the speech in the dictionary.
The second way; After confirming the sentence pattern pattern, can also be corresponding with the position in the sentence pattern pattern speech suitable in the sentence, the extracting section that selection needs is come out; For example; " subject+predicate+illness ", the selected ci poem corresponding " illness " comes out as keyword, and the second way will be further explained below.
According to one embodiment of present invention, step S202 can further include:
S202a, keyword expansion; Keyword expansion has method in common, and for example " painful around the temple " " fever " can expand " headache " speech such as " fevers ", needs to use the synonym in the dictionary to expand;
S202b, part-of-speech tagging; Part-of-speech tagging has method in common, identifies noun in the sentence, verb, adjective etc.; With
S202c, the synonym expansion; Synonym expansion is close a bit with keyword expansion, represents a meaning but synonym is meant two speech, can mutual alternative, and for example " headache " can replace with " headache ".
Above-mentioned substep S202a~S202c can produce more multi-key word.
In step S203, the above-mentioned keyword that utilizes step S202 to generate comes the retrieval knowledge library database.If above-mentioned steps S203 does not find required answer, carry out step S30.
In step S204, the answer of returning in the knowledge base is simple, stiff, can become than the complex natural linguistic form.If the sentence pattern pattern of problem is made up of " statement+problem ", can replace problem with answer, become " statement+answer " and return to the user.
In other embodiment of the present invention; In order further to improve the accuracy of understanding to customer problem; Not all sentence pattern all only extracts key word with regard to (promptly can not ignore the effect of other speech in the sentence fully) much of that; To the speech of the normal association area of using of user in the reality test, also can such speech be listed in the lists of keywords.
Another embodiment is as shown in Figure 3 according to the present invention, and step S30 further comprises:
S301 retrieval question and answer are to database; When retrieving question and answer to database; Calculate the similarity of sentence; In the present embodiment, adopt support vector machine method, whether inspection customer problem and question and answer are enough big to the problem similarity in the database; Promptly whether reach preset threshold value, if enough greatly just return the corresponding answer of problem in this database.
It will be understood by those skilled in the art that then similarity can improve if comprise keyword in the sentence.
In addition, because the probability that matees in practical application fully is very low, so can utilize machine learning algorithm, searching and question sentence similarity maximum and similarity are returned the corresponding answer of this sentence greater than the sentence of certain threshold value.Can improve matching rate like this, more tally with the actual situation simultaneously.
If the problem that S302 retrieves in to database in question and answer and the similarity of customer problem reach certain threshold value, according to the size of similarity result for retrieval is carried out rank, obtain the top n matching result.
S303 is converted into natural language with the top n matching result and returns; If do not find required answer, execution in step S40.
Another embodiment is as shown in Figure 4 according to the present invention, and step S40 further comprises:
S401, problem identificatioin field and answer type; Through qualification answer field, reject the irrelevant candidate answers of a part (those records that promptly its domain attribute of eliminating and determined problem domain the database had nothing to do) from question and answer, can further improve recall precision.
Utilize part-of-speech tagging to confirm which word is keyword in the sentence, can confirm the field of keyword, thereby obtain the field of problem in conjunction with dictionary.The answer type also can be passed through key word recognition, for example, if occur in the keyword " how much ", the problem identificatioin type is a quantity.
S402, based on the sentence pattern set of patterns, utilize the method for machine learning to give question sentence (being problem) classification, the sentence pattern pattern of problem identificatioin.
Concrete, according to the classification in the sentence pattern set of patterns (being the sentence pattern pattern), give the problem mark classification of language material, learn to obtain series of parameters with the method for machine learning.These parameters have been represented the relation of speech and speech, can confirm through these parameters which sentence pattern pattern is new problem belong to.The foundation of sentence pattern set of patterns will go through below.
S403, use the sentence pattern pattern matching problem, obtain new key word, with new key search self-defining data collection.
Concrete, replace the keyword in the target sentences with specific forms, make the expression that similar sentence can corresponding some template (can utilize regular expression to carry out rule match.For example; I+any word+have+any word+illness; Regular expression can be expressed as " I ([]+?) have ([]+?) iDisease ", the speech (iDisease) of wherein representing illness is all in dictionary, as long as the speech that this dictionary the inside has occurs; Just be matched to merit at last, this speech of this position is exactly the speech that needs).For example, " I have a headache recently ", utilize " I ([]+?) iDisease arranged " pattern; Can match " recently " and " headache " two speech; Contrast with the word in the dictionary, can mate successfully with " headache ", that headache is exactly a keyword.
If S404 concentrates in self-defining data and retrieves data matching, according to the degree of correlation (being similarity) result for retrieval is carried out rank, obtain the top n matching result.
The sentence and the speech of answer position in S405, the selector matched moulds plate.
Go on foot N the matching result that obtains, the keyword in the matching template (being the sentence pattern pattern) and corresponding answer type, speech and sentence after obtaining mating to last one.
S406, select the high answer of the frequency of occurrences and return.
To speech and the sentence that a last step obtains, add up its frequency of occurrences, the reasonable distribution weight sorts, and the answer that the selection frequency of occurrences is the highest is also returned.
S204 is similar with step, also can return answer with the natural language form.
The keyword that the result of sentence pattern pattern match obtains generally is the subclass of the keyword of above-mentioned first method (S202) extraction; (retrieval knowledge library database for example in retrieving; The self-defining data collection); The weight of the keyword in this subclass is higher than the weight of other keywords, can improve recall precision and accuracy; The key word that the key word additional step S20 that promptly utilizes step S40 to obtain obtains comes databases such as retrieval knowledge library database, self-defining data collection, can improve recall precision and accuracy.
According to a further embodiment of the invention, as shown in Figure 5, a kind of generation method of sentence pattern set of patterns is provided, it comprises:
S501, set up the self-defining data collection;
The sentence pattern set of patterns is set up in S502a, manual work; And/or
S502b, through artificial labeled data, training data obtains the sentence pattern set of patterns.
In step S501, as stated, the data of self-defining data collection can comprise: (1) is retrieved based on existing encyclopaedia language material; (2) extract based on the answer of medical science voluminous dictionary; (3) extract based on the answer of nutrition knowledge collection; The webpage (document that comprises various forms) that obtains when (4) opening retrieval with various keywords.
In step S502a; Rule template is set up in manual work needs check, and several kinds of templates of general design are earlier tested on existing sentence; If being arranged, sentence can't confirm that perhaps sentence pattern is omitted; Omitted these or need improved sentence pattern schema modification to come, tested once more, till including most of sentence pattern.
In step S502b; At first get the problem of a plurality of (for example 10,000) medical aspect, mark into a kind of sentence pattern (for example artificial mark) to each problem, the sentence that these had marked is learnt with maximum entropy model; Obtain a plurality of parameters, these parameters are used for the sentence pattern pattern of problem identificatioin.
Further; Choose a plurality of (for example 500) again and marked but the sentence of not learning (being that it doesn't matter for Determination of Parameters and these sentences), through the classification of these sentences of parameter prediction before, corresponding two classification of each sentence at this moment; Whether the classification of contrast machine is consistent with the classification that marks before; If inconsistent, increase the quantity of learning sample, reach predetermined threshold value up to the quantity of unanimity.Can further guarantee to utilize these parameters to generate the accuracy of the sentence pattern pattern of question sentence like this, thereby improve the precision of result for retrieval.
According to another embodiment of the invention; After above-mentioned searching step S20, S30 and S40 return optimum answer, can also recommend the user whether to go large hospital to make a definite diagnosis, use which kind of medicine, points for attention etc. through the illness, nutrient, the medicine that relate in the optimum answer.
In the above-described embodiments, according to the order of accuarcy of the data content of storing in the database that each step was directed against, designed the execution sequence of step S20 to S40.It will be understood by those skilled in the art that according to other retrievals of carrying out in proper order three databases and can realize that also the present invention improves the purpose of result for retrieval precision.In addition; Even carry out S20, S30 or S40 separately,, also can improve the precision of result for retrieval because the present invention has improved search method (will be described in more detail below); Be that present embodiment is a preferred embodiment, whether each step is all carried out and execution sequence in the not qualification method.
And, even a searching step obtains required result, also can continue to carry out follow-up searching step, thereby, perhaps select more accurately for the user provides more selection.
According to one embodiment of the invention, a kind of degree of depth answering method is provided, comprising:
Step T10, reception user's input (promptly puing question to data); These enquirement data can be text input, phonetic entry, image input etc.; Being input as example with text below describes;
Step T20, in repository database, retrieve (above-mentioned first retrieval);
Step T30, concentrate in self-defining data and to retrieve (above-mentioned the 3rd retrieval).
Wherein, step T30 obtains utilizing this keyword to carry out the retrieval of self-defining data collection than the more accurate keyword of the keyword among the step T10 through the sentence pattern pattern of utilizing problem.
Wherein, the more accurate keyword that step T30 obtains can be used for the retrieval knowledge library database once more, for the user provides answer more accurately.
This method can also comprise: retrieve (above-mentioned second retrieval) in to database in question and answer.This step can before step T20 and the T30, between or carry out afterwards.
Based on above-mentioned degree of depth answering method,, a kind of degree of depth question answering system/medical retrieval system is provided according to one embodiment of the invention.As shown in Figure 6, this system 100 comprises: load module 101, first retrieval module 102, second retrieval module 103 and the 3rd retrieval module 104; The repository database 201 that three retrieval modules connect respectively, question and answer are to database 202 and self-defining data collection 203.Wherein, first, second, third retrieval module 102,103 and 104 working method are: if the previous stage retrieval module obtains required result then finishes, do not call the next stage retrieval module and retrieve if previous retrieval module obtains required result.
Concrete, load module 101 is used to receive the input from the user; User's input comprises phonetic entry, literal input etc., and under the situation of phonetic entry, load module 101 utilizes existing speech recognition system that phonetic entry is converted into the literal input.Load module 101 sends to first retrieval module 102 with ready literal input.
After first retrieval module 102 receives the literal input; At first question sentence is carried out word segmentation processing; From the result of word segmentation processing, extract keyword then and keyword is expanded; Utilize the result of expanded keyword in repository database 201, to retrieve at last,, start second retrieval module 103 if do not retrieve required result.The search method of first retrieval module 102 is aforesaid step S20, repeats no more here.
Preferably, first retrieval module 102 also comprises: word-dividing mode, and the sentence that is used for problem is a unit, and each sentence is carried out word segmentation processing; With first kind keyword extraction and retrieval module, be used for extracting first kind keyword from the word segmentation processing result, utilize first kind keyword retrieval repository database then.
Second retrieval module 103 is suitable for calculating question sentence and the question and answer degree of correlation (being similarity) to the problem part of the record in the database 202; For example; Use support vector machine method; Whether inspection customer problem and question and answer reach preset threshold value to the problem similarity in the database 202, if met or exceeded this threshold value then return the answer of this correspondence.If do not retrieve required result, start the 3rd retrieval module 104.The search method of second retrieval module 103 is aforesaid step S30, repeats no more here.
Preferably, second retrieval module 103 also comprises similarity calculation module, is used to retrieve question and answer to database, calculates said problem and the question and answer similarity to the record in the database; With feedback module as a result, if exist similarity to reach the record of certain threshold value, said record is carried out rank according to the size of similarity, obtain the top n matching result, N is a natural number.
The 3rd retrieval module 104 is at first based on the sentence pattern set of patterns; Obtain the sentence pattern pattern of problem sentence; Utilize the sentence pattern pattern to obtain keyword (being called second type of keyword) more accurately then, utilize second type of keyword and sentence pattern pattern retrieval self-defining data collection 203, for example utilize regular expression to carry out rule match; According to the degree of correlation result for retrieval is carried out rank, return the top n matching result.The search method of the 3rd retrieval module 104 is aforesaid step S40, repeats no more here.
Preferably, said the 3rd retrieval module also comprises: sentence pattern pattern analysis module, be used for based on the sentence pattern set of patterns, and utilize the method for machine learning to give said problem classification, the sentence pattern pattern of problem identificatioin; Extract and retrieval module with the second generic key word, be used to utilize the sentence pattern pattern to come matching problem, use second type of key search self-defining data collection then to obtain second type of key word.
In the present embodiment,, designed the execution sequence of retrieval module 102 to 104 according to the order of accuarcy of the data content of storing in the database that each module was directed against.It will be understood by those skilled in the art that carrying out three retrieval modules in proper order according to other can realize that also the present invention improves the purpose of result for retrieval precision.In addition; Even only comprise one or two (second retrieval module 103 and the 3rd retrieval module 104 comprise one at least) in the retrieval module 102 to 104 in the system; Owing to improved search method (as indicated above); Also can improve the precision of result for retrieval, promptly present embodiment is a preferred embodiment, does not limit whether each module all comprises and execution sequence in three modules.
1), method and system provided by the invention reallocates resources, and sets up large scale database compared with prior art:, information is comprehensive, and is easy-to-look-up, helps adjusting parameter, improves accuracy, reduces the effect of interfere information; 2), native system is on the data basis of question and answer community; Adjustment parameter and degree of correlation requirement, and for example return three optimum answers, make answer more clear; When the answer degree of correlation is not enough; The answer of selecting for use additive method to return so also can avoid the user in question and answer community, oneself to sum up error message, for the crowd of amateur background important meaning is arranged more; 3), native system proposes and has used a kind of mark standard; In medical treatment qualification field, better effect is arranged; Be fit to native system more, and at language material of large-scale data mark like this, for from now on further investigation lays the first stone; For the researchist of back provides powerful support, this mark standard and the language material that has marked all are that the high precision of system provides support and prepares.
Though carry out explanation as an example with medical domain in the foregoing description, be appreciated that said method and system can also be applied in the data processing of other field, retrieval and the analysis, for example the long-distance education field to method and system.
The advantage of method and system provided by the invention comprises 1) set up three kinds of databases, distinct type data-base is carried out the retrieval of distinct methods, improved the precision of result for retrieval; Except utilizing keyword retrieves, to database, the similarity that rounds a sentence is mated to question and answer; Wherein, to the self-defining data storehouse, attributes such as the sentence pattern pattern of extraction problem, problem domain are to improve the precision of result for retrieval; 2) information retrieval technique is applied to the automatic question answering field; The inference method of abandoning tradition is from the accuracy of new angle increase automatic question answering, information such as tap/dip deep into medical field relevant professional knowledge, general knowledge; Therefrom excavate correct option, improved the precision of result for retrieval; 3) to the characteristic analysis of medical field, be suitable for being applied in the medical consultation system, realize intelligent medical analysis and diagnosis system based on degree of depth dialogue retrieve.
Though method and system provided by the invention all combines medical domain to describe in the above-described embodiments; But one of ordinary skill in the art will appreciate that; Method and system of the present invention also can be applied to any field; Through domain knowledge being decomposed into different levels, different levels is utilized different search method retrievals, can obtain the higher result of accuracy.
Should be noted that and understand, under the situation that does not break away from the desired the spirit and scope of the present invention of accompanying Claim, can make various modifications and improvement the present invention of above-mentioned detailed description.Therefore, the scope of the technical scheme of requirement protection does not receive the restriction of given any specific exemplary teachings.