Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
The electronic document processing method that the embodiment of the invention provides, as shown in Figure 1, its step comprises:
At least one document content in S101, the selection electronic document.
Here, the document content of document can be the component units of certain one page, certain zone in one page or (group) electronic document of electronic document, for example literal, picture etc.
S102, selected document content is identified, and corresponding key word is set for the document content after the sign.
Here, key word can be artificial the appointment, also can be generated automatically according to content by software, for example uses extractions automatically such as TF (Term Frequency, word frequency)/IDF (Inverse Document Frequency arranges frequency) algorithm.
Step S101, S102 can be provided with by production firm when making electronic document, also can be when the user reads, and manually are provided with by the user.
S103, according to document content of selecting and the corresponding key word of the document content, electronic document is handled.
The electronic document processing method that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and handles after the document content of sign is provided with key word.Like this, enriched user's reading method, convenient for users to use, improved user's impression.
The electronic document processing method that another embodiment of the present invention provides; With certain teaching material is example; As shown in Figure 2; Suppose user A (for example teacher) when reading the electronic document of this teaching material, find that the literal in the zone 1 in the document is important formula, thereby select the zone 1 (i.e. document content) of the document; Identify selected regional 1 then, and the key word of correspondence is set: " distributive property " for this zone 1 after the sign.
Subsequently, user A can also select the polylith document content to identify respectively in the same way, and for each document content after the sign key word of correspondence is set.
For example can obtain following " key word " and the corresponding relation of " document content ":
Key word " distributive property ", corresponding document content is " λ (w+z)=λ w+ λ Z "
Key word " associative property ", corresponding document content be " a * (b * c)=(a * b) * c "
Key word " commutative property ", corresponding document content is " a * b=b * a "
…
User A if comprise the key word of setting in the content, then shows the document content that this key word is corresponding when reading the content of this electronic document back.For example, when user A read a certain page of this electronic document, ocr software detected in the page of current reading and has " distributive property ", then can on current page, show " λ (w+z)=λ w+ λ Z "; Perhaps; When user A reads a certain page of this electronic document; Include " distributive property " in this page, if the concrete implication that user A forgets, user A can choose " distributive property "; At this moment, can demonstrate " λ (w+z)=λ w+ λ Z " on the current page.Here, the user chooses key word, can be through the user mouse to be moved on this key word, or click modes such as this key word and realize, but be not limited to this.
This shows that present embodiment is different with the mode that the interpolation bookmark of prior art marks,, but select at least one document content in the electronic document no longer to whole page or leaf labelling document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; When corresponding key word occurring once more in the document, document file page will automatically or demonstrate the corresponding document content of this key word under user's operation, thereby reaches the purpose that helps the user to recall.Therefore convenient for users to use, improved user's impression.
In addition, the demonstration of the document content that key word is corresponding, presetting in the time of can dispatching from the factory according to software producer shows, also can show according to user's setting.For example:
(1) if the view mode that the user is provided with is with civilian pattern; The content model that is provided with is a fixed format; When then the user chooses key word; The corresponding document content of this key word can the identical format of content format with sign the time be presented at the correct position of current page, for example shows the peripheral region of this key word etc., shown in the zone 2 of Fig. 3.
(2) if the view mode that the user is provided with is the contrastive pattern; The content model that is provided with is a fixed format; When then the user chooses key word; The corresponding document content of this key word can the identical format of content format with sign the time be presented in the specific viewing area in the current page, for example is presented in the left margin zone of the page, shown in the zone 3 as shown in Figure 4.
(3) if the view mode that the user is provided with is with civilian pattern; The content model that is provided with is set to reset format; When then the user chose key word, the corresponding document content of key word can be set type according to the characteristics of the viewing area of current page again, and is presented at the place of current page.
(4) if the view mode that the user is provided with is the contrastive pattern; The content model that is provided with is the reset formula; When then the user chose key word, the corresponding document content of key word can be set type according to the characteristics of the viewing area of current page again, and is presented in the specific viewing area in the current page.
Through the setting of above-mentioned display mode flexibly, can meet the needs that the user reads more on the one hand, it is attractive in appearance to guarantee that on the other hand document shows.
Further, after user A (teacher) sets key word and its corresponding document content, both can oneself read and use, also can this key word and its corresponding document content have been derived with prescribed form, for example will
“distributive?property”“λ(w+z)=λw+λZ”
“associative?property”“a×(b×c)=(a×b)×c”
“commutative?property”“a×b=b×a”
…
With deriving of regulation, obtain the xx.xml file like the .xml form.User B (for example student) can import this xx.xml file afterwards.User B need not oneself and selects document content, key word is set after importing this document, the content that just can utilize user A to set, and user B only need be provided with the view mode and the content model of demonstration.If when reading, comprise the key word of setting in the content of this electronic document, then with the display mode that is provided with automatically or the operation through user B show the document content that this key word is corresponding.
The view mode that is provided with according to user B and content model display document content and above-mentioned user A that display mode is set is identical, repeat no more.
Need to prove that the prescribed form that file is derived does not limit, can be according to the specific requirement concrete regulation of each ocr software.
Mode through this derivation imports can help the experience that the descendant learns to use for reference forefathers so that the mark of one piece of document is utilized by many people, and is convenient for users to use, improved user's impression.
In addition, after the user chooses the corresponding key word of document content and the document content, read the key word that treating apparatus can extract the document content and the document content correspondence.Afterwards, the user can search with the search key of user oneself input in the key word of above-mentioned selected document content and the document content correspondence.For example exist
“distributive?property”“λ(w+z)=λw+λZ”
“associative?property”“a×(b×c)=(a×b)×c”
“commutative?property”“a×b=b×a”
…
In search " distributive property ", rather than in the entire chapter document, search, can access the very strong Query Result of specific aim like this, in full text, search and might obtain a lot of junk information, unfavorable user uses.
Have, the user can also make amendment or deletes to the key word of above-mentioned selected document content and the document content correspondence again.For example, the user can select deletion, and all comprise the mark of " formula " key word, and this moment, system can search in key word, and with the deletion of items of all couplings.Also can in batches " formula " with in all key words replace with " theorem " etc.Can also manually select a mark, the document content of its appointment is made amendment, such as its corresponding character, picture page area are dwindled some etc.
The electronic document processing method that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and the document content of sign is provided with key word.Like this, the user just can find required particular document content according to key word at an easy rate, has improved user's impression.
The electronic document processing apparatus 50 that the embodiment of the invention provides, as shown in Figure 5, comprising:
Selected cell 501 is used for selecting at least one document content of electronic document.
Unit 502 is set, is used for selected document content is identified, and the key word of correspondence is set for the said document content after the sign.
Processing unit 503 is used for according to document content of selecting and the corresponding key word of the document content this electronic document being handled.
The electronic document processing apparatus that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and handles after the document content of sign is provided with key word.Like this, enriched user's reading method, convenient for users to use, improved user's impression.
Further, as shown in Figure 6, processing unit 503 comprises:
Display module 5031 is used for when reading electronic document, if comprise the key word of setting in the content of electronic document, then shows the document content that this key word is corresponding.
Thus, when corresponding key word occurring once more in the document, display module 5031 will show the document content that this key word is corresponding, thereby reaches the purpose that helps the user to recall.Therefore convenient for users to use, improved user's impression.
Derive module 5032, be used for document content and the corresponding key word of the document content are derived with prescribed form.
Import module 5033, be used to import document content and the corresponding key word of the document content that prescribed form forms, so that aid reading.
Mode through this derivation imports can help the experience that the descendant learns to use for reference forefathers so that the mark of one piece of document is utilized by many people, and is convenient for users to use, improved user's impression.
Extraction module 5034 is used to extract the document content and the corresponding key word of the document content.
Search module 5035, be used for document content and the corresponding key word of the document content, search with the search key of user's input in extraction module 5034 extractions.
Like this, the user searches certain keyword and need not in the entire chapter document, to search, and can access the very strong Query Result of specific aim like this, in full text, searches and might obtain a lot of junk information, and unfavorable user uses.
Removing module 5036 is used for document content, the corresponding key word of the document content are deleted.
Modified module 5037 is used for document content, the corresponding key word of the document content are made amendment.
Through above-mentioned module, the electronic document processing apparatus that present embodiment provides meets people's use habit more, in the use this user bring more convenient.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be accomplished through the relevant hardware of programmed instruction; Aforesaid program can be stored in the computer read/write memory medium; This program the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
The above; Be merely embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technician who is familiar with the present technique field is in the technical scope that the present invention discloses; Can expect easily changing or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of said claim.