[go: up one dir, main page]

CN102779117A - Electronic document processing method and device - Google Patents

Electronic document processing method and device Download PDF

Info

Publication number
CN102779117A
CN102779117A CN2011101205236A CN201110120523A CN102779117A CN 102779117 A CN102779117 A CN 102779117A CN 2011101205236 A CN2011101205236 A CN 2011101205236A CN 201110120523 A CN201110120523 A CN 201110120523A CN 102779117 A CN102779117 A CN 102779117A
Authority
CN
China
Prior art keywords
document
document content
key word
content
electronic document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011101205236A
Other languages
Chinese (zh)
Inventor
仇睿恒
王毅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Founder Information Industry Holdings Co Ltd
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Original Assignee
Peking University
Founder Information Industry Holdings Co Ltd
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University, Founder Information Industry Holdings Co Ltd, Peking University Founder Group Co Ltd, Beijing Founder Apabi Technology Co Ltd filed Critical Peking University
Priority to CN2011101205236A priority Critical patent/CN102779117A/en
Publication of CN102779117A publication Critical patent/CN102779117A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

本发明实施例提供的电子文档处理方法和装置,涉及实际计算机信息处理领域,能够方便用户阅读。其方法为:选择电子文档中的至少一块文档内容;对所选择的文档内容进行标识,并为标识后的所述文档内容设置对应的关键字;根据选择的文档内容和该文档内容对应的关键字,对所述电子文档进行处理。本发明实施例用于阅读电子文档。

The electronic document processing method and device provided by the embodiments of the present invention relate to the field of actual computer information processing and can facilitate users to read. The method is as follows: select at least one piece of document content in the electronic document; identify the selected document content, and set corresponding keywords for the identified document content; word, and process the electronic document. The embodiment of the present invention is used for reading electronic documents.

Description

A kind of electronic document processing method and device
Technical field
The present invention relates to the computer information processing field, relate in particular to a kind of electronic document processing method and device.
Background technology
People are when reading electronic document, and contents such as the literal of appearance, formula, definition, notion, symbol quotes before regular meeting runs into some.When running into these and quote, the user can not remember these sometimes and quote corresponding content, need seek these forward and quote pairing content, perhaps adopts the mode aid reading that adds bookmark.
The mode of existing interpolation bookmark is that whole page or leaf document is carried out the bookmark mark, when the user need check the page of bookmark mark, can directly jump to the page of this bookmarked.This bookmark mark mode to whole page or leaf document is handled single, and the user uses comparatively inconvenience.
Summary of the invention
Embodiments of the invention provide a kind of electronic document processing method and device, can make things convenient for the user to read.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of electronic document processing method comprises:
Select at least one document content in the electronic document;
Selected document content is identified, and the key word of correspondence is set for the said document content after the sign;
According to document content of selecting and the corresponding key word of the document content, said electronic document is handled.
A kind of electronic document processing apparatus comprises:
Selected cell is used for selecting at least one document content of electronic document;
The unit is set, is used for selected document content is identified, and the key word of correspondence is set for the said document content after the sign;
Processing unit is used for according to document content of selecting and the corresponding key word of the document content said electronic document being handled.
Electronic document processing method that the embodiment of the invention provides and device are selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer just to whole page or leaf labelling document, but selects a part of document content in the document to identify, and handles after the document content of sign is provided with key word.Like this, enriched user's reading method, convenient for users to use, improved user's impression.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
The FB(flow block) of the electronic document processing method that Fig. 1 provides for the embodiment of the invention;
The interface synoptic diagram of the electronic document processing method that Fig. 2 provides for the embodiment of the invention;
Another interface synoptic diagram of the electronic document processing method that Fig. 3 provides for the embodiment of the invention;
The another interface synoptic diagram of the electronic document processing method that Fig. 4 provides for the embodiment of the invention;
The structured flowchart of the electronic document processing apparatus that Fig. 5 provides for the embodiment of the invention;
Another structured flowchart of the electronic document processing apparatus that Fig. 6 provides for the embodiment of the invention.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
The electronic document processing method that the embodiment of the invention provides, as shown in Figure 1, its step comprises:
At least one document content in S101, the selection electronic document.
Here, the document content of document can be the component units of certain one page, certain zone in one page or (group) electronic document of electronic document, for example literal, picture etc.
S102, selected document content is identified, and corresponding key word is set for the document content after the sign.
Here, key word can be artificial the appointment, also can be generated automatically according to content by software, for example uses extractions automatically such as TF (Term Frequency, word frequency)/IDF (Inverse Document Frequency arranges frequency) algorithm.
Step S101, S102 can be provided with by production firm when making electronic document, also can be when the user reads, and manually are provided with by the user.
S103, according to document content of selecting and the corresponding key word of the document content, electronic document is handled.
The electronic document processing method that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and handles after the document content of sign is provided with key word.Like this, enriched user's reading method, convenient for users to use, improved user's impression.
The electronic document processing method that another embodiment of the present invention provides; With certain teaching material is example; As shown in Figure 2; Suppose user A (for example teacher) when reading the electronic document of this teaching material, find that the literal in the zone 1 in the document is important formula, thereby select the zone 1 (i.e. document content) of the document; Identify selected regional 1 then, and the key word of correspondence is set: " distributive property " for this zone 1 after the sign.
Subsequently, user A can also select the polylith document content to identify respectively in the same way, and for each document content after the sign key word of correspondence is set.
For example can obtain following " key word " and the corresponding relation of " document content ":
Key word " distributive property ", corresponding document content is " λ (w+z)=λ w+ λ Z "
Key word " associative property ", corresponding document content be " a * (b * c)=(a * b) * c "
Key word " commutative property ", corresponding document content is " a * b=b * a "
User A if comprise the key word of setting in the content, then shows the document content that this key word is corresponding when reading the content of this electronic document back.For example, when user A read a certain page of this electronic document, ocr software detected in the page of current reading and has " distributive property ", then can on current page, show " λ (w+z)=λ w+ λ Z "; Perhaps; When user A reads a certain page of this electronic document; Include " distributive property " in this page, if the concrete implication that user A forgets, user A can choose " distributive property "; At this moment, can demonstrate " λ (w+z)=λ w+ λ Z " on the current page.Here, the user chooses key word, can be through the user mouse to be moved on this key word, or click modes such as this key word and realize, but be not limited to this.
This shows that present embodiment is different with the mode that the interpolation bookmark of prior art marks,, but select at least one document content in the electronic document no longer to whole page or leaf labelling document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; When corresponding key word occurring once more in the document, document file page will automatically or demonstrate the corresponding document content of this key word under user's operation, thereby reaches the purpose that helps the user to recall.Therefore convenient for users to use, improved user's impression.
In addition, the demonstration of the document content that key word is corresponding, presetting in the time of can dispatching from the factory according to software producer shows, also can show according to user's setting.For example:
(1) if the view mode that the user is provided with is with civilian pattern; The content model that is provided with is a fixed format; When then the user chooses key word; The corresponding document content of this key word can the identical format of content format with sign the time be presented at the correct position of current page, for example shows the peripheral region of this key word etc., shown in the zone 2 of Fig. 3.
(2) if the view mode that the user is provided with is the contrastive pattern; The content model that is provided with is a fixed format; When then the user chooses key word; The corresponding document content of this key word can the identical format of content format with sign the time be presented in the specific viewing area in the current page, for example is presented in the left margin zone of the page, shown in the zone 3 as shown in Figure 4.
(3) if the view mode that the user is provided with is with civilian pattern; The content model that is provided with is set to reset format; When then the user chose key word, the corresponding document content of key word can be set type according to the characteristics of the viewing area of current page again, and is presented at the place of current page.
(4) if the view mode that the user is provided with is the contrastive pattern; The content model that is provided with is the reset formula; When then the user chose key word, the corresponding document content of key word can be set type according to the characteristics of the viewing area of current page again, and is presented in the specific viewing area in the current page.
Through the setting of above-mentioned display mode flexibly, can meet the needs that the user reads more on the one hand, it is attractive in appearance to guarantee that on the other hand document shows.
Further, after user A (teacher) sets key word and its corresponding document content, both can oneself read and use, also can this key word and its corresponding document content have been derived with prescribed form, for example will
“distributive?property”“λ(w+z)=λw+λZ”
“associative?property”“a×(b×c)=(a×b)×c”
“commutative?property”“a×b=b×a”
With deriving of regulation, obtain the xx.xml file like the .xml form.User B (for example student) can import this xx.xml file afterwards.User B need not oneself and selects document content, key word is set after importing this document, the content that just can utilize user A to set, and user B only need be provided with the view mode and the content model of demonstration.If when reading, comprise the key word of setting in the content of this electronic document, then with the display mode that is provided with automatically or the operation through user B show the document content that this key word is corresponding.
The view mode that is provided with according to user B and content model display document content and above-mentioned user A that display mode is set is identical, repeat no more.
Need to prove that the prescribed form that file is derived does not limit, can be according to the specific requirement concrete regulation of each ocr software.
Mode through this derivation imports can help the experience that the descendant learns to use for reference forefathers so that the mark of one piece of document is utilized by many people, and is convenient for users to use, improved user's impression.
In addition, after the user chooses the corresponding key word of document content and the document content, read the key word that treating apparatus can extract the document content and the document content correspondence.Afterwards, the user can search with the search key of user oneself input in the key word of above-mentioned selected document content and the document content correspondence.For example exist
“distributive?property”“λ(w+z)=λw+λZ”
“associative?property”“a×(b×c)=(a×b)×c”
“commutative?property”“a×b=b×a”
In search " distributive property ", rather than in the entire chapter document, search, can access the very strong Query Result of specific aim like this, in full text, search and might obtain a lot of junk information, unfavorable user uses.
Have, the user can also make amendment or deletes to the key word of above-mentioned selected document content and the document content correspondence again.For example, the user can select deletion, and all comprise the mark of " formula " key word, and this moment, system can search in key word, and with the deletion of items of all couplings.Also can in batches " formula " with in all key words replace with " theorem " etc.Can also manually select a mark, the document content of its appointment is made amendment, such as its corresponding character, picture page area are dwindled some etc.
The electronic document processing method that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and the document content of sign is provided with key word.Like this, the user just can find required particular document content according to key word at an easy rate, has improved user's impression.
The electronic document processing apparatus 50 that the embodiment of the invention provides, as shown in Figure 5, comprising:
Selected cell 501 is used for selecting at least one document content of electronic document.
Unit 502 is set, is used for selected document content is identified, and the key word of correspondence is set for the said document content after the sign.
Processing unit 503 is used for according to document content of selecting and the corresponding key word of the document content this electronic document being handled.
The electronic document processing apparatus that the embodiment of the invention provides is selected at least one document content in the electronic document; Selected document content is identified, and the key word of correspondence is set for the document content after the sign; According to document content of selecting and the corresponding key word of the document content, electronic document is handled.The mode that marks with the interpolation bookmark of prior art is different, no longer to whole page or leaf labelling document, but selects a part of document content in the document to identify, and handles after the document content of sign is provided with key word.Like this, enriched user's reading method, convenient for users to use, improved user's impression.
Further, as shown in Figure 6, processing unit 503 comprises:
Display module 5031 is used for when reading electronic document, if comprise the key word of setting in the content of electronic document, then shows the document content that this key word is corresponding.
Thus, when corresponding key word occurring once more in the document, display module 5031 will show the document content that this key word is corresponding, thereby reaches the purpose that helps the user to recall.Therefore convenient for users to use, improved user's impression.
Derive module 5032, be used for document content and the corresponding key word of the document content are derived with prescribed form.
Import module 5033, be used to import document content and the corresponding key word of the document content that prescribed form forms, so that aid reading.
Mode through this derivation imports can help the experience that the descendant learns to use for reference forefathers so that the mark of one piece of document is utilized by many people, and is convenient for users to use, improved user's impression.
Extraction module 5034 is used to extract the document content and the corresponding key word of the document content.
Search module 5035, be used for document content and the corresponding key word of the document content, search with the search key of user's input in extraction module 5034 extractions.
Like this, the user searches certain keyword and need not in the entire chapter document, to search, and can access the very strong Query Result of specific aim like this, in full text, searches and might obtain a lot of junk information, and unfavorable user uses.
Removing module 5036 is used for document content, the corresponding key word of the document content are deleted.
Modified module 5037 is used for document content, the corresponding key word of the document content are made amendment.
Through above-mentioned module, the electronic document processing apparatus that present embodiment provides meets people's use habit more, in the use this user bring more convenient.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be accomplished through the relevant hardware of programmed instruction; Aforesaid program can be stored in the computer read/write memory medium; This program the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
The above; Be merely embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technician who is familiar with the present technique field is in the technical scope that the present invention discloses; Can expect easily changing or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of said claim.

Claims (9)

1. an electronic document processing method is characterized in that, comprising:
Select at least one document content in the electronic document;
Selected document content is identified, and the key word of correspondence is set for the said document content after the sign;
According to document content of selecting and the corresponding key word of the document content, said electronic document is handled.
2. electronic document processing method according to claim 1 is characterized in that, according to the key word of the document content of selecting with the document content correspondence, said electronic document handled comprises:
When reading electronic document,, then show the document content that said key word is corresponding if comprise the key word of setting in the content of said electronic document.
3. electronic document processing method according to claim 2 is characterized in that, shows that the corresponding document content of said key word comprises:
View mode and content model to set show the document content that said key word is corresponding; Wherein, said view mode is one of following: contrastive pattern, with civilian pattern; Said content model is one of following: fixed format, reset format.
4. electronic document processing method according to claim 1 is characterized in that, according to the key word of the document content of selecting with the document content correspondence, said electronic document handled comprises:
Said document content and the corresponding key word of the document content are derived with prescribed form.
5. electronic document processing method according to claim 1 is characterized in that, according to the key word of the document content of selecting with the document content correspondence, said electronic document handled comprises:
Extract the corresponding key word of said document content and the document content;
In the key word of said document content and the document content correspondence, search with the search key of user's input.
6. an electronic document processing apparatus is characterized in that, comprising:
Selected cell is used for selecting at least one document content of electronic document;
The unit is set, is used for selected document content is identified, and the key word of correspondence is set for the said document content after the sign;
Processing unit is used for according to document content of selecting and the corresponding key word of the document content said electronic document being handled.
7. electronic document processing apparatus according to claim 6 is characterized in that, said processing unit comprises:
Display module is used for when reading electronic document, if comprise the key word of setting in the content of said electronic document, then shows the document content that said key word is corresponding.
8. electronic document processing apparatus according to claim 6 is characterized in that, said processing unit comprises:
Derive module, be used for said document content and the corresponding key word of the document content are derived with prescribed form;
Import module, be used to import the document content and the corresponding key word of the document content that form with said prescribed form.
9. electronic document processing apparatus according to claim 6 is characterized in that, said processing unit comprises:
Extraction module is used to extract said document content and the corresponding key word of the document content;
Search module, be used for said document content and the corresponding key word of the document content, search with the search key of user's input in said extraction module extraction.
CN2011101205236A 2011-05-10 2011-05-10 Electronic document processing method and device Pending CN102779117A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011101205236A CN102779117A (en) 2011-05-10 2011-05-10 Electronic document processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011101205236A CN102779117A (en) 2011-05-10 2011-05-10 Electronic document processing method and device

Publications (1)

Publication Number Publication Date
CN102779117A true CN102779117A (en) 2012-11-14

Family

ID=47124034

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011101205236A Pending CN102779117A (en) 2011-05-10 2011-05-10 Electronic document processing method and device

Country Status (1)

Country Link
CN (1) CN102779117A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110046309A (en) * 2019-04-02 2019-07-23 北京字节跳动网络技术有限公司 Processing method, device, electronic equipment and the storage medium of document input content

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030051214A1 (en) * 1997-12-22 2003-03-13 Ricoh Company, Ltd. Techniques for annotating portions of a document relevant to concepts of interest
CN1920809A (en) * 2005-08-22 2007-02-28 刘畅 Text file storage method
CN101196874A (en) * 2007-12-28 2008-06-11 宇龙计算机通信科技(深圳)有限公司 Method and apparatus for machine aid reading
CN101578575A (en) * 2006-09-22 2009-11-11 Opera软件股份公司 Method and device for selecting and displaying a region of interest in an electronic document
CN101782924A (en) * 2009-01-19 2010-07-21 索尼公司 Information processing method, information processing apparatus, and program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030051214A1 (en) * 1997-12-22 2003-03-13 Ricoh Company, Ltd. Techniques for annotating portions of a document relevant to concepts of interest
CN1920809A (en) * 2005-08-22 2007-02-28 刘畅 Text file storage method
CN101578575A (en) * 2006-09-22 2009-11-11 Opera软件股份公司 Method and device for selecting and displaying a region of interest in an electronic document
CN101196874A (en) * 2007-12-28 2008-06-11 宇龙计算机通信科技(深圳)有限公司 Method and apparatus for machine aid reading
CN101782924A (en) * 2009-01-19 2010-07-21 索尼公司 Information processing method, information processing apparatus, and program

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110046309A (en) * 2019-04-02 2019-07-23 北京字节跳动网络技术有限公司 Processing method, device, electronic equipment and the storage medium of document input content
US11423112B2 (en) 2019-04-02 2022-08-23 Beijing Bytedance Network Technology Co., Ltd. Document input content processing method and apparatus, electronic device, and storage medium

Similar Documents

Publication Publication Date Title
Englander et al. Culture and belief in Europe 1450-1600: an anthology of sources
Eggert Text-encoding, Theories of the Text, and the ‘Work-Site’1
CN105988665A (en) Information copying system, information copying method and electronic device
CN104318363A (en) Library management system for university
Faulhaber PhiloBiblon and the Semantic Web. Notes for a Future History
CN102779117A (en) Electronic document processing method and device
Shaw et al. Integrating collaborative bibliography and research
Plastow et al. African Theatre: Contemporary Women
Siemens et al. The Value of Plurality in ‘The Network with a Thousand Entrances’
Ciocca et al. Intangible heritage management and multimodal navigation
Spingou Words and Artworks in Byzantium: Twelfth-century Poetry on Art from MS. Marcianus Gr. 524
Jockers et al. Brief of digital humanities and law scholars as amici curiae in Authors Guild v. Google
Wilar A Christian Perspective on Explorations of Shared Narrative of Religions and Its Significance to Islamic Studies, Islamic Thought, and Well Living
Su Inside the web: A look at digital libraries and the invisible/deep web
Seol et al. Support vector machine (SVM) based stylus touch screen panel
Seol et al. A novel capacitive touch sensing circuit for low power application
McIntyre Beckett: The Failure Sense
Wright Jane Austen in Context, digital resource, Broadview Press
Beals Record how you search, not just what you find: Thoughtfully constructed search terms greatly enhance the reliability of digital research
Murphy Surgical simulation: the way forward or a waste of time?
Shin Class and income inequality in Korea
Tindley Professor Eric Richards–an Obituary
Koedinger Practical Learning Research at Scale
Ferguson Ian Hutchison, A History of Disability in Nineteenth-Century Scotland, Lampeter, Lewiston, NY, and Queenston, Ontario, Edwin Mellen Press, 2007. Pp. ix+ 386; 26 illus. Hardback ISBN 9780773452718;£ 79.95.
Byg Doreen Mende, Estelle Blaschke, and Armin Linke, eds. Doppelte Ökonomien/Double Bound Economies: Vom Lesen eines Fotoarchivs aus der DDR, 1967–1990/On Reading a Photographic Archive from the GDR, 1967–1990; DEFA-Stiftung, ed. Bilder des Jahrhunderts: Staatliches Filmarchiv der DDR 1955–1990. Erinnerungen

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20121114