WO2006095409A1 - Dispositif de recuperation d'informations, programme informatique et support d'enregistrement - Google Patents
Dispositif de recuperation d'informations, programme informatique et support d'enregistrement Download PDFInfo
- Publication number
- WO2006095409A1 WO2006095409A1 PCT/JP2005/003928 JP2005003928W WO2006095409A1 WO 2006095409 A1 WO2006095409 A1 WO 2006095409A1 JP 2005003928 W JP2005003928 W JP 2005003928W WO 2006095409 A1 WO2006095409 A1 WO 2006095409A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- search
- user
- registration information
- web page
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Definitions
- the present invention relates to an information search apparatus, an information search method, and a program for causing a computer to execute the method for searching a website on the Internet based on a keyword input by a user.
- Robot-type search engines and directory-type search engines have been generally known as search engines that perform full-text search of Web sites (WEB pages) on the Internet according to user search instructions.
- a robot-type search engine When a keyword related to information to be searched is specified, a robot-type search engine periodically searches information on all WWW servers that can be found on the Internet using a WWW search program called a WEB robot or spider. The information is automatically collected and collected. Since this robot-type search engine automatically collects data from websites around the world by a computer, it has the advantage of a large amount of information.
- Locator is artificially classified by field such as art, business, education, etc., and the user searches by specifying keywords by field.
- this directory-type search engine information is classified by field, so users can use it like an occupational phone book, and it has the advantage that it is convenient when searching for power by deciding a field.
- search robot performs a search, even if different users search, as long as the search is performed with the same key key, only the same search result can be obtained, and all users having different profiles and preferences can be obtained. There is a problem that satisfactory search results are rarely obtained.
- Patent Document 1 a system that provides a search result that matches the user's preference has been proposed (see Patent Document 1).
- the user specifies the information to handle the information.
- categories can be assigned and classified based on user intentions.
- the degree of coincidence between users of the information in each power category the similarity of preference between the user and other users is evaluated and determined, and the other users whose similarity is determined to be high Can be introduced to the user. According to this, since the system power is introduced to the information evaluated by other users with similar preferences, the user can save time and cost for searching for information.
- Patent Document 1 Japanese Patent Application No. 2000-187666
- the provision is received based on the similarity of the user's preference.
- the ability to provide unknown information to users who are searching for keywords When searching for keywords, trends in users with common preferences (for example, which pages a user with a common hobby refers to) It is not possible to select the information to be provided by using the latest trend of the page (for example, whether it has been recently updated or has been particularly popular recently) as an index.
- the present invention has been made in view of such problems, and an object thereof is to make a search result at the time of keyword search more valuable to a user.
- the present invention for solving the above-described problems is based on the search information input by the client terminal! Searches for information on Web pages on the Internet and transmits the search result to the client terminal.
- An information search apparatus for storing registration information storage means for storing registration information relating to a Web page, receiving means for receiving the search information from the client terminal, and searching for a Web page related to a keyword included in the search information
- First search result generating means for searching for information and generating a first search result, and registration information relating to the Web page included in the first search result stored in the registered information storage means From the selection means for selecting the first registration information using setting information relating to the presence / absence of use of the registration information included in the search information, and based on the first registration information,
- Calculation means for calculating the weight value of the Web page included in the first search result for each Web page, and the Web page included in the first search result based on the weight value calculated for each Web page.
- Rank determining means for determining the rank of the page, second search result
- FIG. 1 is a diagram corresponding to an example of the overall configuration of a system corresponding to an embodiment of the present invention.
- FIG. 2 is a diagram corresponding to an example of a configuration of a client 101 corresponding to the embodiment of the present invention.
- FIG. 3 is a diagram corresponding to an example of a configuration of a server 103 corresponding to the embodiment of the present invention.
- FIG. 4A is a diagram showing an example of registration contents of a user database 104 corresponding to the embodiment of the present invention.
- FIG. 4B is a diagram showing an example of registered contents in the bookmark database 105 corresponding to the embodiment of the present invention.
- FIG. 4C is a diagram showing an example of registered contents in the URL database 106 corresponding to the embodiment of the present invention.
- FIG. 4D is a diagram showing another example of registered contents in the URL database 106 corresponding to the embodiment of the present invention.
- FIG. 5 is a flowchart corresponding to an example of processing between the client 101 and the server 103 when an online bookmark is used, corresponding to the embodiment of the present invention.
- FIG. 6A is a diagram showing an example of a display screen displayed on the display 207 of the client 101 corresponding to the embodiment of the present invention.
- FIG. 6B is a diagram showing another example of a display screen displayed on the display 207 of the client 101 corresponding to the embodiment of the present invention.
- FIG. 6C is a diagram showing still another example of the display screen displayed on the display 207 of the client 101 corresponding to the embodiment of the present invention.
- FIG. 7 is a flowchart corresponding to an example of processing between the client 101 and the server 103 when search processing corresponding to the embodiment of the present invention is executed.
- FIG. 8 is a flowchart corresponding to an example of search processing in the server 103, corresponding to the embodiment of the present invention.
- FIG. 1 is a block diagram showing the overall configuration of a search system corresponding to this embodiment.
- Clients 101a, 101b, and 101c (hereinafter collectively referred to as “client 101”) that are operated by a user to browse content, a server 103 that performs search, and a content server 108 that publishes content, respectively.
- client 101 Clients 101a, 101b, and 101c
- server 103 that performs search
- content server 108 that publishes content, respectively.
- a user database 104, a bookmark database 105, a URL database 106, and a search database 107 are connected to the server 103 !.
- the client 101 is an information processing apparatus as a client terminal that can be operated by a user and browse content published on the Internet, and includes a notebook computer, a desktop computer, a portable information terminal, a cellular phone, and the like. . It is assumed that the client 101 is installed with software capable of displaying content described in a predetermined page description language (for example, HTML, XML, PDF, PS, etc.) on the display screen. In the following embodiment, for the sake of simplicity, a case where the so-called Internet browser power S is installed as the software will be described as an example.
- a predetermined page description language for example, HTML, XML, PDF, PS, etc.
- the client 101 is connected to the Internet 102 via a LAN that is an access means including a network cable such as Ethernet (registered trademark).
- the access means to the network 102 is not limited to an Ethernet (registered trademark) cable.
- the access means may be constituted by a communication cable conforming to the RS232C interface or a wireless communication means such as a wireless LAN.
- the server 103 and the content server 108 described later are also provided with similar access means.
- the Internet 102 is a network in which networks all over the world are connected to each other.
- the Internet 102 may be a network that can be connected only within a specific thread and fabric such as an intranet.
- the server 103 serves as an information search device that provides a search service to the client 101.
- the information processing apparatus manages information stored in the user database 104, bookmark database 105, URL database 106, and search database 107.
- the server 103 periodically accesses a Web site that can be browsed on the Internet 102, collects the URL, title, and full-text data included therein, and updates the URL database 106 and the search database 107.
- the update of the bookmark database 105 is monitored, and when a new URL is added to or deleted from the user bookmark, the corresponding URL database 106 is updated. Do. In addition, the selection operation of the URL registered in the bookmark in the client 101 is monitored, and the reference count of each user is recorded.
- the server 103 is connected to the user database 104, the bookmark database 105, the URL database 106, and the search database 107 by, for example, LAN.
- the user database 104, the bookmark database 105, the URL database 106, and the search database 107 are information processing apparatuses in which predetermined database software is installed, and manage various data.
- the user database 104 manages information on user data.
- the bookmark database 105 manages bookmark data for each user.
- the URL database 106 uniformly manages registration information related to pages registered by each user in the bookmark.
- the search database 107 manages information for search processing.
- the content server 108 is an information processing apparatus that provides content such as a web page, and for example, software such as a web server is installed.
- the server 103, the user database 104, the bookmark database 105, the URL database 106, and the search database 107 are each realized by a physically independent information processing device.
- the embodiment of the present invention is not limited to this.
- these may be realized by a single information processing device.
- the user database 104, the bookmark database 105, the URL database 106, and the search database 107 can be described as being connected to the server 103 via a LAN or the like. You may take the form which can communicate. ⁇ Configuration of information processing apparatus>
- FIG. 2 is a block diagram illustrating an example of the hardware configuration of the client 101.
- the user database 104, the bookmark database 105, the URL database 106, and the search database 107 as the information processing apparatus described above may be configured with the same or equivalent hardware configuration.
- reference numeral 200 denotes a CPU, which is stored in a hard disk device (hereinafter referred to as HD) 205 and executes an application program, an operating system (OS), a control program, etc. Control to temporarily store information, files, etc. necessary for execution.
- HD hard disk device
- OS operating system
- control program etc. Control to temporarily store information, files, etc. necessary for execution.
- Reference numeral 201 denotes a ROM that stores therein various data such as a program such as a basic I / O program, font data used for document processing, and template data.
- 202 is a RAM for temporarily storing various data, and functions as the main memory, work area, etc. of the CPU 200.
- [0028] 203 is an external storage drive for realizing the access to the recording medium, it can be force s load media (recording medium) 204 a program stored in such in the computer system.
- media 204 for example, a flexible disk (FD), CD-ROM, CDR, CD-RW, PC card, DVD, IC memory card, MO, memory stick, etc. can be used.
- Reference numeral 205 denotes an external storage device, which uses an HD functioning as a large-capacity memory in this embodiment.
- the HD 205 stores application programs, OS, control programs, related programs, and the like.
- Reference numeral 206 denotes an instruction input device, which corresponds to a keyboard, a pointing device (such as a mouse), and a touch panel. Using the instruction input device 206, the user instructs the client 101 to input a command or the like for controlling the device.
- Reference numeral 207 denotes a display which displays a command input from the instruction input device 206, a response output of the client 101 in response thereto, and the like.
- Reference numeral 209 denotes a system bus, which controls the flow of data in the information processing apparatus.
- 20 8 is an interface (hereinafter referred to as IZF), and exchanges data with external devices via this IZF208.
- IZF interface
- the program may be loaded into the RAM 202 from the HD 205 in which the program is already installed. It is also possible to record the program according to the present embodiment in the ROM 201, configure it as a part of the memory map, and execute it directly by the CPU 200.
- FIG. 3 is a block diagram showing a device configuration of the server 103. Functions and applications of CPU300, ROM301, RAM302, external storage drive 303, media 304, HD305, instruction input device 306, display 307, I / F308, system bus 309, and their relationship are shown in Fig. 2. Are the same as or equivalent to those described above. Note that in FIG. 3, database 310 is connected to system bus 309. Here, the database 310 is a generic term for the user database 104, the bookmark database 105, the URL database 106, and the search database 107.
- FIG. 4A is a diagram for explaining an example of the structure of user data stored in the user database 104.
- User data is composed of various information related to the characteristics and attributes of individual users.
- 401 is a user ID, which is an identifier given to the unique to uniquely identify each user.
- Reference numeral 402 denotes a password, which is used for authentication when each user accesses registered personal information or uses a bookmark.
- 403 is the user name
- 404 is the user's e-mail address
- 405 is the age
- 406 is the gender
- 407 is the region where the user is active or resides
- 408 is the occupation
- 409 is unmarried
- 410 is the residence type
- 411 is the year 412 is private car ownership
- 413 is a hobby.
- Each data is provided in advance by the user and registered as user data. Also, the above content is the user data It is given as an example, and it is not necessary that all of these are registered, and other information that characterizes the user may be registered.
- the bookmark is created and updated by the user. All bookmark data is stored in the bookmark database 105 connected to the server 103 for each user, and when the user starts the web browser on the client 101, It is downloaded to the client 101 and displayed on the web browser.
- a bookmark is composed of a URL and a folder for storing the URL as a specific group in a known directory structure. That is, each URL is registered in a directory tree for each user (hereinafter referred to as a bookmark directory), and may be directly under the bookmark directory or in an arbitrarily created folder.
- FIG. 4B is a diagram showing an example of management of information related to pages registered in bookmarks.
- the user ID is an identifier of the user who uses the bookmark, and corresponds to the user ID 401 in FIG. 4A.
- the user can be associated with the bookmark.
- information for identifying a Web page to be registered is registered.
- the URL database 106 is used to manage information about each page in a unified manner, so the URL of the corresponding page is not stored in each user bookmark. Instead, an identification number (URL—ID) for managing the page is stored in the URL database 106. If the URL database 106 is not used, the URL itself may be registered.
- URL—ID identification number
- Reference numeral 423 denotes a bookmark name given to each page.
- the bookmark name 423 may be a name arbitrarily set when the user registers a bookmark, or may be a name originally assigned to the page.
- Reference numeral 424 indicates the date and time when the page was created (corresponding to the date and time when the page was first published on the network 102).
- 425 indicates the date and time when the page was updated.
- 426 indicates the date and time when the user referred to the page most recently.
- Reference numeral 427 denotes the total number of times the user refers to the page.
- the total number of references refers to the total number of times the user has referred to the page so far.
- Reference numeral 428 indicates the number of times the user has referred to the page recently.
- the most recent reference count is This is the total number of times the page was viewed by the user during the most recent predetermined period (eg, 1 week, 1 month, etc.).
- a directory table is prepared in the bookmark database 105 for the relationship between the pages registered in the bookmark and information on the folder storing the page.
- the server 103 periodically monitors the update of each page registered as a bookmark as described above, and updates the contents of the update date and time of 425 when there is an update.
- FIG. 4C is a diagram illustrating an example of management of information regarding pages in the URL database 106.
- 422 represents an ID (identifier) given to the URL, and corresponds to the URL—ID 422 in FIG. 4B.
- Reference numeral 429 denotes a URL to which a URL—ID422 is given. 43 0 indicates the title inherent in the Web content corresponding to the URL.
- the creation date and update date are the same as in Figure 4B.
- 431 indicates the last reference date and time, and indicates the date and time when one of the bookmark users referred to the page most recently.
- Reference numeral 432 represents the total of the total number of times 427 of the plurality of users, that is, the total number of times the page is referred to by the plurality of users.
- the most recent reference count 433 represents the total of the recent reference counts 428 of a plurality of users, that is, the total number of times that the page has been referred to by a plurality of users within the most recent predetermined period. Therefore, the “predetermined predetermined period” is the same as in FIG. 4B. For example, recently popular pages that are popular, but popular, have a higher recent reference count of 433, so it is possible to determine the most recent degree of popularity for each page. 434 is the number of increased anchors, and indicates the number of anchors added within the most recent predetermined period for the page. In the present embodiment, an anchor refers to link information to another page embedded in a page. Further, “the most recent predetermined period” is the same as described above.
- FIG. 4D is a diagram illustrating another example of management of information related to pages in the URL database 106.
- Reference numeral 422 denotes a URL ID assigned to each page.
- Reference numeral 440 denotes user data, in which individual user data shown in FIG. 4A is registered.
- Reference numeral 441 denotes the total number of times of reference, and the total number of times of reference with respect to the page, which is aggregated for each user data, is registered.
- Reference numeral 442 denotes the number of the latest references, and the total of the number of references for the most recent predetermined period with respect to the URL, which is aggregated for each user data, is registered.
- the “predetermined predetermined period” is the same as described above.
- 443 is the number of bookmarks registered, and a value obtained by adding the number of users who register the page as a bookmark to! / Speak for each user data is registered.
- Fig. 4D the user data 440 is reached! /, Age 405, hobby 413, job type 408 power ⁇ power that describes the registered f column
- age 405 is considered, "10s", " The total number of referrals 441, the latest reference count 442, and the number of bookmark registrations 443 are registered for each age group, such as “20's”.
- the total number of users in their teens who referred to the URL is 100 times
- the total number of users in their 20s who referred to the URL is 300 times.
- the user data 440 includes a plurality of items other than "age”, "hobby", and "job type". For each of these user data, the total number of reference times 441, the latest reference 442 times and 443 bookmarks are registered. The total number of reference times 441, the number of latest reference times 442, and the number of bookmark registrations 443 are as follows: It can be obtained by using the data shown in FIGS. 4A and 4B.
- registration information the individual registered contents of the database illustrated in FIGS. 4A to 4D are referred to as registration information.
- the search database 107 manages information for search processing as search information for searching Web pages related to keywords for which user power is also provided, and is collected from the Web site on the Internet 102.
- the retrieved search data is registered.
- the search data may include other data that mainly consists of the identifier, keyword, attribute, data acquisition time, data acquisition URL, URL title, full text data (Description), etc. of each search data.
- FIG. 5 is a flowchart corresponding to an example of processing between the client 101 and the server 103 when an online bookmark is used in the client 101.
- the bookmark data request is notified to the server 103 together with the user ID 401 (501).
- the bookmark database 105 is searched based on the notified user ID 401, and the bookmark data of the user is selected (step S552).
- the selected bookmark data is transmitted from the server 103 to the client 101 (502).
- the client 101 displays the activated web browser on the display 207, and displays bookmark information based on the bookmark data transmitted from the server 103 in the bookmark display area of the web browser (step S553).
- the client 101 monitors whether or not the user has received an operation on the bookmark, and if the operation is received (“YES” in step S554), the bookmark operation content is displayed to the server 103. Notification is made (503).
- the operation of the bookmark that accepts user power includes, for example, selection of a URL registered in the bookmark, addition of a new URL, editing (including deletion) of registered URL information, and the like.
- the server 103 receives the bookmark operation content. Update each database based on the contents. If the notification is, for example, a notification that a user has selected a URL, the reference date and time 426 for the corresponding URL, the total reference count 427, the latest reference count 428, the last reference date and time 431, the total reference count 432, The latest reference count 433, the total reference count 441, and the latest reference count 442 are updated.
- a termination notification is transmitted from the client 101 to the server 103 (504), and a termination response is returned from the server 103 (505).
- the contents of the bookmark database 105 and the URL database 106 managed by the server 103 are always updated based on the operation accepted by the user.
- FIGS. 6A to 6C are diagrams showing examples of screens displayed on the display 207 when the user power client 101 executes search processing.
- a predetermined URL is specified and a search page provided by the server 103 is accessed.
- the screen first displayed on the display 207 is a screen 600 as shown in FIGS. 6A to 6C.
- the screen 600 is largely divided into a search screen 601 and a bookmark display area 600.
- bookmark display based on the bookmark data provided from the server 103 is performed, and the user can select and browse a page registered in the bookmark.
- the server 103 provides a search service with the name “O ⁇ search”.
- the search screen 601 includes a keyword input area 602, a search execution button 603, an option 604, and A setting button 605 is displayed.
- the keyword input area 602 is an area for receiving an input of a keyword that the user wants to search.
- the search execution button 603 is a button for instructing the server 103 to perform a search process according to the contents of the set option for the input keyword.
- Option 604 indicates whether to sort the search results based on page information or user data, as shown in Figure 6C.
- the user power is also accepted. In other words, when “use” is selected, a search using the content set as an option is performed. On the other hand, if “do not use” is selected, normal search processing is executed without using the content set as an option, and this search result output S client 101 is notified.
- FIG. 6B is a screen displayed on the display 207 when the setting button 605 is selected in FIG. 6A. This screen is a screen for confirming whether or not the user is registered in the user database 104.
- reference numeral 611 denotes a display field for accepting an input regarding the presence / absence of user registration from a user who wants to perform a search.
- “Yes” or “No” can be selected.
- input fields 612 and 613 for accepting input of “user name” and “password” are displayed.
- “None” is selected, these input fields 612 and 613 are not displayed at all as the force to be grayed out.
- Reference numeral 614 denotes a determination button, which is operated when confirming input contents.
- a cancel button 615 is operated when the user wants to return to the screen shown in FIG. 6A. If the user registration 611 is set to “None” when the OK button 614 is operated, the screen is switched to FIG. 6C without transmitting data to the server. On the other hand, if “user name” and “password” are accepted in the setting of the user registration 611 force S “Yes”, the accepted data is transmitted to the server 103.
- the server 103 searches the user database 104 for a user name 403 and a password 402 that match the “user name” and “password” received from the client 101.
- a matching “user name” is registered, it is registered in the user database 104, and the presence / absence of user data and the user ID 401 are notified from the server 103 to the client 101.
- the client 101 Upon receiving the notification from the server 103, the client 101 displays the screen in FIG. 6C.
- an error message is notified from the server 103 to the client 101.
- the client 101 receives the error message from the server 103, the client 101 displays the screen of FIG. 6B and accepts the input of “user name” and “password” again.
- FIG. 6C is a screen displayed on the display 207 after the enter button 614 is selected in FIG. 6B.
- this screen receives setting of option information used when searching in the server 103, more specifically, designation of registration information used for calculating the weight value of each page included in the search result. It is also a screen for attaching.
- the option input screen 620 roughly shows page information 621, user data 622, and manaise 623.
- Items that can be set for the page information 621 include the update date, the total number of references, the number of latest references, the number of increased anchors, and the number of bookmark registrations. These items correspond to the registration information of the update date and time 425, the total reference count 432, the latest reference count 433, the increased anchor count 434, and the bookmark registration count 435 of Fig. 4C registered in the URL database 106, respectively. . Therefore, when any of these items is selected, the corresponding registration information is used when determining the display order for the pages included in the search results in the server 103 (as will be described later with reference to FIG. 8). Used for
- items that can be set for the user data 622 include age, sex, regional attributes, occupation, unmarried, annual income, presence of a car, and hobbies. These items are registered information related to the specified user data among the registered information for the total reference count 441, the latest reference count 442, and the bookmark registration count 443 in FIG. 4D registered in the URL database 106, respectively. It corresponds.
- the generation to which the user belongs is determined based on the age of the user who made the designation out of the age of the user data 440 in FIG. 4D. And corresponding registration information is specified. Therefore, these items When either one is selected, the corresponding registration information is used when determining the display order of pages included in the search result in the server 103 (as will be described later with reference to FIG. 8).
- Omakase 623 can be selected when the user does not specify the page information 621 or the user data 622. If Random 623 is selected, for example, the search may be performed using all option information, or specific page information 621 and user data 622 may be set in advance and You may search using settings
- the power that items that can be set for the page information 621 and the user data 622 are displayed.
- Such a display is registered in the user database 104 and each item of the user data 622 is displayed.
- the user data must be registered for the setting items without omission.
- some users may not be able to be registered in the user database 104.
- a setting is accepted for the setting item of the user data 622, it cannot be used for the search.
- Such a user should have set the user registration 611 to “none” in the screen of FIG. 6B. Therefore, it is preferable that only the page information 621 is displayed on the screen 620 and the user data 622 is grayed out corresponding to the setting contents. According to this, even a user who is not registered in the user database can be searched using at least the page update date and the latest reference count.
- the user can confirm the contents of the option input by the OK button 624.
- the search screen 601 returns to the contents shown in FIG. 6A. If the cancel button 625 is selected, the screen returns to the screen in FIG. 6A. At that time, the option settings may be reset, or the previous input contents may be retained.
- the client 101 when the URL of the search service page provided by the server 103 is selected ("YES" in step S751), the client 101 sends a request to the server 103. A request for search screen information is notified (701). In response to the request for search screen information from the client 101, the server 103 transmits search screen information as search information generation information for generating search information to be described later in the client 101 (702). Upon receiving the search screen information from the server 103, the client 101 displays a search screen as shown in the search screen 601 in FIG. 6A on the display 207 (step S752).
- step S753 an input such as whether or not the user power keyword or option setting is accepted is accepted (step S753), and when option setting is performed, the screen of FIG. It is determined whether or not the force is applied (step S754). If the option is not set, the following steps S753 and S756 are skipped and the process proceeds to step S757.
- step S756 In the screen displayed in FIG. 6B, if the user registration 611 is set to “none”, it means that “user registration is present” is not set (in step S754! / “NO”), proceed to step S756. On the other hand, when the user registration 611 is set as “present” and “user name” and “password” are entered in the input fields 612 and 613, this means that “user registration is present” (step S754). Therefore, the input information is transmitted to the server 103 as user authentication information (703).
- the server 103 includes "user name” and "password” transmitted from the client.
- User authentication information is used to verify user authentication, that is, whether or not the user is registered in the user database 104 (step S755). If the user is registered as a result of the verification, the user authentication result including the user ID 401 and the information on the presence or absence of the user data registered in the user database 104 for the user is sent to the client 101. It is transmitted (704). If the user is not registered, it is transmitted to the user authentication result S client 101 including an error message (704).
- step S756 an option input screen as shown in FIG. 6C is displayed to accept an option input.
- a search execution instruction by selecting the search execution button 603 is accepted (step S757).
- the server 103 receives the contents of the keyword and option settings received in steps S753 and S756, and the user authentication result (704), it will be displayed.
- the search information including the user ID 401 included in is transmitted (705).
- a predetermined code indicating that the user is not registered may be included in the search information instead of the user ID 401 and transmitted.
- the server 103 Upon receiving the search information transmitted from the client 101, the server 103 performs a search process based on the key word, and creates a search result based on the contents of the option settings (step S758). ) And the search result is transmitted to the client 101 (706). When the client 101 receives the search result from the server 103, the client 101 displays the received search result on the search screen 601 of the display 207 (step S759).
- step S751 the processing from step S751 is resumed. If the user finishes browsing the Web browser (“YES” in step S760), the client 101 sends a termination notice to the server 103 (707), and the server 103 returns a termination response. (708).
- FIG. 8 is a flowchart corresponding to an example of search processing in the server 103.
- step S801 the server 103 receives the search information transmitted from the client 101, and at the same time receives the search request from the received search information (the search request).
- the user ID 401 of the requesting user) is extracted.
- step S802 user data is acquired from the user database 104 based on the extracted user ID 401.
- the search requesting user is not registered in the user database 104, the code indicating the unregistered user is included, but the user ID 401 is not included in the search information! /.
- the option setting 604 in FIG. 6A is set to “not use”, the user authentication result (704) in FIG. 7 is not received. Therefore, the user ID 401 is not included in the search information.
- user data is not acquired for these users.
- step S803 the search database 107 is searched by a known method based on the keywords included in the search information, and data matching the search conditions based on the keywords is acquired as the search result 1 in step S804.
- This search result 1 is the search result obtained by the search process. It is the user data (that is, job title and age of the search requesting user) and page information (update date and time, reference count, etc.) Regardless of the criteria, the degree of association with the keyword is determined based on the criteria such as the keyword specified in the page and how many words are similar to the keyword, and the pages are arranged in the order of the association. Obtained as a thing.
- step S805 the contents of the option setting 604 of the search request user are identified based on the search information! /. If the option setting 604 is set to “not use” in the option setting 604, the process proceeds to step S816 and the search result 1 is transmitted to the client 101. On the other hand, if “use” is set, the process proceeds to step S806, and the point Ps of each page is calculated based on the ranking of the search result 1. For example, if search result 1 contains 100 pages! /, Then based on the ranking! /, The head will also be scored as 1 00, 99, 98 ... 1 Power S can be. Only the top 10 points may be given points of 100, 90, 80 ... 10. In any case, the weight based on the ranking of the search result 1 is given to each page included in the search result 1 as Ps.
- step S807 the order of pages included in the search result 1 is set to Nu, and Nu is set to 1.
- step S808 the page with the Nu rank in search result 1 is selected, and the URL data for the page is based on the URL of the page. If information is registered in the base 106, all the registered information is acquired.
- step S809 based on the contents of the option setting set in the search information and the user data acquired in step S802, if there is the user data, from the acquired registration information, Narrow down the registration information used for weighting the page.
- steps S808 and S809 the force of reading out all of the registration information and then narrowing down this is only described as an example.
- the corresponding registration information may be acquired directly from the URL database 106 based on the contents of the option settings set in the search information and the user data.
- the page itself is not registered in the URL database 106, or when the page itself is registered but information corresponding to the narrowed registration information is not registered (for example, the The page may not be accessed by anyone other than a specific hobby.
- this point Pu it is possible to reflect the relationship between the page and the search requesting user and the usefulness of the information on the page in the search result.
- the total reference count 441, the latest reference count 442, and the bookmark registration count 443 it is registered in the page or bookmark that many other users who have selected the same type of job as the search request user refer to. Have Can be ranked higher in the search results.
- Pr can be used as a numerical value as it is when it is counted numerical information such as the total number of times of reference.
- the Pr value can be obtained by obtaining the difference from the current date and time, and further obtaining the reciprocal thereof. In any case, it is only necessary to convert the Pr value so that it becomes closer to the current date and time.
- the weighting value Pw may be given uniformly for all registration information, or may be given a value for specific registration information.
- the weighting value Pw of the total number of reference times 441 is set to 100 and the weighting value of the total number of reference times 432 is set to 10.
- a different weighting value from other registration information may be prepared.
- step S811 the points Pu for each piece of registration information calculated in step S810 are summed to calculate Pust.
- This point Pust is the weight given to each page based on the registration information.
- Puall Ps + Pust ′ N (N is a predetermined number).
- N is a predetermined number.
- the ranking of search result 1 need not be considered at all.
- step S815 based on the Puall value calculated in step S814! /,
- Puall decides the order of the pages in order of size, sorts the order of search result 1 and creates search result 2.
- search result 2 (or search result 1 if directly transferred from step S805) is transmitted to client 101.
- the server 103 corresponding to the present embodiment can perform search processing based on information other than the user data of the search request user and the contents of the page, and create a search result. I'll do it.
- user data to be used can be designated in advance by option setting 604.
- the user himself / herself narrows down user data that should be actively used, and is highly relevant to the keyword originally intended by the user.
- the page is weighted based on the number of most recent references !, so the page is a page whose total number of references is not very large due to recently launched power.
- the number of times of the most recent reference is naturally larger than other pages. The point (Pu) will be higher. Therefore, such pages are ranked higher in the search results.
- it is normal that such a page does not increase in the number of the latest reference. Therefore, by using this number of the latest reference, it is possible to exclude the search result power. It becomes possible.
- the above processing (for example, processing according to the flowchart shown in FIG. 5, FIG. 7 or FIG. 8 above) is stored as a program in a storage medium such as a CD-R, ROM, DVD-ROM, MO, etc.
- This computer can perform the above processing by loading the stored program into the computer (installing or copying), or downloading a predetermined server power via the network. Therefore, it is clear that this storage medium is also within the scope of the present invention.
- the present invention is not limited to the above-described embodiment, and various changes and modifications can be made without departing from the spirit and scope of the present invention. Therefore, in order to make the scope of the present invention public, the following claims are attached.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
La présente invention concerne un dispositif de récupération d'informations destiné à récupérer les informations d'une page Web sur Internet, sur la base des informations de récupération saisies depuis un terminal client et à transmettre le résultat de récupération au terminal client. La valeur pondérée de la page Web est calculée en utilisant les informations enregistrées relatives à la page Web, lorsqu'il apparaît dans un premier résultat de récupération que la page Web relative à un mot-clé a été récupérée à partir des informations de récupération. L'ordre de la page Web contenue dans le premier résultat de récupération est déterminé sur la base de la valeur pondérée calculée, afin de créer un second résultat de récupération. Ce second résultat est transmis au terminal client.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2005/003928 WO2006095409A1 (fr) | 2005-03-07 | 2005-03-07 | Dispositif de recuperation d'informations, programme informatique et support d'enregistrement |
| JP2007506942A JP4527770B2 (ja) | 2005-03-07 | 2005-03-07 | 情報検索装置、コンピュータプログラム及び記憶媒体 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2005/003928 WO2006095409A1 (fr) | 2005-03-07 | 2005-03-07 | Dispositif de recuperation d'informations, programme informatique et support d'enregistrement |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2006095409A1 true WO2006095409A1 (fr) | 2006-09-14 |
Family
ID=36953020
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2005/003928 Ceased WO2006095409A1 (fr) | 2005-03-07 | 2005-03-07 | Dispositif de recuperation d'informations, programme informatique et support d'enregistrement |
Country Status (2)
| Country | Link |
|---|---|
| JP (1) | JP4527770B2 (fr) |
| WO (1) | WO2006095409A1 (fr) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2009003520A (ja) * | 2007-06-19 | 2009-01-08 | Mic 9 Co Ltd | データベースサーバー装置及び検索サービス提供方法及び関連方法 |
| JP2009282593A (ja) * | 2008-05-20 | 2009-12-03 | Yahoo Japan Corp | コンテンツを検索する索引データを管理する方法、サーバ、およびプログラム |
| JP2010506255A (ja) * | 2006-09-29 | 2010-02-25 | グーグル インコーポレイテッド | ブックマークおよびランク付け |
| JP2010508579A (ja) * | 2006-10-25 | 2010-03-18 | マイクロソフト コーポレーション | マクロを使用する個別化検索 |
| JP2011054189A (ja) * | 2005-05-06 | 2011-03-17 | Nhn Corp | 個人化検索方法および個人化検索システム |
| JP2023078847A (ja) * | 2021-11-26 | 2023-06-07 | 三菱電機Itソリューションズ株式会社 | 文書管理装置、文書管理方法及び文書管理プログラム |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002032401A (ja) * | 2000-07-18 | 2002-01-31 | Mitsubishi Electric Corp | 文書検索方法及び文書検索装置及び文書検索方法をコンピュータに実現させるためのプログラムを記録したコンピュータで読取可能な記録媒体 |
| JP2002215671A (ja) * | 2001-01-16 | 2002-08-02 | Nippon Hoso Kyokai <Nhk> | 情報検索装置、および、情報検索方法 |
| JP2002351916A (ja) * | 2001-05-30 | 2002-12-06 | Bevoid Corporation Ltd | Web情報検索装置、web情報検索方法及びその方法をコンピュータに実行させるプログラム |
| JP2003186907A (ja) * | 2001-12-13 | 2003-07-04 | Fujitsu Ltd | プロファイル情報の情報検索方法、プログラム、記録媒体及び装置 |
| JP2003281179A (ja) * | 2002-03-22 | 2003-10-03 | Hitachi Information Systems Ltd | 検索サイトサーバ装置と検索情報表示制御方法およびプログラムと記録媒体 |
-
2005
- 2005-03-07 JP JP2007506942A patent/JP4527770B2/ja not_active Expired - Fee Related
- 2005-03-07 WO PCT/JP2005/003928 patent/WO2006095409A1/fr not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002032401A (ja) * | 2000-07-18 | 2002-01-31 | Mitsubishi Electric Corp | 文書検索方法及び文書検索装置及び文書検索方法をコンピュータに実現させるためのプログラムを記録したコンピュータで読取可能な記録媒体 |
| JP2002215671A (ja) * | 2001-01-16 | 2002-08-02 | Nippon Hoso Kyokai <Nhk> | 情報検索装置、および、情報検索方法 |
| JP2002351916A (ja) * | 2001-05-30 | 2002-12-06 | Bevoid Corporation Ltd | Web情報検索装置、web情報検索方法及びその方法をコンピュータに実行させるプログラム |
| JP2003186907A (ja) * | 2001-12-13 | 2003-07-04 | Fujitsu Ltd | プロファイル情報の情報検索方法、プログラム、記録媒体及び装置 |
| JP2003281179A (ja) * | 2002-03-22 | 2003-10-03 | Hitachi Information Systems Ltd | 検索サイトサーバ装置と検索情報表示制御方法およびプログラムと記録媒体 |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2011054189A (ja) * | 2005-05-06 | 2011-03-17 | Nhn Corp | 個人化検索方法および個人化検索システム |
| JP2010506255A (ja) * | 2006-09-29 | 2010-02-25 | グーグル インコーポレイテッド | ブックマークおよびランク付け |
| JP2010508579A (ja) * | 2006-10-25 | 2010-03-18 | マイクロソフト コーポレーション | マクロを使用する個別化検索 |
| JP2009003520A (ja) * | 2007-06-19 | 2009-01-08 | Mic 9 Co Ltd | データベースサーバー装置及び検索サービス提供方法及び関連方法 |
| JP2009282593A (ja) * | 2008-05-20 | 2009-12-03 | Yahoo Japan Corp | コンテンツを検索する索引データを管理する方法、サーバ、およびプログラム |
| JP2023078847A (ja) * | 2021-11-26 | 2023-06-07 | 三菱電機Itソリューションズ株式会社 | 文書管理装置、文書管理方法及び文書管理プログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2006095409A1 (ja) | 2008-08-14 |
| JP4527770B2 (ja) | 2010-08-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8732169B2 (en) | Lateral search | |
| JP4731479B2 (ja) | 検索システム及び検索方法 | |
| US6718365B1 (en) | Method, system, and program for ordering search results using an importance weighting | |
| US7617191B2 (en) | Search service that accesses and highlights previously accessed local and online available information sources | |
| KR101130420B1 (ko) | 통합 및 혼합 검색을 위한 시스템 및 방법 | |
| US7660781B2 (en) | Method, apparatus and computer-readable medium for searching and navigating a document database | |
| JP4005425B2 (ja) | 検索結果ランキング変更処理プログラム、検索結果ランキング変更処理プログラム記録媒体、およびコンテンツ検索処理方法 | |
| US8577868B1 (en) | Bookmarks | |
| JP5049223B2 (ja) | Webクエリに対する検索要求属性を自動推定する検索装置、検索方法及びプログラム | |
| CN101263493A (zh) | 用于提供搜索结果的系统和方法 | |
| WO2008051692A1 (fr) | Recherche personnalisée utilisant des macroapplications | |
| US20070239692A1 (en) | Logo or image based search engine for presenting search results | |
| JP2003281179A (ja) | 検索サイトサーバ装置と検索情報表示制御方法およびプログラムと記録媒体 | |
| JP2009037501A (ja) | 情報検索装置、情報検索方法およびプログラム | |
| JP4344596B2 (ja) | コンテンツ検索システム、コンテンツ検索プログラムおよび記録媒体 | |
| US20090240691A1 (en) | Recording medium recording object contents search support program, object contents search support method, and object contents search support apparatus | |
| JP4527770B2 (ja) | 情報検索装置、コンピュータプログラム及び記憶媒体 | |
| JP4912384B2 (ja) | 文書検索装置、文書検索方法、および文書検索プログラム | |
| KR101878937B1 (ko) | 개인화된 정보 제공 시스템, 방법 및 그에 대한 기록매체 | |
| JP2013011940A (ja) | 情報提供サーバ、情報検索システム、情報検索方法、および情報検索プログラム | |
| JP2007034772A (ja) | Webサイト検索結果の最適表示システム及びその装置及びその方法及びそのプログラム | |
| US8131752B2 (en) | Breaking documents | |
| WO2006064573A1 (fr) | Dispositif, procede et programme pour fournir des informations du web | |
| JP2010020739A (ja) | 社会的な人気・関心を反映したディレクトリを作成、検索、及び表示する情報管理装置、方法及びプログラム | |
| JP2009093554A (ja) | 検索支援方法、検索支援システム、アプリケーションサーバ、及び検索支援プログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2007506942 Country of ref document: JP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| NENP | Non-entry into the national phase |
Ref country code: RU |
|
| WWW | Wipo information: withdrawn in national office |
Country of ref document: RU |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 05720200 Country of ref document: EP Kind code of ref document: A1 |
|
| WWW | Wipo information: withdrawn in national office |
Ref document number: 5720200 Country of ref document: EP |