TWI595371B - Legal information search system,? search processing method and computer readable media recorder - Google Patents
Legal information search system,? search processing method and computer readable media recorder Download PDFInfo
- Publication number
- TWI595371B TWI595371B TW105142810A TW105142810A TWI595371B TW I595371 B TWI595371 B TW I595371B TW 105142810 A TW105142810 A TW 105142810A TW 105142810 A TW105142810 A TW 105142810A TW I595371 B TWI595371 B TW I595371B
- Authority
- TW
- Taiwan
- Prior art keywords
- legal
- candidate
- search
- legal data
- keyword
- Prior art date
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Description
本案係關於一種資料搜尋系統及搜尋處理方法,尤指一種以單一欄位進行關鍵字組的關聯度搜尋及排序之法學資料搜尋系統、搜尋處理方法及其電腦可讀取紀錄媒體。 The present invention relates to a data search system and a search processing method, and more particularly to a legal data search system, a search processing method and a computer readable recording medium for searching and sorting the keyword groups in a single field.
一般而言,法學資料的範疇主要包含審判相關法規、判解函釋、判決書、法律案件、除權判決公告等,並可依據欲查詢的法學資料的範疇,透過一法學資料搜尋系統進行法學資料查詢,例如司法院法學資料檢索系統(查詢網址:http://jirs.judicial.gov.tw/Index.htm)。使用習知的法學資料搜尋系統時,通常需先確定欲查詢的範疇,例如查詢法規或判決書,於點選進入頁面後即可進行法規或判決書之查詢。 Generally speaking, the scope of legal materials mainly includes trial-related laws and regulations, interpretations, judgments, legal cases, announcements of ex-rights judgments, etc., and can be used to conduct legal data inquiry through a legal data search system based on the scope of legal information to be inquired. For example, the Court's Jurisprudential Information Retrieval System (available at http://jirs.judicial.gov.tw/Index.htm ). When using the conventional legal data search system, it is usually necessary to first determine the scope of the query, such as querying regulations or judgments, and then querying the regulations or judgments after clicking on the entry page.
第1圖為習知之法學資料搜尋系統使用介面之示意圖。如第1圖所示,如欲利用傳統的法學資料搜尋系統使用介面1進行判決書查詢,使用者必須先確認欲查詢的裁判字號、年分、法院層級和地區,並依照法學資料搜尋系統使用介面1顯示的法院名稱、裁判類別、判決字號、判決案由、判決日期、裁判主文及全文檢索語詞等欄位10輸入對應屬性的關鍵字,於執行檢索後才能找到相應的法學資料。然而,傳統的 法學資料搜尋系統僅能搜尋出完全吻合該關鍵字的資料,而上述的判決字號、法院層級和地區等關鍵字資訊對於使用者而言不易取得且不易了解其語法,因此常於輸入關鍵字進行檢索後得到「查無資料」之檢索結果,換言之,傳統的法學資料搜尋系統不只操作複雜,且無法搜尋提供與關鍵字具有關聯度之相關法學資料。 Figure 1 is a schematic diagram of the interface used by the conventional legal data search system. As shown in Figure 1, if you want to use the traditional legal data search system to use the interface 1 to make a judgment, the user must first confirm the referee's font size, year, court level and region, and use the legal data search system. 1 The displayed court name, referee category, judgment font number, judgment case, judgment date, referee main text and full-text search terms are entered into the corresponding attribute keywords, and the corresponding legal materials can be found after the search is performed. However, traditional The legal data search system can only search for data that exactly matches the keyword, and the keyword information such as the judgment font size, the court level and the region are not easy for the user to obtain and the grammar is difficult to understand, so the keyword is often input. After the search, the search results of "check no data" are obtained. In other words, the traditional legal data search system is not only complicated to operate, but also cannot provide relevant legal materials that are related to keywords.
舉例而言,當欲搜尋案由為「侵害專利權有關財產權」時,若於判決案由欄位輸入「專利侵權」則無法搜索出任何資料。當欲搜尋裁判字號為「六十三年度上易字第六十一號」時,若於判決字號欄位輸入「63年度上易字第61號」則無法找到任何資料。再者,前述之案例中,若搜尋之法院輸入錯誤,仍無法找到任何法學資料。由於所輸入的關鍵字必須精準吻合法學資料的內容才能夠搜索到該筆法學資料,容易導致無法檢索到欲查詢的法學資料,且無法檢索到具相關聯性的法學資料,造成檢索的不便,且檢索過程繁瑣耗時,無法提供使用者友善的操作體驗。 For example, when the case is to be "infringed on the property rights of the patent right", if the patent infringement is entered in the field by the judgment, no information can be searched. When searching for the referee's font number as "Sixty-seventh Shang Yi Zi No. 61", if you enter "63 Years of Shang Yi Zi No. 61" in the judgment font size field, you will not be able to find any information. Furthermore, in the above case, if the court of the search is entered incorrectly, no legal information can be found. Since the input keyword must accurately match the content of the legal data, the legal data can be searched, which may result in the inability to retrieve the legal data to be inquired, and the related legal data cannot be retrieved, resulting in inconvenience in searching. And the retrieval process is cumbersome and time-consuming, and cannot provide a user-friendly operation experience.
此外,習知的法學資料搜尋系統所搜尋的結果清單係以日期作為順序排列的依據,然其並無法依據判決書與關鍵字的關聯性進行排序,因此搜尋結果排序較前面的判決書往往不會是最具關聯性的判決書,使搜尋的精準度低落,造成使用者需大量閱讀與過濾,徒費檢索與閱覽的時間。此外,除判決書查詢之外,審判相關法規檢索、判解函釋檢索、法律案件檢索或除權判決公告檢索等亦存在相同的問題。整體而言,傳統的法學資料搜尋系統的檢索精準度低落,檢索過程繁瑣費時且不便利。 In addition, the list of results searched by the conventional legal data search system is based on the date order, but it cannot be sorted according to the relevance of the judgment and the keyword. Therefore, the ranking of the search results is often not the same as the previous judgment. The most relevant judgments make the accuracy of the search low, causing users to read and filter a lot, and the time for searching and reading. In addition, in addition to the judgment of the judgment, the same problems exist in the trial of relevant laws and regulations, the retrieval of judgments, the retrieval of legal cases or the retrieval of ex-rights judgments. On the whole, the retrieval accuracy of the traditional legal data search system is low, and the retrieval process is cumbersome and time-consuming and inconvenient.
有鑑於此,實有必要發展一種法學資料搜尋系統及搜尋處理方法,以解決現有技術所面臨之問題。 In view of this, it is necessary to develop a legal data search system and a search processing method to solve the problems faced by the prior art.
本案之目的在於提供一種法學資料搜尋系統及搜尋處理方法,其可提供與關鍵字具關聯性的相關法學資料,提高檢索的精確度,提供友善且易於操作的使用者體驗,並且可以節省使用者檢索與閱覽的時間。 The purpose of this case is to provide a legal data search system and a search processing method, which can provide relevant legal materials related to keywords, improve the accuracy of retrieval, provide a friendly and easy-to-operate user experience, and save users. The time of retrieval and reading.
為達上述目的,本案之一較廣義實施樣態為提供一種法學資料搜尋系統,包含:法學資料庫,儲存複數筆法學資料;用戶端裝置,包括智能檢索介面,其中智能檢索介面係供使用者輸入關鍵字組;以及檢索伺服器,與法學資料庫及用戶端裝置相連結及通訊,且包括處理器,其中處理器依據用戶端裝置之關鍵字組進行法學資料搜索作業,處理器包括:關鍵字標記與選擇性擴展模組,將關鍵字組之一個或複數個關鍵字句進行標記作業以及選擇性地進行擴展作業,以取得複數個候選關鍵字句,其中每一個候選關鍵字句分別帶有一標記,且標記係與候選關鍵字句之類別相關聯;關鍵字搜尋欄位與權重調配模組,依據複數個候選關鍵字句與所屬之標記選定各候選關鍵字句之對應搜尋欄位以及對應加權參數;搜尋及關聯度分數計算模組,依據複數個候選關鍵字句搜尋法學資料庫之複數筆法學資料,以取得複數筆候選法學資料,並對複數筆法學資料進行各筆候選法學資料之關聯度分數計算;以及排序模組,依據複數筆候選法學資料之關聯度分數進行關聯度排序作業,俾提供經關聯度排序之複數筆候選法學資料至使用者裝置。 In order to achieve the above objectives, one of the broader aspects of the case provides a legal data search system, which includes: a legal database, storing a plurality of legal data; a client device, including an intelligent search interface, wherein the intelligent search interface is provided to the user. Entering a keyword group; and a search server, connecting and communicating with the legal database and the client device, and including a processor, wherein the processor performs a legal data search operation according to the keyword group of the user device, and the processor includes: a word tag and a selective extension module, which mark one or a plurality of keyword sentences of the keyword group and selectively perform an expansion operation to obtain a plurality of candidate keyword sentences, wherein each candidate keyword sentence respectively There is a mark, and the mark is associated with the category of the candidate keyword sentence; the keyword search field and the weight adjustment module select the corresponding search field of each candidate keyword according to the plurality of candidate keyword sentences and the associated tag and Corresponding weighting parameter; search and relevance score calculation module, based on a plurality of candidate keywords Searching for multiple legal data of the legal database to obtain multiple candidate legal data, and calculating the relevance scores of each candidate legal data for the plurality of legal materials; and sorting module, according to the relevance of the plurality of candidate legal materials The score is subjected to a relevance ranking job, and the plurality of candidate legal data sorted by the relevance degree is provided to the user device.
為達上述目的,本案之另一較廣義實施樣態為提供一種法學資料搜尋處理方法,用於法學資料搜尋系統以進行法學資料檢索作業,其中法學資料搜尋系統包括檢索伺服器,檢索伺服器包括處理器架構於執行法學資料檢索作業,法學資料搜尋處理方法包括步驟:(a)接收關鍵字組,對關鍵字組之一或複數個關鍵字句進行標記作業以及選擇性地進 行擴展作業,以取得複數個候選關鍵字句,其中每一個候選關鍵字句分別帶有標記,且標記係與候選關鍵字句之類別相關聯;(b)依據複數個候選關鍵字句與所屬之標記選定各候選關鍵字句之對應搜尋欄位以及對應加權參數;(c)依據複數個候選關鍵字句搜尋法學資料庫之複數筆法學資料,以取得複數筆候選法學資料,並對複數筆法學資料進行各筆候選法學資料之關聯度分數計算;以及(d)依據複數筆候選法學資料之關聯度分數進行關聯度排序作業,俾提供經關聯度排序之複數筆候選法學資料。 In order to achieve the above objectives, another broader mode of the present invention is to provide a method for searching for legal data for use in a legal data search system for performing legal data retrieval operations, wherein the legal data search system includes a retrieval server, and the retrieval server includes The processor is configured to perform a law data retrieval operation, and the legal data search processing method comprises the steps of: (a) receiving a keyword group, marking one or a plurality of keyword phrases, and selectively entering the keyword group; Extending the job to obtain a plurality of candidate keyword sentences, wherein each candidate keyword sentence has a mark, and the mark is associated with the category of the candidate keyword sentence; (b) according to the plurality of candidate keyword sentences and belongings The mark selects the corresponding search field of each candidate keyword sentence and the corresponding weighting parameter; (c) searches the plurality of legal data of the legal database according to the plurality of candidate keyword sentences to obtain the plurality of candidate legal materials, and obtains the plurality of candidate legal materials The legal data is used to calculate the relevance scores of each candidate legal data; and (d) the relevance ranking operation is performed according to the relevance scores of the plurality of candidate legal data, and the plurality of candidate legal data sorted by the relevance degree is provided.
為達上述目的,本案之再一較廣義實施樣態為提供一種法學資料搜尋處理方法,用於一法學資料搜尋系統以進行法學資料搜尋作業,其中該法學資料搜尋系統包括檢索伺服器,檢索伺服器包括處理器架構於執行法學資料搜尋作業,法學資料搜尋處理方法包括步驟:(a)接收一關鍵字組,對關鍵字組之一或複數個關鍵字句進行擴展作業,以取得複數個候選關鍵字句;以及(b)依據複數個候選關鍵字句搜尋法學資料庫之複數筆法學資料,以取得與關鍵字組具關聯性之複數筆候選法學資料。 In order to achieve the above objective, a more general implementation of the present case provides a method for searching for legal data for a legal data search system for performing a legal data search operation, wherein the legal data search system includes a search server and a search servo. The processor comprises a processor architecture for performing a law data search operation, and the legal data search processing method comprises the steps of: (a) receiving a keyword group, and expanding one of the keyword groups or the plurality of keyword sentences to obtain a plurality of candidates a keyword sentence; and (b) searching for a plurality of legal data of the legal database based on the plurality of candidate keyword sentences to obtain a plurality of candidate legal materials that are related to the keyword group.
為達上述目的,本案之又一較廣義實施樣態為提供一種電腦可讀取記憶媒體,其係儲存程式,且於法學資料搜尋系統之檢索伺服器載入並執行程式後,檢索伺服器係執行如前所述之法學資料搜尋處理方法。 In order to achieve the above object, a broader aspect of the present invention is to provide a computer readable memory medium, which is a storage program, and retrieves a server system after the retrieval server of the legal data search system loads and executes the program. Perform the legal data search processing method as described above.
1‧‧‧法學資料搜尋系統使用介面 1‧‧‧Law data search system interface
10‧‧‧欄位 10‧‧‧ field
2‧‧‧法學資料搜尋系統 2‧‧‧Legal Information Search System
21‧‧‧檢索伺服器 21‧‧‧Search server
200‧‧‧主機 200‧‧‧Host
210‧‧‧處理器 210‧‧‧ processor
211‧‧‧存儲器 211‧‧‧ memory
2111‧‧‧程式 2111‧‧‧Program
2112‧‧‧擴展詞庫 2112‧‧‧ Extended vocabulary
2113‧‧‧類別-搜尋欄位-權重對應資料庫 2113‧‧‧Category-Search Field-weight correspondence database
2114‧‧‧權重參數資料庫 2114‧‧‧weight parameter database
212‧‧‧通訊單元 212‧‧‧Communication unit
213‧‧‧輸入輸出單元 213‧‧‧Input and output unit
214‧‧‧關鍵字標記與選擇性擴展模組 214‧‧‧Keyword tagging and selective extension module
216‧‧‧關鍵字搜尋欄位與權重調配模組 216‧‧‧Keyword Search Field and Weighting Module
217‧‧‧搜尋及關聯度分數計算模組 217‧‧‧Search and relevance score calculation module
218‧‧‧排序模組 218‧‧‧Sorting module
219‧‧‧資料輸出模組 219‧‧‧ data output module
219a‧‧‧高亮標示單元 219a‧‧‧Highlighting unit
219b‧‧‧內容擷取單元 219b‧‧‧Content capture unit
22‧‧‧法學資料庫 22‧‧‧Law Database
221‧‧‧法學資料 221‧‧‧Legal information
222‧‧‧資料欄位 222‧‧‧Information field
23‧‧‧使用者裝置 23‧‧‧User device
23a‧‧‧智慧型手機 23a‧‧‧Smart Phone
23b‧‧‧平板電腦 23b‧‧‧ Tablet PC
23c‧‧‧筆記型電腦 23c‧‧‧Note Computer
23d‧‧‧桌上型電腦 23d‧‧‧Tablet computer
231‧‧‧智能檢索介面 231‧‧‧Intelligent search interface
232‧‧‧搜尋欄位 232‧‧‧Search field
233‧‧‧搜尋按鍵 233‧‧‧Search button
234‧‧‧排序切換按鍵 234‧‧‧Sort switch button
S10~S15‧‧‧法學資料搜尋系統之搜尋處理方法之步驟 Steps of the search and processing method of the S10~S15‧‧ legal data search system
S21~S23‧‧‧關聯度分數計算方法之步驟 Steps for calculating the correlation score of S21~S23‧‧
第1圖為習知之法學資料搜尋系統使用介面之示意圖。 Figure 1 is a schematic diagram of the interface used by the conventional legal data search system.
第2A圖為本案較佳實施例之法學資料搜尋系統之一示範例之架構示意圖。 Figure 2A is a block diagram showing an exemplary embodiment of a legal data search system of the preferred embodiment of the present invention.
第2B圖為本案較佳實施例之法學資料搜尋系統之另一示範例之架構示意圖。 2B is a block diagram showing another exemplary embodiment of the legal data search system of the preferred embodiment of the present invention.
第3圖為第2A圖之使用者裝置之智能檢索介面之示意圖。 Figure 3 is a schematic diagram of the intelligent retrieval interface of the user device of Figure 2A.
第4圖為第2A圖之檢索伺服器之架構示意圖。 Figure 4 is a schematic diagram of the architecture of the retrieval server of Figure 2A.
第5圖為第2A圖之法學資料庫之示意圖。 Figure 5 is a schematic diagram of the legal database of Figure 2A.
第6圖為第4圖之檢索伺服器的處理器與存儲器之架構示意圖。 Figure 6 is a block diagram showing the architecture of the processor and memory of the retrieval server of Figure 4.
第7圖為本案較佳實施例之法學資料搜尋系統之搜尋處理方法之流程圖。 Figure 7 is a flow chart showing the search processing method of the legal data searching system of the preferred embodiment of the present invention.
第8圖為第2A圖之使用者裝置顯示法學資料檢索結果之示意圖。 Figure 8 is a schematic diagram showing the results of the retrieval of legal data by the user device of Figure 2A.
第9圖為本案較佳實施例之關聯度分數計算方法之流程圖。 Figure 9 is a flow chart of a method for calculating the relevance score of the preferred embodiment of the present invention.
體現本案特徵與優點的一些典型實施例將在後段的說明中詳細敘述。應理解的是本案能夠在不同的態樣上具有各種的變化,其皆不脫離本案的範圍,且其中的說明及圖式在本質上係當作說明之用,而非架構於限制本案。 Some exemplary embodiments embodying the features and advantages of the present invention are described in detail in the following description. It is to be understood that the present invention is capable of various modifications in various aspects, and is not intended to limit the scope of the invention.
第2A圖為本案較佳實施例之法學資料搜尋系統之一示範例之架構示意圖,第2B圖為本案較佳實施例之法學資料搜尋系統之另一示範例之架構示意圖,第3圖為第2A圖之使用者裝置之智能檢索介面之示意圖,第4圖為第2A圖之檢索伺服器之架構示意圖,以及第5圖為第2A圖之法學資料庫之示意圖。如第2A、2B、3、4及5圖所示,本案之法學資料搜尋系統2包含檢索伺服器21、法學資料庫22以及至少一個使用者裝置23。於一實施例中,如第2A圖所示,檢索伺服器21與法學資料庫22可共同設置於一主機200,並透過主機200內之連接埠等本機裝置相連結與通訊。於另一實施例中,如第2B圖所示,檢索伺服器21與法學資料庫22可為獨立且分離之實體裝置,並透過通訊網路20 相連結與通訊。檢索伺服器21以及該至少一個使用者裝置23係藉由通訊網路20相連結與通訊,其中通訊網路20可為例如但不限於網際網路、行動通訊網路等。使用者裝置23包括但不限於具聯網功能之智慧型手機23a、平板電腦23b、筆記型電腦23c或桌上型電腦23d。使用者裝置23經由通訊網路20與檢索伺服器21相連結,並載入與顯示一智能檢索介面231,以供使用者輸入關鍵字組以及啟動檢索作業。如第3圖所示,於本實施例中,該智能檢索介面231包括單一之搜尋欄位232及搜尋按鍵233,其中搜尋欄位232用以供使用者輸入欲搜尋之法學資料之關鍵字組,搜尋按鍵233係供使用者點擊以依據輸入於搜尋欄位232之關鍵字組啟動檢索作業。如第5圖所示,法學資料庫22包括複數筆法學資料221,其中每一筆法學資料221包括複數個資料欄位222。如第4圖所示,檢索伺服器21包括處理器210、存儲器211、通訊單元212以及輸入輸出單元213。處理器210係架構於執行檢索作業以及管控檢索伺服器21。存儲器211係架構於儲存程式及儲存執行檢索作業所需之預存資料。通訊單元212係架構於與使用者裝置23以及法學資料庫22相連結與通訊。輸入輸出單元213係架構於進行指令與資料的輸入與輸出。 2A is a schematic structural diagram of an exemplary embodiment of a legal data search system of a preferred embodiment of the present invention, and FIG. 2B is a schematic structural diagram of another exemplary embodiment of a legal data search system of the preferred embodiment of the present invention, and FIG. 2A is a schematic diagram of the intelligent retrieval interface of the user device, FIG. 4 is a schematic diagram of the architecture of the retrieval server of FIG. 2A, and FIG. 5 is a schematic diagram of the legal database of FIG. 2A. As shown in Figures 2A, 2B, 3, 4 and 5, the legal data search system 2 of the present invention includes a retrieval server 21, a legal database 22, and at least one user device 23. In an embodiment, as shown in FIG. 2A, the search server 21 and the legal database 22 can be disposed in a host 200 and connected and communicated through a local device such as a connection port in the host 200. In another embodiment, as shown in FIG. 2B, the retrieval server 21 and the legal database 22 can be separate and separate physical devices and communicate through the communication network 20 Link and communication. The search server 21 and the at least one user device 23 are connected and communicated by the communication network 20, wherein the communication network 20 can be, for example, but not limited to, an internet, a mobile communication network, or the like. The user device 23 includes, but is not limited to, a smart phone 23a with a networking function, a tablet 23b, a notebook computer 23c, or a desktop computer 23d. The user device 23 is coupled to the retrieval server 21 via the communication network 20, and loads and displays an intelligent retrieval interface 231 for the user to input a keyword group and initiate a retrieval operation. As shown in FIG. 3, in the present embodiment, the intelligent search interface 231 includes a single search field 232 and a search button 233, wherein the search field 232 is used for the user to input the keyword group of the legal data to be searched. The search button 233 is for the user to click to initiate a search operation based on the keyword group entered in the search field 232. As shown in FIG. 5, the legal database 22 includes a plurality of legal materials 221, wherein each legal material 221 includes a plurality of data fields 222. As shown in FIG. 4, the retrieval server 21 includes a processor 210, a memory 211, a communication unit 212, and an input and output unit 213. The processor 210 is architected to perform a retrieval operation and to manage the retrieval server 21. The memory 211 is structured to store programs and store pre-stored data required to perform retrieval operations. The communication unit 212 is configured to be connected and communicated with the user device 23 and the legal database 22. The input and output unit 213 is structured to perform input and output of instructions and data.
第6圖為第4圖之檢索伺服器的處理器與存儲器之架構示意圖,以及第7圖為本案較佳實施例之法學資料搜尋系統之搜尋處理方法之流程圖。如第2A至7圖所示,於本實施例中,檢索伺服器21之處理器210包括關鍵字標記與選擇性擴展模組214、關鍵字搜尋欄位與權重調配模組216、搜尋及關聯度分數計算模組217、排序模組218以及資料輸出模組219。檢索伺服器21之存儲器211包括程式2111、擴展詞庫2112、類別-搜尋欄位-權重對應資料庫2113以及權重參數資料庫2114。本案之法學資料搜尋系統2及其搜尋處理方法說明如下。首先,如步驟S10 所示,檢索伺服器21之處理器210於接收到源自使用者裝置23之關鍵字組及啟動檢索作業要求時,解析該關鍵字組並開始執行法學資料搜尋作業。舉例而言,若所輸入的關鍵字組為「訴字第一四四二號 台南地院」時,檢索伺服器21之處理器210係將該關鍵字組解析且擷取為包含複數個關鍵字句,如「訴字第一四四二號」、「台南地院」等關鍵字句。 FIG. 6 is a schematic diagram showing the architecture of the processor and the memory of the retrieval server of FIG. 4, and FIG. 7 is a flowchart of the search processing method of the legal data searching system of the preferred embodiment of the present invention. As shown in Figures 2A through 7, in the present embodiment, the processor 210 of the retrieval server 21 includes a keyword tag and selective expansion module 214, a keyword search field and weighting module 216, and a search and association. The score calculation module 217, the sorting module 218, and the data output module 219. The memory 211 of the retrieval server 21 includes a program 2111, an extended vocabulary 2112, a category-search field-weight correspondence database 2113, and a weight parameter database 2114. The legal data search system 2 of the present case and its search processing method are described below. First, as in step S10 As shown, when receiving the keyword group originating from the user device 23 and initiating the search job request, the processor 210 of the search server 21 analyzes the keyword group and starts executing the legal data search operation. For example, if the input keyword group is "Taiwan No. 1442 Tainan Court," the processor 210 of the retrieval server 21 parses and extracts the keyword group into a plurality of keys. Words such as "Letter No. 1442" and "Tainan District Court".
接著,如步驟S11所示,處理器210係執行該關鍵字標記與選擇性擴展模組214,以將關鍵字組之各關鍵字句進行標記作業,俾使各關鍵字句成為帶有標記之候選關鍵字句。於本步驟中,標記作業可包括例如但不限於對各候選關鍵字句係歸屬於法院名稱、判決字號或法規等類別給予標記,換言之,各候選關鍵字句所帶有之標記係與該候選關鍵字句之類別相關。於本步驟中,處理器210更可選擇性地對關鍵字組之各關鍵字句進行擴展作業,其中選擇性係指處理器210對關鍵字組之各關鍵字句進行擴展作業或不進行擴展作業。前述擴展作業可包括例如但不限於同義字詞擴展、數字轉國字擴展、國字轉數字擴展、法規名稱擴展、法規條號擴展、案件性質擴展、案號辨識與擴展以及法院辨識與擴展,藉此以取得經擴展之複數個關鍵字句。於一些實施例中,處理器210可先進行擴展作業,將關鍵字組之關鍵字句擴展,以取得經擴展後之複數個關鍵字句,之後,處理器210再進行標記作業,將經擴展後之複數個關鍵字句成為分別帶有標記之複數個候選關鍵字句。舉例而言,上述關鍵字組之一關鍵字句「訴字第一四四二號」擴展為「訴字第一四四二號」、「訴字第1442號」等關鍵字句並將其標記為「判決字號」而成為帶有標記之候選關鍵字句,且將上述關鍵字組之另一關鍵字句「台南地院」擴展為「台南地院」、「台南地方法院」、「臺南法院」、「臺南地方法院」等關鍵字句並將其標記為「法 院名稱」而成為帶有標記之候選關鍵字句。於另一些實施例中,處理器210可先進行標記作業,使關鍵字組之各關鍵字句分別帶有標記,之後,處理器210再進行擴展作業,以將帶有標記之關鍵字句擴展,藉此取得複數個帶有標記之候選關鍵字句。 Then, as shown in step S11, the processor 210 executes the keyword tagging and selective expansion module 214 to mark each keyword sentence of the keyword group, so that each keyword sentence is marked. Candidate keyword sentence. In this step, the marking job may include, for example, but not limited to, marking the category of each candidate keyword sentence belonging to the court name, the judgment font size or the regulation, in other words, the marking of each candidate keyword sentence and the candidate The category of the keyword sentence is related. In this step, the processor 210 may further perform an extended operation on each keyword sentence of the keyword group, wherein the selective means that the processor 210 expands or does not expand each keyword sentence of the keyword group. operation. The foregoing expansion operations may include, for example, but not limited to, synonym expansion, digital to national expansion, national to digital expansion, regulatory name extension, regulatory article extension, case extension, case identification and extension, and court identification and extension. Thereby, the expanded plurality of keyword sentences are obtained. In some embodiments, the processor 210 may first perform an extended job to expand the keyword sentence of the keyword group to obtain the expanded plurality of keyword sentences, and then the processor 210 performs the marking operation and is expanded. The latter plurality of keyword sentences become a plurality of candidate keyword sentences each with a mark. For example, one of the above keyword groups, the wording "Symbol No. 1442", was expanded into "Keywords No. 1442" and "Voice No. 1442" and the like. Marked as "judgment font size" and become a candidate keyword sentence with a mark, and expand the other keyword sentence "Tainan Court" of the above keyword group into "Tainan District Court", "Tainan District Court", "Tainan" Key words such as "court" and "Tainan District Court" and mark it as "Law The name of the hospital becomes a candidate keyword sentence with a mark. In other embodiments, the processor 210 may perform a marking operation to mark each keyword sentence of the keyword group respectively, and then the processor 210 performs an extended job to expand the keyword sentence with the tag. In order to obtain a plurality of candidate keyword sentences with tags.
於一些實施例中,存儲器211之擴展詞庫2112中可先預存擴展作業之擴展關鍵字句,因此該關鍵字標記與選擇性擴展模組214於執行擴展作業時,可於存儲器211之擴展詞庫2112中取得對應的擴展關鍵字句。應強調的是,本步驟之標記與選擇性擴展作業並不以上述實施例為限,且可依據實際應用需求而任施變化,藉此以增加關鍵字組搜尋範圍的廣度,並達到以關聯性進行檢索之功效。於本實施例中,法學資料之類別可包括例如但不限於法院名稱、刑事判決、民事判決、行政判決、判決字號、判決案由、判決日期、裁判主文、法規名稱或法規內文。由於各候選關鍵字句所帶有之標記係與該候選關鍵字句之類別相關,因此依據個候選關鍵字句之標記即可判斷取得該候選關鍵字句所屬之類別。 In some embodiments, the extended keyword library of the memory 211 may pre-store the extended keyword sentence of the extended job. Therefore, the keyword tag and the selective expansion module 214 may be extended words in the memory 211 when performing the extended job. The corresponding extended keyword sentence is obtained in the library 2112. It should be emphasized that the marking and selective expansion operations of this step are not limited to the above embodiments, and may be changed according to actual application requirements, thereby increasing the breadth of the keyword search range and achieving association. The effect of sexual search. In this embodiment, the categories of legal materials may include, for example, but are not limited to, court names, criminal judgments, civil judgments, administrative judgments, judgment font numbers, judgment cases, judgment dates, referee texts, regulatory names, or regulatory texts. Since the markup of each candidate keyword sentence is related to the category of the candidate keyword sentence, the category of the candidate keyword sentence can be determined according to the mark of the candidate keyword sentence.
之後,如步驟S12所示,於完成標記作業後,處理器210係執行該關鍵字搜尋欄位與權重調配模組216,以依據上述複數個候選關鍵字句及其各自所屬之類別,對每一個候選關鍵字句進行搜尋欄位與權重調整與分配作業。舉例而言,上述的候選關鍵字句「訴字第一四四二號」依據其標記可知所屬類別為「判決字號」,於執行搜尋欄位與權重調整與分配作業後,可依據候選關鍵字句及其所屬類別,設定該組候選關鍵字句欲於法學資料庫22中進行檢索的資料欄位可涵括例如但不限於法學資料221的「判決字號」以及「判決主文」等資料欄位(請參閱第5圖),藉此可擴展關鍵字之檢索廣度。此外,並依據該類別及/或欄位之重要程度,給定一對應的加權參數。於一些實施例中,存儲器211 之類別-搜尋欄位-權重對應資料庫2113中可先預存類別-搜尋欄位-權重之對應資料,因此該關鍵字搜尋欄位與權重調配模組216於執行搜尋欄位與權重調整與分配作業時,可於存儲器211之類別-搜尋欄位-權重對應資料庫2113中取得類別-搜尋欄位-權重之對應資料。應強調的是,本步驟之搜尋欄位與權重調整與分配作業並不以上述實施例為限,且可依據實際應用需求而任施變化,藉此以增加關鍵字組搜尋範圍的廣度,並達到以關聯性進行檢索之功效。 Then, as shown in step S12, after completing the marking operation, the processor 210 executes the keyword search field and the weighting module 216 to calculate each of the plurality of candidate keyword sentences and their respective categories. A candidate keyword sentence performs search field and weight adjustment and assignment. For example, the above-mentioned candidate keyword sentence "Voice No. 1442" according to its mark can be known as the "judgment font size", and after performing the search field and the weight adjustment and assignment operation, the candidate keyword can be used according to the candidate keyword. The sentence and its category, the data field of the set of candidate keyword sentences to be searched in the legal database 22 may include, for example, but not limited to, the "judgment font size" of the legal data 221 and the data field of the "judgment main text". (See Figure 5), which expands the search breadth of keywords. In addition, a corresponding weighting parameter is given depending on the importance of the category and/or field. In some embodiments, the memory 211 The category-search field-weight corresponding database 2113 can pre-store the category-search field-weight corresponding data, so the keyword search field and weighting module 216 performs the search field and weight adjustment and allocation. At the time of the homework, the category-search field-weight corresponding data can be obtained in the category-search field-weight corresponding database 2113 of the memory 211. It should be emphasized that the search field and the weight adjustment and allocation operation in this step are not limited to the above embodiments, and may be changed according to actual application requirements, thereby increasing the breadth of the keyword search range, and Achieve the ability to search by relevance.
然後,如步驟S13所示,於完成搜尋欄位與權重調整與分配作業後,處理器210係執行該搜尋及關聯度分數計算模組217,以對每一個候選關鍵字句依據其選定的一個或多個搜尋欄位於法學資料庫22之複數筆法學資料221進行檢索,以檢索過濾出複數筆候選法學資料221。同時,依據給定的加權參數以及一特定演算法進行關聯度分數計算。舉例而言,對於任一筆檢索過濾取得之法學資料211而言,上述候選關鍵字句「訴字第一四四二號」出現在該筆法學資料211之「判決字號」欄位依據該欄位對應給定之加權參數wa1所計算取得之權重值為a1,出現在該筆法學資料211之「判決主文」欄位依據該欄位對應給定之加權參數wa2所計算取得之權重值為a2,因此對該筆法學資料211而言,該候選關鍵字句「訴字第一四四二號」之總權重值為a1+a2。相同地,對該筆法學資料211而言,其他上述的候選關鍵字句亦以同樣方式計算其總權重值。之後,將所有候選關鍵字句計算取得之總權重值相加總,以取得該筆法學資料211之關聯度分數。檢索過濾取得之複數筆候選法學資料211分別利用上述方法計算取得各自之關聯度分數。於本實施例中,每筆候選法學資料211之關聯度分數係為該筆候選法學資料與關鍵字組的關聯度高低判斷依據,若關聯度分數越高代表該筆候選法學資料與該關鍵字組的關聯度越高;反之,若關聯度分數越低 代表該筆候選法學資料與關鍵字組的關聯度越低。應強調的是,本步驟之搜尋及關聯度分數計算作業並不以上述實施例為限,且可依據實際應用需求而任施變化。 Then, as shown in step S13, after completing the search field and the weight adjustment and assignment operation, the processor 210 executes the search and relevance score calculation module 217 to select each of the candidate keyword sentences according to the selected one. Or a plurality of search columns are located in the plurality of legal materials 221 of the legal database 22 for searching to filter out the plurality of candidate legal materials 221 . At the same time, the relevance score calculation is performed according to the given weighting parameter and a specific algorithm. For example, for the legal data 211 obtained by any of the search filters, the candidate keyword sentence "Voice No. 1442" appears in the "judgment font number" field of the legal data 211 according to the field. The weight value obtained by calculating the weighting parameter wa1 corresponding to the given weighting parameter wa1 is a1, and the weight value obtained by the weighting parameter wa2 corresponding to the field corresponding to the field in the "judgment main text" field of the grammar data 211 is a2, so In the case of the legal data 211, the total weight value of the candidate keyword sentence "V. No. 1442" is a1+a2. Similarly, for the grammar data 211, the other candidate keyword sentences described above also calculate their total weight values in the same manner. Then, the total weight values obtained by calculating all the candidate keyword sentences are added together to obtain the relevance score of the legal data 211. The plurality of candidate candidate legal materials 211 obtained by the search and filtering are respectively calculated and obtained by the above method. In this embodiment, the relevance score of each candidate legal data 211 is the basis for determining the relevance of the candidate legal data and the keyword group, and the higher the relevance score represents the candidate legal data and the keyword. The higher the association degree of the group; on the contrary, the lower the relevance score The lower the degree of association between the candidate legal data and the keyword group. It should be emphasized that the search and relevance score calculation operation of this step is not limited to the above embodiment, and may be changed according to actual application requirements.
於一些實施例中,該筆候選法學資料211若為「判例」或「高價值裁判」等具特殊性、高價值性之法學資料時,該筆候選法學資料之關聯度分數則依照該法學資料本身性質進一步加成其權重分數,以使具特殊性、高價值性之該筆候選法學資料之關聯度提高。前述之「判例」係指已經生效的判決,法院在判決類似案件時援用為先例,這類被援用的判決先例稱為「判例」。應強調的是,本步驟之法學資料加成權重選擇並不限於「判例」或「高價值裁判」,其他具有具特殊性、高價值性之法學資料亦可列入參考。 In some embodiments, if the candidate legal data 211 is a special, high-value legal material such as a "criteria" or a "high-value referee", the relevance score of the candidate legal data is in accordance with the legal data. The nature of its own further adds its weight scores to increase the relevance of the candidate legal data with particularity and high value. The aforementioned "jurisprudence" refers to a judgment that has already taken effect. The court used the precedent as a precedent in the judgment of similar cases. Such precedents for such judgments are called "jurisprudence". It should be emphasized that the weighting of the legal data addition weights in this step is not limited to "jurisprudence" or "high-value referees". Other legal materials with special and high value may also be included in the reference.
之後,如步驟S14所示,於完成搜尋及關聯度分數計算作業之後,檢索伺服器21之排序模組218將被執行,以依照複數筆候選法學資料211之關聯度分數進行排序作業,即依照複數筆候選法學資料211之關聯度分數由高到低依序排序,以將關聯度分數最高(即關聯度最高)的候選法學資料211排於第一筆,將關聯度分數次高(即關聯度次高)的候選法學資料排於第二筆,並以此類推,以產生以關聯性排序之一檢索查詢結果。 Thereafter, as shown in step S14, after completing the search and relevance score calculation operation, the ranking module 218 of the retrieval server 21 is executed to perform the sorting operation according to the relevance score of the plurality of candidate legal materials 211, that is, according to The relevance scores of the plurality of candidate legal data 211 are sorted from high to low in order to rank the candidate legal data 211 with the highest relevance score (ie, the highest relevance) in the first stroke, and the relevance score is second highest (ie, association) The candidate legal data of the higher degree is ranked second, and so on, to generate one of the relevance rankings to retrieve the query results.
最後,如步驟S15所示,於完成關聯度排序作業之後,伺服器21之資料輸出模組219係被執行,以將依照關聯度排序後之候選法學資料211輸出,並藉由通訊網路20將檢索結果提供且顯示於使用者裝置23,以供使用者瀏覽,如第8圖所示。於一些實施例中,處理器211之資料輸出模組219更包括高亮標示單元219a,以將每筆候選法學資料211中出現候選關鍵字句的位置以高亮標示方式(例如但不限於灰階標示、彩色標示、粗體標示或底線標示)呈現,藉此以利使用者快速閱覽所需 資訊。於另一些實施例中,處理器211之資料輸出模組219更包括內容擷取單元219b,以將每筆候選法學資料211中相對較為重要之段落擷取呈現(例如該筆候選法學資料中具較高權重值之欄位或段落),藉此以利使用者快速閱覽所需資訊。 Finally, as shown in step S15, after completing the relevance ranking job, the data output module 219 of the server 21 is executed to output the candidate legal data 211 sorted according to the relevance degree, and the communication network 20 The search results are provided and displayed on the user device 23 for viewing by the user, as shown in FIG. In some embodiments, the data output module 219 of the processor 211 further includes a highlight indicator unit 219a to highlight the position of the candidate keyword sentence in each candidate legal material 211 (for example, but not limited to gray Presentation of the order, color, bold or bottom line) for the user to quickly view News. In other embodiments, the data output module 219 of the processor 211 further includes a content capturing unit 219b for presenting relatively important paragraphs in each candidate legal material 211 (eg, the candidate legal data has A field or paragraph with a higher weight value, so that the user can quickly view the desired information.
於一些實施例中,使用者裝置23之智能檢索介面11於顯示該查詢結果之頁面時,更可包括一排序切換按鍵234(如第8圖所示),其中該排序切換按鍵234可供使用者點選,以切換為依據日期排序呈現查詢結果,以便於使用者靈活應用。於本實施例中,處理器211於接收到使用者裝置23發出之排序切換要求時,處理器211之排序模組218係被執行,以將複數筆候選法學資料211依據日期(例如判決日期、公開日期、公告日期)進行排序作業,之後,並經由資料輸出模組219而將依據時間排序之檢索結果提供至使用者裝置23顯示,以供使用者瀏覽。於一些實施例中,使用者裝置23之智能檢索介面11於顯示該查詢結果之頁面時,更可包括一推薦關鍵字組連結(未圖示),以同時顯示可能具有關聯性之關鍵字組之至少一搜尋連結。當查詢結果並非使用者欲搜尋之法學資料時,使用者可點選系統提供之推薦關鍵字組的對應搜尋連結,以進行其他可能具有關聯性之關鍵字組的搜尋作業,達到使用上的便利性。於本實施例中,處理器211之關鍵字標記與選擇性擴展模組214可將候選關鍵字組,以連結方式提供至使用者裝置23之智能檢索介面11之推薦關鍵字組連結。 In some embodiments, the smart search interface 11 of the user device 23 may further include a sort switch button 234 (as shown in FIG. 8) when the page for displaying the query result is displayed, wherein the sort switch button 234 is available for use. Click to switch to sort the results based on the date, so that users can flexibly apply. In the embodiment, when the processor 211 receives the sorting switching request from the user device 23, the sorting module 218 of the processor 211 is executed to use the plurality of candidate legal materials 211 according to the date (eg, the date of the judgment, The sorting operation is performed on the publication date and the announcement date. Thereafter, the time-ordered search results are provided to the user device 23 for display by the user through the data output module 219. In some embodiments, the intelligent search interface 11 of the user device 23 may further include a recommended keyword group link (not shown) when displaying the page of the query result to simultaneously display the keyword groups that may be related. At least one search link. When the result of the query is not the legal information that the user wants to search, the user can click the corresponding search link of the recommended keyword group provided by the system to perform other search operations of the keyword group that may be associated, and the convenience of use is achieved. Sex. In this embodiment, the keyword tag and selective extension module 214 of the processor 211 can provide the candidate keyword group to the recommended keyword group link of the intelligent search interface 11 of the user device 23 in a connected manner.
根據本案之構想,關聯度分數計算方式並不以上述實施例為為限。第9圖為本案較佳實施例之關聯度分數計算流程圖。如第9圖所示,本案之關聯度分數計算方法包括如下步驟。首先,如步驟S21所示,依據候選關鍵字句所帶有之標記判斷該候選關鍵字句為裁判類或法規類。舉例而言,候選關鍵字句「貪污治罪條例」所帶有之標記係代表「法 規名稱」,因此依據該標記可判斷該候選關鍵字句所屬類別為「法規名稱」,並進一步將其歸類為法規類;相對地,候選關鍵字句「違反貪污治罪條例」所帶有之標記係代表「判決案由」,因此依據該標記可判斷該候選關鍵字句所屬類別為「判決案由」,並進一步將其歸類為裁判類。以上係以舉例方式進行說明,其判斷類別之方式不以此為限,可依據實際情形任施變化。接著,如步驟S22所示,處理器211之搜尋及關聯度分數計算模組217依據前述判斷結果提供對應於該裁判類或該法規類之各自特定關聯度分數演算法,換言之,若候選關鍵字句被歸類為法規類,則以一法規關聯度計算程序對該法學資料進行運算,並得出該候選關鍵字句於該法學資料之關聯度分數;若候選關鍵字句被歸類為裁判類,則以一裁判關聯度計算程序對該法學資料進行運算,並得出該候選關鍵字句於該法學資料之關聯度分數。最後,如步驟S23所示,將所有候選關鍵字句於該筆候選法學資料之所有關聯度分數進行加總,俾得出該筆候選法學資料之關聯度分數。 According to the concept of the present case, the calculation method of the relevance score is not limited to the above embodiment. Figure 9 is a flow chart for calculating the relevance score of the preferred embodiment of the present invention. As shown in Fig. 9, the method for calculating the relevance score of the present case includes the following steps. First, as shown in step S21, the candidate keyword sentence is judged to be a referee class or a statute class based on the flag attached to the candidate keyword sentence. For example, the candidate keyword sentence "Corruption Crimes Ordinance" carries the mark on behalf of "Law According to the mark, it can be judged that the candidate keyword sentence belongs to the category of "regulatory name" and further classified as a statute; relatively, the candidate keyword sentence "with the Corruption Ordinance" The mark system represents the "judgment case", so according to the mark, the candidate keyword sentence can be judged to belong to the "judgment case" and further classified as a referee class. The above description is made by way of example, and the manner of judging the category is not limited thereto, and may be changed according to actual conditions. Then, as shown in step S22, the search and relevance score calculation module 217 of the processor 211 provides respective specific relevance score algorithms corresponding to the referee class or the rule class according to the foregoing judgment result, in other words, if the candidate keyword The sentence is classified as a statute, and the legal data is calculated by a law of relevance calculation, and the relevance score of the candidate keyword in the legal data is obtained; if the candidate keyword is classified as a referee For the class, the legal data is calculated by a referee correlation calculation program, and the relevance score of the candidate keyword sentence in the legal data is obtained. Finally, as shown in step S23, all the candidate keyword sentences are summed over all the relevance scores of the candidate legal data, and the relevance score of the candidate legal data is obtained.
請繼續參閱第9圖,承上所述,該法規關聯度分數計算程序係透過下列公式進行運算:
其中,Max(matchWeight)為最大的得分,適用在名稱完全符合的情況中;matchWeight為一般符合規則情況下的得分(意即非為最大權重的情況下皆屬之);tieBreak為一加權參數;n為檢索條件之總數,透過此公式之運算以計算出該法學資料的關聯度分數(Score)。舉例來說,請參閱表一,若所輸入的關鍵字句為「刑法」且該筆法學資料為「中華民國刑法」時,輸入之關鍵字句「刑法」經標記與擴展後成為「中華民國刑法」、「刑法」等候選關鍵字句,當檢索條件為「刑法」或「中華民國刑法」時,該法學資料「中華民國刑法」符合檢索條件, 且名稱完全吻合,故適用最大的得分Max(matchWeight)。再者,當檢索條件分別為『「刑」開頭、且「法」或「條例」結尾』、『包含「刑」、「法」並依順序出現』...等情況時,該筆法學資料「中華民國刑法」亦符合檢索條件,並適用一般符合規則情況下的得分,並依據公式1計算將其進行運算加總,得出該法學資料「中華民國刑法」之關聯度分數(如表一所示)。 Among them, Max ( matchWeight ) is the largest score, which is applicable in the case where the name is completely consistent; matchWeight is the score in the case of generally conforming to the rule (meaning that it is not the maximum weight); tieBreak is a weighted parameter; n is the total number of search conditions, and the correlation score (Score) of the legal data is calculated by the operation of this formula. For example, please refer to Table 1. If the keyword sentence entered is "Criminal Law" and the legal data is "Criminal Law of the Republic of China", the input keyword sentence "Criminal Law" is marked and expanded to become "Republic of China." Candidate keyword sentences such as "criminal law" and "criminal law", when the search condition is "criminal law" or "Criminal Law of the Republic of China", the legal data "Criminal Law of the Republic of China" meets the search conditions, and the name is completely consistent, so the maximum score Max is applied. ( matchWeight ). Furthermore, when the search conditions are "the beginning of "criminal", and the end of "law" or "regulation", "including "penalty", "law" and appear in order", etc., the legal data The "Criminal Law of the Republic of China" is also in line with the search conditions, and the scores in the general compliance with the rules are applied, and the calculations are summed according to the formula 1. The correlation scores of the legal data "the Republic of China Criminal Law" are obtained (see Table 1). Shown).
請繼續參閱第9圖,承上所述,該裁判關聯度分數計算程序係透過下列公式進行運算:
其中,queryNorm i,j 為正規化搜尋分數,其計算公式為:
tf i,j 為候選關鍵字句i在欄位j裡出現的次數,其計算公式為:
idf i,j 係為候選關鍵字句i在所有法學資料中的稀有度,其計算公式為: (公式5)idf i,j =1+log(文件總數/(包含關鍵字總數+1)) Idf i,j is the rarity of the candidate keyword sentence i in all legal materials, and its calculation formula is: (Formula 5) idf i,j =1+ log (total number of files / (including total number of keywords + 1))
fieldweight則為該搜尋欄位j的加權參數,以區分欄位之重要程度。透過此公式之運算以計算出該筆法學資料的關聯度分數(Score)。舉例來說,請參閱表二,若輸入之關鍵字句為「公務員利用職務貪污」且該筆法學資料為「臺灣高等法院臺南分院刑事判決99年度上訴字第839號」之判決書時,則其關聯度分數計算說明如下。經標記與擴展後之候選關鍵字句可包括例如但不限於「公務員利用職務」及「貪汙」等候選關鍵字句,並以搜尋欄位為「案由」及「裁判內文」進行搜尋。其中,候選關鍵字句「公務員利用職務」並未出現於搜尋欄位「案由」中,故queryNorm、tf及idf數值均為0;候選關鍵字句「貪汙」於搜尋欄位「案由」出現一次,經由公式4計算出tf=1,再經由公式3計算出queryNorm=0.94229126,且經由公式5計算出「貪汙」於所有法學資料中的稀有度idf=6.9492507。接著,分別計算出候選關鍵字句「公務員利用職務」於搜尋欄位「裁判內文」、候選關鍵字句「貪汙」於搜尋欄位「裁判內文」之queryNorm、tf及idf數值,由於計算方式與本段落前半段所述計算方式相仿,故於此不再贅述。最後,所有queryNorm、tf及idf數值計算完成後,經由公式2進行運算便可得出該筆法學資料(判決書)「臺灣高等法院臺南分院刑事判決99年度上訴字第839號」之關聯度分數(score)=120.235306。 The fieldweight is the weighting parameter of the search field j to distinguish the importance of the field. Through the calculation of this formula, the relevance score (Score) of the legal data is calculated. For example, please refer to Table 2. If the keyword entered is "Civil Service Uses Corruption" and the legal information is "Judgement No. 839 of the 99th Annual Appeal of the Taiwan High Court Tainan Branch", then its The correlation score calculation is explained as follows. The candidate keyword sentences that are marked and expanded may include, for example, but not limited to, candidate keyword sentences such as "civil servant use position" and "corruption", and the search field is searched for "case" and "referee text". Among them, the candidate keyword sentence "Civil Service Use Title" does not appear in the search field "Case", so the queryNorm, tf and idf values are all 0; the candidate keyword sentence "Corruption" appears once in the search field "Case" Calculate tf =1 through Equation 4, and then calculate queryNorm =0.94229126 via Equation 3, and calculate the rarity idf =6.9492507 of "corruption" in all legal materials by Equation 5. Then, calculate the queryNorm, tf, and idf values of the candidate keyword sentence "Civil Service Use Title" in the search field "referee text" and the candidate keyword sentence "corruption" in the search field "referee text". The method is similar to the calculation method described in the first half of this paragraph, so it will not be repeated here. Finally, after all the calculations of queryNorm, tf and idf are completed, the correlation scores of the 99th Annual Appeal No. 839 of the Taiwan High Court Tainan Branch (Taiwan High Court Tainan Branch Criminal Judgment No. 839) can be obtained by the calculation of Equation 2 (the judgment). Score) = 120.235306.
綜上所述,本案之法學資料搜尋系統及搜尋處理方法,其可提供與關鍵字具關聯性的相關法學資料,提高檢索的精確度,提供友善且易於操作的使用者體驗,並且可以節省使用者檢索與閱覽的時間。 In summary, the legal data search system and search processing method of the present case can provide relevant legal data related to the keyword, improve the accuracy of the search, provide a friendly and easy-to-operate user experience, and save the use. The time of retrieval and reading.
本案得由熟知此技術之人士任施匠思而為諸般修飾,然皆不脫如附申請專利範圍所欲保護者。 This case has been modified by people who are familiar with the technology, but it is not intended to be protected by the scope of the patent application.
S10~S15‧‧‧法學資料搜尋系統之搜尋處理方法之步驟 Steps of the search and processing method of the S10~S15‧‧ legal data search system
Claims (23)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW105142810A TWI595371B (en) | 2016-12-22 | 2016-12-22 | Legal information search system,? search processing method and computer readable media recorder |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW105142810A TWI595371B (en) | 2016-12-22 | 2016-12-22 | Legal information search system,? search processing method and computer readable media recorder |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TWI595371B true TWI595371B (en) | 2017-08-11 |
| TW201824028A TW201824028A (en) | 2018-07-01 |
Family
ID=60189342
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW105142810A TWI595371B (en) | 2016-12-22 | 2016-12-22 | Legal information search system,? search processing method and computer readable media recorder |
Country Status (1)
| Country | Link |
|---|---|
| TW (1) | TWI595371B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI640882B (en) * | 2017-12-26 | 2018-11-11 | 七法股份有限公司 | Legal data exploration system and legal data exploration method |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060248440A1 (en) * | 1998-07-21 | 2006-11-02 | Forrest Rhoads | Systems, methods, and software for presenting legal case histories |
| CN1965313A (en) * | 2003-12-31 | 2007-05-16 | 汤姆森环球资源股份公司 | Systems, methods, software and interfaces for integration of case law with legal briefs, litigation documents, and/or other litigation-support documents |
| TW201128416A (en) * | 2010-02-11 | 2011-08-16 | Alibaba Group Holding Ltd | Search result generating method and information search system |
| TW201131399A (en) * | 2010-03-09 | 2011-09-16 | Alibaba Group Holding Ltd | Indexing method and apparatus |
| TW201502814A (en) * | 2013-04-08 | 2015-01-16 | Hon Hai Prec Ind Co Ltd | System and method for searching information |
-
2016
- 2016-12-22 TW TW105142810A patent/TWI595371B/en active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060248440A1 (en) * | 1998-07-21 | 2006-11-02 | Forrest Rhoads | Systems, methods, and software for presenting legal case histories |
| CN1965313A (en) * | 2003-12-31 | 2007-05-16 | 汤姆森环球资源股份公司 | Systems, methods, software and interfaces for integration of case law with legal briefs, litigation documents, and/or other litigation-support documents |
| TW201128416A (en) * | 2010-02-11 | 2011-08-16 | Alibaba Group Holding Ltd | Search result generating method and information search system |
| TW201131399A (en) * | 2010-03-09 | 2011-09-16 | Alibaba Group Holding Ltd | Indexing method and apparatus |
| TW201502814A (en) * | 2013-04-08 | 2015-01-16 | Hon Hai Prec Ind Co Ltd | System and method for searching information |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI640882B (en) * | 2017-12-26 | 2018-11-11 | 七法股份有限公司 | Legal data exploration system and legal data exploration method |
Also Published As
| Publication number | Publication date |
|---|---|
| TW201824028A (en) | 2018-07-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11449767B2 (en) | Method of building a sorting model, and application method and apparatus based on the model | |
| US8965872B2 (en) | Identifying query formulation suggestions for low-match queries | |
| CN106649818B (en) | Application search intent identification method, device, application search method and server | |
| CN104537116B (en) | A kind of books searching method based on label | |
| CN112667794A (en) | Intelligent question-answer matching method and system based on twin network BERT model | |
| US9754022B2 (en) | System and method for language sensitive contextual searching | |
| US20180181544A1 (en) | Systems for Automatically Extracting Job Skills from an Electronic Document | |
| JP6056610B2 (en) | Text information processing apparatus, text information processing method, and text information processing program | |
| US9075898B1 (en) | Generating and ranking incremental search suggestions for personal content | |
| CN109508441B (en) | Method and device for realizing data statistical analysis through natural language and electronic equipment | |
| CN110321561B (en) | Keyword extraction method and device | |
| US11151317B1 (en) | Contextual spelling correction system | |
| US20250061140A1 (en) | Systems and methods for enhancing search using semantic search results | |
| CN112182145A (en) | Text similarity determination method, device, equipment and storage medium | |
| US20090119283A1 (en) | System and Method of Improving and Enhancing Electronic File Searching | |
| US10621252B2 (en) | Method for searching in a database | |
| CN118797005A (en) | Intelligent question-answering method, device, electronic device, storage medium and product | |
| Sukumar et al. | Semantic based sentence ordering approach for multi-document summarization | |
| JP6260678B2 (en) | Information processing apparatus, information processing method, and information processing program | |
| TWI595371B (en) | Legal information search system,? search processing method and computer readable media recorder | |
| JP4428703B2 (en) | Information retrieval method and system, and computer program | |
| CN120086322A (en) | Question and answer information recommendation method, device, equipment and medium based on large language model | |
| CN112270199A (en) | CGAN (Carrier-grade network Access network) method based personalized semantic space keyword Top-K query method | |
| JP2012104051A (en) | Document index creating device | |
| CN110688559A (en) | Retrieval method and device |