[go: up one dir, main page]

TWI459221B - The Chinese Address Translation Method of Hierarchical Architecture - Google Patents

The Chinese Address Translation Method of Hierarchical Architecture Download PDF

Info

Publication number
TWI459221B
TWI459221B TW101119973A TW101119973A TWI459221B TW I459221 B TWI459221 B TW I459221B TW 101119973 A TW101119973 A TW 101119973A TW 101119973 A TW101119973 A TW 101119973A TW I459221 B TWI459221 B TW I459221B
Authority
TW
Taiwan
Prior art keywords
level
address translation
string
translation method
keyword
Prior art date
Application number
TW101119973A
Other languages
Chinese (zh)
Other versions
TW201351170A (en
Original Assignee
Chunghwa Telecom Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chunghwa Telecom Co Ltd filed Critical Chunghwa Telecom Co Ltd
Priority to TW101119973A priority Critical patent/TWI459221B/en
Publication of TW201351170A publication Critical patent/TW201351170A/en
Application granted granted Critical
Publication of TWI459221B publication Critical patent/TWI459221B/en

Links

Landscapes

  • Document Processing Apparatus (AREA)

Description

階層式架構之中文地址轉換方法Chinese address translation method for hierarchical architecture

本發明屬於電信加值、BSS平台與技術、資料mining與分析及綜合技術之領域。The invention belongs to the fields of telecom value addition, BSS platform and technology, data mining and analysis and integrated technology.

以往地址轉換大都採用窮舉法,此法必須建立一個字典來涵蓋所有行政區名稱以及其所對應轉址後的新行政區名稱與大量的轉址規則,針對單一地址的轉換,要到字典搜尋其對應的行政區名稱,並根據轉址規則,逐一去除轉換後不滿足的地址,直到得到一個最符合條件的地址。In the past, most of the address conversions used the exhaustive method. This method must establish a dictionary to cover all administrative district names and their new administrative district names and a large number of forwarding rules. For the conversion of a single address, it is necessary to search the dictionary for its correspondence. The name of the administrative district, and according to the forwarding rules, remove the addresses that are not satisfied after the conversion one by one until you get the most qualified address.

窮舉法有兩大缺點,第一個缺點是執行效率差,因字典龐大,對數字與國字的組合方法可能有千萬種組合,然後將地址與眾多規則一一進行比對,直到找到最符合的再進行轉址,因此,針對龐大資料量的轉址作業,作業時間十分可觀;第二個缺點是執行成效不佳,若窮舉法包含的組合不夠完整,勢必導致無法全面性地修正為正確的地址,例如在自由格式下的「台北縣」組合有非常多種,「臺北縣」、「台北縣」、「苔北縣」、「北縣」、「泰北縣」…等,因人工輸入的多樣性導致隨時都能創造出一種新的案例,也因此要將十分完整的案例建置於資料庫中幾乎是一件不可能達成的事情。There are two major shortcomings in the exhaustive method. The first drawback is that the execution efficiency is poor. Because of the huge dictionary, there may be thousands of combinations of the combination of numbers and national characters. Then the address is compared with many rules one by one until it is found. The most suitable re-addressing, therefore, for the transfer of large amounts of data, the operation time is very impressive; the second shortcoming is that the implementation is not effective, if the exhaustive method contains a combination is not complete, it will inevitably lead to the inability to comprehensively Corrected to the correct address, for example, there are many combinations of "Taipei County" in free format, "Taipei County", "Taipei County", "Moss County", "North County", "Taibei County", etc. Because of the diversity of manual input, a new case can be created at any time, and it is almost impossible to put a very complete case in the database.

顯然地,上述窮舉法並未保證轉址後的地址正確性、完整性,而導致在轉址後的地址錯誤,導致信函無法遞送而喪 失使用者和商家的權益。Obviously, the above exhaustive law does not guarantee the correctness and completeness of the address after the transfer, resulting in an incorrect address after the transfer, resulting in the failure of the letter to be delivered. Loss of the rights of users and businesses.

在專利技術092129073客戶地址多重模式輸入及五碼郵遞區號轉換工具中有提及轉址概念但並無處理自由格式地址之作法,且與本系統之自由格式地址做法大不相同,專利技術092129073乃根據先前所選擇的區域逐漸縮小地址的範圍,並透過其所定義好的地址的代碼去進行郵遞區號的轉換。然而,非自由格式的地址在完整性與正確性皆較自由格式地址佳,而本系統的目的在於將先天多樣性的地址修正為更加完整與有效的地址,以彌補自由格式地址填寫不正確的情形。In the patent technology 092129073 customer address multi-mode input and five-code postal area code conversion tool, there is a reference to the concept of forwarding but does not deal with free-form addresses, and is very different from the free-form address practice of this system, patent technology 092129073 is The range of the address is gradually reduced according to the previously selected area, and the zip code is converted by the code of the defined address. However, non-free-form addresses are better in integrity and correctness than free-form addresses, and the purpose of this system is to correct innately diverse addresses to more complete and valid addresses to compensate for the incorrect filling of free-form addresses. situation.

由此可見,上述習用技術仍有不足之處,而亟待加以改良。It can be seen that the above-mentioned conventional techniques still have shortcomings and need to be improved.

本案發明人鑑於上述習用技術所衍生的各項缺點,乃亟思加以改良創新,並經多年苦心孤詣潛心研究後,終於成功研發完成本件以階層式架構轉換中文地址之方法。In view of the shortcomings derived from the above-mentioned conventional technologies, the inventor of the present invention has improved and innovated, and after years of painstaking research, he finally succeeded in researching and developing the method of converting the Chinese address into a hierarchical structure.

本發明之目的係在於提供一種高效率且不需要字典的轉址機制,使得在轉址過程中,不需花費大量的時間對地址進行一一比對,另外再配合機率統計中文地址架構特性方式進行地址關鍵字快速搜索,使得轉址作業可以高速完成。The object of the present invention is to provide a high-efficiency and dictionary-free addressing mechanism, so that in the process of forwarding, it is not necessary to spend a large amount of time to compare addresses one by one, and in addition to the probability of counting Chinese address architecture characteristics. Perform a quick search of address keywords so that the transfer job can be completed at high speed.

達成上述發明目的之設計方法,包括兩大部份:第一部份是先透過中文地址關鍵字快速搜索程序進行每一個區域層級之字串切割處理,其中區域層級與關鍵字之建立可由靜態設定或動態更改;第二部份則是利用轉換規則修正程序對第一 部份取出之字串進行修正,而正確字串值可由目標資料庫或已知資料庫提供。The design method for achieving the above object includes two major parts: the first part is to perform string cutting processing for each region level through the Chinese address keyword quick search program, wherein the establishment of the regional level and the keyword can be statically set. Or dynamically change; the second part is to use the conversion rule to correct the program to the first Some of the extracted strings are corrected, and the correct string values can be provided by the target database or a known database.

一種階層式架構之中文地址轉換方法,其資料來源係為自由之格式,並存放著具有連續且存在階層式關係之字串,包含以下步驟:a.先依統計方法設定當前區域層級之關鍵字出現於字串中各個位置之機率值;b.計算當前該區域層級之各個該關鍵字於每個位置之該機率值總和,若該機率值總和中較高之位置字元包含於當前該區域層級之該關鍵字,該位置係為當前該區域層級之邊界值,否則將繼續尋找該機率值總和次高之該位置且包含於當前該區域層級之該關鍵字;c.最後依據該區域層級之該邊界值進行切割,得出各個該區域層級之字串。A Chinese-language address translation method for hierarchical architecture, the data source is a free format, and stores a string with continuous and hierarchical relationship, including the following steps: a. first setting a keyword of the current regional level according to a statistical method a probability value appearing at each position in the string; b. calculating a sum of the probability values of each of the keywords at the current level of the region at each position, if the higher position character in the sum of the probability values is included in the current region The keyword of the level, which is the boundary value of the current level of the region, otherwise it will continue to search for the location where the probability value is the second highest and is included in the current level of the region; c. Finally, according to the region level The boundary value is cut to obtain a string of each of the region levels.

其中該區域層級係為一種具有階層式關係之關鍵字,該區域層級與該關鍵字之建立係由靜態設定或動態更改,該機率值係為目標資料庫或搭配已知資料庫,該目標資料庫係為存放待處理資料的儲存空間,該已知資料庫係由第三方提供,其中該第三方提供係為郵局地址之資料庫,該目標資料庫之欄位係包含郵遞區號及地址,該已知資料庫中之欄位不限於郵遞區號、地址與區域層級,其中更包含將輸出之各個區域層級字串進行處理,並將字串修正成正確字串值為處理方式,而該正確字串值係由該目標資料庫或是該已知資料庫提供。The region level is a keyword with a hierarchical relationship, and the establishment of the region hierarchy and the keyword is statically set or dynamically changed, and the probability value is a target database or a matching database, and the target data is The library is a storage space for storing the data to be processed, and the known database is provided by a third party, wherein the third party provides a database of post office addresses, and the target database field includes a postal area code and an address. It is known that the fields in the database are not limited to the postal area code, the address and the area level, and further include processing the output layer level string and correcting the string to the correct string value, and the correct word The string value is provided by the target database or the known database.

本發明所提供之方法,與其他習用技術相互比較時,更具備下列優點:The method provided by the present invention has the following advantages when compared with other conventional technologies:

1.本發明之特點在於避免在本地端儲存大量字典以及解決習用技術耗時的轉址問題。1. The invention is characterized by avoiding storing a large number of dictionaries on the local end and solving the time-consuming transfer problems of the conventional technology.

2.本發明可將先天多樣性的地址修正為更加完整與有效的地址,以彌補自由格式地址填寫不正確的情形。2. The present invention can correct the innately versatile address to a more complete and valid address to compensate for the incorrect filling of the free-form address.

3.本發明可降低退信比率以避免多餘郵資花費,提高有效寄件比率。3. The present invention can reduce the bounce rate to avoid excess postage costs and increase the effective mailing ratio.

4.本發明可確保轉址後地址資訊之正確性和完整性,防止信函無法遞送的事件發生,保障使用者和商家的權益。4. The invention can ensure the correctness and integrity of the address information after the transfer, prevent the occurrence of the event that the letter cannot be delivered, and protect the rights of the user and the merchant.

如圖一,本發明設計方法之實施方式可分為中文地址關鍵字快速搜索程序1與轉換規則修正程序2,透過郵遞區號:82442,地址:高市煙曹區角宿村安新路360號為例,係說明如下。As shown in Figure 1, the implementation method of the design method of the present invention can be divided into a Chinese address keyword fast search program 1 and a conversion rule correction program 2, through the postal code: 82442, address: 360 Anxin Road, Jiaosu Village, Yancao District, Gao City As an example, the description is as follows.

請參閱圖二所示,中文地址關鍵字快速搜索程序1,係由搜索索引元件11、邊界值偵測元件12、關鍵字檢查單元13、區域層級檢查單元14及地址切割元件15所組成;為了可正確切割出自由格式下之地址,此程序採用關鍵字位置機率統計的方式進行切割;首先,針對不同之區域層級制訂該區域層級應出現的關鍵字,區域層級係用關鍵字的等級進行劃分,例如:縣、市為相同的區域層級,村、里係相同的區域層級,但縣、村為不相同的區域層級,而不相同之層級存在一個順序性關係,以Keyword(t,i)定義在第t個區域層級中會出現的第i個關鍵字。關鍵字位置機率值可經由目標資料庫或搭配已知資料庫統計出來,目標資料庫可以是存放待處理資料的儲存空間,已知資料庫可以由第三方提供,如郵局地址之資料庫等。其中目標資料庫的欄位至少包含郵遞區號及地址,已知資料庫的欄位不限於郵遞區號、地址與區域層級,由已知資料庫得知各階層所包含的關鍵字,並可透過目標資料庫做為輔助,統計出每個區域層級的關鍵字出現在地址欄位的地址字串位置之機率值,搜索索引元件11以X(t,n,i)表示該區域層級關鍵字的位置出現機率值,如下所示: Referring to FIG. 2, the Chinese address keyword fast search program 1 is composed of a search index component 11, a boundary value detecting component 12, a keyword checking unit 13, a region level checking unit 14, and an address cutting component 15; The address in the free format can be correctly cut out. This program uses the keyword position probability statistics to cut; firstly, the keywords that should appear in the region level are determined for different regional levels, and the regional level is divided by the keyword level. For example, the county and the city are at the same regional level, and the village and the middle are at the same regional level, but the county and the village are different regional levels, and the different levels have a sequential relationship with Keyword(t,i). Define the ith keyword that will appear in the tth region hierarchy. The keyword position probability value can be counted through the target database or with a known database. The target database can be a storage space for storing the pending data. The known database can be provided by a third party, such as a database of post office addresses. The target database has at least a postal code number and an address. The known database field is not limited to the postal area code, address, and regional level. The known database is used to know the keywords included in each level, and the target can be accessed through the target. As a supplement, the database counts the probability value of the address string position of each address level in the address field, and the search index component 11 indicates the position of the area level keyword by X(t, n, i). The probability value appears as follows:

將X(t,n,i)輸出至邊界值偵測元件12,搜尋出第n個位置出現機率最大的邊界值。根據每個Keyword(t,i)在第n個位置的機率值進行加總,以表示關鍵字出現最頻繁的位置,如式(2)所示:將最大的邊界位置輸出至關 鍵字檢查單元13,邊界位置係每個地址區域之長度,判斷邊界值位置之字元是否為關鍵字(Boundary_keyword),其檢查方法如下所示:Boundary_keywordKeyword(t,i)The X(t, n, i) is output to the boundary value detecting component 12, and the boundary value at which the nth position has the highest probability is searched for. According to the probability value of each Keyword(t, i) at the nth position, Indicates the most frequent location of the keyword, as shown in equation (2): Will be the largest boundary position Output to the keyword checking unit 13, the boundary position is the length of each address area, and it is judged whether the character of the boundary value position is a keyword (Boundary_keyword), and the checking method is as follows: Boundary_keyword Keyword(t,i)

若邊界值位置的字元不為本階層之關鍵字,便可去除目前之邊界值位置,且重新執行邊界值偵測元件12,找尋次大邊界值發生機率的位置;若無法在此階層尋找到關鍵字時,將會尋找下一個層級之關鍵字,一旦關鍵字檢查單元13判斷為關鍵字,則進入區域層級檢查單元14,以確保制訂的每一個區域層級皆被檢查過,若未達最大的區域層級值,將區域層級參數進行累加,以便執行下一個區域層級之邊界值偵測元件12,最後,當制定的每一個區域層級皆找出邊界值位置,執行地址切割元件15的處理程序,將此元件的輸出定義為result t ,表示第t個區域層級的切割結果,完成中文地址關鍵字快速搜索程序1。If the character at the position of the boundary value is not the keyword of the same level, the current boundary value position can be removed, and the boundary value detecting component 12 can be re-executed to find the position of the probability of occurrence of the second largest boundary value; When the keyword is reached, the keyword of the next level will be searched. Once the keyword checking unit 13 judges the keyword, the keyword checking unit 14 is entered to ensure that each of the determined regional levels is checked. The maximum area level value is accumulated by the area level parameter to execute the boundary value detecting element 12 of the next area level. Finally, when each of the determined area levels finds the boundary value position, the processing of the address cutting element 15 is performed. The program defines the output of this component as result t , which indicates the cutting result of the t-th region level, and completes the Chinese address keyword fast search procedure 1 .

轉換規則修正程序2可將上述切割結果依據不同目的進行對應之處理方式,將切割字串轉換成正確字串為此程序的處理方式之一。因自由格式的郵遞區號欄位,相較於地址欄位的正確比率相當高,轉換規則修正程序2係將中文地址關鍵字快速搜索程序1的輸出透過郵遞區號輔助修正為正確的結果,再根據已知資料庫的郵遞區號、地址與區域層級欄位, 透過Addr _String (t ,zipcode )表示利用郵遞區號取出第t個區域層級的字串,Addr _String (t ,zipcode )可由目標資料庫或是已知資料庫提供,並將result t 轉換成Addr _String (t ,zipcode )的字串。The conversion rule correction program 2 can perform the above-mentioned cutting result according to different purposes, and convert the cut string into a correct string as one of the processing methods of the program. Because the free-form postal area code field has a relatively high correct ratio compared to the address field, the conversion rule correction program 2 corrects the output of the Chinese address keyword quick search program 1 to the correct result through the postal area code, and then The zip code, address and area level field of the known database are represented by Addr _ String ( t , zipcode ) to retrieve the string of the t-th region level by using the zip code, and the Addr _ String ( t , zipcode ) can be used by the target database. Or a database provided by a known database, and the result t is converted to a string of Addr _ String ( t , zipcode ).

以上述使用者輸入的郵遞區號:82442與地址:高市煙曹區角宿村安新路360號為例,於中文地址關鍵字快速搜索程序1中,假設欲找尋的區域層級到第四層。在第一個區域層級中,關鍵字為「縣」、「市」,關鍵字發生的最大機率位置為第三個位置,取出的關鍵字為「煙」,進入關鍵字檢查單元的檢查結果發現「煙」並不是第一層的關鍵字,演算法會去除第三個之位置,到邊界值偵測元件中12重新尋找次大的機率位置為第二個位置,根據地址的第二個位置的關鍵字為「市」,符合第一個區域層級的關鍵字,因此第一個區域層級的邊界值為二,接著將區域層級參數進行累加,依此類推尋找接下來的區域層級的邊界值,在地址切割元件15中,將各個區域層級的邊界值進行切割,故得出「高市」、「煙曹區」、「角宿村」與「安新路」的切割結果,轉換規則修正程序2由郵遞區號做為輔助修正,其中郵遞區號為82442,取出Addr _String (1,82442)=高雄市,Addr _String (2,82442)=燕巢區,修正結果為「高雄市」、「燕巢區」、「角宿村」與「安新路」。Take the above-mentioned user input zip code: 82442 and address: No. 360, Anxin Road, Jiaosu Village, Yancao District, Gaocheng City. For example, in the Chinese address keyword quick search procedure 1, assume the region level to the fourth layer to be found. . In the first regional level, the keywords are “County” and “City”. The maximum probability position of the keyword is the third position, and the extracted keyword is “smoke”. The result of entering the keyword check unit is found. "Smoke" is not the first layer of keywords, the algorithm will remove the third position, to the boundary value detection component 12 to find the next largest probability position for the second position, according to the second position of the address The keyword is "City", which matches the keyword of the first regional level, so the boundary value of the first regional level is two, then the regional level parameters are accumulated, and so on, to find the boundary value of the next regional level. In the address cutting element 15, the boundary value of each region level is cut, so that the cutting results of "High City", "Yancao District", "Kuansu Village" and "Anxin Road" are obtained, and the conversion rule is corrected. 2 program by the zip code as a secondary correction, in which the zip code is 82442, out Addr _ String (1,82442) = Kaohsiung, Addr _ String (2,82442) = yanchao district, the correction result is "Kaohsiung""Yanchao District , "Spica Village" and "On the new road."

上列詳細說明乃針對本發明之一可行實施例進行具體說明,惟該實施例並非用以限制本發明之專利範圍,凡未脫離本發明技藝精神所為之等效實施或變更,均應包含於本案之專利範圍中。The detailed description of the present invention is intended to be illustrative of a preferred embodiment of the invention, and is not intended to limit the scope of the invention. The patent scope of this case.

綜上所述,本案不僅於技術思想上確屬創新,並具備習用之傳統方法所不及之上述多項功效,已充分符合新穎性及進步性之法定發明專利要件,爰依法提出申請,懇請 貴局核准本件發明專利申請案,以勵發明,至感德便。To sum up, this case is not only innovative in terms of technical thinking, but also has many of the above-mentioned functions that are not in the traditional methods of the past. It has fully complied with the statutory invention patent requirements of novelty and progressiveness, and applied for it according to law. Approved this invention patent application, in order to invent invention, to the sense of virtue.

1‧‧‧中文地址關鍵字快速搜索程序1‧‧‧Chinese address keyword quick search program

2‧‧‧轉換規則修正程序2‧‧‧ Conversion Rule Amendment Procedure

11‧‧‧搜索索引元件11‧‧‧Search index component

12‧‧‧邊界值偵測元件12‧‧‧Boundary value detection component

13‧‧‧關鍵字檢查單元13‧‧‧Keyword checking unit

14‧‧‧區域層級檢查單元14‧‧‧Regional level inspection unit

15‧‧‧地址切割元件15‧‧‧ Address Cutting Components

請參閱有關本發明之詳細說明及其附圖,將可進一步瞭解本發明之技術內容及其目的功效;有關附圖為:圖一係為本發明之階層式架構之中文地址轉換方法之流程說明圖;圖二係為本發明之階層式架構之中文地址轉換方法之中文地址關鍵字快速搜索程序圖。The detailed description of the present invention and the accompanying drawings will be further understood, and the technical contents of the present invention and the functions thereof can be further understood. FIG. 1 is a flow chart of the Chinese address translation method of the hierarchical architecture of the present invention. Figure 2 is a fast search program diagram of Chinese address keywords for the Chinese address translation method of the hierarchical architecture of the present invention.

1‧‧‧中文地址關鍵字快速搜索程序1‧‧‧Chinese address keyword quick search program

2‧‧‧轉換規則修正程序2‧‧‧ Conversion Rule Amendment Procedure

Claims (10)

一種階層式架構之中文地址轉換方法,其資料來源係為自由之格式,並存放著具有連續且存在階層式關係之字串,包含以下步驟:a.先依統計方法設定當前區域層級之關鍵字出現於字串中各個位置之機率值;b.計算當前該區域層級之各個該關鍵字於每個位置之該機率值總和,若該機率值總和中較高之位置字元包含於當前該區域層級之該關鍵字,該位置係為當前該區域層級之邊界值,否則將繼續尋找該機率值總和次高之該位置且包含於當前該區域層級之該關鍵字;c.最後依據該區域層級之該邊界值進行切割,得出各個該區域層級之字串。A Chinese-language address translation method for hierarchical architecture, the data source is a free format, and stores a string with continuous and hierarchical relationship, including the following steps: a. first setting a keyword of the current regional level according to a statistical method a probability value appearing at each position in the string; b. calculating a sum of the probability values of each of the keywords at the current level of the region at each position, if the higher position character in the sum of the probability values is included in the current region The keyword of the level, which is the boundary value of the current level of the region, otherwise it will continue to search for the location where the probability value is the second highest and is included in the current level of the region; c. Finally, according to the region level The boundary value is cut to obtain a string of each of the region levels. 如申請專利範圍第1項所述之階層式架構之中文地址轉換方法,其中該區域層級係為一種具有階層式關係之關鍵字。For example, the Chinese address translation method of the hierarchical structure described in claim 1 is that the regional level is a keyword having a hierarchical relationship. 如申請專利範圍第1項所述之階層式架構之中文地址轉換方法,其中該區域層級與該關鍵字之建立係由靜態設定或動態更改。The Chinese address translation method of the hierarchical structure described in claim 1, wherein the establishment of the regional level and the keyword is statically set or dynamically changed. 如申請專利範圍第1項所述之階層式架構之中文地址轉換方法,其該機率值係為目標資料庫或搭配已知資料庫。For example, the Chinese address translation method of the hierarchical structure described in claim 1 is the target database or the known database. 如申請專利範圍第4項所述之階層式架構之中文地址轉換方法,該目標資料庫係為存放待處理資料的儲存空間。For example, the Chinese address translation method of the hierarchical structure described in the fourth application of the patent scope is a storage space for storing the data to be processed. 如申請專利範圍第4項所述之階層式架構之中文地址轉 換方法,該已知資料庫係由第三方提供。The Chinese address of the hierarchical structure as described in item 4 of the patent application scope In other words, the known database is provided by a third party. 如申請專利範圍第6項所述之階層式架構之中文地址轉換方法,其中該第三方提供係為郵局地址之資料庫。The Chinese address translation method of the hierarchical structure as described in claim 6 wherein the third party provides a database of post office addresses. 如申請專利範圍第4項所述之階層式架構之中文地址轉換方法,其該目標資料庫之欄位係包含郵遞區號及地址。For the Chinese address translation method of the hierarchical structure described in claim 4, the field of the target database includes the zip code and the address. 如申請專利範圍第4項所述之階層式架構之中文地址轉換方法,該已知資料庫中之欄位不限於郵遞區號、地址與區域層級。For the Chinese address translation method of the hierarchical architecture described in claim 4, the fields in the known database are not limited to the postal area code, the address and the regional level. 如申請專利範圍第1項所述之階層式架構之中文地址轉換方法,其中更包含將輸出之各個區域層級字串進行處理,並將字串修正成正確字串值為處理方式,而該正確字串值係由該目標資料庫或是該已知資料庫提供。For example, the Chinese address translation method of the hierarchical structure described in claim 1 further includes processing the output layer level string and correcting the string to the correct string value, and the correct The string value is provided by the target database or the known database.
TW101119973A 2012-06-04 2012-06-04 The Chinese Address Translation Method of Hierarchical Architecture TWI459221B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW101119973A TWI459221B (en) 2012-06-04 2012-06-04 The Chinese Address Translation Method of Hierarchical Architecture

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW101119973A TWI459221B (en) 2012-06-04 2012-06-04 The Chinese Address Translation Method of Hierarchical Architecture

Publications (2)

Publication Number Publication Date
TW201351170A TW201351170A (en) 2013-12-16
TWI459221B true TWI459221B (en) 2014-11-01

Family

ID=50158020

Family Applications (1)

Application Number Title Priority Date Filing Date
TW101119973A TWI459221B (en) 2012-06-04 2012-06-04 The Chinese Address Translation Method of Hierarchical Architecture

Country Status (1)

Country Link
TW (1) TWI459221B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7320020B2 (en) * 2003-04-17 2008-01-15 The Go Daddy Group, Inc. Mail server probability spam filter
TW200933417A (en) * 2008-01-30 2009-08-01 Supergeo Technologies Inc Positioning methdo with multiple-level precisions

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7320020B2 (en) * 2003-04-17 2008-01-15 The Go Daddy Group, Inc. Mail server probability spam filter
TW200933417A (en) * 2008-01-30 2009-08-01 Supergeo Technologies Inc Positioning methdo with multiple-level precisions

Also Published As

Publication number Publication date
TW201351170A (en) 2013-12-16

Similar Documents

Publication Publication Date Title
US7769778B2 (en) Systems and methods for validating an address
Klinkmüller et al. Increasing recall of process model matching by improved activity label matching
KR102067926B1 (en) Apparatus and method for de-identifying personal information contained in electronic documents
US8965877B2 (en) Apparatus and method for automatic assignment of industry classification codes
CN107608963A (en) Chinese error correction method, device and equipment based on mutual information and storage medium
US7599930B1 (en) Concept synonym matching engine
US9996521B2 (en) Validation of formulas with external sources
CN106021410A (en) Source code annotation quality evaluation method based on machine learning
US20080208566A1 (en) Automated word-form transformation and part of speech tag assignment
CN109543410B (en) A malicious code detection method based on semantic mapping association
CN110348020A (en) A kind of English- word spelling error correction method, device, equipment and readable storage medium storing program for executing
CN110741376A (en) Automatic document analysis for different natural languages
CN109948122A (en) Error correction method and device for input text and electronic equipment
WO2009005492A1 (en) Systems and methods for validating an address
CN110837568A (en) Entity alignment method and device, electronic equipment and storage medium
TWI459221B (en) The Chinese Address Translation Method of Hierarchical Architecture
Asano et al. Detecting bad smells of refinement in goal-oriented requirements analysis
Kumar et al. A hybrid named entity recognition system for south Asian languages
CN106569994A (en) Elevator remote control device
CN115730595B (en) Method, device and medium for identifying target objects in the pharmaceutical industry to be identified
CN104317888B (en) A kind of full-text search test data generating method
Abramowicz et al. Linguistic Suite for Polish Cadastral System.
CN111309853B (en) A code search method based on structured information
US8176407B2 (en) Comparing values of a bounded domain
CN105447160B (en) Chinese name sorting method for portable equipment

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees