WO2006103777A1

WO2006103777A1 - Structured data conversion method

Info

Publication number: WO2006103777A1
Application number: PCT/JP2005/006173
Authority: WO
Inventors: Satoshi Nakashima; Junichi Odagiri; Shigeru Yoshida; Takuroh Yamaguchi
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2005-03-30
Filing date: 2005-03-30
Publication date: 2006-10-05
Anticipated expiration: 2007-09-30
Also published as: US20080091699A1; JP4216323B2; JPWO2006103777A1; EP1870821A1; EP1870821A4

Abstract

A structured data conversion device includes: an inverse conversion information generation unit which assumes that first structured data is converted into second structured data described by a language different from the first structured data and extracts data deleted when converting the first structured data into the second structured data and required when inverse-converting the converted second structured data into the first structured data, as inverse conversion information, from the first structured data; and an inverse conversion information storage unit for embedding the inverse conversion information into the second structured data when converting the first structured data into the second structured data.

Description

明細書 Specification

構造化データ変換方式 Structured data conversion method

技術分野 Technical field

[0001] 本発明は、構造化データの他の形式の構造化データへの変換の技術に関し、更に詳しくは、再変換を考慮した構造化データの変換の技術に関する。 [0001] The present invention relates to a technique for converting structured data into structured data of another format, and more particularly, to a technique for converting structured data in consideration of reconversion.

背景技術 Background art

[0002] 近年、インターネットを通して、個人、企業、自治体など、あらゆる種類のシステムが接続され、連携して、 Webサービスや EDI、 ECが行われつつある。このために、幅広い情報交換が必要になっており、データ交換、データ処理に、 XML (extensible Markup Language)力データを構造ィ匕する柔軟な表現能力を持ち、コンピュータ処理に適するため、共通基盤のフォーマットとして注目されている。 [0002] In recent years, various types of systems such as individuals, companies, and local governments are connected via the Internet, and Web services, EDI, and EC are being performed in cooperation. For this reason, a wide range of information exchange is required, and XML (extensible Markup Language) power is used for data exchange and data processing. It has a flexible expression ability to structure data and is suitable for computer processing. Is attracting attention as a format.

[0003] XMLは、 1986年に ISOで標準化された SGML (Standard Generalized Mar kup Language)をインターネットで活用し易くするために、 1998年 2月にその基本仕様 XML1. 0が W3C (World Wide Web Consortium)において策定されたものである。 [0003] In order to facilitate the use of SGML (Standard Generalized Markup Language), which was standardized by ISO in 1986, on the Internet, XML has its basic specification XML1.0 as W3C (World Wide Web Consortium) in February 1998. ).

[0004] Webページ作成言語である HTML (HyperText Markup Language)は、タグが固定で表示に特化したものとなっており、タグ情報を基にコンピュータで情報を処理したいという要件に対応できない問題があった。それに対して、 XMLは、利用者が自由にタグを定義でき、データ中の文字列に意味付けができる言語構造を持ち、コンピュータで情報処理するのに適している。 [0004] HTML (HyperText Markup Language), a Web page creation language, has a fixed tag and is specialized for display, and cannot meet the requirement to process information on a computer based on tag information was there. In contrast, XML has a language structure that allows users to freely define tags and give meaning to character strings in data, and is suitable for information processing on computers.

[0005] 現在、 XMLは広く普及しつつあり、特にサーバ'システム間の連携などで活用されている。 [0005] Currently, XML is becoming widespread, and is used particularly in cooperation between server systems.

XMLによるデータは階層をもつ構造データを柔軟に記述できるフォーマットを持ち、プログラムで機械的に検索や加工などデータ処理しやすいという利点がある。その反面、表示スタイルを記述するための HTML等とは異なり、そのままでは人が見やすく表示することはできなレ、。そのため、各種データ処理はサーバ上で XMLデータによって行ない、それを HTMLデータに変換してクライアント PCに渡し、クライアント PC上ではその HTMLデータを Webブラウザで表示するという使い方が一般的になつている。 XML data has a format that can flexibly describe hierarchical structured data, and has the advantage of being easy to process data such as searching and processing mechanically by a program. On the other hand, unlike HTML for describing the display style, it cannot be displayed easily by humans. For this reason, various data processing is performed on the server using XML data, which is converted to HTML data and passed to the client PC. On the PC, the HTML data is generally displayed on a web browser.

[0006] XMLデータから HTMLデータへの変換には標準 XML変換技術である XSLT等が広く使われている。 [0006] For converting XML data to HTML data, standard XML conversion technology such as XSLT is widely used.

構造ィヒデータの変換についての技術が開示されている文献としては例えば特許文献 1がある。 For example, Patent Document 1 discloses a technique that discloses a technique for converting structural data.

[0007] 特許文献 1に開示されている、宅配の運送状況を示す情報の提供を行なうシステムでは、検索条件が不明でも運送状況を確認できるようにするため、情報提供者とユーザ端末との間に仲介装置を設け、この仲介装置と情報提供者の情報提供装置とのデータのやり取りを行なうに際して、 HTML力 XMLへの変換や XMLから HTML への変換を行なっている。 [0007] In the system for providing information indicating the delivery status of home delivery disclosed in Patent Document 1, in order to be able to confirm the delivery status even if the search condition is unknown, the information provider and the user terminal An intermediary device is provided between them, and when the data is exchanged between the mediation device and the information provider's information provision device, conversion to HTML power XML and conversion from XML to HTML are performed.

[0008] また特許文献 2には、ソートや一部切り出し等の編集処理後に元の構造ィ匕データを再構築できるようにするため、 XMLデータにインデックスと深さを示す属性値を付加する変換を行なう点が開示されている。 [0008] In addition, Patent Document 2 discloses a conversion in which an attribute value indicating an index and depth is added to XML data so that the original structure data can be reconstructed after editing processing such as sorting or partial cutout. The point which performs is disclosed.

[0009] 上記したような XMLや HTML等の構造化データの変換の普及に伴い、 HTMLとして取得し Webブラウザで表示するだけでなぐそのデータを XMLによるデータとして取得してクライアント PC上でユーザが様々なデータ処理を行なレ、たいとレ、うニーズが出てきている。 [0009] Along with the widespread use of structured data such as XML and HTML as described above, the data obtained as HTML and simply displayed on a Web browser is acquired as XML data on the client PC. There is a growing need for users to process various data.

[0010] 図 1 (a)及び (b)に同一内容のデータを表示と情報処理の両方に用いるようにするために現在行なわれてレ、る方式を示す。 [0010] FIGS. 1 (a) and 1 (b) show the methods currently used to use the same data for both display and information processing.

同図（a)の方式は、元となる XMLデータと表示用にこの XMLデータを変換して生成した HTMLデータの両方を Webサーバに持ち、ユーザに公開する方式である。 The method in (a) in the figure is a method in which both the original XML data and the HTML data generated by converting this XML data for display are held on the Web server and made public to the user.

[0011] この方式では、 Webサーバにおいて XMLデータと HTMLデータの二つのファイルを二重管理する必要が生じ、保守コストが大きくなる。 [0011] This method necessitates double management of two files, XML data and HTML data, on the Web server, which increases maintenance costs.

また、ユーザ側では HTMLデータと XMLデータの両方を受け取らねばならず、 2 度の通信が必要となり、その分レスポンスが落ちる。 In addition, the user must receive both HTML data and XML data, which requires two communications, and the response drops accordingly.

[0012] また図 1 (b)の方式は、 XMLデータを変換して HTMLデータを生成する際に、 HT[0012] The method shown in Fig. 1 (b) uses HT when generating XML data by converting XML data.

MLデータの中に XMLデータを丸ごと坦め込み、ユーザ側ではこの HTMLデータ力 XMLデータを取り出して、情報処理を行なう。 The entire XML data is included in the ML data, and this HTML data on the user side Extracts XML data and performs information processing.

[0013] この方式では図 1 (a)の方式と異なり、上記と違い、 Webサーバで管理するファイルは HTMLデータのファイル一つだけになる力この HTMLデータは XMLデータを丸ごと坦め込む分、そのサイズは HTMLデータと XMLデータの両者を併せた以上の大きさとなってしまう。 [0013] Unlike the method shown in Fig. 1 (a), this method differs from the above in that the file managed by the Web server is only one HTML data file. This HTML data contains the entire XML data, The size is larger than the sum of both HTML data and XML data.

[0014] またユーザ側では、坦め込まれた XMLデータを HTMLデータ力抽出する手間を必要とする。 [0014] Further, the user side needs time and effort to extract the embedded XML data.

またこの他の方法として、 HTMLから XMLに逆変換する方法もある。 Another method is reverse conversion from HTML to XML.

[0015] システムの変更を最小限でこれを実現するには、 XML→HTML変換に加え、 HT[0015] To achieve this with minimal system changes, in addition to XML → HTML conversion, HT

ML→XMLへの逆変換ができると良い。しかし、 XSLT等の従来の変換技術では次の理由によりこれができない。 It should be possible to perform reverse conversion from ML to XML. However, conventional conversion technologies such as XSLT cannot do this for the following reasons.

[0016] XMLはデータの構造 ·意味を自由に定義でき、この情報が様々なデータ処理に役立ち重要である。し力し、 HTMLは表示スタイルを記述するものであり、それらの情報（XMLがもつタグと値の関係）は記述できなレ、。従って、 HTMLから XMLへの変換は、重要な情報が欠落してレ、まレ、実現できなレ、。 [0016] XML can freely define the structure and meaning of data, and this information is important for various data processing. However, HTML describes the display style, and that information (the relationship between XML tags and values) cannot be described. Therefore, conversion from HTML to XML cannot be realized because important information is missing.

[0017] 以上の点を踏まえ、本発明は、 HTML力 XMLへの逆変換を可能とした XMLから HTMLへの変換を行なうことが可能な構造化データ変換装置を提供することを課題とする。 [0017] Based on the above points, the present invention has an object to provide a structured data conversion apparatus capable of performing conversion from XML to HTML that enables reverse conversion to HTML power XML. To do.

[0018] またこの逆変換では、元の XMLデータを完全に再現できる XMLへの逆変換を可能とした XMLから HTMLへの変換を行なえる構造化データ変換装置を提供することを課題とする。 [0018] In addition, with this reverse conversion, it is an object to provide a structured data conversion apparatus that can perform conversion from XML to HTML that enables reverse conversion to XML that can completely reproduce the original XML data. To do.

特許文献 1 :特開 2002— 128232号公報 Patent Document 1: JP 2002-128232 A

特許文献 2 :特開 2004— 62600号公報 Patent Document 2: JP 2004-62600 A

発明の開示 Disclosure of the invention

[0019] 本発明による構造化データ変換装置は、第 1の構造化データを、当該第 1の構造化データとは別の言語によって記載された第 2の構造化データに変換することを前提とし、上記問題点を解決するため、逆変換情報生成部、及び逆変換情報格納部を備える。 [0020] 逆変換情報生成部は、前記第 1の構造化データを前記第 2の構造化データに変換する際に削除され、変換した前記第 2の構造化データを前記第 1の構造化データに逆変換する際に必要となるデータを、逆変換情報として前記第 1の構造化データから抽出する。 [0019] The structured data conversion apparatus according to the present invention is based on the premise that the first structured data is converted into the second structured data described in a language different from the first structured data. In order to solve the above problems, an inverse transformation information generation unit and an inverse transformation information storage unit are provided. [0020] The inverse conversion information generation unit is deleted when converting the first structured data into the second structured data, and the converted second structured data is converted into the first structured data. Data necessary for reverse conversion is extracted from the first structured data as reverse conversion information.

[0021] 逆変換情報格納部は、前記第 1の構造化データを前記第 2の構造化データに変換する際、前記逆変換情報を当該第 2の構造化データに坦め込む。 The inverse transformation information storage unit incorporates the inverse transformation information into the second structured data when converting the first structured data into the second structured data.

この構成により、変換後の第 2の構造ィ匕データ内の逆変換情報を用いて、変換前の第 1の構造化データを再現することが出来る。 With this configuration, the first structured data before conversion can be reproduced using the inverse conversion information in the second structure data after conversion.

[0022] また前記第 2の構造ィヒデータに埋め込まれている前記逆変換情報を用いて、前記第 2の構造化データを前記第 1の構造ィヒデータに逆変換する逆変換部を更に備える構成とすることも出来る。 [0022] The information processing apparatus further includes an inverse conversion unit that inversely converts the second structured data into the first structure data using the inverse conversion information embedded in the second structure data. You can also

[0023] この構成では、変換した第 2の構造化データから第 1の構造化データに逆変換を行なうことが出来る。 [0023] With this configuration, it is possible to perform inverse conversion from the converted second structured data to the first structured data.

更に、前記逆変換情報格納部は、変換後の前記第 2の構造化データを Webブラウザで表示した際に見た目に影響を与えない形で前記逆変換情報を坦め込む構成とすることが出来る。 Further, the inverse transformation information storage unit may be configured to include the inverse transformation information in a form that does not affect the appearance when the second structured data after conversion is displayed on a Web browser. I can do it.

[0024] この埋め込みの仕方としては、例えば第 2の構造化データが HTMLによるものであつた場合、アンカータグ内に、或いはコメントタグ内に坦め込む。 As an embedding method, for example, when the second structured data is in HTML, it is embedded in an anchor tag or a comment tag.

これにより、第 2の構造化データの表示に対し、逆変換情報による影響を無くすことが出来る。 As a result, the influence of the inverse transformation information on the display of the second structured data can be eliminated.

[0025] また前記逆変換情報生成部は、前記第 1の構造化データが繰り返し構造を含むとき、当該繰り返し構造全体に対する前記逆変換情報を生成し、前記逆変換情報格納部は、当該逆変換情報を前記繰り返し構造全体に対応させて坦め込む構成とすることが出来る。 [0025] Further, when the first structured data includes a repetitive structure, the reverse conversion information generation unit generates the reverse conversion information for the entire repetitive structure, and the reverse conversion information storage unit A configuration can be adopted in which conversion information is loaded corresponding to the entire repetitive structure.

[0026] この構成により、第 1の構造化データが繰り返し構造を有する場合、より効率的に逆変換データを埋め込むことが出来る。 [0026] With this configuration, when the first structured data has a repetitive structure, the inverse transform data can be embedded more efficiently.

更に、ユーザからの指示に基づいて、前記第 2の構造化データから前記第 1の構造化データの一部を逆変換する部分逆変換部を更に備える構成とすることも出来る。 [0027] この構成により、逆変換を行なうとき、ユーザに指定された一部分のみを逆変換することが出来る。 Furthermore, it may be configured to further include a partial reverse conversion unit that reversely converts a part of the first structured data from the second structured data based on an instruction from the user. [0027] With this configuration, when performing inverse transformation, only a part designated by the user can be inversely transformed.

また前記第 1の構造ィ匕データから変換した前記第 2の構造化データに対し、ユーザ力の検索条件に基づいて検索を行なう検索部を更に備える構成とすることも出来る The second structured data converted from the first structure data may further include a search unit that performs a search based on a search condition of a user force.

[0028] この構成により、変換後の第 2の構造ィヒデータに対しても検索処理を行なうことが出来る。 [0028] With this configuration, it is possible to perform a search process on the converted second structure data.

更に、前記逆変換情報は、前記第 2の構造化データを前記第 1の構造化データに逆変換する際に最低限必要となる情報、例えば、前記第 1の構造化データ内のタグのタグ名及びタグと要素内容との対応関係を示す情報とする構成とすることが出来る Further, the inverse transformation information is information required at a minimum when inversely transforming the second structured data into the first structured data, for example, a tag of the tag in the first structured data It can be configured as information indicating the correspondence between names and tags and element contents.

[0029] この構成により、埋め込む逆変換情報の大きさを最小限の大きさにすることが出来る。 [0029] With this configuration, the size of the inverse transform information to be embedded can be minimized.

また本発明は構造ィヒデータ変換装置のみならず、構造化データ変換方法、プログラム及び記憶媒体もその範囲に含む。 In addition, the present invention includes not only a structural data conversion device but also a structured data conversion method, program and storage medium.

図面の簡単な説明 Brief Description of Drawings

[0030] [図 l] (a)及び (b)は、同一内容のデータを表示と情報処理の両方に用いるようにするために現在行なわれている方式を示す図、（c)は本実施形態による方式を示す図である。 [0030] [Fig. L] (a) and (b) are diagrams showing a method currently used to use the same data for both display and information processing. It is a figure which shows the system by embodiment.

[図 2]本実施形態における構造ィヒデータ変換装置の概略構成図である。 FIG. 2 is a schematic configuration diagram of a structure-rich data conversion device in the present embodiment.

[図 3]本実施形態の構造化データ変換装置で行なわれる構造化文書の変換の例を示す図である。 FIG. 3 is a diagram showing an example of structured document conversion performed by the structured data conversion apparatus of the present embodiment.

[図 4]変換処理部によって行なわれる XMLデータから HTMLデータへの変換処理時の動作処理を示すフローチャートである。 FIG. 4 is a flowchart showing an operation process at the time of conversion processing from XML data to HTML data performed by a conversion processing unit.

[図 5] (a)は項目名表の例を示す図、（b)は項目値表の例を示す図、（c)はメモリ上に展開された HTMLスタイル指定 222の例を示す図、（d)にこの置き換えを行なった H TMLスタイルの例を示す図である。 [Fig. 5] (a) is a diagram showing an example of an item name table, (b) is a diagram showing an example of an item value table, (c) is a diagram showing an example of an HTML style specification 222 expanded in memory, ( FIG. 4D shows an example of an HTML style in which this replacement is performed.

[図 6]XMLメモリ構造の例を模式ィ匕した図である。 [図 7]逆変換処理部によって行なわれる HTMLデータから XMLデータへの逆変換処理時の動作処理を示すフローチャートである。 FIG. 6 is a diagram schematically showing an example of an XML memory structure. FIG. 7 is a flowchart showing an operation process during reverse conversion processing from HTML data to XML data performed by the reverse conversion processing unit.

[図 8]変換元の構造化文書に繰り返し構造を持つ場合の構造化データ変換装置による変換の概要を示す図である。 FIG. 8 is a diagram showing an outline of conversion by the structured data conversion device when the conversion source structured document has a repeated structure.

[図 9]変換元の構造化文書に繰り返し構造を含む場合の変換の例を示す図である。 FIG. 9 is a diagram showing an example of conversion when a conversion source structured document includes a repetitive structure.

[図 10]変換元の構造化文書に繰り返し形式があり、これを表形式の構造化文書に変換する場合の処理を示すフローチャートである。 FIG. 10 is a flowchart showing processing when a conversion source structured document has a repetitive format and this is converted into a tabular structured document.

[図 11] (a)は項目名表の例を示す図、（b)は項目表値の例を示す図、（c)はメモリ上に展開された HTMLスタイル指定の例を示す図、（d)は生成されるアンカータグの例を示す図、（e)は HTML記述の例を示す図である。 [Fig. 11] (a) is an example of item name table, (b) is an example of item table value, (c) is an example of HTML style specification expanded on memory, (d ) Is a diagram showing an example of the generated anchor tag, and (e) is a diagram showing an example of HTML description.

[図 12]本実施形態の構造化データ変換装置によって表形式の変換された構造化文書を逆変換処理部によって逆変換する場合の処理を示すフローチャートである。 FIG. 12 is a flowchart showing a process in the case where the structured document converted in the table format by the structured data conversion apparatus of the present embodiment is reverse-converted by the reverse-conversion processing unit.

[図 13]部分逆変換処理を示す図である。 FIG. 13 is a diagram showing a partial inverse transform process.

[図 14]この部分逆変換処理部による部分逆変換処理を示すフローチャートである。 FIG. 14 is a flowchart showing partial reverse conversion processing by the partial reverse conversion processing unit.

[図 15] (a)及び (b)は、本実施形態の検索処理の説明図である。 FIGS. 15A and 15B are explanatory diagrams of search processing according to the present embodiment.

[図 16]検索処理部によって行なわれる検索処理を示すフローチャートである。 FIG. 16 is a flowchart showing search processing performed by a search processing unit.

[図 17]逆変換情報を HTMLのコメントタグ内に坦め込んだ場合の例を示す図である [Fig.17] A diagram showing an example of reverse conversion information embedded in an HTML comment tag

[図 18]コンピュータのシステム環境図である。 FIG. 18 is a computer system environment diagram.

[図 19]媒体の例を示す図である。 FIG. 19 is a diagram illustrating an example of a medium.

発明を実施するための最良の形態 BEST MODE FOR CARRYING OUT THE INVENTION

[0031] 以下に図面を参照しながら本発明の一実施形態について説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

図 1 (c)は、本実施形態における構造ィ匕データ変換装置による XMLデータの HT Fig. 1 (c) shows the HT of XML data by the structure data converter in this embodiment.

MLデータへの変換を示す図である。 It is a figure which shows conversion to ML data.

[0032] 本実施形態の構造化データ変換装置では、 XMLデータを HTMLデータに変換する際、 HTML変換時に本来なら失われてしまう XMLデータに再変換する際に必要となる情報 (以下逆変換情報という）を効率よく HTMLデータ内に埋め込む。 In the structured data conversion apparatus of this embodiment, when converting XML data to HTML data, information necessary for re-converting to XML data that would otherwise be lost during HTML conversion (hereinafter referred to as reverse conversion) Information) is efficiently embedded in HTML data.

[0033] これにより、図 1 (b)に示したように元の XMLデータを丸ごと埋め込むのに比して、変換後のデータの大きさが増大するのを防ぐことができる。尚この逆変換情報は、逆変換時に最低限必要となるタグ名と要素内容との対応関係を示す情報なので、これを変換結果に埋め込んでも、変換後のデータのサイズに大きな影響はない。 [0033] As a result, compared to embedding the entire original XML data as shown in FIG. It is possible to prevent the data size after conversion from increasing. Note that this reverse conversion information is information indicating the correspondence between the tag name and element contents that are required at the time of reverse conversion. Therefore, even if this information is embedded in the conversion result, the size of the converted data is not significantly affected.

[0034] 図 2は本実施形態における構造化データ変換装置の概略構成図である。 FIG. 2 is a schematic configuration diagram of the structured data conversion apparatus according to this embodiment.

同図において、構造化データ変換装置 1は、変換処理部 11、逆変換処理部 12、部分逆変換処理部 13、及び検索処理部 14を有する。 In FIG. 1, the structured data conversion device 1 includes a conversion processing unit 11, an inverse conversion processing unit 12, a partial reverse conversion processing unit 13, and a search processing unit 14.

[0035] 変換処理部 11は、変換対象である変換元 XMLデータ 2と変換定義 3を読み込み、変換元 XMLデータ 2を変換定義 3に基づいて HTMLデータ 4に変換する。この変換処理部 11による変換結果である HTMLデータ 4には、逆変換情報 41が坦め込まれている。逆変換処理部 12は、 HTMLデータ 4の XMLデータ 2への逆変換を行なうもので、 HTMLデータ 4内の逆変換情報を用いて HTMLデータ 4力 XMLデータ 2を生成する。部分逆変換処理部 13は、変換処理部 11によって生成された HTMLデータ 4の一部を逆変換して、変換元 XMLデータ 2の部分 XMLデータ 6を生成する。尚この部分逆変換処理部 13によるの部分 XMLデータ 6の生成も逆変換情報 41を用いて行なわれる。検索処理部 14は、変換処理部 11による変換結果である HTMLデータ 4に対して検索条件 5に基づいて検索を行ない、検索結果 7を出力する。変換処理部 11によって生成された HTMLデータ 4は逆変換情報 41を内部に持ってレ、るので、 XMLデータの様な検索を行なうことができる。尚この点の詳細については後述する。 The conversion processing unit 11 reads the conversion source XML data 2 and the conversion definition 3 to be converted, and converts the conversion source XML data 2 into HTML data 4 based on the conversion definition 3. The reverse conversion information 41 is included in the HTML data 4 that is the conversion result of the conversion processing unit 11. The reverse conversion processing unit 12 performs reverse conversion of the HTML data 4 to the XML data 2, and generates HTML data 4 power XML data 2 using the reverse conversion information in the HTML data 4. The partial reverse conversion processing unit 13 performs reverse conversion on a part of the HTML data 4 generated by the conversion processing unit 11 to generate partial XML data 6 of the conversion source XML data 2. The generation of the partial XML data 6 by the partial reverse conversion processing unit 13 is also performed using the reverse conversion information 41. The search processing unit 14 performs a search on the HTML data 4 that is the conversion result by the conversion processing unit 11 based on the search condition 5 and outputs a search result 7. Since the HTML data 4 generated by the conversion processing unit 11 has the reverse conversion information 41 inside, it can be searched like XML data. Details of this point will be described later.

[0036] 尚本実施形態における構造化データ変換装置は、その構成要素として変換処理部 11、逆変換処理部 12、部分逆変換処理部 13、及び検索処理部 14の全てを備える構成のみならず、その一部のみ、例えば変換処理部 11のみや変換処理部 11及び逆変換処理部 12のみを備える構成としても実現することができる。 Note that the structured data conversion apparatus according to the present embodiment has only a configuration including all of the conversion processing unit 11, the inverse conversion processing unit 12, the partial reverse conversion processing unit 13, and the search processing unit 14 as its components. Instead, it can be realized as a configuration including only a part thereof, for example, only the conversion processing unit 11 or only the conversion processing unit 11 and the inverse conversion processing unit 12.

[0037] 図 3に、本実施形態の構造化データ変換装置で行なわれる構造化文書の変換の例を示す。同図は、 XMLデータから HTMLデータに変換を行なっている場合を例として示している、 FIG. 3 shows an example of structured document conversion performed by the structured data conversion apparatus of the present embodiment. The figure shows an example of converting XML data to HTML data.

同図では変換元となる XMLデータ 21には、データである要素内容の他に、く製品情報 >、く製品名〉、く価格 >及びく型名 >のタグ名（要素名）を情報として持っている力これらのタグと要素内容との関係は、通常 HTMLデータに変換される際に失われてしまう。 In the figure, in the XML data 21 that is the conversion source, the tag name (element name) of <Product information>, <Product name>, <Price>, and <Type name> is used as information in addition to the element content that is data. Wait The relationship between these tags and element content is usually lost when converted to HTML data.

[0038] 変換処理部 11は、この XMLデータ 21を HTMLデータ 23に変換する際に、タグ名やタグと要素内容との関係など通常失われてしまう情報を変換定義 22による指示に基づレ、て逆変換情報に変換し、またこのとき再変換に必要となる情報を HTMLデータ 23内に坦め込む。 [0038] When converting this XML data 21 into HTML data 23, the conversion processing unit 11 reads information that is normally lost, such as the tag name and the relationship between the tag and the element content, based on the instruction by the conversion definition 22. Then, the information is converted into reverse conversion information, and information necessary for reconversion at this time is included in the HTML data 23.

[0039] 図 3では、変換定義 22は、項目定義 221と HTMLスタイル指定 222を指定しており、そのうち項目定義 221は対象とする XMLデータ 21の中の各要素について、階層構造上での位置が指定されている。例えば XMLデータ 21中の要素であるく製品情報 >下位階層にあるく製品名 >は、項目定義 221では項目 1として指定されてレ、る。また HTMLスタイル指定 222は、変換後の HTMLデータ 23の構造を指定している In FIG. 3, conversion definition 22 specifies item definition 221 and HTML style specification 222, of which item definition 221 is the position in the hierarchical structure for each element in target XML data 21. Is specified. For example, the product information> product information> product name> in the lower hierarchy is specified as item 1 in the item definition 221. The HTML style specification 222 specifies the structure of the converted HTML data 23.

[0040] 変換処理部 11は、変換定義 22の項目定義 221から変換対象である XMLデータ 2 1を構成する各要素の位置についての情報を取得し、 XMLデータ 21からはその位置にあるデータ（要素内容）を取得する。例えば、項目 1の"/製品情報/製品名"という指定から、 XMLデータ 21中の上位階層が"製品"の"製品名"という項目の位置を表していることが分かり、 XMLデータ 21のその位置のデータである" CELSIUS" を取得する。 [0040] The conversion processing unit 11 obtains information on the position of each element constituting the XML data 21 to be converted from the item definition 221 of the conversion definition 22, and the data at the position from the XML data 21. Get (element content). For example, from the designation “/ Product information / Product name” in Item 1, it can be seen that the upper hierarchy in XML data 21 represents the position of the item “Product name” in “Product”. Get “CELSIUS” which is the data of the position.

[0041] そして、 HTMLデータ 23に変換する際には、 HTMLデータとして表示を行なう際に支障がないような形で、各データに対して項目定義 221の情報を、各要素の位置情報を逆変換情報として坦め込む。このための方法としては、例えば、 HTMLのハイパーリンク機能の一つであるアンカー（参照先マーク）タグ内に埋め込むことで、 Web ブラウザで表示しても見た目に影響を与えない。 [0041] Then, when converting to HTML data 23, the information of item definition 221 is reversed for each data and the position information of each element is reversed so that there is no problem in displaying as HTML data. Incorporated as conversion information. As a method for this, for example, by embedding it in an anchor (reference mark) tag which is one of HTML hyperlink functions, even if it is displayed on a Web browser, it does not affect the appearance.

[0042] 図 3の変換結果である HTMLデータ 23では、データ（要素内容）を挟んでいた開始タグと終了タグがそれぞれアンカーに置き換わっている。例えばデータ（要素内容 ) 222に対する開始タグ 221力アンカー 231に、少量タグ 213がアンカー 233に置き換わってデータ 232の前後に配置されている。 In the HTML data 23 that is the conversion result of FIG. 3, the start tag and the end tag that sandwiched the data (element content) are respectively replaced with anchors. For example, the start tag 221 force anchor 231 for the data (element content) 222 is replaced with the anchor 233 and the small amount tag 213 is replaced before and after the data 232.

[0043] このような、 HTMLデータ 23データは Webブラウザで表示すると、タグは表示されないので、表示イメージ 24のように揷入されたアンカーとして埋め込まれた逆変換情報は表示されない。 [0043] When such HTML data 23 data is displayed in a web browser, the tag is displayed. Therefore, the reverse conversion information embedded as an inserted anchor as shown in display image 24 is not displayed.

[0044] またこのような方法で XMLデータ 21から変換された、逆変換情報を内包した HTM [0044] In addition, an HTM including reverse conversion information converted from XML data 21 by such a method.

Lデータ 23は、この逆変換を行なう際にこの逆変換情報を用いることによって元の XThe L data 23 is obtained by using this inverse transformation information when performing this inverse transformation.

MLデータ 21を復元することができる。 ML data 21 can be restored.

[0045] XMLデータ 23に逆変換する際には、埋め込まれた逆変換情報を抽出し、それに従って XMLに変換する。 When performing reverse conversion to XML data 23, embedded reverse conversion information is extracted and converted to XML accordingly.

例えば図 3において、 For example, in Figure 3,

< a name = "/製品情報/価格" > <a name = "/ Product information / Price">

20万円 ¥ 200,000

< a name = "/製品情報/価格— e" > <a name = "/ Product information / Price—e">

とレ、う記述部分 234は、アンカーとして埋め込まれてレ、る逆変換情報から XMLデータ 21のタグを再現し、 The description part 234 is embedded as an anchor and reproduces the tag of the XML data 21 from the reverse conversion information.

<製品情報 > <Product information>

<価格 > 20万円 < /価格 > <Price> 200,000 yen </ Price>

< /製品情報 > </ Product Information>

とレ、う XMLデータの記述に変換できる。 It can be converted into a description of XML data.

[0046] 図 4は、変換処理部 11によって行なわれる XMLデータ 21から HTMLデータ 23への変換処理時の動作処理を示すフローチャートである。 FIG. 4 is a flowchart showing an operation process at the time of conversion processing from XML data 21 to HTML data 23 performed by the conversion processing unit 11.

以下の説明では、図 3に示した変換対象である XMLデータ 21を変換定義 22を用レ、て変換し、変換結果として HTMLデータ 23を得る場合を例として、変換処理部によって行なわれる動作を説明する。 In the following explanation, the operation performed by the conversion processing unit is shown as an example in which the XML data 21 to be converted shown in FIG. 3 is converted using the conversion definition 22 and the HTML data 23 is obtained as the conversion result. Will be explained.

[0047] 同図において処理が開始されると、まず変換対象となる XMLデータ 21と、変換定義 22をメモリ上に読み込んだ後、ステップ S1として変換定義 22を解析し、メモリ上に項目名表を生成する。 [0047] When processing is started in the figure, first, the XML data 21 to be converted and the conversion definition 22 are read into the memory, and then the conversion definition 22 is analyzed as step S1, and the item name table is stored in the memory. Is generated.

[0048] 図 5 (a)にその項目名表の例を示す。同図の項目名表は、各項目とその項目が対応する XMLデータ 21内の要素の位置の関係が表として示されてレ、る。この項目名表は、変換定義 22の項目定義 221から生成される。 [0049] 次に変換処理部 11は、ステップ S2として、変換元である XMLデータ 21を解析し、木構造を示す XMLメモリ構造をメモリ上に展開する。 [0048] Fig. 5 (a) shows an example of the item name table. The item name table in the figure shows the relationship between each item and the position of the element in the XML data 21 to which the item corresponds. This item name table is generated from the item definition 221 of the conversion definition 22. [0049] Next, in step S2, the conversion processing unit 11 analyzes the XML data 21 that is the conversion source, and expands the XML memory structure indicating the tree structure on the memory.

図 6にこの XMLメモリ構造の例を模式化した図を示す。 Figure 6 shows a schematic diagram of this XML memory structure.

[0050] 同図では、 XMLメモリ構造は、タグ名部分 61とタグ内容部分 62から構成され、タグ名部分には各データ（要素内容）の階層位置をタグ名で示しており、タグ内容部分 6[0050] In the figure, the XML memory structure is composed of a tag name portion 61 and a tag content portion 62. The tag name portion indicates the hierarchical position of each data (element content) by a tag name. 6

2には、データ（要素内容）そのものが示されている。 2 shows the data (element content) itself.

[0051] 変換処理部 11は、ステップ S3としてステップ S1で生成した項目名表と、ステップ S[0051] The conversion processing unit 11 uses the item name table generated in step S1 as step S3, and step S3.

2で生成した XMLメモリ構造から、ステップ S3として、項目値表を生成する。 The item value table is generated from the XML memory structure generated in step 2 as step S3.

図 5 (b)に項目値表の例を示す。 Figure 5 (b) shows an example of the item value table.

[0052] この項目値表は、項目名表の各項目と、 XMLデータ 21内のデータ（要素内容）との関係を示すもので、同図の例では項目 1は CELCIUS、項目 2は T1234、項目 3 は 20万円を示している。 [0052] This item value table shows the relationship between each item in the item name table and the data (element content) in XML data 21. In the example in the figure, item 1 is CELCIUS, item 2 is T1234, item 3 indicates 200,000 yen.

[0053] 次に変換処理部 11は、ステップ S4として変換定義 22の HTMLスタイル指定 222 部分をメモリに展開する。 Next, the conversion processing unit 11 expands the HTML style specification 222 part of the conversion definition 22 in the memory as step S4.

図 5 (c)にこのメモリ上に展開された HTMLスタイル指定 222の例を示す。この HT Figure 5 (c) shows an example of the HTML style specification 222 expanded on this memory. This HT

MLスタイル指定 222が、変換後の HTMLデータ 23のひな形となる。 ML style specification 222 becomes the template of converted HTML data 23.

[0054] そして変換処理部 11は、ステップ S5としてステップ S4でメモリ上に展開した HTM[0054] Then, the conversion processing unit 11 executes the HTM expanded on the memory in step S4 as step S5.

Lスタイル指定から文字列「 $項目 +数字」（ $項目 1、 $項目 2、 · · ·）を全て抽出しExtract all character strings “$ item + number” ($ item1, $ item2, ...) from L style specification

、ステップ S6として、抽出した文字列に対応する項目名とその項目名に対応する項目値を項目名表と項目値表から取得し、ステップ S7として項目名の位置情報を HTIn step S6, the item name corresponding to the extracted character string and the item value corresponding to the item name are acquired from the item name table and the item value table, and in step S7, the position information of the item name is set to HT.

MLアンカータグの" name"属性とし、また項目値をタグで挟まれたタグ内容として HThe ML anchor tag's "name" attribute and the item value sandwiched between the tags are the tag contents.

TMLスタイル指定中の文字列と置き換える。 Replace with character string in TML style specification.

[0055] 図 5 (d)にこの置き換えを行なった HTMLスタイルの例を示す。 [0055] Fig. 5 (d) shows an example of an HTML style with this replacement.

図 5 (d)は、項目 1部分を置き換えたものを示しており、記述部分 51では、項目 1の位置情報である"製品情報/製品名"がアンカータグの" name"属性 52となっており Fig. 5 (d) shows the item 1 part replaced. In the description part 51, the "product information / product name" which is the position information of the item 1 becomes the "name" attribute 52 of the anchor tag. Cage

、項目 1の項目値である CELCIUS力アンカータグの開始タグと終了タグに挟まれたタグ内容 53として記載されている。 The CELCIUS force anchor tag, which is the item value of item 1, is described as tag contents 53 sandwiched between the start tag and end tag of the anchor tag.

[0056] このような置き換えを行なった後、ステップ S8として変換処理部 11は、図 5 (d)のような置き換え後の HTMLスタイルのテキストを変換結果の HTMLデータ 23として出力して、本処理を終了する。 [0056] After such replacement, the conversion processing unit 11 performs step S8 as shown in FIG. The HTML style text after such replacement is output as HTML data 23 of the conversion result, and this process ends.

[0057] この様に本実施形態による構造化データ変換装置によるデータ変換では、変換元のデータの逆変換時に必要となる情報である逆変換情報を変換結果に坦め込むことが出来る。またこの逆変換情報が埋め込まれた変換後のデータは、 Webブラウザによって表示を行なっても、逆変換情報は表示されない。 As described above, in the data conversion by the structured data conversion apparatus according to the present embodiment, the reverse conversion information, which is information necessary for the reverse conversion of the conversion source data, can be included in the conversion result. In addition, the converted data in which the reverse conversion information is embedded does not display the reverse conversion information even if it is displayed by the Web browser.

[0058] また逆変換情報として HTMLデータに埋め込まれる情報は、タグの位置情報等なので、元の XMLデータをそのまま埋め込む場合に比して、データサイズを小さく抑えることが出来る。 [0058] The information embedded in the HTML data as the reverse conversion information is the tag position information and the like, so the data size can be reduced compared to the case where the original XML data is embedded as it is.

[0059] 図 7は、逆変換処理部 12によって行なわれる HTMLデータ 23から XMLデータ 21 への逆変換処理時の動作処理を示すフローチャートである。 FIG. 7 is a flowchart showing an operation process at the time of reverse conversion processing from HTML data 23 to XML data 21 performed by the reverse conversion processing unit 12.

以下の説明では、図 3に示した本実施形態の構造化データ変換装置によって変換された HTMLデータ 23を逆変換して、変換結果として XMLデータ 21を得る場合を例として、逆変換処理部 12によって行なわれる動作を説明する。 In the following description, the reverse conversion processing unit 12 is taken as an example in which the HTML data 23 converted by the structured data conversion apparatus of this embodiment shown in FIG. 3 is reversely converted to obtain the XML data 21 as a conversion result. The operation performed by is described.

[0060] 図 7において処理が開始されると、まず逆変換処理部 12は、処理対象である HTM Lデータ 23をメモリから読み出し、ステップ S 11としてこの HTMLデータ 23を解析し、アンカータグ部分を全て抽出する。 [0060] When the processing is started in FIG. 7, the inverse transform processing unit 12 first reads out the HTML data 23 to be processed from the memory, analyzes the HTML data 23 as step S11, and determines the anchor tag portion. Extract all.

[0061] 次にステップ S 12として逆変換処理部 12は、ステップ S 11で抽出したアンカーの "n ame"属性部分の文字列の末尾力 S"__e "となっているアンカーを求め、またステップ S 13として、ステップ S12で求めたアンカーと対となる、 "name"属性部分の文字列力ステップ S 12のアンカーの" name"属性部分の文字列の末尾から" _e "を除いた文字列のアンカーを取得する。そして逆変換処理部 12は、ステップ S 14として、ステツプ S 12とステップ S 13で取得した対のアンカーに囲まれた文字列を取得する。 [0061] Next inverse conversion unit 12 as the step S 12 determines the anchor has become "n ame" trailing force attribute part of the string S "_ _e" anchor extracted in step S 11, also In step S13, the string power of the "name" attribute part paired with the anchor obtained in step S12. The character excluding "_e" from the end of the "name" attribute part of the anchor in step S12 Get the column anchor. In step S14, the inverse conversion processing unit 12 acquires a character string surrounded by the pair of anchors acquired in step S12 and step S13.

[0062] 次に、逆変換分 12は、ステップ S 15としてステップ S 12若しくは S 13で取得したアンカーのアンカーの" name"属性の文字列を、 "/"を区切りとして分解して、図 6に示したような XMLデータのメモリ構造のタグ名部分 61を生成する。また、逆変換部 12 は、ステップ S 16として、ステップ S14で取得したアンカーに囲まれた文字列を要素内容として図 6の XMLデータのメモリ構造に加え、タグ内容部分 62を生成する。 [0063] この様にして生成された XMLデータのメモリ構造を、最後に逆変換分は、ステップ S17として XMLデータに変換しこれを XMLファイルとして出力し、本処理を終了する。 [0062] Next, the inverse transformation portion 12 decomposes the character string of the "name" attribute of the anchor anchor obtained in step S12 or S13 as step S15, with "/" as a delimiter. The tag name part 61 of the memory structure of the XML data as shown in 6 is generated. Further, in step S16, the inverse conversion unit 12 generates a tag content portion 62 in addition to the memory structure of the XML data in FIG. 6 as an element content including the character string surrounded by the anchor acquired in step S14. [0063] The memory structure of the XML data generated in this way is finally converted into XML data in step S17 for the reverse conversion, and this is output as an XML file, and this process is terminated.

[0064] この様に本実施形態における構造化データ変換装置では、 XMLデータから変換した HTMLデータに対して再変換を行なうことによって、元の XMLデータを再現すること力 Sできる。 [0064] As described above, the structured data conversion apparatus according to the present embodiment can reproduce the original XML data by re-converting the HTML data converted from the XML data.

[0065] 次に、変換対象となる変換元データにテーブルやリスト等の繰り返し部分を有する場合の変換及び逆変換について説明する。 Next, conversion and reverse conversion in the case where the conversion source data to be converted has a repetitive portion such as a table or list will be described.

例えば XMLに於いては、その特徴として繰り返し同じ構造のデータを持つ構成を含む場合が多い。 For example, XML often includes a structure with repeated data having the same structure as its characteristics.

[0066] これらの繰返し構造を持つ XMLデータを、 HTMLで表現する場合には表形式 (T ABLEタグ)やリスト形式 (ULタグ)で表されることが多レ、。 [0066] When XML data having these repeating structures is expressed in HTML, it is often expressed in a table format (T ABLE tag) or a list format (UL tag).

繰返し構造は数が多くなりがちなので、上記したように全ての項目に逆変換情報を付加するとサイズが大きくなつてしまう。本実施形態では、このような繰り返し構造を持つ場合において HTMLの表形式等に変換する場合は、より効率的に逆変換情報を埋め込めるよう、表 (若しくはリスト）全体 (繰返し構造全体）に対して一組の逆変換情報を坦め込む。 Since the number of repeated structures tends to be large, adding inverse transformation information to all items as described above increases the size. In this embodiment, when converting to an HTML table format or the like when having such a repeating structure, the entire table (or list) (the entire repeating structure) can be embedded more efficiently so that reverse conversion information can be embedded. On the other hand, it carries a set of inverse transformation information.

[0067] これにより本実施形態の構造化データ変換装置による変換では、変換対象が繰り返し部分を有するものであるとき、変換時に坦め込む逆変換情報をよりコンパクトにすることが出来る。 [0067] With this, in the conversion by the structured data conversion apparatus of the present embodiment, when the conversion target has a repeated portion, the reverse conversion information carried during conversion can be made more compact.

[0068] 図 8は、変換元の構造化文書に繰り返し構造を持つ場合の構造化データ変換装置による変換の概要を示す図である。 FIG. 8 is a diagram showing an outline of conversion by the structured data conversion device when the conversion source structured document has a repetitive structure.

構造化データ変換装置は、変換元の構造化文書 81を解析し、表等の繰り返し構造が存在する場合、これらに対する逆変換情報を 1つにまとめて変換結果となる構造化文書 82に揷入する。 The structured data converter analyzes the source structured document 81, and if there is a repeating structure such as a table, the reverse conversion information for these is put together into a structured document 82 that is the conversion result. To do.

[0069] またこの変換された構造ィ匕文書 82を逆変換する際には、表等の繰り返し構造に対して 1つにまとめて、効率よく組み込まれた逆変換情報を用いて元の構造化文書 81 を生成する。 [0070] 図 9は、変換元の構造化文書に繰り返し構造を含む場合の変換の例を示す。同図も、図 3と同様 XMLデータから HTMLデータに変換を行なっている場合を例として示している、 [0069] When the converted structure document 82 is inversely converted, the original structure is obtained by using the inversely converted information that is efficiently incorporated into the repeated structure such as a table. Generate a document 81. FIG. 9 shows an example of conversion when the conversion source structured document includes a repetitive structure. This figure also shows an example of conversion from XML data to HTML data, as in Figure 3.

同図では、変換元の構造ィ匕文書である XMLデータ 91をユーザによって定義された変換定義 92を用いて HTMLデータ 93に変換する。 In the figure, the XML data 91 which is the structure document of the conversion source is converted into HTML data 93 using the conversion definition 92 defined by the user.

[0071] 図 9の XMLデータ 91は、く型名〉、く名前〉、及びく価格 >部分の繰り返し構造 911を備え、また変換定義 92では、項目定義 921でこれらを項目として定義し、また HTMLスタイル指定 922には" MALE— TABLE"と表形式に変換することが定義されている。尚項目定義の 921の"レコード"部分は、表のレコードとなる部分を示しており、また HTMLスタイル指定 922の" MALE— TABLE"は、引数として持つ項目を行とした表形式のスタイルであることを示している。 [0071] The XML data 91 in FIG. 9 has a repeating structure 911 of type name>, name>, and price> parts, and conversion definition 92 defines these as items in item definition 921. In addition, HTML style specification 922 defines "MALE-TABLE" and conversion to a table format. The “record” part of 921 of the item definition indicates the part that becomes the record of the table, and “MALE—TABLE” of the HTML style specification 922 is a tabular style with the items as arguments as rows. It shows that there is.

[0072] この変換定義 92に基づいて変換処理部 11が、 XMLデータ 91を変換すると、変換結果として HTMLデータ 93が得られる。 When the conversion processing unit 11 converts the XML data 91 based on the conversion definition 92, HTML data 93 is obtained as a conversion result.

この HTMLデータ 93では、変換定義 92で定義されたように各行に XMLデータ 91 のく型名〉、く名前〉、及びく価格〉の要素内容を持つ表を示している。またこの HTMLデータ 93には、表の先頭部分と終わり部分に逆変換情報 931及び 932がァンカーとして埋め込まれてレ、る。 This HTML data 93 shows a table having the element contents of <type name>, <name>, and <price> in XML data 91 as defined in the conversion definition 92. Also, in this HTML data 93, reverse conversion information 931 and 932 are embedded as anchors at the beginning and end of the table.

[0073] この様に、変換元の構造化文書が繰り返し構造を持つ場合であった場合、逆変換に用いる逆変換情報をまとめて効率よく変換結果の構造化文書に埋め込むことが出来る。 [0073] In this way, when the conversion source structured document has a repetitive structure, it is possible to efficiently embed the reverse conversion information used for the reverse conversion and embed it in the structured document of the conversion result.

また表示結果 94は、変換結果である HTMLデータ 93を Webブラウザによって表示したものであるが、逆変換情報はアンカーとして埋め込まれているので、表示には何ら影響を与えない。 The display result 94 is the HTML data 93 that is the conversion result displayed by the Web browser. However, since the reverse conversion information is embedded as an anchor, there is no influence on the display.

[0074] 図 10は、上記したように変換元の構造ィヒ文書に繰り返し形式があり、これを表形式の構造化文書に変換する場合の処理を示すフローチャートである。 [0074] FIG. 10 is a flowchart showing processing when the conversion source structured document has a repetitive format as described above, and this is converted to a tabular structured document.

以下の説明では、図 9に示した変換対象である XMLデータ 91を変換定義 92を用レ、て変換し、変換結果として HTMLデータ 93を得る場合を例として、変換処理部によって行なわれる動作を説明する。 [0075] 同図において処理が開始されると、まず変換対象となる XMLデータ 91と、変換定義 92をメモリ上に読み込んだ後、ステップ S21として変換定義 92を解析し、メモリ上に項目名表及び項目表値を生成する。尚この処理は図 4のステップ Sl、 S2と同じ処理である。 In the following explanation, the operation performed by the conversion processing unit is shown by taking as an example the case of converting the XML data 91 to be converted shown in FIG. 9 using the conversion definition 92 and obtaining the HTML data 93 as the conversion result. Will be explained. [0075] When the processing is started in the figure, first, the XML data 91 to be converted and the conversion definition 92 are read into the memory, then the conversion definition 92 is analyzed as step S21, and the item name table is stored in the memory. And item table values are generated. This process is the same as steps Sl and S2 in Fig. 4.

[0076] 図 11 (a)にこのとき生成される項目名表、同図（b)に項目表値の例を示す。同図（a )の項目名表は、各項目とその項目が対応する XMLデータ 91内の要素の位置の関係が表として示されている。この項目名表は、変換定義 92の項目定義 291から生成される。 FIG. 11A shows an item name table generated at this time, and FIG. 11B shows an example of the item table value. The item name table in FIG. 6A shows the relationship between each item and the position of the element in the XML data 91 to which the item corresponds. This item name table is generated from the item definition 291 of the conversion definition 92.

[0077] この図 11 (a)の項目名表と図 5 (a)の項目名表を比較すると図 11 (a)の項目名表には、項目 1〜3の他に"レコード"が項目として加わっている。 [0077] When comparing the item name table in Fig. 11 (a) with the item name table in Fig. 5 (a), the item name table in Fig. 11 (a) includes "Record" as an item in addition to items 1 to 3. Yes.

また図 5 (b)の項目標値と比較すると、図 11 (b)の項目表値は、変換元の XMLデータ 91が繰り返し構造を持つので、 1つの項目に複数のデータが対応している。 Compared with the target value of the item in Fig. 5 (b), the item table value in Fig. 11 (b) has a repetitive structure in the source XML data 91, so multiple items correspond to one item. ing.

[0078] 次に変換処理部 11は、ステップ S22として、変換定義 92中の HTMLスタイル指定[0078] Next, the conversion processing unit 11 specifies the HTML style in the conversion definition 92 as step S22.

922をメモリ上に展開する。 The 922 is expanded on the memory.

図 11 (c)にこのメモリ上に展開された HTMLスタイル指定 922の例を示す。この H Figure 11 (c) shows an example of HTML style specification 922 expanded on this memory. This H

TMLスタイル指定 922が、変換後の HTMLデータ 93のひな形となる。同図では、 MThe TML style specification 922 becomes a template for the HTML data 93 after conversion. In the figure, M

AKE— TABLE ( $項目 1 , $項目 2, $項目 3)と、項目値表で定義された各項目 1AKE— TABLE ($ item 1, $ item 2, $ item 3) and each item 1 defined in the item value table

〜項目 3のデータを行とする表形式として HTMLデータ 93が形成されることが指定されている。 ~ It is specified that HTML data 93 is formed in a table format with the data of item 3 as rows.

[0079] 変換処理部 11は、ステップ S23として、メモリ上に展開した HTMLスタイル指定の" MAKE_TABLE ( · · · ) "の引数である文字歹 $項目 +数字」（ $項目 1、 $項目 2 、 · · ·）を全て抽出し、ステップ S24として、抽出した文字列に対応する項目名とその項目名に対応する項目値を項目名表と項目値表から取得する。 [0079] In step S23, the conversion processing unit 11 performs the character 歹 $ item + number ”($ item1, $ item2, which is an argument of“ MAKE_TABLE (····)) ”specified in the HTML style expanded in memory. ····) are extracted, and in step S24, the item name corresponding to the extracted character string and the item value corresponding to the item name are obtained from the item name table and the item value table.

[0080] 次に変換処理部 11は、ステップ S25として項目名 92の"レコード"の位置情報（図 9 では"/製品名/製品"）力 HTMLアンカータグの" name"属性の文字列を生成する。 [0080] Next, the conversion processing unit 11 generates the character string of the "name" attribute of the HTML anchor tag in step S25 as the position information of the "record" of the item name 92 ("/ product name / product" in Fig. 9) To do.

[0081] 図 11 (d)にステップ S25で生成されるアンカータグの例を示す。同図中 111が開始タグを表わしており変換後の HTMLデータ 91の表部分の開始位置に坦め込まれる。また同図中 112が終了タグに対応しており、 HTMLデータ 91の表部分の終了位置に坦め込まれる。 FIG. 11 (d) shows an example of the anchor tag generated in step S25. In the figure, 111 represents the start tag, and is loaded at the start position of the table part of the converted HTML data 91. . In addition, 112 in the figure corresponds to the end tag and is loaded at the end position of the table portion of the HTML data 91.

[0082] 次に変換処理部 11は、ステップ S26としてアンカー 111の" s_rowl =header"から、く製品一覧 > <製品 >の下位要素の要素名であるく型名 >、く名前 >、及び <価格 >がヘッダーとしてテーブルの 1行目に来るように、 HTML記述を作成する。そして次にステップ S27として、 HTMLスタイル指定 921の" MAKE_TABLE"中で指定された順序で、ステップ S26で作成した HTML記述の表の 2行目移行に対応する部分に、項目名に対応する項目値を並べて追カ卩し、 HTML記述を作成する。 [0082] Next, in step S26, the conversion processing unit 11 starts from “s_rowl = header” of the anchor 111, and selects the type name that is the element name of the lower element of the product list> <product >>, name>, and Create an HTML description so that <price> comes as the header in the first row of the table. Then, in step S27, in the order specified in “MAKE_TABLE” of HTML style specification 921, the item value corresponding to the item name is displayed in the portion corresponding to the second row of the table of the HTML description created in step S26. Create an HTML description.

[0083] 図 11 (e)にこのステップ S26、 S27によって作成される HTML記述を示す。 [0083] FIG. 11 (e) shows the HTML description created in steps S26 and S27.

同図中 113部分力ステップ S26で作成される表の 1行目のヘッダー部分に対応し、 114部分が表の 2行目以降の部分に対応する。 In the figure, 113 partial force corresponds to the header portion of the first row of the table created in step S26, and 114 portion corresponds to the second and subsequent portions of the table.

[0084] このような HTML記述をメモリ上に作成後、変換処理部 11は、ステップ S28として、図 11 (e)の HTML記述の先頭部分（図 11 (e)の < table >タグの直後）に開始タグ 1 11のアンカーを、最終部分（図 11 (e)のく/ table >タグの直前）に終了タグ 112のアンカーを付加し、この HTML記述を HTMLスタイル指定 922の" MAKE— TABL E ( · · · ) "部分と置き換える。そしてステップ S29として、置き換えた結果を変換結果である HTMLデータ 93としてファイルに出力し、本処理を終了する。 [0084] After creating such an HTML description in the memory, the conversion processing unit 11 performs step S28 as the first part of the HTML description in Fig. 11 (e) (immediately after the <table> tag in Fig. 11 (e)). The start tag 1 11 anchor is added to the final part (Fig. 11 (e) / immediately before the table> tag), and the end tag 112 anchor is added. This HTML description is specified in the HTML style specification 922 "MAKE— TABL E (· · ·) "Replace with part. In step S29, the replacement result is output to a file as HTML data 93, which is the conversion result, and this process ends.

[0085] この様に本実施形態による構造化データ変換装置によるデータ変換では、変換元の構造化文書に繰り返し構造があっても、変換後のデータが大きくならないよう、効率良く逆変換情報を埋め込んだ変換を行なうことができる。 In this way, in the data conversion by the structured data conversion apparatus according to the present embodiment, the reverse conversion information is efficiently converted so that the converted data does not become large even if the conversion source structured document has a repeated structure. You can do embedded conversions.

[0086] 次に、図 10の処理によって変換された構造ィ匕文書の逆変換処理について説明する。 Next, the inverse conversion process of the structure document converted by the process of FIG. 10 will be described.

図 12は、本実施形態の構造化データ変換装置によって表形式の変換された構造化文書を逆変換処理部 12によって逆変換する場合の処理を示すフローチャートである。 FIG. 12 is a flowchart showing a process in the case where the reverse conversion processing unit 12 performs reverse conversion on the structured document converted in the table format by the structured data conversion apparatus of this embodiment.

[0087] 以下の説明では、図 9に示した変換処理部 11によって変換された HTMLデータ 9 3を逆変換し、変換結果として XMLデータ 91を得る場合を例として、逆変換処理部 1 2によって行なわれる動作を説明する。 [0088] 図 12において処理が開始されると、逆変換処理部 12は、処理対象である HTML データ 93をメモリに読み込み、ステップ S31としてこの HTMLデータ 93を解析し、ァンカータグ部分を全て抽出する。 [0087] In the following description, an example in which the HTML data 93 converted by the conversion processing unit 11 shown in FIG. The operation performed will be described. [0088] When the processing is started in FIG. 12, the inverse conversion processing unit 12 reads the HTML data 93 to be processed into the memory, analyzes the HTML data 93 as step S31, and extracts all the anchor tag portions. .

[0089] 次にステップ S32として逆変換処理部 12は、ステップ S31で抽出したアンカーの "n ame"属性部分の文字列の末尾力 S"__e "となっているアンカーを取得する。そしてステツプ S33として、ステップ S32で取得したアンカーの" name"属性部分の文字列から末尾の" _e "を除いた文字列に、 "s_row' 'が続くアンカーの" name"属性を持つアンカーを取得する。そして逆変換処理部 12は、ステップ S34として、このステップ S 32と S33で取得したアンカーに囲まれた文字列である HTML記述を取得する。 [0089] Next inverse conversion unit 12 as step S32 acquires an anchor that is the "n ame" trailing force attribute part of the string S "_ _e" anchor extracted in step S31. Then, as step S33, an anchor having a "name" attribute of an anchor in which "s_row '" follows the character string obtained by removing "_e" at the end from the character string of the "name" attribute part of the anchor acquired in step S32. In step S34, the inverse conversion processing unit 12 acquires an HTML description that is a character string surrounded by the anchors acquired in steps S32 and S33.

[0090] 次に逆変換処理部 12は、ステップ S35として、ステップ S33で取得したアンカーの" name"属性部分の文字列部分の" s— row"と" = header"の間の番号を所得する。本例では" name"属性部分の文字列は"製品一覧/製品— s— rowl =header"なので、この番号は" 1 "となる。 [0090] Next, in step S35, the inverse transformation processing unit 12 obtains a number between "s-row" and "= header" in the character string portion of the "name" attribute portion of the anchor acquired in step S33. . In this example, the character string of the “name” attribute is “product list / product—s—rowl = header”, so this number is “1”.

[0091] 次に逆変換処理部 12は、ステップ S36として、ステップ S35で取得した番号に該当するく table >タグの下位層の行（く tr>タグ部分）について、項目（く td>タグ）の値を XMLタグ値として抽出する。本例の場合このステップで抽出されるのは、 XML タグく型名>、 <名前 >、及びく価格 >となる。 [0091] Next, in step S36, the inverse transformation processing unit 12 sets the item (td> tag) for the row in the lower layer of the table> tag (tr> tag part) corresponding to the number acquired in step S35. The value of is extracted as an XML tag value. In this example, the XML tag type name, <name>, and price are extracted in this step.

[0092] 次に逆変換処理部 12は、ステップ S37として、ステップ S35で取得した番号に該当しなレ、（本例の場合テーブルの 2行目以降に該当する） < table >タグの下位層の行 (く tr>タグ部分）について、項目（く td>タグ）の値を XMLタグ値として抽出する。 [0092] Next, in step S37, the inverse transformation processing unit 12 does not correspond to the number acquired in step S35 (in this example, it corresponds to the second and subsequent rows of the table) <table> lower layer of the tag For the line (<tr> tag part), the value of the item (<td> tag) is extracted as the XML tag value.

[0093] そして、ステップ S38として、ステップ S33、 S36、 S37で抽出したアンカーの" nam e"属性部分、 XMLタグ名、 XMLタグ値から、 XMLデータを生成し、これを逆変換結果である XMLファイルとして出力し、本処理を終了する。 [0093] Then, as step S38, XML data is generated from the "name" attribute part of the anchor extracted in steps S33, S36, and S37, the XML tag name, and the XML tag value, and this is converted into XML as a result of reverse conversion. Output as a file and end this process.

[0094] この様に本実施形態では、表形式を含む形の構造化文書に対して逆変換を行なつても、元の繰り返し構造を持つ構造ィヒ文書を再現することができる。 As described above, in the present embodiment, a structured document having an original repetitive structure can be reproduced even if inverse transformation is performed on a structured document including a tabular format.

尚上記説明では、繰り返し構造を持つ構造化文書を表形式の構造化文書に変換していたが、表形式ではなくリスト形式の構造化文書 (HTMLの場合 <ul > )に変換した場合に於いても、同様に逆変換情報を坦め込むことが出来、また逆変換によって元の構造化文書を再現することができる。 In the above description, a structured document with a repetitive structure has been converted to a tabular structured document. However, when converted to a list structured document (<ul> for HTML) instead of a tabular format. However, the reverse conversion information can be loaded in the same way, The original structured document can be reproduced.

[0095] 次に、部分逆変換処理部 13によって行なわれる部分逆変換処理について説明する。 Next, the partial reverse conversion process performed by the partial reverse conversion processing unit 13 will be described.

この部分逆変換処理は、変換処理部 11によって変換された変換結果の一部分を逆変換するものである。 This partial reverse conversion process is to reverse-convert a part of the conversion result converted by the conversion processing unit 11.

[0096] 部分逆変換処理では変換結果の HTMLデータの中から特定部分のみ抽出し、 X MLデータに逆変換する。この部分逆変換処理は、本実施形態の構造化データ変換装置で変換された HTMLデータから特定の部分のみを逆変換してデータとして欲しレ、とき、例えば、製品情報を表示するページの HTMLデータの中から、製品の仕様詳細は除いて概要情報のみを XMLデータとして取得したい場合などに用いることができる。 [0096] In the partial reverse conversion process, only a specific part is extracted from the HTML data of the conversion result, and is converted back to XML data. This partial reverse conversion processing is performed by reversely converting only a specific part from the HTML data converted by the structured data conversion apparatus of this embodiment, and for example, HTML data of a page displaying product information. It can be used when you want to obtain only summary information as XML data, excluding product specification details.

[0097] 図 13にこの部分逆変換処理を示す。 FIG. 13 shows this partial inverse transform process.

上記したように変換処理部 11は、変換元の構造ィ匕文書 131に対して変換定義 132 に基づいて、変換処理を行ない変換結果 133を生成する。 As described above, the conversion processing unit 11 performs a conversion process on the conversion source structure document 131 based on the conversion definition 132 and generates a conversion result 133.

[0098] そして上記説明では、この変換結果 133の構造化文書をそのまま逆変換を行ない元の構造化文書 131を生成していたが、部分逆変換処理では、構造化文書 133全体を逆変換するのではなぐユーザに抽出部分指定ファイル 134によって一部分を指定させ、指定された部分のみを抽出して逆変換する。 [0098] In the above description, the structured document of the conversion result 133 is inversely converted to generate the original structured document 131. However, in the partial reverse conversion process, the entire structured document 133 is inversely converted. Instead, the user designates a part by the extracted part designation file 134, extracts only the designated part, and performs reverse conversion.

[0099] 例えば図 13では、抽出部分指定ファイル 134によって "Z製品情報 Z製品名"が指定されると、部分逆変換処理部 13は、階層位置が" Z製品情報 Z製品名"に対応する部分を抽出して逆変換を行ない、抽出部分指定ファイル 134によって指定された要素部分を逆変換結果である部分 XMLデータ 135として出力する。 For example, in FIG. 13, when “Z product information Z product name” is specified by the extracted part specification file 134, the partial reverse conversion processing unit 13 corresponds to the hierarchical position “Z product information Z product name”. The part is extracted and reversely converted, and the element part specified by the extracted part specifying file 134 is output as the partial XML data 135 which is the reverse conversion result.

[0100] 図 14は、この部分逆変換処理部 13による部分逆変換処理を示すフローチャートである。 FIG. 14 is a flowchart showing the partial reverse conversion process performed by the partial reverse conversion processing unit 13.

同図において処理が開始されると、部分逆変換処理部 13は、まずステップ S41として、逆変換処理部 12を起動する。 When the process is started in the figure, the partial inverse transform processing unit 13 first activates the inverse transform processing unit 12 as step S41.

[0101] 次に部分逆変換処理部 13は、ステップ S42としてユーザによって抽出部分が指定された抽出部分ファイル 134が存在するかどうかを判断する。そしてその結果、抽出部分ファイル 134が存在しなければ (ステップ S42、 No)、部分逆変換でなく通常の逆変換を行なうので、ステップ S43としてステップ S41で起動した逆変換処理部 12処理を渡し、部分逆変換処理部 13は処理を終了する。このとき、処理を渡された逆変換処理部は、前述した図 7のステップ S11からの処理を開始し、逆変換処理を行なう Next, the partial inverse transform processing unit 13 determines whether or not the extracted partial file 134 in which the extracted part is designated by the user exists in step S42. And as a result, extraction If partial file 134 does not exist (step S42, No), normal reverse conversion is performed instead of partial reverse conversion, so the reverse conversion processing unit 12 process started in step S41 is passed as step S43, and partial reverse conversion processing is performed. Unit 13 ends the process. At this time, the inverse conversion processing unit that has been passed the process starts the process from step S11 of FIG. 7 described above, and performs the inverse conversion process.

[0102] ステップ S42において、抽出部分ファイル 134が存在していれば（ステップ S42、 Y es) ,ユーザによって部分抽出を行なう箇所が指定されているので、ステップ S44として、抽出部分ファイル 134をメモリから読み出し、この抽出部分ファイル 134に指定されている抽出部分指定の文字列を取得する。 [0102] In step S42, if the extracted partial file 134 exists (step S42, Yes), the location where partial extraction is to be performed is specified by the user. Therefore, in step S44, the extracted partial file 134 is stored in the memory. The character string of the extraction part specification specified in this extraction part file 134 is acquired.

[0103] そして次に部分逆変換処理部 13は、ステップ S45として、 "name"属性に抽出部分ファイル 134で指定された抽出部分指定の文字列と同じ文字列を持つアンカーを、変換結果である HTMLデータ 133から抽出する。 [0103] Then, the partial reverse conversion processing unit 13 selects an anchor having the same character string as the extracted part designation character string specified in the extraction part file 134 in the "name" attribute as the conversion result in step S45. Extract from some HTML data 133.

[0104] そして、ステップ S45で抽出したアンカーと共に処理を逆変換処理部 12に渡し、部分逆変換処理部 13は処理を終了する。そして処理を受け渡された逆変換処理部 12 は、部分逆変換処理部 13から受け取ったアンカーを用いて、上記した図 7のステップ S 12以降の処理を行なレ、、結果として生成された XMLの部分ファイルを逆変換結果として出力する。 Then, the process is passed to the inverse transformation processing unit 12 together with the anchor extracted in step S45, and the partial inverse transformation processing unit 13 ends the process. Then, the inverse transformation processing unit 12 that has passed the processing uses the anchor received from the partial inverse transformation processing unit 13 to perform the processing after step S12 in FIG. 7 described above, and is generated as a result. Output XML partial file as reverse transformation result.

[0105] この様に本実施形態の構造化データ変換装置では、変換された HTMLデータを逆変換して XMLデータに戻す際、ユーザから指定されたデータのみを逆変換することが出来る。 As described above, in the structured data conversion apparatus of the present embodiment, when the converted HTML data is reversely converted back to XML data, only the data designated by the user can be reversely converted.

[0106] 次に、検索処理部 14によって行なわれる検索処理について説明する。 Next, the search process performed by the search processing unit 14 will be described.

本実施形態による構造化データ変換装置では、変換結果である HTMLデータに対して、 XMLデータと同様の検索処理を行なうことができる。 In the structured data conversion apparatus according to the present embodiment, the same search processing as XML data can be performed on the HTML data that is the conversion result.

[0107] 通常 HTMLデータは、表示用に特化した言語で、内部のデータについては表示形態についての情報のみを持ち、その内容については検索を行なうことが出来ない。しかし、本実施形態の構造化データ変換装置によって変換された HTMLデータは、内部に逆変換情報を含んでおり、この逆変換情報を用いて内部データに対する検索を行なうことができる。 [0108] 図 15 (a)、図 15 (b)は、本実施形態の検索処理の説明図である。 [0107] Normally, HTML data is a language specialized for display. Internal data has only information about the display form, and the contents cannot be searched. However, the HTML data converted by the structured data conversion apparatus of the present embodiment includes reverse conversion information therein, and the internal data can be searched using this reverse conversion information. FIGS. 15 (a) and 15 (b) are explanatory diagrams of search processing of the present embodiment.

通常 HTMLデータでは、同図（a)に示すように内部データ（同図の「100万円」等）は、何を示したデータであるのかは不明であり、この内部データが予算に対するデータであってもこの HTMLデータに対し、例えば"予算"等のデータの内容を検索する意味的な検索を行なうことは出来ない。 Normally, in HTML data, as shown in (a) in the figure, it is unclear what the internal data (such as “1 million yen” in the figure) indicates, and this internal data is the data for the budget. However, it is not possible to perform a semantic search on this HTML data, such as searching for the contents of data such as "Budget".

[0109] それに対して、同図（b)に示す本実施形態の構造化データ変換装置によって変換された HTMLデータ 152は、内部データに対応づけて逆変換情報がアンカーとして埋め込まれており、この逆変換情報には変換元の XMLデータの要素名が含まれている。 [0109] On the other hand, the HTML data 152 converted by the structured data conversion apparatus of this embodiment shown in (b) of the figure has the inverse conversion information embedded as an anchor in association with the internal data. The reverse conversion information includes the element name of the source XML data.

[0110] 例えば HTMLデータ 152に対して、 2004年度上期の予算を検索したい場合、検索例 153のように、検索キーワード 156として" 2004上期"ど'予算"を検索処理部 14 に入力すると、検索処理部 14は HTMLデータ 152のアンカー内の逆変換情報を検索し、検索キーワード 156と同じ文字列 154a、 154bを検出すると、対応する内部データ 155 (研削機一和戸を含む開始タグど' name"属性の文字列力 S"—_e"で終了する終了タグに挟まれたデータ）を検索結果 157として出力する。 [0110] For example, if you want to search the budget for the first half of fiscal 2004 against HTML data 152, as shown in search example 153, if you enter "2004 first half" and 'budget "as the search keyword 156, When the search processing unit 14 searches the reverse conversion information in the anchor of the HTML data 152 and finds the same character string 154a, 154b as the search keyword 156, the corresponding internal data 155 (start including the grinder The tag's “name” attribute string power S ”—data sandwiched between end tags ending with“ _e ”) is output as search result 157.

[0111] 図 15の場合、 HTMLデータ 152に対する" 2004上期"ど'予算"という文字列を検索キーワードとすると、アンカー中の"/予算/ 2004上期予算"という逆変換情報が見つかるので、そのアンカーに対応する内部データである" 100万円"が検索結果 1 57として得られる。 [0111] In the case of Fig. 15, if the search keyword is the string "2004 1st half" but 'budget "for HTML data 152, the reverse conversion information" / budget / 2004 1st budget "in the anchor is found. The internal data corresponding to the anchor “1 million yen” is obtained as the search result 1 57.

[0112] 図 16は、検索処理部 14によって行なわれる検索処理を示すフローチャートである同図の処理が開始されると、まず検索処理部 14は、ステップ S51として、ユーザに検索条件として検索キーワードを入力させる。 FIG. 16 is a flowchart showing the search process performed by the search processing unit 14. When the process of FIG. 16 is started, the search processing unit 14 first sends a search keyword as a search condition to the user in step S51. Let them enter.

[0113] 次に検索処理部 14は、検索対象となる HTMLデータ 133をメモリから読み出し、ステツプ S52として HTMLデータ 133から逆変換情報に対応するアンカー名（"name" 属性の文字列）を全て抽出する。 [0113] Next, the search processing unit 14 reads out the HTML data 133 to be searched from the memory, and in step S52, selects all the anchor names (character strings of the "name" attribute) corresponding to the reverse conversion information from the HTML data 133. Extract.

[0114] そして検索処理部 14は、ステップ S52で抽出した複数のアンカー名に対してステツプ S 51でユーザに入力させた検索キーワードを照合し、検索キーワードの文字列を含むものを抽出する。 [0114] Then, the search processing unit 14 matches the search keyword input by the user in step S51 against the plurality of anchor names extracted in step S52, and obtains the character string of the search keyword. Extract what it contains.

[0115] その結果、検索キーワードの文字列を含むアンカー名が存在すれば（ステップ S54 、 Yes)、ステップ S56として、該当したアンカー名を含むアンカー及びそのアンカーと対応するアンカー（同一のアンカー名 + "_e "のアンカー名のアンカー）との間に含まれる文字列を検索結果として出力し、本処理を終了する。 [0115] As a result, if there is an anchor name including the search keyword character string (step S54, Yes), as step S56, an anchor including the corresponding anchor name and an anchor corresponding to the anchor (the same anchor name + The character string included between "_e" and the anchor name is output as a search result, and this process ends.

[0116] またステップ S54において、検索キーワードの文字列を含むアンカー名が存在しなければ (ステップ S54、 No)、ステップ S55として検索結果として「該当無し」を出力し、処理を終了する。 In step S54, if there is no anchor name including the character string of the search keyword (step S54, No), “not applicable” is output as the search result as step S55, and the process is terminated.

[0117] この様に本実施形態における構造化データ変換装置によって変換した HTMLデータは、変換元の XMLデータと同様に検索処理を行なうことができる。 [0117] In this way, the HTML data converted by the structured data conversion apparatus in the present embodiment can be searched in the same manner as the conversion source XML data.

尚上記例では、検索処理を例として挙げている力本実施形態における構造化データ変換装置によって変換された変換結果である HTMLデータは、データ抽出や計算処理、独自処理等、検索以外の一般的に XMLデータに対する処理も同様にして行なうことが出来る。 In the above example, the power of the search process is taken as an example. The HTML data, which is the conversion result converted by the structured data conversion apparatus in this embodiment, is used for data extraction, calculation processing, unique processing, etc. In general, XML data can be processed in the same way.

[0118] また上記説明では、逆変換情報はアンカータグとして、変換結果の HTMLデータに坦め込んでいたが、 HTMLデータを Webブラウザによって表示したときに、坦め込んだ逆変換情報が表示されなければ、他の方法で坦め込んでも良い。 [0118] Also, in the above description, the reverse conversion information is included in the HTML data of the conversion result as an anchor tag. However, when the HTML data is displayed by the Web browser, the reverse conversion information embedded is displayed. If not, it may be carried in other ways.

[0119] 図 17は、逆変換情報を HTMLのコメントタグ内に坦め込んだ場合の例を示す図である。 [0119] Fig. 17 is a diagram showing an example of the case where reverse conversion information is embedded in an HTML comment tag.

同図では、変換元の XMLデータ 171を変換定義 172に基づいて XMLデータ 173 に変換する際、埋め込む逆変換情報をコメント 175— 1〜： 175— 6として埋め込んでいる。この場合逆変換情報は、コメントタグ 175中の" < ！ ——"ど'——〉"との間に挟まれたコメント本文として坦め込まれる。 In the figure, when converting the conversion source XML data 171 to the XML data 173 based on the conversion definition 172, the reverse conversion information to be embedded is embedded as comments 175-1 to 175-6. In this case, the reverse conversion information is carried as a comment text sandwiched between "<!-" Do '->> "in the comment tag 175.

[0120] このコメント 175は、 Webブラウザでは読み飛ばされるので、コメント 175を坦め込んでも、表示結果 174には全く影響を与えない。 [0120] Since this comment 175 is skipped by the Web browser, even if the comment 175 is inserted, the display result 174 is not affected at all.

図 18は、本実施形態における構造化データ変換装置の構成要素である、変換処理部 11、逆変換処理部 12、部分逆変換処理部 13、及び検索処理部 14をソフトゥェァ的手法によって実現した場合の、これらの構成要素が実現されるコンピュータのシステム環境図である。 FIG. 18 shows a software processing method for the conversion processing unit 11, the inverse conversion processing unit 12, the partial reverse conversion processing unit 13, and the search processing unit 14, which are components of the structured data conversion apparatus according to this embodiment. The computer system on which these components are realized. FIG.

[0121] 同図のコンピュータは、 CPU181、主記憶装置 182、ハードディスク等の補助記憶装置 183、ディスプレイ、キーボード等の入出力装置（I/O) 184、モデム等のネットワーク接続装置 185及びディスク、磁気テープなどの可搬記憶媒体から記憶内容を読み出す媒体読み取り装置 186を有し、これらが互いにバス 187により接続される構成を備えている。 [0121] The computer shown in the figure includes a CPU 181, a main storage device 182, an auxiliary storage device 183 such as a hard disk, an input / output device (I / O) 184 such as a display and a keyboard, a network connection device 185 such as a modem, a disk, A medium reading device 186 that reads out stored contents from a portable storage medium such as a magnetic tape is provided, and these are connected to each other via a bus 187.

[0122] 図 12のコンピュータでは、媒体読み取り装置 186により磁気テープ、フレキシブルディスク、 CD-ROM, MO等の記憶媒体 188に記憶されているプログラム、データを読み出し、これを主記憶装置 182または補助記憶装置 183にダウンロードする。そしてこのプログラムやデータに基づいて、 CPU181が、上述した変換処理部 11、逆変換処理部 12、部分逆変換処理部 13、及び検索処理部 14の各機能をソフトウェア的に実現する。 In the computer shown in FIG. 12, the medium reading device 186 reads out programs and data stored in a storage medium 188 such as a magnetic tape, a flexible disk, a CD-ROM, and an MO, and stores them in the main storage device 182 or auxiliary storage. Download to device 183. Based on this program and data, the CPU 181 implements the functions of the conversion processing unit 11, the inverse conversion processing unit 12, the partial reverse conversion processing unit 13, and the search processing unit 14 described above in software.

[0123] また、図 18のコンピュータシステムでは、フレキシブルディスク等の記憶媒体 187を用いてアプリケーションソフトの交換が行われる場合がある。よって、本発明は、構造化データ変換装置や変換方法に限らず、コンピュータにより使用されたときに、上述した本発明の実施形態の機能をコンピュータに行なわせるためのコンピュータ読み出し可能な記憶媒体 187として構成することもできる。 [0123] In the computer system of Fig. 18, application software may be exchanged using a storage medium 187 such as a flexible disk. Therefore, the present invention is not limited to a structured data conversion apparatus and conversion method, and when used by a computer, a computer-readable storage medium for causing a computer to perform the functions of the above-described embodiments of the present invention. It can also be configured as.

[0124] この場合、「記憶媒体」には、例えば図 19に示されるように、 CD-ROM,フレキシブルディスク（あるいは MO、 DVD、リムーバブルハードディスク等であってもよレ、）等の媒体駆動装置 197に脱着可能な可搬記憶媒体 196や、ネットワーク回線 193経由で送信される外部の装置 (サーバ等）内の記憶部（データベース等） 192、あるいは情報処理装置 191の本体 194内のメモリ（RAM又はハードディスク等） 195等が含まれる。可搬記憶媒体 196や記憶部（データベース等） 192に記憶されているプロダラムは、本体 194内のメモリ（RAM又はハードディスク等） 195にロードされて、実行される。 [0124] In this case, the "storage medium" is a medium drive such as a CD-ROM, flexible disk (or MO, DVD, removable hard disk, etc.) as shown in FIG. A portable storage medium 196 that can be attached to and detached from the device 197, a storage unit (database, etc.) 192 in an external device (server, etc.) transmitted via the network line 193, or a memory (main unit 194) of the information processing device 191 ( RAM, hard disk, etc.) 195 etc. are included. The programs stored in the portable storage medium 196 and the storage unit (database or the like) 192 are loaded into a memory (RAM or hard disk or the like) 195 in the main body 194 and executed.

[0125] また、既に説明した CD— ROMや DVD— ROM等の記憶媒体には、上記に例として挙げたものの他にも、例えば、 Blu-ray Disc (登録商標）や AOD (Advanced Optical Disc)などの青色レーザーを用いた次世代光ディスク記憶媒体、赤色レーザ一を用いる HD_DVD9、青紫色レーザーを用いる Blue Laser DVDなど、今後開発される種々の大容量記憶媒体を用いて本発明を実施することも可能である。 [0125] In addition to the above-mentioned examples of storage media such as CD-ROM and DVD-ROM, for example, Blu-ray Disc (registered trademark) and AOD (Advanced Optical Disc) ) And other next-generation optical disk storage media using a blue laser It is also possible to implement the present invention using various large-capacity storage media that will be developed in the future, such as HD_DVD9 using the same or Blue Laser DVD using a blue-violet laser.

[0126] 本発明によれば、構造化文書を他のデータ構造に変換する際に、変換先のデザィン (データ構造）を別個に作成しておき、このデザインに殆ど手をいれずに変換に使うことが可能である。このため、データとデザインを分離して管理、再利用することが可能になる。 [0126] According to the present invention, when converting a structured document to another data structure, a conversion destination design (data structure) is created separately, and the design is converted with little work. It can be used for This makes it possible to manage and reuse data and design separately.

[0127] 尚上記例では XML文書と HTML文書を相互変換する場合を例として挙げたが、本実施形態における文書変換活用システムによる変換は、これに限定されるものではなぐ SGML文書から HTML文書への変換等、他の構造化文書間の変換も行なうことができる。 In the above example, the case where the XML document and the HTML document are mutually converted is taken as an example. However, the conversion by the document conversion utilization system in the present embodiment is not limited to this. From the SGML document to the HTML document You can also convert between other structured documents, such as converting to.

Claims

The scope of the claims

[1] A structured data conversion device for converting first structured data into second structured data written in a language different from the first structured data,

Data that is deleted when the first structured data is converted to the second structured data, and that is necessary when the converted second structured data is converted back to the first structured data Is obtained from the first structured data as inverse transformation information,

An inverse transformation information storage unit that embeds the inverse transformation information in the second structured data when the first structured data is transformed into the second structured data;

A structured data conversion apparatus comprising:

[2] The image processing apparatus further includes an inverse conversion unit that inversely converts the second structured data into the first structured data using the inverse transformation information carried in the second structured data. The structured data conversion device according to claim 1, wherein:

[3] The reverse conversion information storage unit includes the reverse conversion information in a form that does not affect the appearance when the second structured data after conversion is displayed on a Web browser. The structured data conversion device according to claim 1.

[4] The request is characterized in that the second structured data is data described in HTML, and the reverse conversion information storage unit embeds the reverse conversion information in an HTML anchor tag. Item 4. The structured data converter according to item 3.

[5] The second structured data is data described in HTML, and the reverse conversion information storage unit embeds the reverse conversion information in an HTML comment tag. Structured data conversion device described in 1.

[6] The reverse conversion information generation unit extracts the reverse conversion information based on a conversion definition including information indicating a position in a hierarchical structure of an element in the first structured data. The structured data conversion device according to claim 1.

[7] The inverse transformation information generation unit generates the inverse transformation information for the entire repetitive structure when the first structure data includes a repetitive structure, and the inverse transformation information storage unit It is characterized by loading information corresponding to the entire repetitive structure The structured data conversion device according to claim 1.

[8] When the first structured data includes a repeated structure, the first structured data is converted into a table when the first structured data is converted into the second structured data. The structured data conversion device according to claim 1.

[9] The method according to claim 1, further comprising a partial reverse conversion unit that reversely converts a part of the first structured data from the second structured data based on an instruction from a user. Structured data converter described in 1.

10. The search unit according to claim 1, further comprising a search unit that searches the second structured data converted from the first structured data based on a search condition of user power. Structured data converter.

[11] The structure according to [1], wherein the inverse transformation information is information required at a minimum when inversely transforming the second structured data into the first structured data Data conversion device.

12. The structured data according to claim 11, wherein the inverse transformation information is information indicating a tag name of the tag in the first structure data and a correspondence relationship between the tag and the element content. Conversion device.

13. The first structured data according to claim 1, wherein the first structured data is data described in XML, and the second structured data is data described in HTML. Structured data converter.

[14] A structured data conversion method for converting first structured data into second structured data written in a language different from the first structured data,

Read the structured data to be converted from memory,

It is deleted when converting the first structured data into the second structured data, and is necessary when converting the converted second structured data back into the first structured data. The first structured data force is extracted as inverse transformation information,

When converting the first structured data into the second structured data, the inverse transformation information is embedded in the second structured data.

A structured data conversion method characterized by the above. When executed on a computer that converts the first structured data into second structured data written in a language different from the first structured data,

Read the structured data to be converted from memory,

A portable storage medium readable by the computer storing a program for causing the computer to execute the above.