WO2009084570A1

WO2009084570A1 - Compiler embedded function adding device

Info

Publication number: WO2009084570A1
Application number: PCT/JP2008/073552
Authority: WO
Inventors: Takahiro Kumura
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2007-12-28
Filing date: 2008-12-25
Publication date: 2009-07-09
Anticipated expiration: 2010-06-28

Abstract

This object aims to provide a device, program or method that adds an embedded function corresponding to an additional command to a compiler in the form that is possible to reflect information of a command word length or latency of the additional command on the compiler. An embedded function adding program (30) reads in an additional command specification description file (40) and a compiler source code (50) for a base processor from a memory device (20), generates a definition sentence of the embedded function corresponding to the additional command described in the additional command specification description file, and outputs to the memorydevice (20) a compiler source code for a base processor to which the definition sentence of the embedded function corresponding to the additional command is added as a compiler source code (60) for an expanded processor where the additional command is provided in a command set.

Description

Built-in function addition device

［関連出願の記載］
　本発明は、日本国特許出願：特願２００７－３４０５３２号（２００７年１２月２８日出願）の優先権主張に基づくものであり、同出願の全記載内容は引用をもって本書に組み込み記載されているものとする。
　本発明は、プロセッサのコンパイラ技術に関し、特に、基本となるプロセッサ（ベースプロセッサ）の命令セットへ追加された新たな命令（追加命令）に一対一に対応した組み込み関数をコンパイラへ追加する技術に関する。 [Description of related applications]
The present invention is based on the priority claim of Japanese patent application: Japanese Patent Application No. 2007-340532 (filed on Dec. 28, 2007), the entire contents of which are incorporated herein by reference. Shall.
The present invention relates to a compiler technique for a processor, and more particularly to a technique for adding an embedded function corresponding to a new instruction (additional instruction) added to an instruction set of a basic processor (base processor) to a compiler.

　プロセッサの設計を効率良く行なうためにさまざまなツールが開発されている。この種のツールの一つにプロセッサの設計仕様からプロセッサのハードウェア構成やソフトウェア開発ツール（コンパイラ、アセンブラ、シミュレータなど）を生成するツールが用いられており、コンピュータ上で動作するプログラムとして提供される。ツールが動作するコンピュータは「プロセッサ設計支援装置」とも呼ばれる。 Various tools have been developed for efficient processor design. One of these types of tools is a tool that generates the hardware configuration of the processor and software development tools (compiler, assembler, simulator, etc.) from the design specifications of the processor, and is provided as a program that runs on a computer. . The computer on which the tool operates is also called a “processor design support device”.

　設計者が定義したプロセッサの命令セットに基づいてプロセッサのためのアセンブラやコンパイラを生成するプロセッサ設計支援装置が、非特許文献１、特許文献１乃至３等に開示されている。 Non-Patent Document 1, Patent Documents 1 to 3 and the like disclose a processor design support apparatus that generates an assembler and compiler for a processor based on a processor instruction set defined by a designer.

　非特許文献１、特許文献１乃至３等に開示されているプロセッサ設計支援装置は、基本となるプロセッサ（「ベースプロセッサ」という）へ新たな命令を追加することができる。ベースプロセッサへ追加された命令に対応する組み込み関数をコンパイラから利用可能にするために、インラインアセンブラで記述された関数を組み込み関数として定義するようなヘッダファイルをプロセッサ設計支援装置が生成する。なお、ヘッダファイルの生成については、特許文献１（段落番号０３２９）、特許文献２（段落番号００５６）、特許文献３（段落番号００５２）等の記載も参照される。 The processor design support apparatus disclosed in Non-Patent Document 1, Patent Documents 1 to 3, and the like can add a new instruction to a basic processor (referred to as a “base processor”). In order to make the built-in function corresponding to the instruction added to the base processor available from the compiler, the processor design support apparatus generates a header file that defines the function described in the inline assembler as the built-in function. Regarding the generation of the header file, the descriptions in Patent Document 1 (paragraph number 0329), Patent Document 2 (paragraph number 0056), Patent Document 3 (paragraph number 0052), and the like are also referred to.

　組み込み関数は、プロセッサの命令に一対一に対応する関数である。 ∙ Built-in functions are functions that correspond one-to-one with processor instructions.

　組み込み関数を使うことによって、プログラム開発者は、コンパイラが通常使用しない命令をコンパイラに使用させることができる。 By using built-in functions, program developers can let the compiler use instructions that the compiler does not normally use.

　コンパイラは、組み込み関数を通常の関数の一つとして扱うため、組み込み関数の引数や戻り値に関するレジスタ割当をコンパイラが行う。 ∙ The compiler handles built-in functions as one of the normal functions, so the compiler performs register allocation for arguments and return values of built-in functions.

　インラインアセンブラは、例えば、Ｃ言語のソースコードの中に記述されたアセンブラ言語である。インラインアセンブラを使うことによって、ソースコードの中にアセンブラ言語を挿入することができる。 The inline assembler is, for example, an assembler language described in C language source code. By using an inline assembler, assembler language can be inserted into the source code.

Tensilica, USP-6477683, “Automated processor generation system for designing a configurable processor and method for the same.”Tensilica, USP-6477683, “Automatedmprocessor generation system for designing a configurable processor and method for the same.” Richard M. Stallman and the GCC Developer Community, “GNU Compiler Collection Internals (GCC),”　インターネット＜URL: http://gcc.gnu.org/onlinedocs/gccint.pdf＞（第２３９頁、“14 Machine Description”、第２４１頁の“14.4 RTL Template”、第３２７頁の“14.19.8 Specifying processor pipeline description”）Richard M. Stallman and the GCC Developer Community, “GNU Compiler Collection Internals (GCC),” Internet <URL: ＜ http://gcc.gnu.org/onlinedocs/gccint.pdf> (page 239, “14 Machine Description”) , Page 241, “14.4 RTL Template”, page 327 “14.19.8 Specifying processor processor, pipeline description”) Free Software Foundation, Inc.,　矢吹洋一訳，　“　ＧＮＵ　コンパイラ集（ＧＣＣ）　の使い方と移植について，”（第１７８頁の“機械記述”、第１８０頁の“ＲＴＬテンプレート”）　インターネット＜URL: http://www.sra.co.jp/public/sra/product/wingnut/gcc/gcc-j.html＞Free Software Foundation, Inc., Yoichi Yabuki, “GNU Compiler Collection (GCC) Usage and Porting” (“Machine Description” on page 178, “RTL Template” on page 180)) Internet <URL: http: //www.sra.co.jp/public/sra/product/wingnut/gcc/gcc-j.html> GNU Project - Free Software Foundation, Inc., “Installing GCC,”インターネット＜URL：http://gcc.gnu.org/install/＞　（第７頁の“Configuration”と第２１頁の“Building”）GNU Project-Free Software Foundation, Inc., "Installing GCC," Internet <URL: http://gcc.gnu.org/install/> ("Configuration" on page 7 and "Building" on page 21) 特表２００３－５１８２８０号公報Special table 2003-518280 gazette 特開２００３－３２３４６３号公報JP 2003-323463 A 特開２００２－２４０２９号公報Japanese Patent Laid-Open No. 2002-24029

　以上の非特許文献１乃至４及び特許文献１乃至３の開示事項は、本書に引用をもって繰り込み記載されているものとする。以下に本発明による関連技術の分析を与える。 The disclosures of Non-Patent Documents 1 to 4 and Patent Documents 1 to 3 described above are incorporated herein by reference. The following is an analysis of the related art according to the present invention.

　特許文献１乃至３等の上記関連技術においては、コンパイラ組み込み関数の追加は、ベースプロセッサのためのコンパイラを何ら変更せずに、ヘッダファイルだけを追加することによって追加命令のための組み込み関数を使用可能にする。 In the related technologies such as Patent Documents 1 to 3, the addition of the compiler built-in function uses the built-in function for the additional instruction by adding only the header file without changing the compiler for the base processor. enable.

　すなわち、ヘッダファイルをコンパイラへ追加するだけなので、コンパイラの変更が必要ないという利点がある。 That is, there is an advantage that no compiler change is required since only the header file is added to the compiler.

　しかしながら、上記関連技術によるコンパイラ組み込み関数の追加においては、追加命令の命令語長や、レイテンシなどの情報はまったくコンパイラに反映されない、という問題がある。 However, when a compiler built-in function is added by the related technology, there is a problem that information such as instruction word length of an additional instruction and latency is not reflected in the compiler at all.

　なお、命令語長は、追加命令が何バイトの命令であるかを表す情報である。 The instruction word length is information indicating how many bytes the additional instruction is.

　レイテンシは、追加命令の演算結果が何サイクル後に得られるかを表す情報である。 Latency is information indicating how many cycles the operation result of the additional instruction is obtained.

　上記関連技術によるコンパイラ組み込み関数の追加において、追加命令の命令語長をコンパイラへ知らせることができない。このため、コンパイラは、追加命令の語長が全て同一で、既知の値をもつと仮定しなければならない。 * When adding a compiler built-in function using the above related technology, the instruction word length of the additional instruction cannot be notified to the compiler. For this reason, the compiler must assume that the word lengths of the additional instructions are all the same and have known values.

　これは語長が異なる追加命令を作成できないことを意味する。 This means that additional commands with different word lengths cannot be created.

　さらに、上記関連技術によるコンパイラ組み込み関数の追加において、追加命令のレイテンシをコンパイラへ知らせることができないことから、コンパイラがその追加命令を効率良くスケジューリングすることができない。対象となるソースコードの制御フローやデータフローの情報を利用して、コンパイラは追加命令を効率良くスケジューリングできる可能性があるが、上記関連技術ではそれが不可能である。 Furthermore, in the addition of the compiler built-in function according to the related technology, the compiler cannot inform the compiler of the latency of the additional instruction, so the compiler cannot schedule the additional instruction efficiently. There is a possibility that the compiler can efficiently schedule additional instructions using the control flow and data flow information of the target source code, but this is not possible with the related technology.

　上記の如く、関連技術によるコンパイラ組み込み関数の追加においては、追加命令の命令語長やレイテンシなどの情報はまったくコンパイラに反映されない。 As described above, when adding a compiler built-in function according to related technology, information such as the instruction word length and latency of the additional instruction is not reflected in the compiler at all.

　したがって、本発明の主たる目的は、コンパイラ組み込み関数の追加において、追加命令の命令語長やレイテンシなどの仕様情報をコンパイラに反映可能とする装置、方法、プログラムを提供することにある。 Therefore, a main object of the present invention is to provide an apparatus, a method, and a program that allow specification information such as the instruction word length and latency of an additional instruction to be reflected in the compiler when a compiler built-in function is added.

　本発明の１つの側面においては、基本となるプロセッサ（ベースプロセッサ）の命令セットへ追加される追加命令について、追加命令に対応する組み込み関数をベースプロセッサのコンパイラへ追加する。本発明においては、追加命令の命令語長やレイテンシの情報をコンパイラが理解できるようにするためのソースコードを、ベースコンパイラのソースコードへ追加することによって、組み込み関数をコンパイラへ追加する。 In one aspect of the present invention, for an additional instruction added to the instruction set of the basic processor (base processor), an embedded function corresponding to the additional instruction is added to the compiler of the base processor. In the present invention, an intrinsic function is added to the compiler by adding source code for enabling the compiler to understand the instruction word length and latency information of the additional instruction to the source code of the base compiler.

　本発明によれば、ベースプロセッサ用のコンパイラのソースコードに対して、ベースプロセッサの命令セットに新たに追加される追加命令の仕様記述に基づいて、その追加命令を備えたベースプロセッサである拡張プロセッサ用のコンパイラのソースコードを生成することにより、該追加命令に対応する組み込み関数をコンパイラへ追加する組み込み関数追加処理を実行する装置が提供される。 According to the present invention, based on the specification description of an additional instruction newly added to the instruction set of the base processor with respect to the source code of the compiler for the base processor, the extended processor which is a base processor having the additional instruction By generating the source code of the compiler for use, an apparatus for executing an embedded function addition process for adding an embedded function corresponding to the additional instruction to the compiler is provided.

　本発明によれば、ベースプロセッサ用のコンパイラのソースコードを入力し、前記ベースプロセッサの命令セットに新たに追加される追加命令の仕様記述に基づき、前記ベースプロセッサ用のコンパイラのソースコードから、前記追加命令を命令セットに備えた拡張プロセッサ用のコンパイラのソースコードを生成し、前記追加命令に対応する組み込み関数を、前記拡張プロセッサ用のコンパイラへ追加する組み込み関数追加処理をコンピュータに実行させるプログラムが提供される。 According to the present invention, the source code of the compiler for the base processor is input, and based on the specification description of the additional instruction newly added to the instruction set of the base processor, the source code of the compiler for the base processor A program for generating a source code of a compiler for an extended processor having an additional instruction in an instruction set and causing the computer to execute an embedded function adding process for adding an embedded function corresponding to the additional instruction to the compiler for the extended processor. Provided.

　本発明によれば、ベースプロセッサ用のコンパイラのソースコードを入力し、前記ベースプロセッサの命令セットに新たに追加される追加命令の仕様記述に基づき、前記ベースプロセッサ用のコンパイラのソースコードから、前記追加命令を命令セットに備えた拡張プロセッサ用のコンパイラのソースコードを生成し、前記追加命令に対応する組み込み関数を、前記拡張プロセッサ用のコンパイラへ追加する組み込み関数追加処理を実行する方法が提供される。 According to the present invention, the source code of the compiler for the base processor is input, and based on the specification description of the additional instruction newly added to the instruction set of the base processor, the source code of the compiler for the base processor A method for generating a compiler source code for an extended processor having an additional instruction in an instruction set and executing an embedded function adding process for adding an embedded function corresponding to the additional instruction to the compiler for the extended processor is provided. The

　本発明において、前記組み込み関数追加処理は、
　追加命令の仕様記述を読み込む処理と、
　前記仕様記述に基づいて追加命令の構成定義文を生成する処理と、
　前記仕様記述に基づいて追加命令とそれに対応する組み込み関数との関係定義文を生成する処理と、
　前記構成定義文と前記関係定義文とを前記ベースプロセッサのためのコンパイラのソースコードへ追加することによって前記拡張プロセッサのためのコンパイラのソースコードを生成する処理と、を含むようにしてもよい。 In the present invention, the built-in function adding process is:
Processing to read the specification description of the additional instruction;
Processing for generating a configuration definition statement of an additional instruction based on the specification description;
Processing for generating a relationship definition statement between an additional instruction and a corresponding built-in function based on the specification description;
Processing for generating a compiler source code for the extension processor by adding the configuration definition statement and the relationship definition statement to the source code of the compiler for the base processor.

　本発明において、前記構成定義文は、コンパイラの中間言語で表現された追加命令のテンプレートと、追加命令の実行に必要なサイクル数を定義するレイテンシ定義文と、から構成され、前記構成定義文を生成する処理が、前記仕様記述に含まれる追加命令の名前と入力オペランドと出力オペランドとに基づいて前記テンプレートを生成し、さらに前記仕様記述に含まれる追加命令の実行に必要なサイクル数に基づいて前記レイテンシ定義文を生成する、構成としても良い。 In the present invention, the configuration definition statement includes an additional instruction template expressed in an intermediate language of a compiler, and a latency definition statement that defines the number of cycles required to execute the additional instruction, and the configuration definition statement is The generating process generates the template based on the name of the additional instruction, the input operand, and the output operand included in the specification description, and further, based on the number of cycles necessary for executing the additional instruction included in the specification description. The latency definition sentence may be generated.

　本発明において、前記関係定義文は、追加命令に対応する組み込み関数のプロトタイプ宣言と、組み込み関数を追加命令へ置き換えるための組み込み関数展開関数と、から構成され、前記関係定義文を生成する処理が、前記仕様記述に含まれる追加命令の名前と入力オペランドと出力オペランドとに基づいて前記組み込み関数のプロトタイプ宣言を生成し、さらにコンパイラの所定の形式に基づいて前記組み込み関数展開関数を生成する、構成としても良い。 In the present invention, the relationship definition statement includes a prototype declaration of an embedded function corresponding to an additional instruction and an embedded function expansion function for replacing the embedded function with the additional instruction, and the process of generating the relationship definition statement includes Generating a prototype declaration of the built-in function based on the name of an additional instruction included in the specification description, an input operand, and an output operand, and further generating the built-in function expansion function based on a predetermined format of a compiler It is also good.

　本発明において、前記構成定義文における前記テンプレートを生成する際に、第一のコンパイラ組み込み関数追加装置は、
　前記仕様記述の追加命令の出力オペランドと入力オペランドとを順番に並べることによって追加命令の中間言語定義文におけるオペランドの番号を決定し、
　前記仕様記述の追加命令の名前に基づいてテンプレートの名前を生成し、
　前記仕様記述の追加命令の出力オペランドの型に基づいてテンプレートの出力オペランドの記述を生成し、
　前記仕様記述の追加命令の入力オペランドの型に基づいてテンプレートの入力オペランドの記述を生成し、
　追加命令がコンパイラにとって不明な演算を実行することを示す演算子をもちいてテンプレートにおける追加命令の動作記述を生成し、
　前記仕様記述の追加命令のシンタックスに含まれるオペランドを前記オペランド番号で置き換えることによってテンプレートのシンタックスを生成し、
　前記仕様記述の追加命令の命令語長に基づいてテンプレートにおける命令語長定義文を生成し、
　前記仕様記述の追加命令の名前に基づいてテンプレートにおけるニモニック定義文を生成する、という一連の処理を行う構成としても良い。 In the present invention, when generating the template in the configuration definition statement, the first compiler built-in function adding device,
Determining the operand number in the intermediate language definition statement of the additional instruction by arranging the output operand and the input operand of the additional instruction of the specification description in order;
Generating a name for the template based on the name of the additional instruction in the specification description;
Generating a description of the output operand of the template based on the type of the output operand of the additional instruction of the specification description;
Generating a description of the input operand of the template based on the type of the input operand of the additional instruction of the specification description;
Generate an action description of the additional instruction in the template using an operator that indicates that the additional instruction performs an operation unknown to the compiler,
Generating a template syntax by replacing an operand included in the syntax of the additional instruction in the specification description with the operand number;
Generating an instruction word length definition statement in the template based on the instruction word length of the additional instruction in the specification description;
A configuration may be adopted in which a series of processes of generating a mnemonic definition sentence in the template based on the name of the additional instruction in the specification description is performed.

　本発明によれば、コンパイラ組み込み関数の追加において、追加命令の命令語長やレイテンシなどの情報をコンパイラに反映することができる。 According to the present invention, when a compiler built-in function is added, information such as instruction word length and latency of the additional instruction can be reflected in the compiler.

本発明の一実施例の構成を示す図である。It is a figure which shows the structure of one Example of this invention. 本発明の一実施例のコンパイラ組み込み関数追加プログラムの入出力関係を示す図である。It is a figure which shows the input / output relationship of the compiler intrinsic | native function addition program of one Example of this invention. 本発明の一実施例の処理手順を示す図である。It is a figure which shows the process sequence of one Example of this invention. 図３におけるＲＴＬテンプレート生成処理の手順を示す図である。It is a figure which shows the procedure of the RTL template production | generation process in FIG. ＲＴＬテンプレート生成処理におけるＲＴＬテンプレートのオペランド番号の決定の手順を示す図である。It is a figure which shows the procedure of the determination of the operand number of the RTL template in a RTL template production | generation process. 図３のレイテンシ定義文生成処理の処理手順を示す図である。It is a figure which shows the process sequence of the latency definition sentence production | generation process of FIG. 追加命令の仕様記述における＜ｏｐｅｒａｎｄ＞タグの属性ｗｉｄｔｈとＲＴＬテンプレートのオペランドの語長を表す文字列ｍｏｄｅｎａｍｅｓｈｏｒｔとの対応関係を示す図である。It is a figure which shows the correspondence of the attribute width of the <operand> tag in the specification description of an additional command, and the character string modemshort which represents the word length of the operand of an RTL template. 組み込み関数のプロトタイプ宣言をコンパイラへ教える関数の名前を定義するための定義文の形式を表す図である。It is a figure showing the format of the definition statement for defining the name of the function which tells a compiler the prototype declaration of a built-in function. 追加命令の仕様記述における＜ｏｐｅｒａｎｄ＞タグの属性ｗｉｄｔｈと組み込み関数のプロトタイプ宣言における戻り値や引数の型指定子ｍｏｄｅｎａｍｅｌｏｎｇ＿ｔｙｐｅ＿ｎｏｄｅとの対応関係を示す図である。It is a figure which shows the correspondence with the attribute width of the <operand> tag in the specification description of an additional instruction, the return value in the prototype declaration of a built-in function, and the argument type specifier modelnamelong_type_node. 組み込み関数のプロトタイプ宣言をコンパイラへ教える関数の内容を示す図である。It is a figure which shows the content of the function which tells a compiler the prototype declaration of an intrinsic function. 組み込み関数プロトタイプ宣言生成処理によって生成される組み込み関数プロトタイプ宣言の定義文の形式を示す図である。It is a figure which shows the format of the definition statement of the intrinsic | native function prototype declaration produced | generated by the intrinsic | native function prototype declaration production | generation process. 組み込み関数の展開関数の名前を定義するための定義文の形式を示す図である。It is a figure which shows the format of the definition sentence for defining the name of the expansion function of a built-in function. 組み込み関数の展開関数の内容を示す図である。It is a figure which shows the content of the expansion function of a built-in function. 本発明の一実施例の追加命令仕様記述ファイルの具体的な内容を示す図である。It is a figure which shows the specific content of the additional instruction specification description file of one Example of this invention. ＲＴＬテンプレート生成処理（ステップ２００）において、図１４の仕様記述に基づいて生成した追加命令ｍａｃ３２のためのＲＴＬテンプレートを示す図である。FIG. 15 is a diagram showing an RTL template for an additional instruction mac32 generated based on the specification description of FIG. 14 in the RTL template generation process (step 200). ＲＴＬテンプレート生成処理（ステップ２００）において、図１４の仕様記述に基づいて生成した追加命令ａｖｇ２のためのＲＴＬテンプレートを示す図である。FIG. 15 is a diagram showing an RTL template for an additional instruction avg2 generated based on the specification description of FIG. 14 in the RTL template generation process (step 200). 図６のステップ３１０とステップ３２０とにおいて、図１４の仕様記述に基づいて生成したレイテンシ定義文を示す図である。FIG. 15 is a diagram showing a latency definition sentence generated based on the specification description of FIG. 14 in step 310 and step 320 of FIG. 6. 図６のステップ３３０において、図１４の仕様記述に基づいて生成した追加命令ｍａｃ３２のためのレイテンシ定義文を示す図である。FIG. 15 is a diagram showing a latency definition statement for an additional instruction mac32 generated based on the specification description of FIG. 14 in step 330 of FIG. 図６のステップ３３０において、図１４の仕様記述に基づいて生成した追加命令ａｖｇ２のレイテンシ定義文を示す図である。FIG. 15 is a diagram showing a latency definition sentence of an additional instruction avg2 generated based on the specification description of FIG. 14 in step 330 of FIG. 組み込み関数プロトタイプ宣言生成処理（ステップ４００）において、図１４の仕様記述に基づいて生成した追加命令ｍａｃ３２のための組み込み関数プロトタイプ宣言を示す図である。FIG. 15 is a diagram showing an embedded function prototype declaration for an additional instruction mac32 generated based on the specification description of FIG. 14 in the embedded function prototype declaration generation process (step 400). 組み込み関数プロトタイプ宣言生成処理（ステップ４００）において、図１４の仕様記述に基づいて生成した追加命令ａｖｇ２のための組み込み関数プロトタイプ宣言を示す図である。FIG. 15 is a diagram showing an embedded function prototype declaration for an additional instruction avg2 generated based on the specification description of FIG. 14 in the embedded function prototype declaration generation process (step 400). 組み込み関数プロトタイプ宣言生成処理（ステップ４００）において、図１４の仕様記述に基づいて生成した関数の内容を示す図である。FIG. 15 is a diagram showing the contents of a function generated based on the specification description of FIG. 14 in the built-in function prototype declaration generation process (step 400).

Explanation of symbols

　１０　ＣＰＵ
　２０　記憶装置
　３０　コンパイラ組み込み関数追加プログラム
　４０　追加命令仕様記述ファイル
　５０　ベースプロセッサ用コンパイラソースコード
　６０　拡張プロセッサ用コンパイラソースコード
　６１　追加命令の構成定義文
　６２　追加命令のテンプレート
　６３　追加命令のレイテンシを定義するレイテンシ定義
　６４　追加命令と組み込み関数との関係定義文
　６５　組み込み関数のプロトタイプ宣言
　６６　組み込み関数展開関数
　７０　コンパイラ組み込み関数追加装置 10 CPU
DESCRIPTION OF SYMBOLS 20 Storage device 30 Compiler built-in function additional program 40 Additional instruction specification description file 50 Base processor compiler source code 60 Extended processor compiler source code 61 Additional instruction configuration definition statement 62 Additional instruction template 63 Latency for defining additional instruction latency Definition 64 Definition statement of relationship between additional instruction and built-in function 65 Prototype declaration of built-in function 66 Built-in function expansion function 70 Compiler built-in function adding device

　本発明の実施の形態について図面を参照して詳細に説明する。本発明のコンパイラ組み込み関数追加装置は、ベースプロセッサへの追加命令に関する仕様記述に基づいて、ベースプロセッサのためのコンパイラへその追加命令に対応する組み込み関数を追加する装置である。 Embodiments of the present invention will be described in detail with reference to the drawings. The compiler built-in function adding device according to the present invention is a device that adds a built-in function corresponding to the additional instruction to the compiler for the base processor based on the specification description regarding the additional instruction to the base processor.

　組み込み関数は、「イントリンジック（ｉｎｔｒｉｎｓｉｃ）」とも呼ばれる。組み込み関数とはプロセッサの命令に一対一に対応する関数である。 The built-in function is also called “intrinsic”. Built-in functions are functions that correspond one-to-one with processor instructions.

　一般的には、プログラムのソースコードの中で組み込み関数を使うと、それは対応する命令へコンパイラによって置き換えられる。 Generally, when an embedded function is used in the source code of a program, it is replaced by the compiler with the corresponding instruction.

　ベースプロセッサへの追加命令に対応した組み込み関数をコンパイラへ追加すると、プログラム開発者は、その追加命令を関数呼び出しのような形式でコンパイラから使用することができる。 When an embedded function corresponding to an additional instruction to the base processor is added to the compiler, the program developer can use the additional instruction from the compiler in the form of a function call.

　本発明の一実施例においては、ベースプロセッサのコンパイラは既に存在しており、該コンパイラのソースコードも利用可能であることを前提としている。 In one embodiment of the present invention, it is assumed that a compiler for a base processor already exists and that the source code of the compiler can also be used.

　かかる前提において、追加命令に対応する組み込み関数をコンパイラが扱えるようにするために、ベースプロセッサのコンパイラのソースコードを修正する。該ソースコードの修正においては、追加命令のための組み込み関数を扱うために必要となるソースコードの断片を、コンパイラのソースコードへ追加する修正が行われる。本発明によれば、追加命令の命令語長やレイテンシなどの情報をコンパイラに反映させることができる。この結果、コンパイラは、追加命令を含むプログラムの語長を正しく計算したり、追加命令を適切にスケジューリングすることが可能となる。 Based on this premise, the source code of the compiler for the base processor is modified so that the compiler can handle the built-in functions corresponding to the additional instructions. In the modification of the source code, a modification is performed in which a fragment of the source code necessary for handling the built-in function for the additional instruction is added to the source code of the compiler. According to the present invention, information such as the instruction word length and latency of the additional instruction can be reflected in the compiler. As a result, the compiler can correctly calculate the word length of the program including the additional instruction and can appropriately schedule the additional instruction.

　図１は、本発明の模範的な一実施例におけるコンパイラ組み込み関数追加装置の構成を示す図である。コンパイラ組み込み関数追加装置７０は、ＣＰＵ１０と、記憶装置２０と、コンパイラ組み込み関数追加手段（コンパイラ組み込み関数追加プログラム）３０と、追加命令仕様記述ファイル４０と、ベースプロセッサ用コンパイラソースコード５０と、拡張プロセッサ用コンパイラソースコード６０と、を備えている。コンパイラ組み込み関数追加手段３０は、ＣＰＵ１０で実行されるプログラムによって処理・機能が実現されため、コンパイラ組み込み関数追加プログラム３０ともいう。 FIG. 1 is a diagram showing a configuration of a compiler built-in function adding device in an exemplary embodiment of the present invention. The compiler built-in function adding device 70 includes a CPU 10, a storage device 20, a compiler built-in function adding means (compiler built-in function adding program) 30, an additional instruction specification description file 40, a compiler source code 50 for a base processor, and an extended processor. Compiler source code 60. The compiler built-in function adding unit 30 is also referred to as a compiler built-in function adding program 30 because processing and functions are realized by a program executed by the CPU 10.

　ＣＰＵ１０がコンパイラ組み込み関数追加プログラム３０を実行することによって、記憶装置２０から追加命令仕様記述ファイル４０とベースプロセッサ用コンパイラソースコード５０とを読み込み、追加命令仕様記述ファイル４０に記述された追加命令に対応する組み込み関数の定義文を生成し、追加命令に対応する組み込み関数の定義文をベースプロセッサ用コンパイラソースコード５０へ追加したものを、拡張プロセッサ用コンパイラソースコード６０とし、拡張プロセッサ用コンパイラソースコード６０を記憶装置２０へ格納する。 When the CPU 10 executes the compiler built-in function addition program 30, the additional instruction specification description file 40 and the base processor compiler source code 50 are read from the storage device 20 and correspond to the additional instructions described in the additional instruction specification description file 40. An extension function compiler source code 60 is generated by generating an embedded function definition statement corresponding to the additional instruction to the base processor compiler source code 50 and generating the extension processor compiler source code 60. Is stored in the storage device 20.

　コンパイラ組み込み関数追加装置７０は、一般的なコンピュータ（例えば、パーソナルコンピュータ）をもちいて実現可能である。使用者（ユーザ）は、拡張プロセッサ用コンパイラソースコード６０を使って、拡張プロセッサ用コンパイラを構築することも可能である。さらに、その構築機能を、コンパイラ組み込み関数追加装置７０に付加することも可能である。 The compiler built-in function adding device 70 can be realized using a general computer (for example, a personal computer). The user (user) can also build an extended processor compiler using the extended processor compiler source code 60. Furthermore, the construction function can be added to the compiler built-in function adding device 70.

　図２は、図１のコンパイラ組み込み関数追加プログラム３０の入出力関係の詳細を示す図である。コンパイラ組み込み関数追加プログラム３０の入力ファイルは、追加命令仕様記述ファイル４０と、ベースプロセッサ用コンパイラソースコード５０である。 FIG. 2 is a diagram showing details of the input / output relationship of the compiler built-in function addition program 30 of FIG. The input files of the compiler built-in function adding program 30 are an additional instruction specification description file 40 and a base processor compiler source code 50.

　コンパイラ組み込み関数追加プログラム３０の最終的な出力ファイルは、拡張プロセッサ用コンパイラソースコード６０である。 The final output file of the compiler built-in function addition program 30 is the compiler source code 60 for the extended processor.

　拡張プロセッサ用コンパイラソースコード６０は、ベースプロセッサ用コンパイラソースコード５０と、追加命令の構成定義文６１と、追加命令と組み込み関数との関係定義文６４と、を含む。 The extended processor compiler source code 60 includes a base processor compiler source code 50, a configuration definition statement 61 of an additional instruction, and a relationship definition statement 64 of the additional instruction and an embedded function.

　コンパイラ組み込み関数追加プログラム３０を実行することによって、ＣＰＵ１０は、追加命令仕様記述ファイル４０に基づいて、追加命令の構成定義文６１と、追加命令と組み込み関数との関係定義文６４とを生成する。 By executing the compiler built-in function addition program 30, the CPU 10 generates a configuration definition statement 61 of an additional instruction and a relationship definition statement 64 between the additional instruction and the built-in function based on the additional instruction specification description file 40.

　さらに、ＣＰＵ１０は、追加命令の構成定義文６１と、追加命令と組み込み関数との関係定義文６４とを、ベースプロセッサ用コンパイラソースコード５０へ追加する。そして、ＣＰＵ１０は、追加命令の構成定義文６１と、追加命令と組み込み関数との関係定義文６４と追加したソースコードを、拡張プロセッサ用コンパイラソースコード６０として出力する。 Further, the CPU 10 adds the configuration definition statement 61 of the additional instruction and the relationship definition statement 64 between the additional instruction and the built-in function to the compiler source code 50 for the base processor. Then, the CPU 10 outputs the configuration definition statement 61 of the additional instruction, the relationship definition statement 64 between the additional instruction and the built-in function, and the added source code as the extended processor compiler source code 60.

　追加命令の構成定義文６１は、コンパイラの中間言語で定義された追加命令のテンプレート６２と、追加命令のレイテンシを定義するレイテンシ定義文６３と、を含む。 The additional instruction configuration definition statement 61 includes an additional instruction template 62 defined in an intermediate language of the compiler, and a latency definition sentence 63 that defines the latency of the additional instruction.

　テンプレート６２は、一般的には、「ＲＴＬ（ｒｅｇｉｓｔｅｒ　ｔｒａｎｓｆｅｒ　ｌａｎｇｕａｇｅ）」と呼ばれるコンパイラの中間言語で記述される。以下では、テンプレート６２の具体的な例を「ＲＴＬテンプレート」と呼ぶことにする。 The template 62 is generally described in an intermediate language of a compiler called “RTL (register transfer language)”. Hereinafter, a specific example of the template 62 will be referred to as an “RTL template”.

　レイテンシ定義文６３は、追加命令の実行に必要なサイクル数（レイテンシ）を定義する。 The latency definition statement 63 defines the number of cycles (latency) necessary for executing the additional instruction.

　追加命令と組み込み関数との関係定義文６４は、追加命令に対応する組み込み関数のプロトタイプ宣言６５と、組み込み関数の展開関数６６と、を含む。 The relation definition statement 64 between the additional instruction and the built-in function includes a prototype declaration 65 of the built-in function corresponding to the additional instruction and an expansion function 66 of the built-in function.

　組み込み関数のプロトタイプ宣言６５は、組み込み関数の戻り値や引数の型を定義する。 The prototype declaration 65 of the built-in function defines the return value and argument type of the built-in function.

　組み込み関数のプロトタイプ宣言６５によって、拡張プロセッサ用コンパイラは、組み込み関数の名前、戻り値や引数の型を知ることができる。 The built-in function prototype declaration 65 allows the compiler for the extended processor to know the name of the built-in function, the return value, and the argument type.

　組み込み関数の展開関数６６は、組み込み関数に対応する追加命令を検出するために、拡張プロセッサ用コンパイラが用いる関数である。 The expansion function 66 of the built-in function is a function used by the compiler for the extended processor in order to detect an additional instruction corresponding to the built-in function.

　図３は、コンパイラ組み込み関数追加プログラム３０の処理手順を示すフローチャートである。図３を参照すると、コンパイラ組み込み関数追加プログラム３０は、
　仕様記述の読み込み処理を行うステップ１００と、
　ＲＴＬテンプレートを生成する処理を行うステップ２００と、
　レイテンシ定義文を生成する処理を行うステップ３００と、
　組み込み関数プロトタイプ宣言を生成する処理を行うステップ４００と、
　組み込み関数展開関数を生成する処理を行うステップ５００と、
　コンパイラソースコードの追加処理を行うステップ６００と、
　を含む。 FIG. 3 is a flowchart showing the processing procedure of the compiler built-in function addition program 30. Referring to FIG. 3, the compiler built-in function addition program 30 is
A step 100 for reading the specification description;
Step 200 for performing processing for generating an RTL template;
Step 300 for performing processing for generating a latency definition statement;
A step 400 of performing a process of generating a built-in function prototype declaration;
Step 500 for performing processing for generating an embedded function expansion function;
Step 600 for performing additional processing of compiler source code;
including.

　本実施例において、図３に示したフローチャートの処理手順をコンピュータに実行させるためのコンピュータプログラムを、コンパイラ組み込み関数追加装置７０へ所定の媒体等を介して供給し、該コンピュータプログラムをコンパイラ組み込み関数追加装置７０のＣＰＵ１０が主メモリにロードして実行するようにしてもよい。また、コンパイラ組み込み関数追加装置７０に供給されたコンピュータプログラムは、読み書き可能なメモリまたはハードディスク装置などの記憶媒体に格納すれば良い。本発明は、係るコンピュータプログラムあるいは記憶媒体によって構成される。 In this embodiment, a computer program for causing the computer to execute the processing procedure of the flowchart shown in FIG. 3 is supplied to the compiler built-in function adding device 70 via a predetermined medium or the like, and the computer program is added to the compiler built-in function. The CPU 10 of the device 70 may be loaded into the main memory and executed. The computer program supplied to the compiler built-in function adding device 70 may be stored in a readable / writable memory or a storage medium such as a hard disk device. The present invention is constituted by such a computer program or a storage medium.

　以下では、まず、追加命令に関する仕様記述について説明してから、図３の各ステップを説明する。 In the following, first, the specification description regarding the additional instruction will be described, and then each step of FIG. 3 will be described.

［ベースプロセッサへの追加命令に関する仕様記述］
　前述したように、コンパイラ組み込み関数追加プログラム３０への入力は、ベースプロセッサへの追加命令に関する仕様記述ファイル４０である。特に制限されないが、本実施例において、追加命令の仕様記述は、ＸＭＬ（ｅｘｔｅｎｓｉｂｌｅ　ｍａｒｋｕｐ　ｌａｎｇｕａｇｅ）に基づいている。追加命令の仕様を、ＸＭＬで記述することによって、ＸＭＬに対応した既存の字句解析器や構文解析器を使って追加命令の仕様記述を解析したり変換したりできる。そのため、追加命令の仕様記述の解析や変換が容易になる。なお、ＸＭＬ以外の言語を使ったとしても、追加命令の仕様を記述することは可能である。 [Specification description for additional instructions to base processor]
As described above, the input to the compiler built-in function addition program 30 is the specification description file 40 regarding the additional instruction to the base processor. Although not particularly limited, in the present embodiment, the specification description of the additional instruction is based on XML (extensible markup language). By describing the specification of the additional instruction in XML, it is possible to analyze or convert the specification description of the additional instruction using an existing lexical analyzer or syntax analyzer that supports XML. This facilitates the analysis and conversion of the specification description of the additional instruction. Even if a language other than XML is used, it is possible to describe the specifications of additional instructions.

　以下、ＸＭＬに基づいた追加命令の仕様記述について説明する。 The following describes the specification description of the additional instruction based on XML.

　追加命令の仕様記述のためのＸＭＬのタグは、＜ｎｉｃｋｎａｍｅ＞と＜ｉｎｓｎ＞である。 XML tags for specification description of additional instructions are <nickname> and <insn>.

　＜ｎｉｃｋｎａｍｅ＞タグは、ターゲットとなるプロセッサのニックネームを定義するタグである。ニックネームの文字数は任意であり、ニックネームとして使用可能な文字はアルファベットと数字である。＜ｎｉｃｋｎａｍｅ＞タグで定義されたニックネームは、コンパイラ組み込み関数追加プログラム３０によって生成される変数や関数などの名前の一部に使用される。 The <nickname> tag is a tag that defines a nickname of a target processor. The number of characters in the nickname is arbitrary, and the characters that can be used as the nickname are alphabets and numbers. The nickname defined by the <nickname> tag is used as a part of the name of a variable, function, or the like generated by the compiler built-in function addition program 30.

　＜ｎｉｃｋｎａｍｅ＞タグの記述例を以下に示す。 <Example of <nickname> tag description is shown below.

＜ｎｉｃｋｎａｍｅ＞ｆｏｏ＜／ｎｉｃｋｎａｍｅ＞ <Nickname> foo </ nickname>

＜ｉｎｓｎ＞タグはターゲットプロセッサの命令を定義するタグである。
＜ｉｎｓｎ＞タグの値は命令のシンタックスや語長や入力オペランドを定義するための別のタグである。 The <insn> tag is a tag that defines an instruction of the target processor.
The value of the <insn> tag is another tag for defining instruction syntax, word length, and input operand.

　＜ｉｎｓｎ＞タグの記述例を、図１４に示す。図１４に例示したように、＜ｉｎｓｎ＞タグの値として記述可能なタグは以下の六個である。 FIG. 14 shows a description example of the <insn> tag. As illustrated in FIG. 14, the following six tags can be described as values of the <insn> tag.

　＜ｍｎｅｍｏｎｉｃ＞は、命令のニモニックを定義するタグである。 <Mnemonic> is a tag that defines the mnemonic of the instruction.

　＜ｓｙｎｔａｘ＞は、命令のシンタックスを定義するタグである。 <Syntax> is a tag that defines the syntax of an instruction.

　＜ｌｅｎｇｔｈ＞は、命令の語長を定義するタグである。 <Length> is a tag that defines the word length of the instruction.

　＜ｌａｔｅｎｃｙ＞は、命令のレイテンシを定義するタグである。 <Latency> is a tag that defines the latency of the instruction.

　＜ｏｕｔｐｕｔ＞は、命令が値を書き込む出力オペランドを定義するタグである。 <Output> is a tag that defines an output operand into which an instruction writes a value.

　＜ｉｎｐｕｔ＞は、命令が使用する入力オペランドを定義するタグである。
　以下、＜ｉｎｓｎ＞タグの値として記述可能なタグについて詳しく説明する。 <Input> is a tag that defines an input operand used by the instruction.
Hereinafter, tags that can be described as values of the <insn> tag will be described in detail.

　＜ｍｎｅｍｏｎｉｃ＞タグは、命令のニモニックを定義するタグである。ニモニックの文字数は任意であり、ニモニックとして使用可能な文字はアルファベットと数字である
（図１４ではｍａｃ３２）。 The <mnemonic> tag is a tag that defines a mnemonic of an instruction. The number of mnemonic characters is arbitrary, and the characters that can be used as mnemonics are alphabets and numbers (mac32 in FIG. 14).

　ある命令のニモニックは他のどの命令のニモニックとも同じであってはならない。ニモニックでは大文字と小文字を区別しない。例えば、ニモニックＦｏｏとニモニックｆｏｏを同一であると見なす。命令のニモニックはその命令に対応する組み込み関数の名前の一部に使用される。 The mnemonic of one instruction must not be the same as the mnemonic of any other instruction. Mnemonics are not case sensitive. For example, mnemonic Foo and mnemonic foo are considered to be the same. An instruction mnemonic is used as part of the name of the built-in function corresponding to the instruction.

　＜ｓｙｎｔａｘ＞タグは、命令のシンタックスを定義するタグである。 The <syntax> tag is a tag that defines the syntax of an instruction.

　＜ｓｙｎｔａｘ＞タグの値は、命令のシンタックスを表す任意の文字列（図１４の”ｍａｃ３２　％ｒｄ　％ｒａ　％ｒｂ　％ｒｃ”）である。この文字列には、フォーマット文字列を含めることができる。フォーマット文字列とは命令オペランドの出力位置を指定するための文字列である。フォーマット文字列は％記号で始まる。％記号に続く文字列（例えばｒｄ）は、後述する＜ｏｐｅｒａｎｄ＞タグの値と一致しなければならない。＜ｏｐｅｒａｎｄ＞タグについては後で説明する。 The value of the <syntax> tag is an arbitrary character string (“mac32% rd% ra% rb% rc” in FIG. 14) representing the syntax of the instruction. This character string can include a format character string. The format character string is a character string for designating the output position of the instruction operand. The format string begins with a% symbol. A character string (for example, rd) following the% symbol must match a value of an <operand> tag described later. The <operand> tag will be described later.

　フォーマット文字列は、オペランドを表す適切な文字列で置き換えられる。例えば、レジスタを表す＜ｏｐｅｒａｎｄ＞タグに対応するフォーマット文字列はレジスタの名前に置き換えられ、即値を表す＜ｏｐｅｒａｎｄ＞タグに対応するフォーマット文字列はレジスタの名前に置き換えられる。 The format string is replaced with an appropriate string representing the operand. For example, a format character string corresponding to an <operand> tag representing a register is replaced with a register name, and a format character string corresponding to an <operand> tag representing an immediate value is replaced with a register name.

　＜ｌｅｎｇｔｈ＞タグは、命令の語長をビット単位で定義するタグである。ただし、
命令語長は８の倍数とする（図１４では３２ビット、すなわち４バイト）。 The <length> tag is a tag that defines the word length of an instruction in bits. However,
The instruction word length is a multiple of 8 (in FIG. 14, 32 bits, that is, 4 bytes).

　＜ｌａｔｅｎｃｙ＞タグは、命令のレイテンシを定義するタグである。レイテンシとは命令を実行してからその命令の実行結果が利用可能になるまでにかかるサイクル数である。命令のレイテンシは、プログラムの文脈に応じて変化することも考えられる。そのような場合は、考えられる最長のレイテンシをこのタグで定義する。 The <latency> tag is a tag that defines the latency of an instruction. Latency is the number of cycles it takes for an instruction to execute after the instruction is executed. The instruction latency may vary depending on the program context. In such cases, this tag defines the longest possible latency.

　＜ｏｕｔｐｕｔ＞タグは、命令の出力オペランドを定義するタグである。＜ｏｕｔｐｕｔ＞タグの値はオペランドを表す＜ｏｐｅｒａｎｄ＞タグである。 The <output> tag is a tag that defines an output operand of an instruction. The value of the <output> tag is an <operand> tag that represents an operand.

　＜ｉｎｐｕｔ＞タグは、命令の入力オペランドを定義するタグである。＜ｉｎｐｕｔ＞タグの値はオペランドを表す＜ｏｐｅｒａｎｄ＞タグである。 The <input> tag is a tag that defines an input operand of an instruction. The value of the <input> tag is an <operand> tag that represents an operand.

　＜ｏｐｅｒａｎｄ＞タグは、追加命令のオペランドを定義するタグである。＜ｏｐｅｒａｎｄ＞タグは、＜ｏｕｔｐｕｔ＞タグや＜ｉｎｐｕｔ＞タグの値として使用される。＜ｏｐｅｒａｎｄ＞タグの値は、オペランドの名前である。この名前は＜ｓｙｎｔａｘ＞タグのフォーマット文字列（例えば％ｒｄ）と関連している。 The <operand> tag is a tag that defines an operand of an additional instruction. The <operand> tag is used as a value of an <output> tag or an <input> tag. The value of the <operand> tag is the name of the operand. This name is associated with the format string (eg% rd) of the <syntax> tag.

　＜ｏｐｅｒａｎｄ＞タグで定義されたオペランドの名前だけを＜ｓｙｎｔａｘ＞タグで定義された命令のシンタックスにおけるフォーマット文字列として使用することができる。 Only the name of the operand defined by the <operand> tag can be used as a format character string in the syntax of the instruction defined by the <syntax> tag.

　＜ｏｐｅｒａｎｄ＞タグで定義されていないオペランドの名前をフォーマット文字列として使うことはできない。＜ｏｐｅｒａｎｄ＞タグはｔｙｐｅとｗｉｄｔｈという二つの属性をもつ。 * Operand names not defined in the <operand> tag cannot be used as format strings. The <operand> tag has two attributes, type and width.

　属性ｔｙｐｅは、オペランドの型を表す属性である。この属性の値として使用可能な名前は、ｒｅｇｉｓｔｅｒとｉｍｍｅｄｉａｔｅである。 Attribute type is an attribute that represents the operand type. The names that can be used as the value of this attribute are register and immediate.

　ｒｅｇｉｓｔｅｒは、オペランドがレジスタであることを表す。 Register indicates that the operand is a register.

　ｉｍｍｅｉｄａｔｅは、オペランドが即値であることを表す。 Immediate indicates that the operand is an immediate value.

　属性ｗｉｄｔｈはオペランドのビット幅を表す属性である。 Attribute width is an attribute that represents the bit width of the operand.

［追加命令の仕様記述の読み込み処理］
　コンパイラ組み込み関数追加装置７０（図１）における追加命令の仕様記述の読み込み処理について説明する。この処理は、図３のステップ１００である。この処理において、図１のＣＰＵ１０は、ＸＭＬで記述された追加命令の仕様記述を読み込んで、それらをコンピュータプログラムが理解可能なデータ構造に置き換える。 [Loading specification description of additional instructions]
Processing for reading the specification description of the additional instruction in the compiler built-in function adding device 70 (FIG. 1) will be described. This process is step 100 in FIG. In this process, the CPU 10 in FIG. 1 reads the specification description of the additional instruction described in XML and replaces it with a data structure that can be understood by the computer program.

　追加命令の仕様記述は、ＸＭＬの言語仕様に基づき、＜ｎｉｃｋｎａｍｅ＞タグや＜ｉｎｓｎ＞タグで記述されている。したがって、ＸＭＬのための一般的な字句解析器と構文解析器とを使って、図１のＣＰＵ１０は、追加命令の仕様記述をコンピュータプログラムが理解可能なデータ構造に置き換える。置き換えられたデータ構造は、追加命令仕様記述ファイル４０に記述されたタグに一対一に対応する。したがって、以降は、該データ構造の代わりにタグを参照して、図１の動作を説明する。 The specification description of the additional instruction is described by a <nickname> tag or an <insn> tag based on the XML language specification. Therefore, using a general lexical analyzer and syntax analyzer for XML, the CPU 10 in FIG. 1 replaces the specification description of the additional instruction with a data structure understandable by the computer program. The replaced data structure corresponds to the tags described in the additional instruction specification description file 40 on a one-to-one basis. Therefore, hereinafter, the operation of FIG. 1 will be described with reference to a tag instead of the data structure.

［ＲＴＬテンプレート生成処理］
　コンパイラ組み込み関数追加装置７０における追加命令のためのＲＴＬテンプレート生成処理について説明する。この処理は図３のステップ２００である。 [RTL template generation processing]
An RTL template generation process for an additional instruction in the compiler built-in function adding device 70 will be described. This process is step 200 in FIG.

　ＲＴＬ（ｒｅｇｉｓｔｅｒ　ｔｒａｎｓｆｅｒ　ｌａｎｇｕａｇｅ）とは、コンパイラにおける中間言語である。コンパイラはＣ／Ｃ＋＋などの高級言語で記述されたソースコードをＲＴＬと呼ばれる中間言語に変換し、ＲＴＬに基づいて、プロセッサの命令を出力する。ＲＴＬとプロセッサの命令は一対一に対応している。ＲＴＬテンプレートとはプロセッサの命令をＲＴＬで表現したものである。一般的に、コンパイラがプロセッサの命令の動作を理解できるように、ＲＴＬテンプレートを記述する。 RTL (register transfer language) is an intermediate language in the compiler. The compiler converts source code described in a high-level language such as C / C ++ into an intermediate language called RTL, and outputs a processor instruction based on the RTL. There is a one-to-one correspondence between RTL and processor instructions. An RTL template is a representation of processor instructions in RTL. Generally, RTL templates are written so that the compiler can understand the operation of the processor instructions.

　しかし、本実施例では、追加命令の正確な動作はＲＴＬテンプレートには記述しない。本実施例においては、コンパイラが追加命令の入出力オペランドだけを正しく理解できるようなＲＴＬテンプレートを出力する。このようにすることで、追加命令の動作をＲＴＬテンプレートで正しく表現する、という複雑な問題を回避する。 However, in this embodiment, the exact operation of the additional instruction is not described in the RTL template. In this embodiment, the RTL template is output so that the compiler can correctly understand only the input / output operands of the additional instruction. By doing so, the complicated problem of correctly expressing the operation of the additional instruction with the RTL template is avoided.

　追加命令のためのＲＴＬテンプレートの例を、図１５に示す。図１５は、積和演算命令のためのＲＴＬテンプレートである。 FIG. 15 shows an example of the RTL template for the additional instruction. FIG. 15 is an RTL template for a product-sum operation instruction.

　ＲＴＬテンプレートは、ｄｅｆｉｎｅ＿ｉｎｓｎという式で記述される。ｄｅｆｉｎｅ＿ｉｎｓｎ式は、Ｌｉｓｐ言語の式に似ている。なお、ｄｅｆｉｎｅ＿ｉｎｓｎ式の詳しい説明については、非特許文献２の第２３９ページ、“１４　Ｍａｃｈｉｎｅ　Ｄｅｓｃｒｉｐｔｉｏｎ”，又は非特許文献２の第１７８頁の“機械記述”の記載が参照される。 The RTL template is described by an expression “define_insn”. The define_insn expression is similar to an expression in the Lisp language. For a detailed description of the define_insn formula, refer to the description on page 239 of Non-Patent Document 2, “14 Machine Description”, or “Machine Description” on page 178 of Non-Patent Document 2.

　本実施例における追加命令のためのＲＴＬテンプレートは、
（１）ＲＴＬテンプレートの名前、
（２）出力オペランド、
（３）入力オペランド、
（４）追加命令の動作記述、
（５）追加命令のシンタックス、
（６）追加命令の語長、
（７）追加命令のニモニック、
　の７個の要素を含む。 The RTL template for additional instructions in this example is
(1) RTL template name,
(2) Output operand,
(3) input operands,
(4) Action description of additional instructions,
(5) Additional instruction syntax,
(6) Word length of the additional instruction,
(7) Additional instruction mnemonics,
7 elements are included.

　図４は、本実施例におけるＲＴＬテンプレート生成処理（図３の２００）の手順を示すフローチャートである。上記の７個の要素を生成するために、ＲＴＬテンプレート生成処理は、
　入出力オペランド番号の決定するステップ２１０と、
　ＲＴＬテンプレートの名前を生成するステップ２２０と、
　出力オペランドを生成するステップ２３０と、
　入力オペランドを生成するステップ２４０と、
　追加命令の動作記述を生成するステップ２５０と、
　追加命令のシンタックスを生成するステップ２６０と、
　追加命令の語長定義文を生成するステップ２７０と、
　追加命令のニモニック定義文を生成するステップ２８０と、
を含む。 FIG. 4 is a flowchart showing the procedure of the RTL template generation process (200 in FIG. 3) in the present embodiment. In order to generate the above seven elements, the RTL template generation process
A step 210 of determining input / output operand numbers;
Generating a name of the RTL template 220;
Generating 230 an output operand;
Generating an input operand 240;
Generating an operation description of the additional instruction 250;
Generating 260 additional instruction syntax;
Generating a word length definition sentence of the additional command; 270;
Generating a mnemonic definition statement of the additional instruction, 280;
including.

　まず、入出力オペランド番号の決定（ステップ２１０）について説明する。 First, the determination of the input / output operand number (step 210) will be described.

　非特許文献２の第２４１頁の“１４．４　ＲＴＬ　Ｔｅｍｐｌａｔｅ”又は非特許文献３の第１８０頁の“ＲＴＬテンプレート”によると、ＲＴＬテンプレートにおいて、出力オペランドや入力オペランドなどの各オペランドには０から始まる番号を与えることになっている。 According to “14.4 RTL Template” on page 241 of Non-Patent Document 2 or “RTL Template” on page 180 of Non-Patent Document 3, in the RTL template, each operand such as an output operand or input operand starts from 0. It is to be given a number that begins.

　本実施例において、ＲＴＬテンプレート生成処理（図３のステップ２００）では、図１のＣＰＵ１０は追加命令の仕様記述に基づいて各オペランドへ番号を与える。ステップ２１０は、その番号を決定する処理である。 In this embodiment, in the RTL template generation process (step 200 in FIG. 3), the CPU 10 in FIG. 1 assigns a number to each operand based on the specification description of the additional instruction. Step 210 is a process for determining the number.

　本実施例において、ＲＴＬテンプレート生成処理（図３のステップ２００）では、図１のＣＰＵ１０は、追加命令の仕様記述における＜ｏｐｅｒａｎｄ＞タグに基づいて追加命令のＲＴＬテンプレートにおける出力オペランドや入力オペランドを決定する。 In the present embodiment, in the RTL template generation process (step 200 in FIG. 3), the CPU 10 in FIG. 1 determines the output operand and input operand in the RTL template of the additional instruction based on the <operand> tag in the specification description of the additional instruction. To do.

　出力オペランドは、＜ｏｕｔｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグによって、また入力オペランドは、＜ｉｎｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグによって、それぞれ決定される。 The output operand is determined by the <operand> tag described inside the <output> tag, and the input operand is determined by the <operand> tag described inside the <input> tag.

　コンパイラ組み込み関数追加プログラム３０は、以下の手順で、ＲＴＬテンプレートのオペランドの番号ｏｐ＿ｉｎｄｅｘを決める。 The compiler built-in function addition program 30 determines the operand number op_index of the RTL template according to the following procedure.

　図５は、ＲＴＬテンプレートのオペランドの番号ｏｐ＿ｉｎｄｅｘを決める手順を説明するフローチャートである。 FIG. 5 is a flowchart for explaining the procedure for determining the operand number op_index of the RTL template.

　＜ｏｕｔｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグの数をＰ個とする（ステップ２１１）。 Suppose that the number of <operand> tags described inside the <output> tag is P (step 211).

　次に、＜ｉｎｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグの数をＱ個とする（ステップ２１２）。 Next, let the number of <operand> tags described inside the <input> tag be Q (step 212).

　＜ｏｕｔｐｕｔ＞タグの内側に記述された全ての＜ｏｐｅｒａｎｄ＞タグに０からＰ－１までの番号ｏｐ＿ｉｎｄｅｘをつける（ステップ２１３）。 <A number op_index from 0 to P-1 is assigned to all <operand> tags described inside the <output> tag (step 213).

　＜ｉｎｐｕｔ＞タグの内側に記述された全ての＜ｏｐｅｒａｎｄ＞タグにＰからＰ＋Ｑ－１までの番号ｏｐ＿ｉｎｄｅｘをつける（ステップ２１４）。 <Numbers op_index from P to P + Q-1 are assigned to all <operand> tags described inside the <input> tag (step 214).

　次に、ＲＴＬテンプレートの名前生成（ステップ２２０）について説明する。 Next, name generation (step 220) of the RTL template will be described.

　ステップ２２０は、追加命令に対応するＲＴＬテンプレートの名前を決定する処理である。ＲＴＬテンプレート生成処理では、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートの名前を追加命令の仕様記述における＜ｎｉｃｋｎａｍｅ＞タグと＜ｍｎｅｍｏｎｉｃ＞タグに基づいて決定する。 Step 220 is a process for determining the name of the RTL template corresponding to the additional command. In the RTL template generation process, the compiler built-in function addition program 30 determines the name of the RTL template based on the <nickname> tag and the <mnemonic> tag in the specification description of the additional instruction.

　ＲＴＬテンプレートの名前の形式は、ｂｕｉｌｔｉｎ＿ｎｋｎａｍｅ＿ｍｎｅｍである。ここで、ｎｋｎａｍｅは、＜ｎｉｃｋｎａｍｅ＞タグで定義されたプロセッサのニックネームである。ｍｎｅｍは、＜ｍｎｅｍｏｎｉｃ＞タグで定義された追加命令のニモニックである。 The format of the name of the RTL template is builtin_nkname_mnem. Here, nkname is the nickname of the processor defined by the <nickname> tag. mnem is a mnemonic of an additional instruction defined by the <mnemonic> tag.

　次に、図４の出力オペランドの生成（ステップ２３０）について説明する。図４のステップ２３０は追加命令に対応するＲＴＬテンプレートの出力オペランドを生成する処理である。 Next, the generation of the output operand (step 230) in FIG. 4 will be described. Step 230 in FIG. 4 is processing for generating an output operand of the RTL template corresponding to the additional instruction.

　出力オペランドは追加命令の演算結果を格納するオペランドを表すので、一般的には出力オペランドはレジスタである。 Since the output operand represents an operand that stores the operation result of the additional instruction, the output operand is generally a register.

　追加命令の仕様記述において、＜ｏｕｔｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグの属性ｔｙｐｅの値がレジスタ（ｒｅｇｉｓｔｅｒ）である場合、コンパイラ組み込み関数追加プログラム３０は、当該＜ｏｐｅｒａｎｄ＞タグに対応する出力オペランドの形式を、
　(match_operand:modenameshort op_index “register_operand” “=r”)　　・・・（１）
　とする。 In the specification description of the additional instruction, if the value of the attribute type of the <operand> tag described inside the <output> tag is a register, the compiler built-in function addition program 30 corresponds to the <operand> tag. The format of the output operand to
(match_operand: modenameshort op_index “register_operand” “= r”) (1)
And

　（１）において、ｍｏｄｅｎａｍｅｓｈｏｒｔは、出力オペランドの語長を表す文字である。 (1) “modenameshort” is a character representing the word length of the output operand.

　ｍｏｄｅｎａｍｅｓｈｏｒｔは、＜ｏｐｅｒａｎｄ＞タグの属性ｗｉｄｔｈに基づいて決定される。図７に、属性ｗｉｄｔｈとｍｏｄｅｎａｍｅｓｈｏｒｔの関係を示す。 “Modenameshort” is determined based on the attribute width of the <operand> tag. FIG. 7 shows the relationship between the attribute width and the modemshort.

　ｏｐ＿ｉｎｄｅｘは、出力オペランドの番号を表す数字である。このｏｐ＿ｉｎｄｅｘはステップ２１０において決定された番号である。 Op_index is a number representing the number of the output operand. This op_index is the number determined in step 210.

　次に、図４の入力オペランドの生成（ステップ２４０）について説明する。図４のステップ２４０は、追加命令に対応するＲＴＬテンプレートの入力オペランドを生成する処理である。 Next, the generation of the input operand (step 240) in FIG. 4 will be described. Step 240 in FIG. 4 is processing for generating an input operand of the RTL template corresponding to the additional instruction.

　ステップ２４０は、レジスタと即値の二種類の入力オペランドを扱う。 Step 240 handles two types of input operands: registers and immediate values.

　追加命令の仕様記述において、＜ｉｎｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグの属性ｔｙｐｅがレジスタ（ｒｅｇｉｓｔｅｒ）である場合、コンパイラ組み込み関数追加プログラム３０は、当該＜ｏｐｅｒａｎｄ＞タグに対応する入力オペランドの形式を、
　(match_operand:modenameshort op_index “register_operand” “r”)　　・（２）
　とする。 In the specification description of the additional instruction, when the attribute type of the <operand> tag described inside the <input> tag is a register, the compiler built-in function addition program 30 inputs corresponding to the <operand> tag. Operand format
(match_operand: modenameshort op_index “register_operand” “r”) (2)
And

　この形式は、（１）のレジスタの出力オペランドに似ている。 This format is similar to the output operand of register (1).

　出力オペランドの形式では“＝ｒ”となる部分が、入力オペランドの形式では“ｒ”となる。 The part that becomes “= r” in the format of the output operand becomes “r” in the format of the input operand.

　次に即値の入力オペランドについて説明する。 Next, immediate input operands are explained.

　＜ｉｎｐｕｔ＞タグの内側に記述された＜ｏｐｅｒａｎｄ＞タグの属性ｔｙｐｅが即値（ｉｍｍｅｄｉａｔｅ）の場合、コンパイラ組み込み関数追加プログラム３０は、その＜ｏｐｅｒａｎｄ＞タグに対応する出力オペランドの形式を
　(match_operand:SI op_index “immediate_operand” “n”)　　　　　・・・（３）
　とする。 When the attribute type of the <operand> tag described inside the <input> tag is an immediate value, the compiler built-in function addition program 30 sets the format of the output operand corresponding to the <operand> tag to (match_operand: SI op_index “immediate_operand” “n”) (3)
And

　次に、図４の追加命令の動作記述の生成（ステップ２５０）について説明する。図４のステップ２５０はＲＴＬテンプレートにおける追加命令の動作を表す記述を生成する処理である。 Next, the generation (step 250) of the operation description of the additional instruction in FIG. 4 will be described. Step 250 in FIG. 4 is processing for generating a description representing the operation of the additional instruction in the RTL template.

　ステップ２５０において、コンパイラ組み込み関数追加プログラム３０は、追加命令が不明な演算をするものとして、追加命令の動作記述を生成する。 In step 250, the compiler built-in function addition program 30 generates an operation description of the additional instruction on the assumption that the additional instruction performs an unknown operation.

　ステップ２５０が生成する追加命令の動作記述の形式は、以下の通りである。 The format of the operation description of the additional instruction generated by step 250 is as follows.

　(set output_operand (unspec:VOID [all_input_operands] unspec_insn_id))　　・・・（４） (Set output_operand (unspec: VOID [all_input_operands] unspec_insn_id)) (4)

　この形式は、ａｌｌ＿ｉｎｐｕｔ＿ｏｐｅｒａｎｄｓを入力オペランドとして、番号ｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄで表される不明な演算の演算結果を出力オペランドｏｕｔｐｕｔ＿ｏｐｅｒａｎｄへ代入することを表す。 This format represents that all_input_operands is used as an input operand, and an operation result of an unknown operation represented by the number unspec_insn_id is assigned to the output operand output_operand.

　（４）において、ｕｎｓｐｅｃは、コンパイラにとって不明な演算を表す演算子である。 In (4), unspec is an operator representing an operation unknown to the compiler.

　ｏｕｔｐｕｔ＿ｏｐｅｒａｎｄは、出力オペランドを表す。 Output_operand represents an output operand.

　ａｌｌ＿ｉｎｐｕｔ＿ｏｐｅｒａｎｄｓは、全ての入力オペランドを表す。 All_input_operands represents all input operands.

　ｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄは、追加命令の不明な演算を区別するための番号を表す。各追加命令ごとにｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄの値は異なる。 Unspec_insn_id represents a number for distinguishing an unknown operation of an additional instruction. The value of unspec_insn_id differs for each additional instruction.

　次に、図４の追加命令のシンタックスの生成（ステップ２６０）について説明する。図４のステップ２６０はＲＴＬテンプレートにおける追加命令のシンタックスを生成する処理である。 Next, generation of the syntax of the additional instruction in FIG. 4 (step 260) will be described. Step 260 in FIG. 4 is a process for generating the syntax of the additional instruction in the RTL template.

　追加命令の仕様記述における＜ｓｙｎｔａｘ＞タグで定義されたシンタックスに基づいて、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートにおける追加命令のシンタックスを生成する。 Based on the syntax defined by the <syntax> tag in the specification description of the additional instruction, the compiler built-in function addition program 30 generates the syntax of the additional instruction in the RTL template.

　＜ｓｙｎｔａｘ＞タグで定義されたシンタックスは、フォーマット文字列を含む。フォーマット文字列は、＜ｏｐｅｒａｎｄ＞タグで定義された追加命令のオペランドと一対一に対応する文字列である。 The syntax defined by the <syntax> tag includes a format character string. The format character string is a character string that has a one-to-one correspondence with the operand of the additional instruction defined by the <operand> tag.

　コンパイラ組み込み関数追加プログラム３０は、＜ｓｙｎｔａｘ＞タグで定義されたシンタックスに含まれるフォーマット文字列を、ＲＴＬテンプレートのオペランドを表す番号（前述のｏｐ＿ｉｎｄｅｘ）で置き換えたものを、ＲＴＬテンプレートにおける追加命令のシンタックスとする。番号ｏｐ＿ｉｎｄｅｘは、＜ｏｐｅｒａｎｄ＞タグに基づいてＲＴＬテンプレート生成処理が決定する番号であることから、番号ｏｐ＿ｉｎｄｅｘはフォーマット文字列と一対一に対応する。 The compiler built-in function addition program 30 replaces the format character string included in the syntax defined by the <syntax> tag with a number (op_index described above) representing the operand of the RTL template, and adds an additional instruction in the RTL template. Use syntax. Since the number op_index is a number determined by the RTL template generation process based on the <operand> tag, the number op_index has a one-to-one correspondence with the format character string.

　次に、図４の追加命令の語長定義文の生成（ステップ２７０）について説明する。図４のステップ２７０は、ＲＴＬテンプレートにおける追加命令の語長を定義する記述を生成する処理である。 Next, generation of a word length definition sentence (step 270) of the additional instruction in FIG. 4 will be described. Step 270 in FIG. 4 is processing for generating a description that defines the word length of the additional instruction in the RTL template.

　追加命令の仕様記述における＜ｌｅｎｇｔｈ＞タグで定義された語長に基づいて、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートにおける追加命令の語長を次のような形式で出力する。 Based on the word length defined by the <length> tag in the specification description of the additional instruction, the compiler built-in function addition program 30 outputs the word length of the additional instruction in the RTL template in the following format.

　(set_attr“length”“len”)　　　　　　　　　　・・・（５）
　ここで、ｌｅｎは、＜ｌｅｎｇｔｈ＞タグで定義された語長を表す数字である。ＲＴＬテンプレートにおける追加命令の語長はコンパイラがコードサイズを計算するために使われる。 (set_attr “length” “len”) (5)
Here, len is a number representing the word length defined by the <length> tag. The word length of the additional instruction in the RTL template is used by the compiler to calculate the code size.

　次に、図４の追加命令のニモニック定義文の生成（ステップ２８０）について説明する。図４のステップ２８０はＲＴＬテンプレートにおける追加命令のニモニックを定義する記述を生成する処理である。 Next, generation of a mnemonic definition sentence (step 280) of the additional instruction in FIG. 4 will be described. Step 280 in FIG. 4 is processing for generating a description that defines the mnemonic of the additional instruction in the RTL template.

　追加命令の仕様記述における＜ｍｎｅｍｏｎｉｃ＞タグで定義されたニモニックにもとづいて、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートにおける追加命令のニモニック定義文を以下の形式で出力する。 Based on the mnemonic defined by the <mnemonic> tag in the specification description of the additional instruction, the compiler built-in function addition program 30 outputs the mnemonic definition statement of the additional instruction in the RTL template in the following format.

　(set_attr“mnemonic”“mnem”)　　　　　　　・・・（６）
　ここで、ｍｎｅｍは＜ｍｎｅｍｏｎｉｃ＞タグで定義されたニモニックを表す文字列である。 (set_attr “mnemonic” “mnem”) (6)
Here, mnem is a character string representing a mnemonic defined by the <mnemonic> tag.

　コンパイラ組み込み関数追加プログラム３０は、レイテンシ定義文生成処理は追加命令のレイテンシ定義文を生成するために、ＲＴＬテンプレートにおける追加命令のニモニックを追加命令の識別コードとして使用する。 The compiler built-in function addition program 30 uses the mnemonic of the additional instruction in the RTL template as the identification code of the additional instruction in the latency definition sentence generation process to generate the latency definition sentence of the additional instruction.

［レイテンシ定義文生成処理］
　本実施例のコンパイラ組み込み関数追加装置における追加命令のためのレイテンシ定義文生成処理（図３のステップ３００）について説明する。レイテンシ定義文は、コンパイラの命令スケジューラへ追加命令のレイテンシを知らせるための定義文である。レイテンシ定義文生成処理は以下の３種類の式を使用する。 [Latency definition statement generation processing]
A latency definition statement generation process (step 300 in FIG. 3) for an additional instruction in the compiler built-in function adding device of this embodiment will be described. The latency definition statement is a definition statement for informing the instruction scheduler of the compiler of the latency of the additional instruction. The latency definition sentence generation process uses the following three types of expressions.

　define_automaton 　　　　　　　　　　　　　　　・・・（７）
　define_cpu_unit　　　　　　　　　　　　　　　　・・・（８）
　define_insn_reservation　　　　　　　　　　　　・・・（９） define_automaton (7)
define_cpu_unit (8)
define_insn_reservation (9)

　ｄｅｆｉｎｅ＿ａｕｔｏｍａｔｏｎ式は、コンパイラの命令スケジューラが使用する状態遷移マシンの名前を定義する。 The define_automaton expression defines the name of the state transition machine used by the compiler instruction scheduler.

　ｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式は、状態遷移マシンが管理すべきプロセッサの実行ユニットを定義する。 The define_cpu_unit expression defines an execution unit of a processor to be managed by the state transition machine.

　ｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式は、命令のレインシと命令が使用する実行ユニットを定義する。 The define_insn_reservation expression defines an instruction train and an execution unit used by the instruction.

　つまり、ｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式によってプロセッサがもつ実行ユニットをコンパイラの命令スケジューラへ知らせる。 That is, the execution unit of the processor is notified to the instruction scheduler of the compiler by the define_cpu_unit expression.

　ｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式によってどの命令がどの実行ユニットを使用するかをコンパイラの命令スケジューラへ知らせる。 It tells the instruction scheduler of the compiler which instruction uses which execution unit by the define_insn_reservation expression.

　ｄｅｆｉｎｅ＿ａｕｔｏｍａｔｏｎ式で定義された名前の状態遷移マシンで、実行ユニットの使用状況と命令のレイテンシを管理する。 The state transition machine with the name defined by the define_automaton expression manages the usage status of the execution unit and the latency of the instruction.

　これらの式の詳しい説明については非特許文献２の第３２７頁の“１４．１９．８　Ｓｐｅｃｉｆｉｙｉｎｇ　ｐｒｏｃｅｓｓｏｒ　ｐｉｐｅｌｉｎｅ　ｄｅｓｃｒｉｐｔｉｏｎ”の記載が参照される。 For a detailed description of these equations, refer to the description of “14.19.8 Specifying processor pipeline description” on page 327 of Non-Patent Document 2.

　図６は、図３のレイテンシ定義文生成処理（ステップ３００）の処理手順を示すフローチャートである。レイテンシ定義文生成処理は、
　ｄｅｆｉｎｅ＿ａｕｔｏｍａｔｏｎ式を生成するステップ３１０と、
　ｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式を生成するステップ３２０と、
　ｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式を生成するステップ３３０と、
　を含む。 FIG. 6 is a flowchart showing a processing procedure of the latency definition sentence generation process (step 300) of FIG. Latency definition statement generation processing
generating 310 a define_automaton expression;
generating a define_cpu_unit expression 320;
generating 330 a define_insn_reservation expression;
including.

　ステップ３３０は全ての追加命令に対して実行される。 Step 330 is executed for all additional instructions.

　追加命令のためのレイテンシ定義文生成処理において、コンパイラ組み込み関数追加プログラム３０は、
　一つのｄｅｆｉｎｅ＿ａｕｔｏｍａｔｏｎ式と、
　一つのｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式と、
　追加命令の個数分のｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式と、
　を生成する。 In the latency definition statement generation process for the additional instruction, the compiler built-in function additional program 30
One define_automaton expression,
One define_cpu_unit expression,
Define_insn_reservation expression for the number of additional instructions,
Is generated.

　追加命令の仕様記述には、プロセッサの実行ユニットに関する情報や追加命令がどの実行ユニットをいつ使用するかという情報は含まれない。 The specification description of the additional instruction does not include information on the execution unit of the processor or information on which execution unit the additional instruction uses.

　追加命令のためのレイテンシ定義文生成処理では、コンパイラ組み込み関数追加プログラム３０は、全ての追加命令が同じ実行ユニットを使用すると仮定して、上記の３つの式を生成する。 In the latency definition statement generation process for the additional instruction, the compiler built-in function addition program 30 generates the above three expressions on the assumption that all the additional instructions use the same execution unit.

　ステップ３１０とステップ３２０とをまとめて説明する。追加命令のためのレイテンシ定義文生成処理が生成するｄｅｆｉｎｅ＿ａｕｔｏｍａｔｏｎ式の形式は、以下の（１０）で与えられる。 Step 310 and step 320 will be described together. The format of the define_automaton expression generated by the latency definition sentence generation processing for the additional instruction is given by the following (10).

　(define_automaton“dfa_builtin_insns”)　　　　　　　　　　　・・・（１０） (Define_automaton “dfa_builtin_insns”) (10)

　この式の第一引数は、状態遷移マシンの名前を表す。 The first argument of this expression represents the name of the state transition machine.

　追加命令のためのレイテンシ定義文生成処理が生成するｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式の形式は、以下の（１１）で与えられる。 The format of the define_cpu_unit expression generated by the latency definition sentence generation process for the additional instruction is given by the following (11).

　(define_cpu_unit“unit_builtin”“dfa_builtin_insns”)　　　　・・・（１１） (Define_cpu_unit “unit_builtin” “dfa_builtin_insns”) (11)

　この式の第一引数はプロセッサの実行ユニットの名前を、第二引数は状態遷移マシンの名前を、それぞれ表す。 The first argument of this expression represents the name of the processor execution unit, and the second argument represents the name of the state transition machine.

　追加命令のためのレイテンシ定義文生成処理は、架空の実行ユニット“ｕｎｉｔ＿ｂｕｉｌｔｉｎ”を定義する。 The latency definition statement generation process for the additional instruction defines a fictitious execution unit “unit_builtin”.

　これらの２つの形式は全ての追加命令に共通して使用され、どの追加命令の仕様記述にも依存しない。 These two formats are used in common for all additional instructions and do not depend on the specification description of any additional instructions.

　次に、ステップ３３０について説明する。追加命令のためのレイテンシ定義文生成処理が生成するｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式の形式は、以下の式で与えられる。 Next, step 330 will be described. The format of the define_insn_reservation expression generated by the latency definition sentence generation process for the additional instruction is given by the following expression.

(define_insn_reservation“mnem”ltcy　(eq_attr“mnemonic”“mnem”)“unit_builtin,nothing*ltcy_minus_1”)　　　　　　　　　　　　・・・（１２） (Define_insn_reservation “mnem” ltcy (eq_attr “mnemonic” “mnem”) “unit_builtin, nothing * ltcy_minus_1”)) (12)

　この式の第一引数はこの式全体に与えられた名前、
　第二引数はレイテンシのサイクル数、
　第三引数はこの式が当てはまる追加命令の条件、
　第四引数は追加命令が使用する実行ユニット、
　をそれぞれ表す。 The first argument of this expression is the name given to this whole expression,
The second argument is the number of latency cycles,
The third argument is the condition of the additional command to which this expression applies,
The fourth argument is the execution unit used by the additional instruction,
Respectively.

　ここで、ｍｎｅｍは、追加命令の仕様記述において、＜ｍｎｅｍｏｎｉｃ＞タグで定義された追加命令のニモニックである。 Here, mnem is a mnemonic of the additional instruction defined in the <mnemonic> tag in the specification description of the additional instruction.

　ｌｔｃｙは、＜ｌａｔｅｎｃｙ＞タグで定義された追加命令のレイテンシを表す値である。 Ltcy is a value representing the latency of the additional instruction defined by the <latency> tag.

　ｌｔｃｙ＿ｍｉｎｕｓ＿１は、ｌｔｃｙから１を引いた値である。 Ltcy_minus_1 is a value obtained by subtracting 1 from ltcy.

　この形式では、追加命令のｄｅｆｉｎｅ＿ｉｎｓｎ式で定義された属性ｍｎｅｍｏｎｉｃを使ってこの式に当てはまる追加命令の条件を指定する。 In this format, the condition of the additional instruction that applies to this expression is specified by using the attribute mnemonic defined in the define_insn expression of the additional instruction.

　さらに、追加命令が使用する実行ユニットは、ｄｅｆｉｎｅ＿ｃｐｕ＿ｕｎｉｔ式で定義された実行ユニット“ｕｎｉｔ＿ｂｕｉｌｔｉｎ”だけを最初のサイクルで使用し、その後のサイクルでは何も実行ユニットを使用しないものとする。 Further, the execution unit used by the additional instruction uses only the execution unit “unit_builtin” defined by the define_cpu_unit expression in the first cycle, and does not use any execution unit in the subsequent cycles.

　追加命令のレイテンシｌｔｃｙが１である場合、最初のサイクルで使用される実行ユニットだけを使用することを表すために、コンパイラ組み込み関数追加プログラム３０は、ｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式の第４引数を、“ｕｎｉｔ＿ｂｕｉｌｔｉｎ”とする。 When the latency of the additional instruction is 1, the compiler built-in function adding program 30 sets the fourth argument of the define_insn_reservation expression as “unit_builtin” to indicate that only the execution unit used in the first cycle is used. To do.

　追加命令のためのレイテンシ定義文生成処理では、コンパイラ組み込み関数追加プログラム３０は、全ての追加命令についてこの形式にしたがってｄｅｆｉｎｅ＿ｉｎｓｎ＿ｒｅｓｅｒｖａｔｉｏｎ式を生成する。 In the latency definition statement generation process for the additional instruction, the compiler built-in function addition program 30 generates a define_insn_reservation expression for all the additional instructions according to this format.

［組み込み関数プロトタイプ宣言生成処理］
　コンパイラ組み込み関数追加装置における追加命令のための組み込み関数プロトタイプ宣言生成処理（図３のステップ４００）について説明する。追加命令のための組み込み関数プロトタイプ宣言生成処理は、コンパイラのための組み込み関数のプロトタイプ宣言を生成する処理である。 [Built-in function prototype declaration generation processing]
An embedded function prototype declaration generation process (step 400 in FIG. 3) for an additional instruction in the compiler embedded function adding device will be described. The built-in function prototype declaration generation process for the additional instruction is a process of generating a built-in function prototype declaration for the compiler.

　組み込み関数のプロトタイプ宣言とは、組み込み関数がどんな引数をもつかとか組み込み関数の戻り値は何かなどを表す情報である。 The prototype declaration of the built-in function is information indicating what arguments the built-in function has and what the return value of the built-in function is.

　組み込み関数のプロトタイプ宣言をコンパイラへ組み込むことによって、コンパイラは組み込み関数を組み込み関数として認識することができる。 * By incorporating the prototype declaration of an embedded function into the compiler, the compiler can recognize the embedded function as an embedded function.

　組み込み関数プロトタイプ宣言生成処理において、コンパイラ組み込み関数追加プログラム３０は、組み込み関数のプロトタイプ宣言をコンパイラへ教える関数の名前を定義する定義文と、その関数の内容と、を生成する。 In the built-in function prototype declaration generation process, the compiler built-in function addition program 30 generates a definition statement that defines the name of a function that tells the compiler the prototype declaration of the built-in function and the contents of the function.

　組み込み関数のプロトタイプ宣言をコンパイラへ教える関数の名前の定義文の形式を以下に示す（図８）。
#undef　TARGET_INIT_BUILTINS
#define　TARGET_INIT_BUILTINS　target_init_builtins　　　　　　・・・（１３） The format of a function name definition statement that tells the compiler the prototype declaration of an embedded function is shown below (FIG. 8).
#undef TARGET_INIT_BUILTINS
#define TARGET_INIT_BUILTINS target_init_builtins (13)

　この定義文は、追加命令の仕様記述には依存しない。どんな追加命令に対してもこの定義文が使用される。 This definition statement does not depend on the specification description of the additional instruction. This definition statement is used for any additional command.

　この定義文に対応する関数ｔａｒｇｅｔ＿ｉｎｉｔ＿ｂｕｉｌｔｉｎｓ（）の内容は、図１０のようになる。 The contents of the function target_init_builds () corresponding to this definition statement are as shown in FIG.

　関数ｔａｒｇｅｔ＿ｉｎｉｔ＿ｂｕｉｌｔｉｎｓ（）は引数も戻り値も持たない関数であり、その内部に記述された各追加命令に対応する組み込み関数のプロトタイプ宣言を実行する。 The function target_init_builds () is a function having no argument and no return value, and executes a prototype declaration of a built-in function corresponding to each additional instruction described therein.

　さらに、組み込み関数プロトタイプ宣言生成処理では、コンパイラ組み込み関数追加プログラム３０は、全ての追加命令に対応する組み込み関数のプロトタイプ宣言の定義文を、関数ｔａｒｇｅｔ＿ｉｎｉｔ＿ｂｕｉｌｔｉｎｓ（）の内容として生成する。 Furthermore, in the built-in function prototype declaration generation process, the compiler built-in function addition program 30 generates a definition statement of the prototype declaration of the built-in function corresponding to all the additional instructions as the contents of the function target_init_builds ().

　組み込み関数プロトタイプ宣言の定義文の生成方法について説明する。組み込み関数プロトタイプ宣言生成処理では、コンパイラ組み込み関数追加プログラム３０は、追加命令の出力オペランドの数に基づいて追加命令に対応する組み込み関数の戻り値と引数を決定する。 Describes how to generate definition statements for built-in function prototype declarations. In the built-in function prototype declaration generation process, the compiler built-in function addition program 30 determines a return value and an argument of the built-in function corresponding to the additional instruction based on the number of output operands of the additional instruction.

　追加命令の出力オペランドの数が１個の場合、コンパイラ組み込み関数追加プログラム３０は、組み込み関数プロトタイプ宣言生成処理はその出力オペランドを組み込み関数の戻り値とし、その追加命令の全ての入力オペランドを組み込み関数の引数とする。 When the number of output operands of the additional instruction is one, the compiler built-in function additional program 30 causes the built-in function prototype declaration generation processing to use the output operand as the return value of the built-in function, and to set all input operands of the additional instruction as built-in functions As an argument.

　追加命令の出力オペランドの数が０個か２個以上の場合、コンパイラ組み込み関数追加プログラム３０は、組み込み関数プロトタイプ宣言生成処理は組み込み関数は戻り値をもたないことにし、追加命令の全ての出力オペランドと入力オペランドとを組み込み関数の引数とする。 When the number of output operands of the additional instruction is 0 or 2 or more, the compiler intrinsic function additional program 30 determines that the intrinsic function prototype declaration generation process does not have a return value for the intrinsic function, and outputs all the additional instructions. Operands and input operands are built-in function arguments.

　次に、組み込み関数の引数の順番について説明する。組み込み関数の戻り値と引数の決定方法に基づいて組み込み関数がどんな引数を持つかを決定した後に、コンパイラ組み込み関数追加プログラム３０は、組み込み関数プロトタイプ宣言生成処理（図３のステップ４００）において、前述のｄｅｆｉｎｅ＿ｉｎｓｎ式におけるオペランドの番号ｏｐ＿ｉｎｄｅｘに基づいて、引数に対応するオペランドの番号ｏｐ＿ｉｎｄｅｘが小さいものから順番に引数を並べる。 Next, the order of arguments of built-in functions will be described. After determining what arguments the intrinsic function has based on the return value of the intrinsic function and the method of determining the argument, the compiler intrinsic function addition program 30 performs the above-mentioned in the intrinsic function prototype declaration generation process (step 400 in FIG. 3). Based on the operand number op_index in the define_insn expression, the arguments are arranged in order from the smallest operand number op_index corresponding to the argument.

　つまり、組み込み関数プロトタイプ宣言生成処理では、コンパイラ組み込み関数追加プログラム３０は、最も小さな番号ｏｐ＿ｉｎｄｅｘをもつオペランドに対応する引数を第一引数とし、その次に小さな番号ｏｐ＿ｉｎｄｅｘをもつオペランドに対応する引数を第二引数とし、・・・という具合に、組み込み関数の引数の順番を決定する。 In other words, in the built-in function prototype declaration generation process, the compiler built-in function addition program 30 sets the argument corresponding to the operand having the smallest number op_index as the first argument, and the argument corresponding to the operand having the next smaller number op_index as the first argument. It takes two arguments, and so on, determines the order of the arguments of the built-in function.

　組み込み関数プロトタイプ宣言生成処理（図３のステップ４００）で生成される組み込み関数プロトタイプ宣言の定義文の形式を以下の（１４）に示す（図１１参照）。 The format of the definition statement of the built-in function prototype declaration generated in the built-in function prototype declaration generation process (step 400 in FIG. 3) is shown in (14) below (see FIG. 11).

　ここで、ｍｎｅｍは、追加命令の仕様記述の＜ｍｎｅｍｏｎｉｃ＞タグで定義された追加命令のニモニックである。ｎｋｎａｍｅは、＜ｎｉｃｋｎａｍｅ＞タグで定義されたプロセッサのニックネームである。ｍｏｄｅｎａｍｅｌｏｎｇは組み込み関数の引数の型を表す文字列である。 Here, mnem is a mnemonic of the additional instruction defined by the <mnemonic> tag in the specification description of the additional instruction. nkname is the nickname of the processor defined by the <nickname> tag. “modenamelong” is a character string representing the type of the argument of the built-in function.

{
tree ftype_mnem=build_function_type_list (
                modenamelong_type_node,  /* operand 0 */
                modenamelong_type_node,  /* operand 1 */
                modenamelong_type_node,  /* operand 2 */  ..
.               NULL_TREE);
builtin_function(“__builtin_nkname_mnem”,/* name of intrinsic */
                ftype_mnem, /* prototype */
                CODE_FOR_builtin_nkname_mnem, /* RTL template name */
                BUILT_IN_MD, NULL, NULL_TREE);
}　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　・・・（１４） {
tree ftype_mnem = build_function_type_list (
modenamelong_type_node, / * operand 0 * /
modenamelong_type_node, / * operand 1 * /
modenamelong_type_node, / * operand 2 * / ..
. NULL_TREE);
builtin_function (“__ builtin_nkname_mnem”, / * name of intrinsic * /
ftype_mnem, / * prototype * /
CODE_FOR_builtin_nkname_mnem, / * RTL template name * /
BUILT_IN_MD, NULL, NULL_TREE);
} ・・・ (14)

　この形式は、まず関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）を使って組み込み関数の戻り値と引数の型を表すリストｆｔｙｐｅ＿ｍｎｅｍを作成し、次に、関数ｂｕｉｌｔｉｎ＿ｆｕｎｃｔｉｏｎ（）を使って、組み込み関数の名前やリストｆｔｙｐｅ＿ｍｎｅｍをコンパイラに登録する、ということを意味する。 This form first creates a list ftype_mnem representing the return value and argument type of a built-in function using the function build_function_type_list (), and then uses the function built_function () to give the name of the built-in function and the list ftype_mnem to the compiler. It means to register.

　関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）の引数は組み込み関数の戻り値の型と引数の型を表す型指定子ｍｏｄｅｎａｍｅｌｏｎｇ＿ｔｙｐｅ＿ｎｏｄｅである。 The argument of the function build_function_type_list () is a type specifier “modenamelong_type_node” representing the return value type of the built-in function and the argument type.

　関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）の第一引数は、組み込み関数の戻り値の型指定子であり、第二引数以降は、組み込み関数の引数の型指定子である。 The first argument of the function build_function_type_list () is a type specifier of the return value of the built-in function, and the second and subsequent arguments are type specifiers of the argument of the built-in function.

　もし組み込み関数が戻り値をもたないならば、コンパイラ組み込み関数追加プログラム３０は、ｖｏｉｄ＿ｔｙｐｅ＿ｎｏｄｅを関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）の第一引数とする。 If the built-in function has no return value, the compiler built-in function adding program 30 sets void_type_node as the first argument of the function build_function_type_list ().

　組み込み関数の戻り値の型と引数の型を表す型指定子ｍｏｄｅｎａｍｅｌｏｎｇ＿ｔｙｐｅ＿ｎｏｄｅについて説明する。 The type specifier modelnamelong_type_node representing the return value type and argument type of the built-in function will be described.

　組み込み関数の戻り値や引数は追加命令の仕様記述における＜ｏｐｅｒａｎｄ＞タグに一対一に対応する。 The return values and arguments of built-in functions correspond one-to-one with the <operand> tag in the specification description of additional instructions.

　型指定子ｍｏｄｅｎａｍｅｌｏｎｇ＿ｔｙｐｅ＿ｎｏｄｅにおけるｍｏｄｅｎａｍｅｌｏｎｇは、＜ｏｐｅｒａｎｄ＞タグの属性ｗｉｄｔｈに基づいて決定される。 <Modelnamelong> in the type specifier modelnamelong_type_node is determined based on the attribute width of the <operand> tag.

　図９は、属性ｗｉｄｔｈとｍｏｄｅｎａｍｅｌｏｎｇの関係を示す図である。 FIG. 9 is a diagram showing the relationship between the attribute width and modernalong.

　属性ｗｉｄｔｈが８ならば、ｍｏｄｅｎａｍｅｌｏｎｇはｃｈａｒであり、
　属性ｗｉｄｔｈが１６ならば、ｍｏｄｅｎａｍｅｌｏｎｇはｓｈｏｒｔ＿ｉｎｔｅｇｅｒであり、
　属性ｗｉｄｔｈが３２ならば、ｍｏｄｅｎａｍｅｌｏｎｇはｉｎｔｅｇｅｒである。 If the attribute width is 8, then modemalong is char,
If the attribute width is 16, then modemamelong is short_integer,
If the attribute width is 32, modemalong is an integer.

　関数ｂｕｉｌｔｉｎ＿ｆｕｎｃｔｉｏｎ（）は、組み込み関数をコンパイラへ登録する関数である。 Function “function_function ()” is a function for registering an embedded function in the compiler.

　この登録によって、コンパイラは、組み込み関数の名前と、組み込み関数の戻り値の型や引数の型と、組み込み関数に対応するＲＴＬテンプレートの名前と、を知ることができる。 This registration allows the compiler to know the name of the built-in function, the return value type and argument type of the built-in function, and the name of the RTL template corresponding to the built-in function.

　組み込み関数の名前の形式は
　__builtin_nkname_mnem　　　　　　　　・・・（１５）
である。 The built-in function name format is __builtin_nkname_mnem (15)
It is.

　組み込み関数の名前にはプロセッサのニックネームｎｋｎａｍｅと追加命令のニモニックｍｎｅｍが含まれる。組み込み関数の戻り値の型や引数の型は型指定子のリストｆｔｙｐｅ＿ｍｎｅｍとして表される。 The name of the built-in function includes the nickname nkname of the processor and the mnemonic mnem of the additional instruction. The return value type and argument type of the built-in function are expressed as a list of type specifiers, ftype_mnem.

　このリストｆｔｙｐｅ＿ｍｎｅｍは、関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）によって生成される。 This list ftype_mnem is generated by the function build_function_type_list ().

　組み込み関数に対応するＲＴＬテンプレートの名前の形式は、ＣＯＤＥ＿ＦＯＲ＿ｂｕｉｌｔｉｎ＿ｎｋｎａｍｅ＿ｍｎｅｍである。これは、ｄｅｆｉｎｅ＿ｉｎｓｎ式におけるＲＴＬテンプレートの名前の先頭にＣＯＤＥ＿ＦＯＲ＿を付加したものである。 The format of the name of the RTL template corresponding to the built-in function is CODE_FOR_builtin_nkname_mnem. This is obtained by adding CODE_FOR_ to the head of the name of the RTL template in the define_insn expression.

[組み込み関数の展開関数生成処理]
　コンパイラ組み込み関数追加装置における組み込み関数の展開関数生成処理（図３のステップ５００）について説明する。この処理において、コンパイラ組み込み関数追加プログラム３０は、組み込み関数の展開関数の名前を定義する定義文と、その関数の内容と、を生成する。 [Built-in function expansion function generation processing]
An expansion function generation process (step 500 in FIG. 3) of the embedded function in the compiler embedded function adding device will be described. In this process, the compiler built-in function addition program 30 generates a definition statement that defines the name of the expansion function of the built-in function and the contents of the function.

　組み込み関数の展開関数は、与えられた組み込み関数がどの追加命令に対応するかを分析し、与えられた組み込み関数に対応する追加命令を表すＲＴＬ式を生成する。 The expansion function of the built-in function analyzes which additional instruction corresponds to the given built-in function, and generates an RTL expression representing the additional instruction corresponding to the given built-in function.

　この処理において生成された組み込み関数の展開関数は、コンパイラによって呼び出されることになる。 The expansion function of the built-in function generated in this process is called by the compiler.

　組み込み関数の展開関数の名前の定義文の形式を以下に示す。この定義文によって、組み込み関数の展開関数の名前は、ｔａｒｇｅｔ＿ｅｘｐａｎｄ＿ｂｕｉｌｔｉｎ（）となる。
#undef　TARGET_EXPAND_BUILTIN
#define　TARGET_EXPAND_BUILTIN　target_expand_builtin
　　　　　　　　　　　　　　　　　　　　　　　　　　　　・・・（１６） The format of a definition statement for the name of a built-in function expansion function is shown below. By this definition statement, the name of the expansion function of the built-in function becomes target_expand_buildin ().
#undef TARGET_EXPAND_BUILTIN
#define TARGET_EXPAND_BUILTIN target_expand_builtin
... (16)

　さらに、組み込み関数の展開関数生成処理は、組み込み関数の展開関数の内容を生成する。組み込み関数の展開関数は、ステップ５１０とステップ５２０とステップ５３０の三つのステップから構成される。ステップ５１０は組み込み関数の引数の型を検査するステップである。ステップ５２０は組み込み関数の戻り値の型を取得するステップである。ステップ５３０は組み込み関数に対応するＲＴＬ式を生成するステップである。 Furthermore, the expansion function generation process of the built-in function generates the contents of the expansion function of the built-in function. The expansion function of the built-in function includes three steps of step 510, step 520, and step 530. Step 510 is a step of checking the argument type of the built-in function. Step 520 is a step of acquiring the return type of the built-in function. Step 530 is a step of generating an RTL expression corresponding to the built-in function.

　図１３は、組み込み関数の展開関数の内容の一例を示す図である。追加命令に対応する全ての組み込み関数は、図１３の組み込み関数の展開関数によって処理される。各ステップの詳しい処理内容を以下に示す。 FIG. 13 is a diagram showing an example of the contents of the expansion function of the built-in function. All the built-in functions corresponding to the additional instruction are processed by the expansion function of the built-in function in FIG. The detailed processing contents of each step are shown below.

　引数の型検査（ステップ５１０）において、組み込み関数に対応するｄｅｆｉｎｅ＿ｉｎｓｎ式で定義されたＲＴＬテンプレートに基づいて、組み込み関数の展開関数は、組み込み関数の引数を適切な記憶場所に配置する。 In the argument type check (step 510), based on the RTL template defined by the define_insn expression corresponding to the embedded function, the expansion function of the embedded function arranges the argument of the embedded function in an appropriate storage location.

　もし、引数に対応するＲＴＬテンプレートのオペランドがレジスタである場合、組み込み関数の展開関数は、引数を仮想レジスタへ格納する。 If the operand of the RTL template corresponding to the argument is a register, the built-in function expansion function stores the argument in the virtual register.

　もし、引数に対応するＲＴＬテンプレートのオペランドが即値である場合、組み込み関数の展開関数は引数をそのまま即値として扱う。 If the operand of the RTL template corresponding to the argument is an immediate value, the expansion function of the built-in function treats the argument as an immediate value as it is.

　戻り値の型取得（ステップ５２０）において、組み込み関数に対応するｄｅｆｉｎｅ＿ｉｎｓｎ式で定義されたＲＴＬテンプレートに基づいて、組み込み関数の展開関数は組み込み関数の戻り値の型を取得する。もし組み込み関数が戻り値をもつならば、組み込み関数の展開関数はその戻り値に対応する仮想レジスタを作成する。もし組み込み関数が戻り値をもたないならば、組み込み関数の展開関数は戻り値に関してはなにもしない。 In the return type acquisition (step 520), the expansion function of the built-in function acquires the return value type of the built-in function based on the RTL template defined by the define_insn expression corresponding to the built-in function. If a built-in function has a return value, the expansion function of the built-in function creates a virtual register corresponding to the return value. If the built-in function has no return value, the built-in function expansion function does nothing with the return value.

　組み込み関数に対応するＲＴＬ式の生成（ステップ５３０）において、組み込み関数の名前と戻り値と引数に基づいて、組み込み関数の展開関数は組み込み関数に対応するＲＴＬ式を生成する。ＲＴＬ式の生成にはＧＥＮ＿ＦＣＮ（Ｘ）というマクロ関数が使われる。マクロ関数ＧＥＮ＿ＦＣＮ（Ｘ）はコンパイラが提供している関数であり、組み込み関数Ｘに対応するＲＴＬ式生成関数の関数ポインタを返す関数である。マクロ関数ＧＥＮ＿ＦＣＮ（Ｘ）が返す関数を使用して、組み込み関数の展開関数は組み込み関数に対応するＲＴＬ式を生成する。 In the generation of the RTL expression corresponding to the built-in function (step 530), the expansion function of the built-in function generates the RTL expression corresponding to the built-in function based on the name, return value, and argument of the built-in function. A macro function called GEN_FCN (X) is used to generate the RTL expression. The macro function GEN_FCN (X) is a function provided by the compiler, and is a function that returns a function pointer of an RTL expression generation function corresponding to the built-in function X. Using the function returned by the macro function GEN_FCN (X), the expansion function of the built-in function generates an RTL expression corresponding to the built-in function.

［コンパイラソースコード追加処理］
　コンパイラ組み込み関数追加装置７０におけるコンパイラソースコード追加処理（図３のステップ６００について説明する。この追加処理は、図３のステップ２００、３００、４００、５００が生成したコンパイラのソースコードの断片を、ベースプロセッサためのコンパイラのソースコード（ベースプロセッサ用コンパイラソースコード５０）へ追加する処理である。 [Compiler source code addition processing]
Compiler source code addition processing in the compiler built-in function addition apparatus 70 (step 600 in FIG. 3 will be described. This addition processing is based on the source code fragment of the compiler generated in steps 200, 300, 400, and 500 in FIG. This is processing to be added to the compiler source code for the processor (compiler source code 50 for the base processor).

　ステップ６００において、コンパイラ組み込み関数追加プログラム３０は、追加命令の仕様記述における＜ｎｉｃｋｎａｍｅ＞タグに基づいて、コンパイラのどのソースコードへ断片を追加するかを決める。＜ｎｉｃｋｎａｍｅ＞タグで定義されたニックネームをｎｋｎａｍｅとすると、断片を追加するコンパイラのソースコードはｇｃｃ／ｃｏｎｆｉｇ／ｎｋｎａｍｅというディレクトリにあるｎｋｎａｍｅ．ｃとｎｋｎａｍｅ．ｍｄという二つのファイルである。これら二つのファイルがベースプロセッサ用コンパイラソースコード５０となる。ソースコードの断片とは以下の四つのことである。 In step 600, the compiler built-in function addition program 30 determines which source code of the compiler to add the fragment based on the <nickname> tag in the specification description of the additional instruction. If the nickname defined by the <nickname> tag is nkname, the source code of the compiler to which the fragment is added is nkname.com in the directory gcc / config / nkname. c and nkname. There are two files named md. These two files become the base processor compiler source code 50. Source code fragments are the following four.

　・ＲＴＬテンプレート（図３のステップ２００が生成したもの）、
　・レイテンシ定義文（ステップ３００が生成したもの）、
　・組み込み関数プロトタイプ宣言（ステップ４００が生成したもの）、
　・組み込み関数展開関数（ステップ５００が生成したもの） RTL template (generated by step 200 of FIG. 3),
Latency definition statement (generated by step 300),
Built-in function prototype declaration (generated by step 400),
Built-in function expansion function (generated by step 500)

　ステップ６００において、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートとレイテンシ定義文とをファイルｎｋｎａｍｅ．ｍｄの末尾へ追加する。 In step 600, the compiler built-in function addition program 30 stores the RTL template and the latency definition statement in the file nkname. Append to the end of md.

　さらに、ステップ６００において、コンパイラ組み込み関数追加プログラム３０は、組み込み関数プロトタイプ宣言と組み込み関数展開関数とをファイルｎｋｎａｍｅ．ｃの末尾へ追加する。この追加によってステップ２００からステップ５００までの処理が生成したソースコードの断片がベースプロセッサ用コンパイラソースコード５０へ追加される。断片を追加されたソースコードは図１の拡張プロセッサ用コンパイラソースコード６０となる。 Further, in step 600, the compiler built-in function addition program 30 stores the built-in function prototype declaration and the built-in function expansion function in the file nkname. Append to the end of c. By this addition, a fragment of the source code generated by the processing from step 200 to step 500 is added to the compiler source code 50 for the base processor. The source code to which the fragments are added becomes the compiler source code 60 for the extended processor in FIG.

　拡張プロセッサ用コンパイラソースコード６０を使ってコンパイラをビルドする手順は一般的なコンパイラのビルド手順と全く同じである。コンパイラのビルド手順については、非特許文献４の第７頁の“Ｃｏｎｆｉｇｕｒａｔｉｏｎ”と第２１頁の“Ｂｕｉｌｄｉｎｇ”の記載が参照される。 The procedure for building a compiler using the compiler source code 60 for the extended processor is exactly the same as a general compiler build procedure. Regarding the build procedure of the compiler, refer to the description of “Configuration” on page 7 and “Building” on page 21 of Non-Patent Document 4.

　本実施例の動作を具体的な例に基づいて以下に説明する。追加命令仕様記述ファイル４０の例を図１４に示す。図１４に示したＸＭＬファイルを入力ファイルとして、コンパイラ組み込み関数追加装置がどのように動作するかを説明する。 The operation of this embodiment will be described below based on a specific example. An example of the additional instruction specification description file 40 is shown in FIG. A description will be given of how the compiler built-in function adding apparatus operates using the XML file shown in FIG.

　まず、図３のステップ１００において、コンパイラ組み込み関数追加プログラム３０は、図１４のＸＭＬファイルの内容を読み込む。 First, in step 100 of FIG. 3, the compiler built-in function addition program 30 reads the contents of the XML file of FIG.

　ステップ１００において、コンパイラ組み込み関数追加プログラム３０は、図１４の内容をコンピュータプログラムが理解可能なデータ構造に置き換える。 In step 100, the compiler built-in function addition program 30 replaces the content shown in FIG. 14 with a data structure understandable by the computer program.

　次に、図３のステップ２００において、コンパイラ組み込み関数追加プログラム３０は、図１４に記述された追加命令に対応するＲＴＬテンプレートを生成する。 Next, in step 200 of FIG. 3, the compiler built-in function addition program 30 generates an RTL template corresponding to the additional instruction described in FIG.

　図１４には、＜ｉｎｓｎ＞タグを使って二個の追加命令の仕様が記述されている。図１４において、最初の＜ｉｎｓｎ＞タグはｍａｃ３２という追加命令である。二番目の＜ｉｎｓｎ＞タグはａｖｇ２という追加命令である。 FIG. 14 describes the specifications of two additional instructions using the <insn> tag. In FIG. 14, the first <insn> tag is an additional instruction called mac32. The second <insn> tag is an additional instruction avg2.

　ステップ２００において、コンパイラ組み込み関数追加プログラム３０は、一度に一個の追加命令のＲＴＬテンプレートを生成する。二個の追加命令のＲＴＬテンプレートを生成するために、ステップ２００が二回実行される。 In step 200, the compiler built-in function addition program 30 generates an RTL template for one additional instruction at a time. Step 200 is performed twice to generate an RTL template for two additional instructions.

　ステップ２００は、ステップ２１０からステップ２８０までの８つのステップを含む。各ステップにおいて、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレートの構成部品を生成し、それらを一つのＲＴＬテンプレートとして結合する。 Step 200 includes eight steps from Step 210 to Step 280. In each step, the compiler built-in function addition program 30 generates RTL template components and combines them as one RTL template.

　図１５は、ステップ２００において生成された追加命令ｍａｃ３２のＲＴＬテンプレートを示す。図１６は、追加命令ａｖｇ２のＲＴＬテンプレートを示す。以下、各ステップにおいて、図１のＣＰＵ１０が追加命令ｍａｃ３２と追加命令ａｖｇ２について何を生成するのかについて説明する。 FIG. 15 shows the RTL template of the additional instruction mac32 generated in step 200. FIG. 16 shows an RTL template for the additional instruction avg2. The following describes what the CPU 10 of FIG. 1 generates for the additional instruction mac32 and the additional instruction avg2 in each step.

　図４のステップ２１０の動作について説明する。ステップ２１０は、追加命令の入出力オペランドの番号を決定する処理である。 The operation of step 210 in FIG. 4 will be described. Step 210 is a process of determining the input / output operand number of the additional instruction.

　まず、ステップ２１０において、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２の＜ｏｐｅｒａｎｄ＞タグに基づいて、入出力オペランドの番号を決定する。 First, in step 210, the compiler built-in function addition program 30 determines the number of the input / output operand based on the <operand> tag of the additional instruction mac32.

　追加命令ｍａｃ３２の＜ｏｕｔｐｕｔ＞タグによると、追加命令ｍａｃ３２の出力オペランドの数は１個である。 According to the <output> tag of the additional instruction mac32, the number of output operands of the additional instruction mac32 is one.

　追加命令ｍａｃ３２の＜ｉｎｐｕｔ＞タグによると、追加命令ｍａｃ３２の入力オペランドの数は３個である。 According to the <input> tag of the additional instruction mac32, the number of input operands of the additional instruction mac32 is three.

　ステップ２１０において、コンパイラ組み込み関数追加プログラム３０は、出力オペランドから先に番号を割り当て、図１４で先に記述されたオペランドに先に番号を割り当てる。 In step 210, the compiler built-in function adding program 30 assigns a number first from the output operand, and assigns a number first to the operand described earlier in FIG.

　したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２の入出力オペランドに関して、
　出力オペランドｒｄの番号を０、
　入力オペランドｒａの番号を１、
　入力オペランドｒｂの番号を２、
　入力オペランドｒｃの番号を３、
とする。 Therefore, the compiler built-in function addition program 30 relates to the input / output operand of the additional instruction mac32.
The number of the output operand rd is 0,
The number of the input operand ra is 1,
The number of the input operand rb is 2,
The number of the input operand rc is 3,
And

　さらに、同様にして、追加命令ａｖｇ２についても、＜ｏｐｅｒａｎｄ＞タグに基づいて入出力オペランドの番号が決定される。 Furthermore, in the same manner, for the additional instruction avg2, the number of the input / output operand is determined based on the <operand> tag.

　追加命令ａｖｇ２の入出力オペランドについては、
　出力オペランドｒｄの番号は０、
　入力オペランドｒｍの番号は１、
　入力オペランドｒｎの番号は２、
　と決まる。 For the input / output operands of the additional instruction avg2,
The number of the output operand rd is 0,
The number of the input operand rm is 1,
The number of the input operand rn is 2,
It is decided.

　図４のステップ２２０の動作について説明する。ステップ２２０は、追加命令の＜ｎｉｃｋｎａｍｅ＞タグと＜ｍｎｅｍｏｎｉｃ＞タグに基づいてＲＴＬテンプレートの名前を決定する処理である。 The operation of step 220 in FIG. 4 will be described. Step 220 is a process of determining the name of the RTL template based on the <nickname> tag and the <mnemonic> tag of the additional instruction.

　ＲＴＬテンプレートの名前の形式は、ｂｕｉｌｔｉｎ＿ｎｋｎａｍｅ＿ｍｎｅｍである。したがって、コンパイラ組み込み関数追加プログラム３０は、
　追加命令ｍａｃ３２のＲＴＬテンプレートの名前を、ｂｕｉｌｔｉｎ＿ｍｙｃｐｕ＿ｍａｃ３２とし、
　追加命令ａｖｇ２のＲＴＬテンプレートの名前を、ｂｕｉｌｔｉｎ＿ｍｙｃｐｕ＿ａｖｇ２
　とする。 The format of the name of the RTL template is builtin_nkname_mnem. Therefore, the compiler built-in function addition program 30 is
The name of the RTL template of the additional instruction mac32 is defined as buildin_mycpu_mac32.
The name of the RTL template of the additional instruction avg2 is builtin_mycpu_avg2
And

　次に図４のステップ２３０の動作について説明する。ステップ２３０は、追加命令の＜ｏｐｅｒａｎｄ＞タグに基づいてＲＴＬテンプレートの出力オペランド記述を生成する処理である。 Next, the operation of step 230 in FIG. 4 will be described. Step 230 is a process of generating an output operand description of the RTL template based on the <operand> tag of the additional instruction.

　出力オペランドの形式は、以下の（１７）で与えられる。
　(match_operand:modenameshort　op_index "register_operand" "=r")　　・・・（１７） The format of the output operand is given by (17) below.
(match_operand: modenameshort op_index "register_operand""=r") (17)

　追加命令ｍａｃ３２の出力オペランドはｒｄだけである。ステップ２１０によると、出力オペランドｒｄの番号は０と決められているので、ｒｄのｏｐ＿ｉｎｄｅｘは０である。 The output operand of the additional instruction mac32 is only rd. According to step 210, since the number of the output operand rd is determined to be 0, the op_index of rd is 0.

　そして、出力オペランドｒｄは３２ビット幅のレジスタであることから、ｒｄのｍｏｄｅｎａｍｅｓｈｏｒｔはＳＩである。 And, since the output operand rd is a 32-bit wide register, the mode's modemshort is SI.

　したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のＲＴＬテンプレートの出力オペランドを以下の（１８）とする。 Therefore, the compiler built-in function addition program 30 sets the output operand of the RTL template of the additional instruction mac32 as (18) below.

　(match_operand:SI　0　"register_operand" "=r")　　　　　　・・・（１８） (Match_operand: SI 0 "register_operand" "= r") (18)

　同様にして、追加命令ａｖｇ２についても、ＲＴＬテンプレートの出力オペランドが決定される。 Similarly, the output operand of the RTL template is determined for the additional instruction avg2.

　追加命令ａｖｇ２のＲＴＬテンプレートの出力オペランドは、
　(match_operand:SI　0　"register_operand" "=r")　　　　　　　・・・（１９）
と決まる。 The output operand of the RTL template of the additional instruction avg2 is
(match_operand: SI 0 “register_operand” “= r”) (19)
It is decided.

　次に、図４のステップ２４０の動作について説明する。ステップ２４０は追加命令の＜ｏｐｅｒａｎｄ＞タグに基づいてＲＴＬテンプレートの出力オペランド記述を生成する処理である。レジスタの入力オペランドの形式は、
　(match_operand:modenameshort　op_index　"register_operand" "r")　　・・（２０）
である。 Next, the operation of step 240 in FIG. 4 will be described. Step 240 is a process for generating an output operand description of the RTL template based on the <operand> tag of the additional instruction. The input operand format of the register is
(match_operand: modenameshort op_index "register_operand""r") (20)
It is.

　追加命令ｍａｃ３２の入力オペランドはｒａとｒｂとｒｃの三つで、全て３２ビット幅のレジスタである。 The input instruction mac32 has three input operands, ra, rb, and rc, all of which are 32-bit registers.

　図４のステップ２１０において、既に、入力オペランドｒａとｒｂとｒｃの番号はそれぞれ１，２，３と決められている。したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のＲＴＬテンプレートの三つの入力オペランドｒａ、ｒｂ、ｒｃをそれぞれ以下のようにする。
入力オペランドｒａ： (match_operand:SI　1 "register_operand" "r")　・・・（２１）
入力オペランドｒｂ： (match_operand:SI　2 "register_operand" "r")　・・・（２２）
入力オペランドｒｃ： (match_operand:SI　3 "register_operand" "r")　・・・（２３） In step 210 of FIG. 4, the numbers of the input operands ra, rb, and rc are already determined as 1, 2, and 3, respectively. Therefore, the compiler built-in function addition program 30 sets the three input operands ra, rb, and rc of the RTL template of the additional instruction mac32 as follows.
Input operand ra: (match_operand: SI 1 “register_operand” “r”) (21)
Input operand rb: (match_operand: SI 2 “register_operand” “r”) (22)
Input operand rc: (match_operand: SI 3 “register_operand” “r”) (23)

　さらに、同様にして、追加命令ａｖｇ２についてもＲＴＬテンプレートの入力オペランドが決定される。追加命令ａｖｇ２のＲＴＬテンプレートの二つの入力オペランドｒｍ、ｒｎをそれぞれ以下のようにする。
入力オペランドｒｍ：　(match_operand:SI　1 "register_operand" "r")　・・・（２４）
入力オペランドｒｎ：　(match_operand:SI　2 "register_operand" "r")　・・・（２５） Similarly, the input operand of the RTL template is determined for the additional instruction avg2. The two input operands rm and rn of the RTL template of the additional instruction avg2 are set as follows.
Input operand rm: (match_operand: SI 1 “register_operand” “r”) (24)
Input operand rn: (match_operand: SI 2 “register_operand” “r”) (25)

　次に図４のステップ２５０の動作について説明する。ステップ２５０は追加命令のＲＴＬテンプレートの動作記述を生成する処理である。追加命令のＲＴＬテンプレートの動作記述の形式は、
　(set　output_operand　(unspec:VOID　[all_input_operands]　unspec_insn_id))　　・・・（２６）
　である。 Next, the operation of step 250 in FIG. 4 will be described. Step 250 is a process of generating an operation description of the RTL template of the additional instruction. The format of the behavior description of the RTL template for additional instructions is
(set output_operand (unspec: VOID [all_input_operands] unspec_insn_id)) (26)
It is.

　ｏｕｔｐｕｔ＿ｏｐｅｒａｎｄは出力オペランドを表す。これはステップ２３０において生成されたものである。 Output_operand represents an output operand. This was generated in step 230.

　ａｌｌ＿ｉｎｐｕｔ＿ｏｐｅｒａｎｄは、全ての入力オペランドを表す。これはステップ２４０において生成されたものである。 All_input_operand represents all input operands. This was generated in step 240.

　ｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄは適当な番号であり、各追加命令が異なる値のｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄをもつ。 Unspec_insn_id is an appropriate number, and each additional instruction has a different value of unspec_insn_id.

　ここでは、コンパイラ組み込み関数追加プログラム３０は、
　追加命令ｍａｃ３２のｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄを１００００、
　追加命令ａｖｇ２のｕｎｓｐｅｃ＿ｉｎｓｎ＿ｉｄを１０００１、
　とする。 Here, the compiler built-in function addition program 30 is
10000 unspec_insn_id of the additional instruction mac32
Unspec_insn_id of the additional instruction avg2 is set to 10001,
And

　次に図４のステップ２６０の動作について説明する。ステップ２６０は追加命令の＜ｓｙｎｔａｘ＞タグに基づいてＲＴＬテンプレートのシンタックスを生成する処理である。 Next, the operation of step 260 in FIG. 4 will be described. Step 260 is a process of generating the syntax of the RTL template based on the <syntax> tag of the additional instruction.

　追加命令ｍａｃ３２の＜ｓｙｎｔａｘ＞タグによると、追加命令ｍａｃ３２のシンタックスは“ｍａｃ３２　％ｒｄ　％ｒａ　％ｒｂ　％ｒｃ”である。このシンタックスには、フォーマット文字列（％ｒｄ、％ｒａ、％ｒｂ、％ｒｃ）が含まれている。これらのフォーマット文字列は入出力オペランドに対応している。 According to the <syntax> tag of the additional instruction mac32, the syntax of the additional instruction mac32 is “mac32% rd% ra% rb% rc”. This syntax includes a format character string (% rd,% ra,% rb,% rc). These format character strings correspond to input / output operands.

　ステップ２６０において、コンパイラ組み込み関数追加プログラム３０は、これらのフォーマット文字列をＲＴＬテンプレートのオペランド番号へ置き換える。％ｒｄは％０へ、％ｒａは％１へ、％ｒｂは％２へ、％ｒｃは％３へ、それぞれ置き換えられる。 In step 260, the compiler built-in function addition program 30 replaces these format character strings with the operand numbers of the RTL template. % Rd is replaced with% 0,% ra is replaced with% 1,% rb is replaced with% 2, and% rc is replaced with% 3.

　したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のＲＴＬテンプレートのシンタックスを、
　“ｍａｃ３２　％０　％１　％２　％３”　　　　　　　　　　　・・・（２７）
　とする。 Accordingly, the compiler built-in function addition program 30 uses the syntax of the RTL template of the additional instruction mac32 as follows:
“Mac32% 0% 1% 2% 3” (27)
And

　同様にして、追加命令ａｖｇ２についてもＲＴＬテンプレートのシンタックスが決定される。追加命令ａｖｇ２のＲＴＬテンプレートのシンタックスは、
　“ａｖｇ２　％０　％１　％２”　　　　　　　　　　　　　　・・・（２８）
と決まる。 Similarly, the syntax of the RTL template is determined for the additional instruction avg2. The syntax of the RTL template for the additional instruction avg2 is
“Avg2% 0% 1% 2” (28)
It is decided.

　次に図４のステップ２７０の動作について説明する。ステップ２７０は、追加命令の＜ｌｅｎｇｔｈ＞タグに基づいてＲＴＬテンプレートの語長定義文を生成する処理である。ＲＴＬテンプレートにおける語長定義文の形式は、
　(set_attr "length" "len")　　　　　　　　　　　　　　　・・・（２９）
である。 Next, the operation of step 270 in FIG. 4 will be described. Step 270 is processing for generating a word length definition sentence of the RTL template based on the <length> tag of the additional instruction. The format of the word length definition sentence in the RTL template is
(set_attr “length” “len”) (29)
It is.

　追加命令ｍａｃ３２の＜ｌｅｎｇｔｈ＞タグによると、追加命令ｍａｃ３２の語長は３２ビット（４バイト）である。したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のＲＴＬテンプレートの語長定義文を
　(set_attr "length" "4")　　　　　　　　　　　　　　　　・・・（３０）
とする。 According to the <length> tag of the additional instruction mac32, the word length of the additional instruction mac32 is 32 bits (4 bytes). Therefore, the compiler built-in function addition program 30 changes the word length definition sentence of the RTL template of the additional instruction mac32 to (set_attr “length” “4”) (30)
And

　同様にして、追加命令ａｖｇ２についてもＲＴＬテンプレートの語長定義文が生成される。追加命令ａｖｇ２のＲＴＬテンプレートの語長定義文は、
　(set_attr "length" "4")　　　　　　　　　　　　　　　　　・・・（３１）
と決まる。 Similarly, a word length definition sentence of the RTL template is generated for the additional instruction avg2. The word length definition sentence of the RTL template of the additional instruction avg2 is
(set_attr "length""4") (31)
It is decided.

　次に図４のステップ２８０の動作について説明する。ステップ２８０は追加命令の＜ｍｎｅｍｏｎｉｃ＞タグに基づいてＲＴＬテンプレートのニモニック定義文を生成する処理である。 Next, the operation of step 280 in FIG. 4 will be described. Step 280 is processing for generating a mnemonic definition sentence of the RTL template based on the <mnemonic> tag of the additional instruction.

　ＲＴＬテンプレートにおけるニモニック定義文の形式は、
　(set_attr "mnemonic" "mnem")　　　　　　　　　　　　　　・・・（３２）
である。 The format of the mnemonic definition statement in the RTL template is
(set_attr “mnemonic” “mnem”) (32)
It is.

　追加命令ｍａｃ３２の＜ｍｎｅｍｏｎｉｃ＞タグによると、追加命令ｍａｃ３２のニモニックはｍａｃ３２である。したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のＲＴＬテンプレートのニモニック定義文を、
　(set_attr "mnemonic" "mac32")　　　　　　　　　　　　　・・・（３３）
とする。 According to the <mnemonic> tag of the additional instruction mac32, the mnemonic of the additional instruction mac32 is mac32. Therefore, the compiler built-in function adding program 30 converts the mnemonic definition statement of the RTL template of the additional instruction mac32 into
(set_attr “mnemonic” “mac32”) (33)
And

　同様にして、追加命令ａｖｇ２についてもＲＴＬテンプレートのニモニック定義文が生成される。追加命令ａｖｇ２のＲＴＬテンプレートのニモニック定義文は、
　(set_attr "mnemonic" "avg2")　　　　　　　　　　　　　・・・（３４）
と決まる。 Similarly, a mnemonic definition sentence of the RTL template is generated for the additional instruction avg2. The mnemonic definition statement of the RTL template for the additional instruction avg2 is
(set_attr "mnemonic""avg2") (34)
It is decided.

　最終的に、ステップ２００において、コンパイラ組み込み関数追加プログラム３０は、ステップ２１０からステップ２３０において生成されたものをまとめて、追加命令ｍａｃ３２のＲＴＬテンプレート（図１５）と追加命令ａｖｇ２のＲＴＬテンプレート（図１６）とを生成する。 Finally, in step 200, the compiler built-in function adding program 30 combines the ones generated in steps 210 to 230 into an RTL template (FIG. 15) of the additional instruction mac32 and an RTL template (FIG. 16) of the additional instruction avg2. ) And generate.

　次に、図３のステップ３００において、コンパイラ組み込み関数追加プログラム３０は、図１４に記述された追加命令に対応するレイテンシ定義文を生成する。前述したように、ステップ３００はステップ３１０とステップ３２０とステップ３３０の三つのステップから構成される。ステップ３１０とステップ３２０は全ての追加命令に共通な定義文を生成する処理である。図１７に、ステップ３１０とステップ３２０において生成される共通な定義文の一例を示す。 Next, in step 300 of FIG. 3, the compiler built-in function addition program 30 generates a latency definition statement corresponding to the additional instruction described in FIG. As described above, step 300 is composed of three steps: step 310, step 320, and step 330. Steps 310 and 320 are processes for generating a definition sentence common to all additional instructions. FIG. 17 shows an example of a common definition sentence generated in step 310 and step 320.

　つづいて、図１４に記述された２個の追加命令に対して、コンパイラ組み込み関数追加プログラム３０は、ステップ３３０を実行する。 Subsequently, the compiler built-in function addition program 30 executes Step 330 for the two additional instructions described in FIG.

　ステップ３３０において、コンパイラ組み込み関数追加プログラム３０は、各追加命令のレイテンシ定義文を生成する。 In step 330, the compiler built-in function addition program 30 generates a latency definition statement for each additional instruction.

　レイテンシ定義文の形式は、
(define_insn_reservation "mnem" ltcy　(eq_attr "mnemonic" "mnem" ) "unti_builtin,nothing*ltcy_minus_1") ・・・（３５）
である。 The format of the latency definition statement is
(define_insn_reservation "mnem" ltcy (eq_attr "mnemonic""mnem")"unti_builtin, nothing * ltcy_minus_1")) (35)
It is.

　追加命令ｍａｃ３２の＜ｍｎｅｍｏｎｉｃ＞タグと＜ｌａｔｅｎｃｙ＞タグによると、追加命令ｍａｃ３２のニモニックはｍａｃ３２で、追加命令ｍａｃ３２の語長は３２ビット（４バイト）である。したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のレイテンシ定義文を図１８のようにする。 According to the <mnemonic> tag and the <latency> tag of the additional instruction mac32, the mnemonic of the additional instruction mac32 is mac32, and the word length of the additional instruction mac32 is 32 bits (4 bytes). Therefore, the compiler built-in function addition program 30 sets the latency definition statement of the additional instruction mac32 as shown in FIG.

　同様にして、追加命令ａｖｇ２についてもレイテンシ定義文が生成される。追加命令ａｖｇ２のレイテンシ定義文は、図１９のようになる。 Similarly, a latency definition statement is also generated for the additional instruction avg2. The latency definition statement of the additional instruction avg2 is as shown in FIG.

　次に、図３のステップ４００において、コンパイラ組み込み関数追加プログラム３０は、図１４に記述された追加命令に対応する組み込み関数プロトタイプ宣言を生成する。ステップ４００において、まず、コンパイラ組み込み関数追加プログラム３０は、組み込み関数のプロトタイプ宣言をコンパイラへ教える関数の名前を定義する定義文を生成する。この定義文は、図８のようになる。 Next, in step 400 of FIG. 3, the compiler built-in function addition program 30 generates a built-in function prototype declaration corresponding to the additional instruction described in FIG. In step 400, first, the compiler built-in function addition program 30 generates a definition statement that defines the name of the function that tells the compiler the prototype declaration of the built-in function. This definition sentence is as shown in FIG.

　ステップ４００において、コンパイラ組み込み関数追加プログラム３０は、図１４に記述された追加命令ｍａｃ３２と追加命令ａｖｇ２に対応する組み込み関数のプロトタイプ宣言を生成する。組み込み関数のプロトタイプ宣言の形式は、図１１のようになる。 In step 400, the compiler built-in function addition program 30 generates a prototype declaration of the built-in function corresponding to the additional instruction mac32 and the additional instruction avg2 described in FIG. The format of the prototype declaration of the built-in function is as shown in FIG.

　追加命令ｍａｃ３２のＲＴＬテンプレートのオペランドは四個あり、図４のステップ２１０においてそれらのオペランドの番号ｏｐ＿ｉｎｄｅｘは決定されている。出力オペランドｒｄの番号は０であり、入力オペランドｒａの番号は１であり、入力オペランドｒｂの番号は２であり、入力オペランドｒｃの番号は３である。 There are four operands in the RTL template of the additional instruction mac32, and the number op_index of these operands is determined in step 210 of FIG. The number of the output operand rd is 0, the number of the input operand ra is 1, the number of the input operand rb is 2, and the number of the input operand rc is 3.

　図１４の＜ｏｐｅｒａｎｄ＞タグによると、これらのオペランドのビット幅は全て３２ビットである。 According to the <operand> tag in FIG. 14, the bit widths of these operands are all 32 bits.

　図９によると、３２ビット幅のオペランドのｍｏｄｅｎａｍｅｌｏｎｇはｉｎｔｅｇｅｒである。したがって、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２のための関数ｂｕｉｌｄ＿ｆｕｎｃｔｉｏｎ＿ｔｙｐｅ＿ｌｉｓｔ（）の引数を全てｉｎｔｅｇｅｒ＿ｔｙｐｅ＿ｎｏｄｅとする。 According to FIG. 9, the operand of the 32-bit width operand is an integer. Accordingly, the compiler built-in function addition program 30 sets all arguments of the function build_function_type_list () for the additional instruction mac32 to be integer_type_node.

　図１４の＜ｎｉｃｋｎａｍｅ＞タグによると、プロセッサのニックネームはｍｙｃｐｕである。そして、追加命令ｍａｃ３２の＜ｍｎｅｍｏｎｉｃ＞タグによると、追加命令ｍａｃ３２のニモニックはｍａｃ３２である。最終的に、コンパイラ組み込み関数追加プログラム３０は、追加命令ｍａｃ３２に対応する組み込み関数のプロトタイプ宣言を図２０のようにする。 According to the <nickname> tag in FIG. 14, the nickname of the processor is mycpu. According to the <mnemonic> tag of the additional instruction mac32, the mnemonic of the additional instruction mac32 is mac32. Finally, the compiler built-in function addition program 30 makes the prototype declaration of the built-in function corresponding to the add instruction mac32 as shown in FIG.

　同様にして、追加命令ａｖｇ２についても組み込み関数のプロトタイプ宣言が生成される。追加命令ａｖｇ２の組み込み関数のプロトタイプ宣言は図２１のようになる。最終的に、コンパイラ組み込み関数追加プログラム３０は、図２２に示すように、追加命令ｍａｃ３２と追加命令ａｖｇ２に対応する組み込み関数のプロトタイプ宣言を関数ｔａｒｇｅｔ＿ｉｎｉｔ＿ｂｕｉｌｔｉｎｓ（）の中に配置する。 Similarly, a prototype declaration of an embedded function is generated for the additional instruction avg2. The prototype declaration of the built-in function of the additional instruction avg2 is as shown in FIG. Finally, as shown in FIG. 22, the compiler built-in function addition program 30 arranges the prototype declaration of the built-in function corresponding to the add instruction mac32 and the add instruction avg2 in the function target_init_builds ().

　次に、図３のステップ５００において、コンパイラ組み込み関数追加プログラム３０は、
　組み込み関数の展開関数の名前を定義する定義文と、
　組み込み関数の展開関数の内容と、
　を生成する。これらは追加命令の仕様記述には依存しない。 Next, in step 500 of FIG.
A definition statement that defines the name of a built-in function expansion function,
The contents of the expansion function of the built-in function,
Is generated. These do not depend on the specification description of the additional instruction.

　組み込み関数の展開関数の名前を定義する定義文は、図１２のようになる。そして、組み込み関数の展開関数の内容は、図１３となる。 The definition statement that defines the name of the expansion function of the built-in function is as shown in FIG. The contents of the expansion function of the built-in function are as shown in FIG.

　次に、図３のステップ６００において、コンパイラ組み込み関数追加プログラム３０は、ソースコードの断片をベースプロセッサのためのコンパイラのソースコード（ベースプロセッサ用コンパイラソースコード５０）へ追加する。 Next, in step 600 of FIG. 3, the compiler built-in function addition program 30 adds a source code fragment to the compiler source code for the base processor (base processor compiler source code 50).

　ソースコードの断片とは、ステップ２００からステップ５００において生成された以下の四つのことである。 The source code fragments are the following four items generated from step 200 to step 500.

　・ＲＴＬテンプレート（図３のステップ２００において生成されたもの：図１５と図１６）；
　・レイテンシ定義文（図３のステップ３００において生成されたもの：図１７と図１８と図１９）；
　・組み込み関数プロトタイプ宣言（図３のステップ４００において生成されたもの：図８と図２２）；
　・組み込み関数展開関数（図３のステップ５００において生成されたもの：図１２と図１３） RTL template (generated in step 200 of FIG. 3: FIGS. 15 and 16);
Latency definition statement (generated in step 300 of FIG. 3: FIGS. 17, 18 and 19);
Built-in function prototype declaration (generated in step 400 of FIG. 3: FIGS. 8 and 22);
Built-in function expansion function (generated in step 500 of FIG. 3: FIGS. 12 and 13)

　図１４の＜ｎｉｃｋｎａｍｅ＞タグによると、プロセッサのニックネームはｍｙｃｐｕである。したがって、ステップ６００において、図１のＣＰＵ１０が上記のソースコード断片を追加するコンパイラのソースコードは、ｇｃｃ／ｃｏｎｆｉｇ／ｍｙｃｐｕというディレクトリにあるｍｙｃｐｕ．ｃとｍｙｃｐｕ．ｍｄという二つのファイルである。これら二つのファイルがベースプロセッサ用コンパイラソースコード５０となる。 According to the <nickname> tag in FIG. 14, the nickname of the processor is mycpu. Therefore, in step 600, the source code of the compiler to which the CPU 10 of FIG. 1 adds the above source code fragment is mycpu. c and mycpu. There are two files named md. These two files become the base processor compiler source code 50.

　図３のステップ６００において、コンパイラ組み込み関数追加プログラム３０は、ＲＴＬテンプレート（図１５と図１６）と、レイテンシ定義文（図１７と図１８と図１９）と、をファイルｍｙｃｐｕ．ｍｄの末尾へ追加する。 3 In step 600 of FIG. 3, the compiler built-in function addition program 30 stores the RTL template (FIGS. 15 and 16) and the latency definition statement (FIGS. 17, 18, and 19) in the file mycpu. Append to the end of md.

　さらに、図３のステップ６００において、コンパイラ組み込み関数追加プログラム３０は、組み込み関数プロトタイプ宣言（図８と図２２）と、組み込み関数展開関数（図１２と図１３）と、をファイルｍｙｃｐｕ．ｃの末尾へ追加する。 Further, in step 600 of FIG. 3, the compiler built-in function addition program 30 stores the built-in function prototype declaration (FIGS. 8 and 22) and the built-in function expansion function (FIGS. 12 and 13) in the file mycpu. Append to the end of c.

　この追加によって、図３のステップ２００からステップ５００までの処理において生成されたソースコードの断片がベースプロセッサ用コンパイラソースコード５０へ追加される。断片を追加されたソースコードが、図１の拡張プロセッサ用コンパイラソースコード６０となる。 As a result of this addition, a fragment of the source code generated in the processing from step 200 to step 500 in FIG. 3 is added to the compiler source code 50 for the base processor. The source code to which the fragments are added becomes the compiler source code 60 for the extended processor in FIG.

　拡張プロセッサ用コンパイラソースコード６０を使ってコンパイラをビルドする手順は一般的なコンパイラのビルド手順と全く同じである。コンパイラのビルド手順については非特許文献４（第７頁、”Ｃｏｎｆｉｇｕｒａｔｉｏｎ”と第２１ページの“Building”）の記載が参照される。 The procedure for building a compiler using the compiler source code 60 for the extended processor is exactly the same as a general compiler build procedure. Reference is made to the description of Non-Patent Document 4 (page 7, “Configuration” and “Building” on page 21) for the build procedure of the compiler.

　本発明を使用すれば、追加命令の仕様記述に基づいてその追加命令に対応する組み込み関数をコンパイラへ簡単に追加することに加えて、そのコンパイラは追加命令を含むプログラムの語長を正しく計算したり、追加命令を適切にスケジューリングできるようになる。そのコンパイラを使うことにより、プログラム開発者は組み込み関数を使ったプログラム開発が可能となる。特定用途向けのプロセッサの命令セットを設計する際にプロセッサ設計者が本発明を活用することができる。 According to the present invention, in addition to simply adding an intrinsic function corresponding to the additional instruction to the compiler based on the specification description of the additional instruction, the compiler correctly calculates the word length of the program including the additional instruction. Or additional instructions can be scheduled appropriately. By using the compiler, program developers can develop programs using built-in functions. The processor designer can take advantage of the present invention in designing a processor instruction set for a specific application.

　なお、上記の特許文献１－３、非特許文献１－４の各開示を、本書に引用をもって繰り込むものとする。本発明の全開示（請求の範囲を含む）の枠内において、さらにその基本的技術思想に基づいて、実施形態ないし実施例の変更・調整が可能である。また、本発明の請求の範囲の枠内において種々の開示要素の多様な組み合わせ乃至選択が可能である。すなわち、本発明は、請求の範囲を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。 The disclosures of Patent Documents 1-3 and Non-Patent Documents 1-4 above are incorporated herein by reference. Within the scope of the entire disclosure (including claims) of the present invention, the embodiments and examples can be changed and adjusted based on the basic technical concept. Various combinations and selections of various disclosed elements are possible within the scope of the claims of the present invention. That is, the present invention of course includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the technical idea.

Claims

Based on the specification description of the additional instruction newly added to the instruction set of the base processor, the source code of the compiler for the extended processor having the additional instruction in the instruction set is compared with the source code of the compiler for the base processor. Generate
A compiler built-in function adding device, comprising: a built-in function adding unit that adds a built-in function corresponding to the additional instruction to the compiler for the extension processor.

The built-in function adding means reads the specification description of the additional instruction and the source code of the compiler for the base processor from the storage device, and defines the definition statement of the built-in function corresponding to the additional instruction described in the specification description of the additional instruction. And the definition statement of the built-in function corresponding to the additional instruction added to the source code of the compiler for the base processor is used as the source code of the compiler for the extended processor. Item 3. The compiler built-in function adding device according to Item 1.

The built-in function adding means is
Read the specification description of the additional instruction,
Generate a configuration definition statement of the additional instruction based on the specification description,
Generating a relationship definition statement between the additional instruction and a built-in function corresponding to the additional instruction based on the specification description;
The configuration definition statement and the relationship definition statement are added to a source code of the compiler for the base processor, and a source code of the compiler for the extension processor is generated. The compiler built-in function addition device described.

The configuration statement is
A template for additional instructions expressed in the intermediate language of the compiler;
A latency definition statement that defines the number of cycles required to execute the additional instruction;
Including
The built-in function adding means generates the configuration definition statement of the additional instruction.
Generating the template based on a name of the additional instruction included in the specification description, an input operand, and an output operand;
4. The compiler built-in function adding device according to claim 3, wherein the latency definition statement is generated based on a number of cycles necessary for executing the additional instruction included in the specification description.

The relationship definition statement is
A prototype declaration of a built-in function corresponding to the additional instruction;
A built-in function expansion function for replacing the built-in function with an additional instruction;
Including
The built-in function adding means generates the relationship definition statement.
Generating a prototype declaration of the built-in function based on the name of the additional instruction, the input operand, and the output operand included in the specification description;
5. The compiler built-in function adding device according to claim 4, wherein the built-in function expansion function is generated based on a predetermined format of the compiler.

When the built-in function adding means generates the template,
By arranging the output operand and the input operand of the additional instruction included in the specification description in order, the number of the operand in the intermediate language definition sentence of the additional instruction is determined;
Generating a name for the template based on the name of the additional instruction included in the specification description;
Generating a description of the output operand of the template based on the type of the output operand of the additional instruction included in the specification description;
Generating a description of the input operand of the template based on the type of the input operand of the additional instruction included in the specification description;
Generating an action description of the additional instruction in the template using an operator indicating that the additional instruction performs an operation unknown to the compiler;
Generating the syntax of the template by replacing an operand included in the syntax of the additional instruction included in the specification description with the operand number;
Generating an instruction word length definition sentence in the template based on an instruction word length of the additional instruction included in the specification description;
Generating a mnemonic definition sentence in the template based on the name of the additional instruction included in the specification description;
The compiler built-in function adding device according to claim 4 or 5, characterized in that:

A compiler built-in function adding device according to any one of claims 1 to 6,
A compiler construction apparatus comprising means for constructing the compiler based on generated source code.

Enter the compiler source code for the base processor
Based on the specification description of the additional instruction newly added to the instruction set of the base processor, the compiler source code for the extended processor having the additional instruction in the instruction set is generated from the source code of the compiler for the base processor And
A program for causing a computer to execute an embedded function addition process for adding an embedded function corresponding to the additional instruction to the compiler for the extension processor.

The built-in function adding process reads the specification description of the additional instruction and the source code of the compiler for the base processor from the storage device, and defines the definition statement of the built-in function corresponding to the additional instruction described in the specification description of the additional instruction. Is generated, and the definition statement of the built-in function corresponding to the additional instruction is added to the source code of the compiler for the base processor, and is output to the storage device as the source code of the compiler for the extended processor. The program according to claim 8.

The built-in function addition process
A process of reading the specification description of the additional instruction;
Processing for generating a configuration definition statement of the additional instruction based on the specification description;
Processing for generating a relationship definition statement between the additional instruction and a built-in function corresponding to the additional instruction based on the specification description;
9. The process of adding the configuration definition statement and the relationship definition statement to a source code of the compiler for the base processor to generate a source code of the compiler for the extension processor. Or the program of 9.

The configuration statement is
A template for additional instructions expressed in the intermediate language of the compiler;
A latency definition statement that defines the number of cycles required to execute the additional instruction;
Including
The built-in function adding process is:
In the process of generating the configuration definition statement,
Generating the template based on a name of the additional instruction included in the specification description, an input operand, and an output operand;
The program according to claim 10, wherein the latency definition sentence is generated based on the number of cycles necessary for executing the additional instruction included in the specification description.

The relationship definition statement is
A prototype declaration of a built-in function corresponding to the additional instruction;
A built-in function expansion function for replacing the built-in function with an additional instruction,
The built-in function adding process is:
In the process of generating the relationship definition statement,
Generating a prototype declaration of the built-in function based on the name of the additional instruction, the input operand, and the output operand included in the specification description;
12. The program according to claim 11, wherein the built-in function expansion function is generated based on a predetermined format of a compiler.

In the built-in function adding process, when generating the template,
Determining the number of the operand in the intermediate language definition statement of the additional instruction by arranging the output operand and the input operand of the additional instruction in the specification description in order;
Generating a name for the template based on the name of the additional instruction included in the specification description;
Generating a description of the output operand of the template based on the type of the output operand of the additional instruction included in the specification description;
Generating a description of the input operand of the template based on the type of the input operand of the additional instruction included in the specification description;
Generate an action description of the additional instruction in the template using an operator that indicates that the additional instruction performs an operation unknown to the compiler;
Generating a template syntax by replacing an operand included in the syntax of the additional instruction included in the specification description with the operand number;
Generating an instruction word length definition sentence in the template based on an instruction word length of the additional instruction included in the specification description;
Generating a mnemonic definition sentence in the template based on the name of the additional instruction included in the specification description;
The program according to claim 11 or 12, characterized in that:

Enter the compiler source code for the base processor
Based on the specification description of the additional instruction newly added to the instruction set of the base processor, the compiler source code for the extended processor having the additional instruction in the instruction set is generated from the source code of the compiler for the base processor And
A method for adding a compiler built-in function, comprising a step of adding a built-in function corresponding to the additional instruction to a compiler for the extension processor.

The built-in function adding step includes
Read the specification description of the additional instruction and the source code of the compiler for the base processor from the storage device,
A definition statement of an embedded function corresponding to the additional instruction described in the specification description of the additional instruction;
The definition statement of the built-in function corresponding to the additional instruction added to the source code of the compiler for the base processor is output to the storage device as the source code of the compiler for the extension processor. 14. A method for adding a compiler built-in function according to 14.

The built-in function adding step includes
Read the specification description of the additional instruction,
Generate a configuration definition statement of the additional instruction based on the specification description,
Generating a relationship definition statement between the additional instruction and a built-in function corresponding to the additional instruction based on the specification description;
16. The source code of the compiler for the extension processor is generated by adding the configuration definition statement and the relationship definition statement to the source code of the compiler for the base processor. To add a built-in compiler function.

The configuration statement is
A template for additional instructions expressed in the intermediate language of the compiler;
A latency definition statement that defines the number of cycles required to execute the additional instruction;
Including
In generating the configuration statement of the additional instruction,
Generating the template based on a name of the additional instruction included in the specification description, an input operand, and an output operand;
17. The method for adding a compiler built-in function according to claim 16, wherein the latency definition statement is generated based on the number of cycles required for executing the additional instruction included in the specification description.

The relationship definition statement is
A prototype declaration of a built-in function corresponding to the additional instruction;
A built-in function expansion function for replacing the built-in function with an additional instruction,
In generating the relationship definition statement,
Generating a prototype declaration of the built-in function based on the name of the additional instruction, the input operand, and the output operand included in the specification description;
18. The method for adding a compiler built-in function according to claim 17, wherein the built-in function expansion function is generated based on a predetermined format of the compiler.

In the built-in function adding step, the template is generated.
Determining the number of the operand in the intermediate language definition statement of the additional instruction by arranging the output operand and the input operand of the additional instruction in the specification description in order;
Generating a name for the template based on the name of the additional instruction included in the specification description;
Generating a description of the output operand of the template based on the type of the output operand of the additional instruction included in the specification description;
Generating a description of the input operand of the template based on the type of the input operand of the additional instruction included in the specification description;
Generate an action description of the additional instruction in the template using an operator that indicates that the additional instruction performs an operation unknown to the compiler;
Generating a template syntax by replacing an operand included in the syntax of the additional instruction included in the specification description with the operand number;
Generating an instruction word length definition sentence in the template based on an instruction word length of the additional instruction included in the specification description;
19. The method for adding a compiler built-in function according to claim 17, wherein a mnemonic definition sentence in the template is generated based on a name of the additional instruction included in the specification description.

The compiler built-in function adding device according to any one of claims 1 to 3, wherein the specification description of the additional instruction includes an instruction word length and latency of the additional instruction.

The program according to any one of claims 8 to 10, wherein the specification description of the additional instruction includes an instruction word length and latency of the additional instruction.

The method for adding a compiler built-in function according to any one of claims 14 to 16, wherein the specification description of the additional instruction includes an instruction word length and latency of the additional instruction.

A new compiler is automatically generated by adding built-in functions corresponding to new instructions added to the instruction set of the base processor (called "additional instructions") from the compiler of the basic processor (called "base processor") The additional instruction mnemonic, the additional instruction syntax, the input instruction and output operands of the additional instruction, the instruction word length information of the additional instruction, and the additional instruction A storage device storing an additional instruction specification description file including latency information, and a source code of the compiler for the base processor;
The additional instruction specification description file and the source code of the base processor compiler are input from the storage device, and the additional instruction configuration definition information, the additional instruction, and the embedded instruction are input from the additional instruction specification description file. Generating the relationship definition information with the function, and adding the configuration definition information of the additional instruction and the relationship definition information of the additional instruction and the built-in function to the source code of the compiler for the base processor, Built-in function adding means for generating source code of the new compiler for a processor having an additional instruction in an instruction set and outputting the generated source code to a storage device;
The configuration definition information is
A template for the additional instruction expressed in an intermediate language of the compiler;
A latency definition that defines the number of cycles required to execute the additional instruction;
Including
The built-in function adding means includes
In generating the configuration definition information of the additional instruction,
Generating the template including the mnemonic of the additional instruction, the input and output operands, the operation of the additional instruction, the word length and syntax of the additional instruction from the specification description of the additional instruction;
Based on the latency information of the additional instruction included in the specification description, the latency definition is generated,
The relationship definition information is
A prototype declaration of an embedded function including information on arguments of the embedded function corresponding to the additional instruction, information on a return value of the embedded function, and causing the compiler to recognize the embedded function as an embedded function;
A built-in function expansion function for replacing the built-in function with an additional instruction;
Including
The built-in function adding means includes
In generating the relationship definition information,
Generating a prototype declaration of the built-in function based on the additional instruction mnemonic, the input operand, and the output operand included in the specification description;
In the expansion function generation of the built-in function, the argument type of the built-in function is checked, the return type of the built-in function is obtained, and an RTL (Register Transfer Level) expression that is an intermediate language expression corresponding to the built-in function. A compiler built-in function adding device characterized by generating.