[go: up one dir, main page]

CN112036562B - Bit cell applied to memory computation and memory computation array device - Google Patents

Bit cell applied to memory computation and memory computation array device Download PDF

Info

Publication number
CN112036562B
CN112036562B CN202011219701.6A CN202011219701A CN112036562B CN 112036562 B CN112036562 B CN 112036562B CN 202011219701 A CN202011219701 A CN 202011219701A CN 112036562 B CN112036562 B CN 112036562B
Authority
CN
China
Prior art keywords
transistor
output
bit
storage unit
line
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011219701.6A
Other languages
Chinese (zh)
Other versions
CN112036562A (en
Inventor
乔树山
黄茂森
尚德龙
周玉梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongke Nanjing Intelligent Technology Research Institute
Original Assignee
Institute of Microelectronics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Microelectronics of CAS filed Critical Institute of Microelectronics of CAS
Priority to CN202011219701.6A priority Critical patent/CN112036562B/en
Publication of CN112036562A publication Critical patent/CN112036562A/en
Application granted granted Critical
Publication of CN112036562B publication Critical patent/CN112036562B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Neurology (AREA)
  • Static Random-Access Memory (AREA)

Abstract

The invention provides a bit cell applied to memory computing. The bit cell comprises: a four-pipe memory cell and a peripheral memory circuit; the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit. The invention has set up four-tube memory cell, apply it to the memory array module instead of six-tube memory cell, has simplified the structure of the array device of the memory calculation, the peripheral memory circuit of the invention is used for carrying on the operation of cumulative addition, utilize analog mixed signal capacitive coupling to calculate and finish the neural network cumulative operation of binary system, realize the input activation of 5 values, has improved the computational accuracy, and there is quiescent current to reduce the power consumption and capacitive coupling mechanism has better stability in the course of calculating. Therefore, the invention simplifies the storage and calculation array structure, reduces the power consumption and improves the storage and calculation efficiency and precision.

Description

Bit cell applied to memory computation and memory computation array device
Technical Field
The present invention relates to the field of memory computing technologies, and in particular, to a bit cell and a memory array device for memory computing.
Background
Deep Convolutional Neural Networks (DCNNs) continue to demonstrate improved inference accuracy, and Deep learning is shifting to edge computation. This development has driven the work of low-resource machine learning algorithms and their accelerated hardware. The most common operation in DCNNs is Multiplication and Accumulation (MAC), which controls power and delay. The MAC operation has high regularity and parallelism, and is therefore very suitable for hardware acceleration. However, the amount of memory access severely limits the energy efficiency of conventional digital accelerators.
The current memory array is basically based on six or more tubes of memory cells, and MAC operation is divided into current domain calculation based on a resistance divider, a discharge rate and the like and charge domain calculation based on charge sharing, a capacitance divider and the like. In contrast, charge domain calculation consumes less power due to the absence of quiescent current. And the six-tube structure has larger area and larger power consumption.
How to simplify the storage and calculation array structure, reduce power consumption and improve storage and calculation efficiency and precision becomes a technical problem to be solved urgently.
Disclosure of Invention
The invention aims to provide a bit cell applied to memory computation and a memory computation array device, so as to simplify the memory computation array structure, reduce the power consumption and improve the memory computation efficiency and precision.
In order to achieve the purpose, the invention provides the following scheme:
a bit cell for application to memory computations, the bit cell comprising:
the four-tube storage unit and a peripheral memory circuit are used for carrying out accumulation addition operation on the activation signal and the weight value output by the four-tube storage unit;
the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit;
the word line control end of the four-tube storage unit is connected with a word line, the bit line control end of the four-tube storage unit is connected with a bit line, and the bit bar control end of the four-tube storage unit is connected with a bit bar;
a first activation signal input end of the peripheral memory circuit is connected with a first activation signal line, and a first deactivation signal input end of the peripheral memory circuit is connected with a first deactivation signal line;
a second activation signal input end of the peripheral memory circuit is connected with a second activation signal line, and a second deactivation signal input end of the peripheral memory circuit is connected with a second deactivation signal line;
and the signal output end of the peripheral memory circuit is connected with a signal output line.
Optionally, the four-transistor memory cell includes a transistor T1, a transistor T2, a transistor T3, and a transistor T4;
the input end of the transistor T1 and the input end of the transistor T2 are both connected with a power supply VDD, the output end of the transistor T1 is connected with the control end of the transistor T2, and the control end of the transistor T1 is connected with the output end of the transistor T2;
an input terminal of the transistor T3 is connected to the bit line BL, an output terminal of the transistor T3 is connected to the output terminal of the transistor T1, and a control terminal of the transistor T3 is connected to the word line WL;
the input terminal of the transistor T4 is connected to the inverted bit line BLB, the output terminal of the transistor T4 is connected to the output terminal of the transistor T2, and the control terminal of the transistor T4 is connected to the word line WL.
Optionally, the peripheral memory circuit includes a capacitor C1, a capacitor C2, a transistor T5, a transistor T6, a transistor T7, and a transistor T8;
an input terminal of the transistor T5 is connected to the first activation signal line MWL _ a, and an input terminal of the transistor T6 is connected to the first deactivation signal line MWLB _ a; the output terminal of the transistor T5, the output terminal of the transistor T6, and one end of the capacitor C1 are connected in common;
an input terminal of the transistor T7 is connected to the second activation signal line MWL _ b, and an input terminal of the transistor T8 is connected to the second deactivation signal line MWLB _ b; the output terminal of the transistor T7, the output terminal of the transistor T8, and one end of the capacitor C2 are connected in common;
the other end of the capacitor C1 is connected with the other end of the capacitor C2 and then is connected with a signal output line;
the control end of the transistor T5 and the control end of the transistor T7 are both connected with the weight output end of the four-tube memory cell;
the control terminal of the transistor T6 and the control terminal of the transistor T8 are both connected to the inverted value output terminal of the four-transistor memory cell.
A computing array device, the computing array device comprising:
the device comprises a storage array module, a column decoding module, a row decoding module, an input activation driving module and an analog-to-digital conversion output module; the memory array module comprises
Figure 100002_DEST_PATH_IMAGE001
Bit cells applied to memory computation;
the output ends of the n bit lines of the row decoding module are respectively connected with the n bit lines, and the output ends of the n bit reversal lines of the row decoding module are respectively connected with the n bit reversal lines;
the m word line output ends of the row decoding module are respectively connected with the m word lines;
the m first activation signal output ends of the storage array module are respectively connected with the m first activation signal lines, and the m first deactivation signal output ends of the storage array module are respectively connected with the m first deactivation signal lines; the m second activation signal output ends of the storage array module are respectively connected with the m second activation signal lines, and the m second deactivation signal output ends of the storage array module are respectively connected with the m second deactivation signal lines;
n analog signal input ends of the analog-to-digital conversion output module are respectively connected with n signal output lines;
word line control ends of bit cells in m rows of the storage array module are respectively connected with m word lines;
first activation signal input ends of m rows of bit units of the storage array module are respectively connected with m first activation signal lines, first flyback signal input ends of m rows of bit units of the storage array module are respectively connected with m first flyback signal lines, second activation signal input ends of m rows of bit units of the storage array module are respectively connected with m second activation signal lines, and second flyback signal input ends of m rows of bit units of the storage array module are respectively connected with m second flyback signal lines;
bit line control ends of the n rows of bit cells of the storage array module are respectively connected with the n bit lines, and inverted bit line control ends of the n rows of bit cells of the storage array module are respectively connected with the n inverted bit lines;
the signal output ends of the bit cells of the n rows of the memory array module are respectively connected with the n signal output lines.
Optionally, the bit cell comprises:
the four-tube storage unit and a peripheral memory circuit are used for carrying out accumulation addition operation on the activation signal and the weight value output by the four-tube storage unit;
the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit;
the word line control end of the four-tube storage unit is connected with a word line, the bit line control end of the four-tube storage unit is connected with a bit line, and the bit bar control end of the four-tube storage unit is connected with a bit bar;
a first activation signal input end of the peripheral memory circuit is connected with a first activation signal line, and a first deactivation signal input end of the peripheral memory circuit is connected with a first deactivation signal line;
a second activation signal input end of the peripheral memory circuit is connected with a second activation signal line, and a second deactivation signal input end of the peripheral memory circuit is connected with a second deactivation signal line;
and the signal output end of the peripheral memory circuit is connected with a signal output line.
Optionally, the four-transistor memory cell includes a transistor T1, a transistor T2, a transistor T3, and a transistor T4;
the input end of the transistor T1 and the input end of the transistor T2 are both connected with a power supply VDD, the output end of the transistor T1 is connected with the control end of the transistor T2, and the control end of the transistor T1 is connected with the output end of the transistor T2;
an input terminal of the transistor T3 is connected to the bit line BL, an output terminal of the transistor T3 is connected to the output terminal of the transistor T1, and a control terminal of the transistor T3 is connected to the word line WL;
the input terminal of the transistor T4 is connected to the inverted bit line BLB, the output terminal of the transistor T4 is connected to the output terminal of the transistor T2, and the control terminal of the transistor T4 is connected to the word line WL.
Optionally, the peripheral memory circuit includes a capacitor C1, a capacitor C2, a transistor T5, a transistor T6, a transistor T7, and a transistor T8;
an input terminal of the transistor T5 is connected to the first activation signal line MWL _ a, and an input terminal of the transistor T6 is connected to the first deactivation signal line MWLB _ a; the output terminal of the transistor T5, the output terminal of the transistor T6, and one end of the capacitor C1 are connected in common;
an input terminal of the transistor T7 is connected to the second activation signal line MWL _ b, and an input terminal of the transistor T8 is connected to the second deactivation signal line MWLB _ b; the output terminal of the transistor T7, the output terminal of the transistor T8, and one end of the capacitor C2 are connected in common;
the other end of the capacitor C1 is connected with the other end of the capacitor C2 and then is connected with a signal output line;
the control end of the transistor T5 and the control end of the transistor T7 are both connected with the weight output end of the four-tube memory cell;
the control terminal of the transistor T6 and the control terminal of the transistor T8 are both connected to the inverted value output terminal of the four-transistor memory cell.
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
the invention provides a bit cell applied to memory calculation, which comprises: the four-tube storage unit and a peripheral memory circuit are used for carrying out accumulation addition operation on the activation signal and the weight value output by the four-tube storage unit; the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit. The invention has set up four-tube memory cell, apply it to the memory array module instead of six-tube memory cell, has simplified the structure of the array device of the memory calculation, the peripheral memory circuit of the invention is used for carrying on the operation of cumulative addition, utilize analog mixed signal capacitive coupling to calculate and finish the neural network cumulative operation of binary system, realize the input activation of 5 values, has improved the computational accuracy, and there is quiescent current to reduce the power consumption and capacitive coupling mechanism has better stability in the course of calculating. Therefore, the invention simplifies the storage and calculation array structure, reduces the power consumption and improves the storage and calculation efficiency and precision.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.
FIG. 1 is a circuit diagram of a bit cell for memory computation according to the present invention;
fig. 2 is a circuit diagram of a storage array apparatus provided in the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention aims to provide a bit cell applied to memory computation and a memory computation array device, so as to simplify the memory computation array structure, reduce the power consumption and improve the memory computation efficiency and precision.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
Example 1
As shown in FIG. 1, the present invention provides a bit cell for use in memory computing, the bit cell comprising: the four-tube storage unit and a peripheral memory circuit are used for carrying out accumulation addition operation on the activation signal and the weight value output by the four-tube storage unit; the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit; the word line control terminals (the control terminals of the transistor T3 and the transistor T4) of the four-transistor memory cell are connected with a word line WL, the bit line control terminal (the input terminal of the transistor T3) of the four-transistor memory cell is connected with a bit line BL, and the inverted bit line control terminal (the input terminal of the transistor T4) of the four-transistor memory cell is connected with an inverted bit line BLB; a first activation signal input terminal (input terminal of the transistor T5) of the peripheral saving circuit is connected to the first activation signal line MWL _ a, and a first deactivation signal input terminal (input terminal of the transistor T6) of the peripheral saving circuit is connected to the first deactivation signal line MWLB _ a; a second activation signal input terminal (input terminal of the transistor T7) of the peripheral saving circuit is connected to a second activation signal line MWL _ b, and a second deactivation signal input terminal (input terminal of the transistor T8) of the peripheral saving circuit is connected to a second deactivation signal line MWLB _ b; the signal output terminal of the peripheral memory circuit (the common terminal of the capacitor C1 and the capacitor C2) is connected to a signal output line.
Wherein the four-transistor memory cell comprises a transistor T1, a transistor T2, a transistor T3, and a transistor T4; the input end of the transistor T1 and the input end of the transistor T2 are both connected with a power supply VDD, the output end of the transistor T1 is connected with the control end of the transistor T2, and the control end of the transistor T1 is connected with the output end of the transistor T2; an input terminal of the transistor T3 is connected to the bit line BL, an output terminal of the transistor T3 is connected to the output terminal of the transistor T1, and a control terminal of the transistor T3 is connected to the word line WL; the input terminal of the transistor T4 is connected to the inverted bit line BLB, the output terminal of the transistor T4 is connected to the output terminal of the transistor T2, and the control terminal of the transistor T4 is connected to the word line WL.
The peripheral memory circuit comprises a capacitor C1, a capacitor C2, a transistor T5, a transistor T6, a transistor T7 and a transistor T8; an input terminal of the transistor T5 is connected to the first activation signal line MWL _ a, and an input terminal of the transistor T6 is connected to the first deactivation signal line MWLB _ a; the output terminal of the transistor T5, the output terminal of the transistor T6, and one end of the capacitor C1 are connected in common; an input terminal of the transistor T7 is connected to the second activation signal line MWL _ b, and an input terminal of the transistor T8 is connected to the second deactivation signal line MWLB _ b; the output terminal of the transistor T7, the output terminal of the transistor T8, and one end of the capacitor C2 are connected in common;
the other end of the capacitor C1 is connected with the other end of the capacitor C2 and then is connected with a signal output line; the control end of the transistor T5 and the control end of the transistor T7 are both connected with the weight output end of the four-tube memory cell; the control terminal of the transistor T6 and the control terminal of the transistor T8 are both connected to the inverted value output terminal of the four-transistor memory cell.
Specifically, as shown in fig. 1, the bit cell (Bitcell) is composed of a basic memory structure (four-transistor memory cell) of 4 transistors (T1, T2, T3, T4) with the addition of two capacitors (C1, C2) and peripheral memory circuits of four pass transistors (T5, T6, T7, T8).
In the bit unit, the capacitor C1 is charged and discharged after being gated by the activation signal line MWL _ a/MWLB _ a through the transistors T5 and T6, the capacitor C2 is charged and discharged after being gated by the activation signal line MWL _ b/MWLB _ b through the transistors T7 and T8, and the four transistors are selected to be conducted by the stored weight values (weight value Q and inverse weight value QB). Charge (one row at a time) is placed on the bit lines and shared by columns. bMAC is divided into two steps: the first step of precharging, in which MWL _ a (i), MWLB _ a (i), MWL _ b (i), MWLB _ b (i), MBL (i) are charged to VRST (middle level) at the same time, and there is no voltage potential on both sides of the capacitor; the second step is charging off, the input driver transmits an activation signal to mwl (i)/mwlb (i), and the result of the input activation being the same as or equal to the weight forms a voltage difference with MBL across the capacitor to generate charge accumulation on the bit line MBL. And the MBL outputs a result after analog-to-digital conversion is carried out through the ADC. The RST is a control signal, the gate of the MOS transistor connected to the VRST (intermediate level) is input, and the precharge and the charge shutdown are controlled by controlling the on/off of the MOS transistor connected to the VRST (intermediate level).
Table 1 is a multiply-accumulate operand table, in which the values of the Input (Input) are represented by combinations of high and low levels on four MWL lines (activation signal lines), and two numbers in each table in the bold frame at the lower right of table 1 represent voltages Vc1 and Vc2 applied to two capacitors. The 5-value Input (Input) improves the accuracy of the calculation very well.
Table 15 value input logic table
Figure 253807DEST_PATH_IMAGE002
Example 2
The present invention also provides a storage array apparatus comprising:
the device comprises a storage array module, a column decoding module, a row decoding module, an input activation driving module and an analog-to-digital conversion output module; the memory array module comprises
Figure 158178DEST_PATH_IMAGE001
Bit cells applied to memory computation; the output ends of the n bit lines of the row decoding module are respectively connected with the n bit lines, and the output ends of the n bit reversal lines of the row decoding module are respectively connected with the n bit reversal lines; the m word line output ends of the row decoding module are respectively connected with the m word lines; the m first activation signal output ends of the storage array module are respectively connected with the m first activation signal lines, and the m first deactivation signal output ends of the storage array module are respectively connected with the m first deactivation signal lines; the m second activation signal output ends of the storage array module are respectively connected with the m second activation signal lines, and the m second deactivation signal output ends of the storage array module are respectively connected with the m second deactivation signal lines; n analog signal input ends of the analog-to-digital conversion output module are respectively connected with n signal output lines; word line control ends of bit cells in m rows of the storage array module are respectively connected with m word lines; first activation signal input ends of m rows of bit units of the storage array module are respectively connected with m first activation signal lines, first flyback signal input ends of m rows of bit units of the storage array module are respectively connected with m first flyback signal lines, second activation signal input ends of m rows of bit units of the storage array module are respectively connected with m second activation signal lines, and second flyback signal input ends of m rows of bit units of the storage array module are respectively connected with m second flyback signal lines; bit line control ends of the n rows of bit cells of the storage array module are respectively connected with the n bit lines, and inverted bit line control ends of the n rows of bit cells of the storage array module are respectively connected with the n inverted bit lines; the signal output ends of the bit cells of the n rows of the memory array module are respectively connected with the n signal output lines.
The storage array device comprises a storage array module, a row decoding module, a column decoding module, an input activation driving module and an analog-to-digital conversion output module, wherein the row decoding module is used for row decoding in read-write operation (R/W) of a storage unit, the column decoding module is used for Address Decoder and R/W BL Control, and the input activation driving module and the analog-to-digital conversion output module are used for input activation driving (MWLDecoder/Driver) of a storage structure. The column decoding module acts on bit lines BL (i) and its inverse signals BLB (i), inputs and activates signals MWL _ a [ i ] and MWL _ b [ i ] and its inverse signals MWLB _ b [ i ], and outputs bit lines MBL (i) to analog-to-digital conversion output module (ADC) for output.
The bit line output MBL of a binary multiply accumulate (bMAC) operation for each column in a memory array module is the sum of a column multiply accumulate calculation, with analog signals on the MBL side, and the array contains one ADC per column in order to digitize these values.
The row decoding module decodes the address signal of the access data, and the column decoding module decodes the access data signal to realize the basic read-write operation of the data stored in the memory array.
The input activation driving module is used for transmitting an input activation signal, and the activation signal is operated with the data (namely the weight) stored in the storage array.
Analog-to-digital converter ADC performs analog-to-digital conversion on the multiply-accumulate bit line mbl (i) signal.
The structure and operation principle of the bit cell are the same as those of embodiment 1, and are not described herein again.
As shown in fig. 2, a row decoding module (c) outputs WL [ i ] signal to select a certain row of a storage array module (i) after decoding, a column decoding module (c) outputs BL [ i ] and BLB [ i ] to select a certain column of the storage array module (i), and the column decoding module (c) and the row decoding module (c) realize reading and writing of weight in a bit cell; the input activation driving module (IV) decodes the input activation signal and outputs 128 groups of MWL signals which are connected to each row of the array (IV); the output signal MBL [ i ] in the storage array module is connected to the analog-to-digital conversion output module (V), and the ADC in the corresponding column in the analog-to-digital conversion output module (V) completes the final result output.
The memory computing device is a structure proposed for a neural network, and the working principle of the memory computing device is to multiply and accumulate input activation and storage weight values in a storage unit. The weights are binarized to +1 and-1 in a Binary Neural Network (BNN), so that the multiplication can be represented by a simple exclusive-nor (XNOR) operation. The present invention relates to a memory computing device that performs binary multiply-accumulate (bMAC) using charge sharing.
As shown in fig. 2, the charge of the storage array device of the present invention (one row at a time) is placed on the bit lines and shared by columns. bMAC is divided into two steps: the first step of precharging, in which MWL _ a (i), MWLB _ a (i), MWL _ b (i), MWLB _ b (i), MBL (i) are charged to VRST (middle level) at the same time, and there is no voltage potential on both sides of the capacitor; the second step is charging off, the input driver transmits an activation signal to mwl (i)/mwlb (i), and the result of the input activation being the same as or equal to the weight forms a voltage difference with MBL across the capacitor to generate charge accumulation on the bit line MBL. And the MBL outputs a result after analog-to-digital conversion is carried out through the ADC.
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
the invention provides a bit cell applied to memory calculation, which comprises: a four-pipe memory cell and a peripheral memory circuit; the weight output end of the four-tube storage unit is connected with the weight input end of the four-tube storage unit, and the counter weight output end of the four-tube storage unit is connected with the counter weight input end of the four-tube storage unit. The storage array module in the storage array device adopts 4-tube units, so that the array structure is optimized, and the array area is reduced; 5-value input is activated, so that the calculation precision is improved; the calculation process of memory calculation is completed through a capacitive coupling charge domain, no static current is generated, power consumption is reduced, and a capacitive coupling mechanism has better stability. Therefore, compared with the prior art, the storage array device has smaller area, higher precision and better stability.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.

Claims (5)

1.一种应用于存内计算的位单元,其特征在于,所述位单元包括:1. A bit unit applied to in-memory computing, wherein the bit unit comprises: 四管存储单元和用于将激活信号与四管存储单元输出的权值进行累积加操作的外围存算电路;A four-tube storage unit and a peripheral storage circuit for accumulating and adding the activation signal and the weight output by the four-tube storage unit; 所述四管存储单元的权值输出端与所述四管存储单元的权值输入端连接,所述四管存储单元的反权值输出端与所述四管存储单元的反权值输入端连接;The weight output terminal of the four-tube storage unit is connected to the weight input terminal of the four-tube storage unit, and the inverse weight output terminal of the four-tube storage unit is connected to the inverse weight input terminal of the four-tube storage unit. connect; 所述四管存储单元的字线控制端与字线连接,所述四管存储单元的位线控制端与位线连接,所述四管存储单元的反位线控制端与反位线连接;The word line control terminal of the four-tube storage unit is connected to the word line, the bit line control terminal of the four-tube storage unit is connected to the bit line, and the reverse bit line control terminal of the four-tube storage unit is connected to the reverse bit line; 所述外围存算电路的第一激活信号输入端与第一激活信号线连接,所述外围存算电路的第一反激活信号输入端与第一反激活信号线连接;The first activation signal input end of the peripheral storage circuit is connected to the first activation signal line, and the first inversion signal input end of the peripheral storage circuit is connected to the first deactivation signal line; 所述外围存算电路的第二激活信号输入端与第二激活信号线连接,所述外围存算电路的第二反激活信号输入端与第二反激活信号线连接;The second activation signal input end of the peripheral storage and arithmetic circuit is connected with the second activation signal line, and the second inversion signal input end of the peripheral storage and calculation circuit is connected with the second inversion signal line; 所述外围存算电路的信号输出端与信号输出线连接;The signal output end of the peripheral storage circuit is connected with the signal output line; 所述外围存算电路,包括电容C1、电容C2、晶体管T5、晶体管T6、晶体管T7和晶体管T8;晶体管T5的输入端与第一激活信号线MWL_a连接,晶体管T6的输入端与第一反激活信号线MWLB_a连接;晶体管T5的输出端、晶体管T6的输出端和电容C1的一端共点连接;晶体管T7的输入端与第二激活信号线MWL_b连接,晶体管T8的输入端与第二反激活信号线MWLB_b连接;晶体管T7的输出端、晶体管T8的输出端和电容C2的一端共点连接;电容C1的另一端和电容C2的另一端连接后与信号输出线连接;晶体管T5的控制端和晶体管T7的控制端均与所述四管存储单元的权值输出端连接;晶体管T6的控制端和晶体管T8的控制端均与所述四管存储单元的反权值输出端连接。The peripheral storage circuit includes a capacitor C1, a capacitor C2, a transistor T5, a transistor T6, a transistor T7 and a transistor T8; the input end of the transistor T5 is connected to the first activation signal line MWL_a, and the input end of the transistor T6 is connected to the first inversion The signal line MWLB_a is connected; the output end of the transistor T5, the output end of the transistor T6 and one end of the capacitor C1 are connected in common; the input end of the transistor T7 is connected to the second activation signal line MWL_b, and the input end of the transistor T8 is connected to the second inversion signal The line MWLB_b is connected; the output end of the transistor T7, the output end of the transistor T8 and one end of the capacitor C2 are connected in common; the other end of the capacitor C1 is connected to the other end of the capacitor C2 and then connected to the signal output line; the control end of the transistor T5 and the transistor The control terminal of T7 is connected to the weight output terminal of the four-tube storage unit; the control terminal of the transistor T6 and the control terminal of the transistor T8 are both connected to the inverse weight output terminal of the four-tube storage unit. 2.根据权利要求1所述的应用于存内计算的位单元,其特征在于,所述四管存储单元包括晶体管T1、晶体管T2、晶体管T3和晶体管T4;2. The bit cell applied to in-memory computing according to claim 1, wherein the four-tube storage cell comprises a transistor T1, a transistor T2, a transistor T3 and a transistor T4; 晶体管T1的输入端和晶体管T2的输入端均与电源VDD连接,晶体管T1的输出端与晶体管T2的控制端连接,晶体管T1的控制端与晶体管T2的输出端连接;The input end of the transistor T1 and the input end of the transistor T2 are both connected to the power supply VDD, the output end of the transistor T1 is connected to the control end of the transistor T2, and the control end of the transistor T1 is connected to the output end of the transistor T2; 晶体管T3的输入端与位线BL连接,晶体管T3的输出端与晶体管T1的输出端连接,晶体管T3的控制端与字线WL连接;The input end of the transistor T3 is connected to the bit line BL, the output end of the transistor T3 is connected to the output end of the transistor T1, and the control end of the transistor T3 is connected to the word line WL; 晶体管T4的输入端与反位线BLB连接,晶体管T4的输出端与晶体管T2的输出端连接,晶体管T4的控制端与字线WL连接。The input terminal of the transistor T4 is connected to the inverted bit line BLB, the output terminal of the transistor T4 is connected to the output terminal of the transistor T2, and the control terminal of the transistor T4 is connected to the word line WL. 3.一种存算阵列装置,其特征在于,所述存算阵列装置包括:3. A storage and calculation array device, wherein the storage and calculation array device comprises: 存储阵列模块、列译码模块、行译码模块、输入激活驱动模块和模数转换输出模块;所述存储阵列模块包括成阵列排列的
Figure DEST_PATH_IMAGE001
个应用于存内计算的位单元;
a storage array module, a column decoding module, a row decoding module, an input activation drive module and an analog-to-digital conversion output module; the storage array module includes an array of
Figure DEST_PATH_IMAGE001
a bit unit applied to in-memory computations;
列译码模块的n个位线输出端分别与n个位线连接,列译码模块的n个反位线输出端分别与n个反位线连接;The n bit line output terminals of the column decoding module are respectively connected with the n bit lines, and the n inverted bit line output terminals of the column decoding module are respectively connected with the n inverted bit lines; 行译码模块的m个字线输出端分别与m个字线连接;m word line output terminals of the row decoding module are respectively connected with m word lines; 存储阵列模块的m个第一激活信号输出端分别与m个第一激活信号线连接,存储阵列模块的m个第一反激活信号输出端分别与m个第一反激活信号线连接;存储阵列模块的m个第二激活信号输出端分别与m个第二激活信号线连接,存储阵列模块的m个第二反激活信号输出端分别与m个第二反激活信号线连接;The m first activation signal output terminals of the storage array module are respectively connected with the m first activation signal lines, and the m first deactivation signal output terminals of the storage array module are respectively connected with the m first deactivation signal lines; the storage array The m second activation signal output ends of the module are respectively connected with m second activation signal lines, and the m second inversion signal output ends of the storage array module are respectively connected with m second inversion signal lines; 模数转换输出模块的n个模拟信号输入端分别与n个信号输出线连接;The n analog signal input ends of the analog-to-digital conversion output module are respectively connected with n signal output lines; 存储阵列模块的m行的位单元的字线控制端分别与m个字线连接;word line control terminals of bit cells in m rows of the storage array module are respectively connected with m word lines; 存储阵列模块的m行的位单元的第一激活信号输入端分别与m个第一激活信号线连接,存储阵列模块的m行的位单元的第一反激活信号输入端分别与m个第一反激活信号线连接,存储阵列模块的m行的位单元的第二激活信号输入端分别与m个第二激活信号线连接,存储阵列模块的m行的位单元的第二反激活信号输入端分别与m个第二反激活信号线连接;The first activation signal input terminals of the bit cells in m rows of the storage array module are respectively connected with m first activation signal lines, and the first inversion signal input terminals of the bit cells in m rows of the storage array module are respectively connected with m first activation signal lines. The deactivation signal lines are connected, the second activation signal input ends of the bit cells in m rows of the storage array module are respectively connected with m second activation signal lines, and the second deactivation signal input ends of the bit cells in m rows of the storage array module are respectively connected with m second deactivation signal lines; 存储阵列模块的n列的位单元的位线控制端分别与n个位线连接,存储阵列模块的n列的位单元的反位线控制端分别与n个反位线连接;The bit line control terminals of the bit cells in n columns of the storage array module are respectively connected with n bit lines, and the reverse bit line control terminals of the bit cells in n columns of the storage array module are respectively connected with n reverse bit lines; 存储阵列模块的n列的位单元的信号输出端分别与n个信号输出线连接;The signal output ends of the bit cells of the n columns of the storage array module are respectively connected with the n signal output lines; 位单元的外围存算电路,包括电容C1、电容C2、晶体管T5、晶体管T6、晶体管T7和晶体管T8;晶体管T5的输入端与第一激活信号线MWL_a连接,晶体管T6的输入端与第一反激活信号线MWLB_a连接;晶体管T5的输出端、晶体管T6的输出端和电容C1的一端共点连接;晶体管T7的输入端与第二激活信号线MWL_b连接,晶体管T8的输入端与第二反激活信号线MWLB_b连接;晶体管T7的输出端、晶体管T8的输出端和电容C2的一端共点连接;电容C1的另一端和电容C2的另一端连接后与信号输出线连接;晶体管T5的控制端和晶体管T7的控制端均与位单元的四管存储单元的权值输出端连接;晶体管T6的控制端和晶体管T8的控制端均与位单元的四管存储单元的反权值输出端连接。The peripheral storage circuit of the bit cell includes a capacitor C1, a capacitor C2, a transistor T5, a transistor T6, a transistor T7 and a transistor T8; the input end of the transistor T5 is connected to the first activation signal line MWL_a, and the input end of the transistor T6 is connected to the first inverter. The activation signal line MWLB_a is connected; the output end of the transistor T5, the output end of the transistor T6 and one end of the capacitor C1 are connected in common; the input end of the transistor T7 is connected to the second activation signal line MWL_b, and the input end of the transistor T8 is connected to the second inversion The signal line MWLB_b is connected; the output end of the transistor T7, the output end of the transistor T8 and one end of the capacitor C2 are connected in common; the other end of the capacitor C1 is connected to the other end of the capacitor C2 and then connected to the signal output line; the control end of the transistor T5 and the The control terminal of the transistor T7 is connected to the weight output terminal of the four-tube storage unit of the bit unit; the control terminal of the transistor T6 and the control terminal of the transistor T8 are both connected to the inverse weight output terminal of the four-tube storage unit of the bit unit.
4.根据权利要求3所述的存算阵列装置,其特征在于,所述位单元包括:4. The storage and arithmetic array device according to claim 3, wherein the bit unit comprises: 四管存储单元和用于将激活信号与四管存储单元输出的权值进行累积加操作的外围存算电路;A four-tube storage unit and a peripheral storage circuit for accumulating and adding the activation signal and the weight output by the four-tube storage unit; 所述四管存储单元的权值输出端与所述四管存储单元的权值输入端连接,所述四管存储单元的反权值输出端与所述四管存储单元的反权值输入端连接;The weight output terminal of the four-tube storage unit is connected to the weight input terminal of the four-tube storage unit, and the inverse weight output terminal of the four-tube storage unit is connected to the inverse weight input terminal of the four-tube storage unit. connect; 所述四管存储单元的字线控制端与字线连接,所述四管存储单元的位线控制端与位线连接,所述四管存储单元的反位线控制端与反位线连接;The word line control terminal of the four-tube storage unit is connected to the word line, the bit line control terminal of the four-tube storage unit is connected to the bit line, and the reverse bit line control terminal of the four-tube storage unit is connected to the reverse bit line; 所述外围存算电路的第一激活信号输入端与第一激活信号线连接,所述外围存算电路的第一反激活信号输入端与第一反激活信号线连接;The first activation signal input end of the peripheral storage circuit is connected to the first activation signal line, and the first inversion signal input end of the peripheral storage circuit is connected to the first deactivation signal line; 所述外围存算电路的第二激活信号输入端与第二激活信号线连接,所述外围存算电路的第二反激活信号输入端与第二反激活信号线连接;The second activation signal input end of the peripheral storage and arithmetic circuit is connected with the second activation signal line, and the second inversion signal input end of the peripheral storage and calculation circuit is connected with the second inversion signal line; 所述外围存算电路的信号输出端与信号输出线连接。The signal output end of the peripheral storage circuit is connected with the signal output line. 5.根据权利要求4所述的存算阵列装置,其特征在于,所述四管存储单元包括晶体管T1、晶体管T2、晶体管T3和晶体管T4;5. The storage and arithmetic array device according to claim 4, wherein the four-tube storage unit comprises a transistor T1, a transistor T2, a transistor T3 and a transistor T4; 晶体管T1的输入端和晶体管T2的输入端均与电源VDD连接,晶体管T1的输出端与晶体管T2的控制端连接,晶体管T1的控制端与晶体管T2的输出端连接;The input end of the transistor T1 and the input end of the transistor T2 are both connected to the power supply VDD, the output end of the transistor T1 is connected to the control end of the transistor T2, and the control end of the transistor T1 is connected to the output end of the transistor T2; 晶体管T3的输入端与位线BL连接,晶体管T3的输出端与晶体管T1的输出端连接,晶体管T3的控制端与字线WL连接;The input end of the transistor T3 is connected to the bit line BL, the output end of the transistor T3 is connected to the output end of the transistor T1, and the control end of the transistor T3 is connected to the word line WL; 晶体管T4的输入端与反位线BLB连接,晶体管T4的输出端与晶体管T2的输出端连接,晶体管T4的控制端与字线WL连接。The input terminal of the transistor T4 is connected to the inverted bit line BLB, the output terminal of the transistor T4 is connected to the output terminal of the transistor T2, and the control terminal of the transistor T4 is connected to the word line WL.
CN202011219701.6A 2020-11-05 2020-11-05 Bit cell applied to memory computation and memory computation array device Active CN112036562B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011219701.6A CN112036562B (en) 2020-11-05 2020-11-05 Bit cell applied to memory computation and memory computation array device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011219701.6A CN112036562B (en) 2020-11-05 2020-11-05 Bit cell applied to memory computation and memory computation array device

Publications (2)

Publication Number Publication Date
CN112036562A CN112036562A (en) 2020-12-04
CN112036562B true CN112036562B (en) 2021-02-26

Family

ID=73573266

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011219701.6A Active CN112036562B (en) 2020-11-05 2020-11-05 Bit cell applied to memory computation and memory computation array device

Country Status (1)

Country Link
CN (1) CN112036562B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112232501B (en) * 2020-12-11 2021-09-28 中科南京智能技术研究院 Memory computing device
CN112232502B (en) * 2020-12-17 2021-03-23 中科院微电子研究所南京智能技术研究院 A kind of XOR storage unit and storage array device
CN112558922A (en) * 2021-02-20 2021-03-26 中科院微电子研究所南京智能技术研究院 Four-transistor memory computing device based on separated word lines
CN112558919B (en) * 2021-02-22 2021-05-18 中科院微电子研究所南京智能技术研究院 In-memory computing bit unit and in-memory computing device
CN113095496B (en) * 2021-04-25 2022-07-15 山东大学 A circuit and method for improving the calculation accuracy of an integrated storage and calculation array
CN112992232B (en) * 2021-04-28 2021-08-17 中科院微电子研究所南京智能技术研究院 A multi-bit positive and negative single-bit memory computing unit, array and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7240657B2 (en) * 2018-05-15 2023-03-16 Tokyo Artisan Intelligence株式会社 Neural network circuit device, neural network, neural network processing method, and neural network execution program
CN111816233B (en) * 2020-07-30 2023-08-01 中科南京智能技术研究院 In-memory computing unit and array
CN111816232B (en) * 2020-07-30 2023-08-04 中科南京智能技术研究院 In-memory computing array device based on 4-pipe storage structure

Also Published As

Publication number Publication date
CN112036562A (en) 2020-12-04

Similar Documents

Publication Publication Date Title
CN112036562B (en) Bit cell applied to memory computation and memory computation array device
CN112992223B (en) Memory computing unit, memory computing array and memory computing device
CN111816231B (en) Memory computing device with double-6T SRAM structure
CN115048075B (en) SRAM in-memory computing chip based on capacitive coupling
CN112151091B (en) 8T SRAM unit and memory computing device
CN112151092B (en) A storage unit, storage array and in-memory computing device based on 4-tube storage
CN113255904B (en) Voltage margin enhanced capacitive coupling storage and computing integrated unit, sub-array and device
CN111816232B (en) In-memory computing array device based on 4-pipe storage structure
CN112558919B (en) In-memory computing bit unit and in-memory computing device
CN113257306B (en) Storage and calculation integrated array and accelerating device based on static random access memory
CN109979503B (en) Static random access memory circuit structure for realizing Hamming distance calculation in memory
CN112133348B (en) A 6T cell-based storage unit, storage array and in-memory computing device
CN114627930B (en) Single-bit differential SRAM (static random Access memory) storage integrated array and device
CN110058839B (en) Circuit structure based on static random access memory internal subtraction method
CN114300012B (en) Decoupling SRAM memory computing device
CN110176264B (en) A High-Low-Bit Combining Circuit Structure Based on In-Memory Computing
CN110633069B (en) Multiplication circuit structure based on static random access memory
CN116126779B (en) 9T memory operation circuit, multiply-accumulate operation circuit, memory operation circuit and chip
CN113782072B (en) Multi-bit memory computing circuit
US11100979B1 (en) Low-power SRAM memory cell and application structure thereof
CN114895869A (en) Multi-bit memory computing device with symbols
Tyagi et al. A 101 TOPS/W and 1.73 TOPS/mm 2 6T SRAM-Based Digital Compute-in-Memory Macro Featuring a Novel 2T Multiplier
CN118333119B (en) Memory calculating unit, memory calculating method, memory calculating block and neural network circuit assembly
CN112558922A (en) Four-transistor memory computing device based on separated word lines
CN116204490B (en) A 7T storage and calculation circuit and multiplication and accumulation circuit based on low voltage technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 5 / F, building 1, Qilin artificial intelligence Industrial Park, 266 Chuangyan Road, Jiangning District, Nanjing City, Jiangsu Province

Patentee after: Zhongke Nanjing Intelligent Technology Research Institute

Address before: 5 / F, building 1, Qilin artificial intelligence Industrial Park, 266 Chuangyan Road, Jiangning District, Nanjing City, Jiangsu Province

Patentee before: Nanjing Institute of intelligent technology, Institute of microelectronics, Chinese Academy of Sciences