# A 210-MHz 4.23 fJ Energy/Bit 1-kb Asymmetrical Schmitt-Trigger-Based SRAM Using 40-nm CMOS Process

Ralph Gerard B. Sangalang<sup>o</sup>[,](https://orcid.org/0000-0002-8014-8229) *Member, IEEE*, Shiva Reddy, Lean Karlo S. Tolentino<sup>o</sup>, *Student Member, IEEE*, You-Wei Shen[,](https://orcid.org/0000-0003-4565-9506) Oliver Lexter July A. Jose<sup>®</sup>, and Chua-Chin Wan[g](https://orcid.org/0000-0002-2426-2879)<sup>®\*</sup>, Senior Member, IEEE

*Abstract*—This study introduces a 1-kb SRAM design that operates at low power utilizing asymmetrical Schmitt-trigger based SRAM cells. By adopting an asymmetrical design, the SRAM cell performance in terms of hold, access, and write capabilities were enhanced. A power reduction and speed compensation circuit is presented in the design. The SRAM is implemented and fabricated on-silicon using a typical 40-nm TSMC CMOS technology. Measurements show that the design can operate up to 210 MHz clock frequency with a 4.23 fJ energy per bit for a supply voltage of 0.8 V. It also has an average delay of 5.659 ns, which is reduced to an average of 1.708 ns when the speed compensation circuit is enabled.

*Index Terms*—Schmitt trigger, data retention, memory integrity, low power, static noise margin, SRAM cell

## I. INTRODUCTION AND OVERVIEW

<span id="page-0-0"></span>SRAM is essential for low power applications because it allows for high-speed data processing and access while using very little power. Internet of Things (IoT), mobile, and wearable devices are examples of low power applications that demand efficient power utilization to optimize battery life and reduce energy consumption. The fast speed and low power consumption of SRAMs make it the perfect memory solution for low power applications [\[1\]](#page-4-0). Because of this SRAM feature, the refresh circuitry is not required, which reduces power consumption and simplifies circuit design.

SRAM is intended to be self-sustaining, unlike other forms of memory like dynamic RAM (DRAM), which needs frequent power refresh cycles to keep data state. This makes it perfect for systems like cache memory in microprocessors, graphics processing units (GPUs), and other high-performance systems that need quick access while consuming little power. Overall, SRAM is critical in low power applications due to

Correspondence: Chua-Chin Wang (email: ccwang@ee.nsysu.edu.tw). Dr. Wang is with Dept. of Electrical Engineering and Inst. of Undersea Technology, National Sun Yat-Sen University, Kaohsiung 80424, Taiwan. He is also an adjunct professor at VelTech, Chennai 600062, Tamil Nadu, India.

R. G. B. Sangalang, S. Reddy, and Y.-W. Shen are with Dept. of Electrical Engineering, National Sun Yat-Sen University.

L. K. S. Tolentino is with Dept. of Electrical Engineering, National Sun Yat-Sen University and Dept. of Electronics Engineering, Technological University of the Philippines.

O. L. J. A. Jose is with Dept. of Electrical Engineering, National Sun Yat-Sen University and Dept. of Electronics Engineering, Batangas State University– The Philippines' National Engineering University.

This investigation was partially supported by National Science and Technology Council, Taiwan, under grant MOST 110-2221-E-110-063-MY2 and MOST 109-2221-E-032-001-MY3.

its capacity to deliver high-speed data access and processing with low power consumption. Battery life optimization, device efficiency improvement, and energy consumption reduction are key factors for contemporary electronic gadgets.

1

Different techniques have been used to reduce the energy consumption of SRAM designs for the said applications. Examples are single-ended cell configurations [\[2\]](#page-4-1)–[\[6\]](#page-4-2), voltage scaling [\[4\]](#page-4-3)–[\[6\]](#page-4-2), voltage splitting [\[7\]](#page-4-4), voltage stacking with array swap [\[8\]](#page-4-5), and Schmitt-trigger cells [\[9\]](#page-4-6)–[\[15\]](#page-4-7). Singleended architecture have been proved as an effective way to minimize energy consumption. However, it requires a more complex controller and is somehow more susceptible to noise. Prior Schmitt-trigger SRAM cells proved to have a good noise performance, but it is yet to be tested on silicon [\[9\]](#page-4-6)–[\[15\]](#page-4-7).

# *Contributions of the paper*

In this paper, we proposed a 1-kb SRAM with 8T Schmitttrigger cells that has improved read and write performance having the following attributes: (1) quick access speed, (2) low energy consumption through voltage selection; and (3) high noise immunity. We also presented the implementation and measurement on silicon to support the performance claimed.

This report is sequenced as follows: Section [I](#page-0-0) presents an overview discussion of the study. The architecture, implementation and measurements are presented in Sections [II](#page-0-1) and [III,](#page-2-0) respectively. Finally, a conclusion is given in Section [IV.](#page-4-8)

#### <span id="page-0-1"></span>II. PROPOSED SRAM ARCHITECTURE AND CELLS

## *A. Proposed Architecture*

Fig. [1](#page-1-0) shows the top level view of the proposed SRAM consisting of seven (7) main blocks:

- SRAM array (1-kb)
- Power Reduction and Speed Compensation circuit
- Row and Column Decoders
- Control circuit
- Memory Built-In Self-Test (MBIST) Circuit
- Sense Amplifiers
- Output Column Selector

# *B. Asymmetric Schmitt-trigger based SRAM Cell*

Referring to Fig. [2,](#page-1-1) the proposed cell consists of a traditional inverter, composed of  $M_{201}$  and  $M_{203}$ , and a Schmitt-trigger (ST) inverter, composed of  $M_{202}$ ,  $M_{204}$ , &  $M_{205}$ , acting as the

This article has been accepted for publication in IEEE Transactions on Circuits and Systems--II: Express Briefs. This is the author's version which has not been fully edited and content may change prior to final publication. Citation information: DOI 10.1109/TCSII.2023.3289911



Fig. 1. Block diagram of the proposed 1-kb SRAM

<span id="page-1-0"></span>

<span id="page-1-1"></span>Fig. 2. Asymmetric Schmitt-trigger based SRAM cell with Voltage Mode Select

storage node.  $M_{207}$  &  $M_{208}$  are the access transistors for the read and write operations. Feedback transistor  $M_{206}$  raises the voltage at node 'x' to VDD-Vth when QB is high, requiring a larger voltage at Q to invert QB. Because of this, the noise margin of the SRAM is improved. Fig. [3](#page-1-2) shows the voltage transfer curves (VTC) of the ST inverter and the traditional inverter. It shows a steeper curve, due to the positive feedback transistor in the ST inverter, which can improve the noise rejection performance.

Shown in Fig. [4](#page-1-3) is the timing waveform of the three operations of the proposed SRAM, namely read, write, and standby operations.

*1) Write and Read operations:* During the write operation, the bitlines (BL/BLB) are pre-discharged firstly to refresh the



<span id="page-1-2"></span>Fig. 3. Schmitt-trigger and traditional inverter voltage transfer curves (VTC)



<span id="page-1-3"></span>Fig. 4. SRAM operation timing diagram

state of the cell. Discharging the state of the cell also reduces the power consumption. Both access transistors  $M_{207}$  or  $M_{208}$ are turned on when a corresponding row address is selected. The control circuit then drives BLB to 1 or 0 during "Write 0" or "Write 1", respectively, while BL is released of the control. During a "Write 1" operation,  $M_{201}$ ,  $M_{204}$ , and  $M_{205}$ are turned on, driving node Q high and node QB low. While during "Write 0" operation,  $M_{202}$  and  $M_{203}$  are on, giving nodes Q and QB a low and high value, respectively.

When the read operation is initiated, the control circuit releases control of the bitlines and turns on the sense amplifiers. The corresponding address are selected to access the states of the cells. When Q is high and QB is low, BL will be at a higher potential than BLB. Thus, when it is fed to the sense amplifier, it will generate a high output. The same is true for the inverse operation.

In both operations, VS (in Fig. [2\)](#page-1-1) is set to low to turn on M<sup>209</sup> to give a higher supply voltage to the cells.

*2) Standby operation:* Standby operation is entered when the cells are not being accessed. In this operation, VS is turned off and VSB is on thus giving a lower supply, VDD–Vthp, to the cells. By this way, the standby power of the cells are decreased. The access transistors  $M_{207}$  &  $M_{208}$  are turned off to disconnect the cells from the bitlines, preventing the access to any of the cells. Transistor  $M_{206}$  also prevents the cells to be flipped from 1 to 0, when Q drops due to noise or other external disturbance.

*3) Cell Sizing:* Eqns. [\(1\)](#page-2-1), [\(2\)](#page-2-2), and [\(3\)](#page-2-3) are used to determine the sizes of the transistors in the cells [\[16\]](#page-4-9). The cell and pullup ratios (CR and PR) are assigned a value of 1 to reduce the effects of NBTI (negative-bias temperature instability). The upper trip point (utp) of the ST inverter in Fig. [2](#page-1-1) is set to 0.5 V, slightly higher than half of the supply. This is to have a better hold of the high logic value. The storage transistors in the cells are high-Vth devices which reduces the probability of a voltage flip in standby mode. The access transistors are chosen to be low-Vth devices to shorten the access time.

This article has been accepted for publication in IEEE Transactions on Circuits and Systems--II: Express Briefs. This is the author's version which has not been fully edited and content may change prior to final publication. Citation information: DOI 10.1109/TCSII.2023.3289911

3



<span id="page-2-4"></span>Fig. 5. Adaptive Voltage Select (AVS) circuit

$$
PR = \left(\frac{W}{L}\right)_{M202} \div \left(\frac{W}{L}\right)_{M208} \tag{1}
$$

$$
CR = \left(\frac{W}{L}\right)_{M203} \div \left(\frac{W}{L}\right)_{M201} \tag{2}
$$

<span id="page-2-3"></span>
$$
\frac{\beta_1}{\beta_3} = \left(\frac{W}{L}\right)_{M205} \div \left(\frac{W}{L}\right)_{M206} = \left(\frac{V_{DD} - V_{utp}}{V_{utp} - V_{thn}}\right)^2 \tag{3}
$$

#### *C. Power Reduction and Speed Compensation Circuit*

Power Reduction and Speed Compensation circuit is composed of three components as shown in Fig. [1,](#page-1-0) (1) Voltage Mode Select (VMS), (2) Adaptive Voltage Select (AVS), and Line Voltage Boost (LVB). This is meant to decrease the energy consumption of the SRAM as well as compensate for the speed drop when needed.

*1) Voltage Mode Select (VMS):* The VMS circuit is designed to give the cell a lower supply voltage during standby operation and a normal voltage during read or write operations. Referring to Fig. [2,](#page-1-1) the proposed SRAM cell is powered through the VMS circuit. The VMS selects the supply voltage being used in the column of the SRAM.

*2) Adaptive Voltage Select (AVS):* The adaptive voltage select (AVS) circuit is illustrated in Fig. [5.](#page-2-4) The AVS circuit is triggered when boost select (BS) is enabled driving node VP0 to ground through  $M_{501}$  and  $M_{502}$ . This will then drive node VP1 high and node VP2 low. The latch composed of  $inv_{502}$ ,  $inv<sub>503</sub>$ , and the transmission gate ensures that VP2 is low even when BS is not enabled. This then gives out the boost signal to the Line Voltage Boost (LVB) circuit.

*3) Line Voltage Boost (LVB) Circuit:* The Line Voltage Boost (LVB) circuit is shown in Fig. [6.](#page-2-5) When LVB is activated, the capacitor  $C_{601}$  is charge to a higher voltage equivalent to the voltage ( $\Delta V$ ) from the pull-up circuit of inv<sub>603</sub>. When the boost circuit is disabled,  $M_{601}$  is turned on and the pulldown network (PD $_{inv603}$ ) of inv<sub>603</sub> activates and charges up capacitor  $C_{601}$ . Upon enabling the boost circuit,  $M_{601}$  is then turned off and activating the pull-up network ( $PU<sub>inv603</sub>$ ) of  $inv<sub>603</sub>$  providing a bitline and wordline voltages of VDD' = VDD +  $\Delta V$  providing the speed boost to the circuit.

## *D. Memory Built-in Self-test (MBIST)*

In order to ensure the SRAM reliability, a memory builtin self-test circuit is included in the design. Fig. [7\(](#page-3-0)a) shows



<span id="page-2-5"></span>Fig. 6. Line Voltage Boost (LVB) circuit

TABLE I MARCH C– ALGORITHM NOTATIONS

<span id="page-2-8"></span><span id="page-2-2"></span><span id="page-2-1"></span>

| <b>Description</b>                            | <b>Notation</b> |
|-----------------------------------------------|-----------------|
| March element. Contains the operation needed. |                 |
| Write pattern 0 to SRAM                       | W <sub>0</sub>  |
| Read SRAM, check if equal to pattern 0        | R <sub>0</sub>  |
| Write pattern 1                               | W1              |
| Read SRAM, check if equal to pattern 1        | R1              |
| Increment address after operation             |                 |
| Decrement address after operation             | JL              |

∗ Pattern 0 and 1 are generated by the LFSR, they are inverse of each other

the block diagram of the MBIST circuit. It consists of a controller, pattern generator (PG), and an output response analyzer (ORA). The test patterns are generated by an LFSR circuit (linear feedback shift register), as shown in Fig. [7\(](#page-3-0)b) and the corresponding polynomial is described in Eqn. [\(4\)](#page-2-6). A simplified timing diagram of the MBIST is shown in Fig. [4.](#page-1-3) March C– algorithm is used since it performs better than other March algorithm, requiring 10 access per SRAM bits and detects more faults. The algorithm can detect transition faults (TF), stuck-at faults (SAF), neighborhood pattern sensitive faults (NPSF), coupling faults (CF), bridging faults (BF), and address decoder faults (ADF). The March C– algorithm is described using the notations in Eqn. [\(5\)](#page-2-7) and Table [I.](#page-2-8)

<span id="page-2-6"></span>
$$
f(x) = 1 + x + x3 + x4 + x5
$$
 (4)

<span id="page-2-7"></span>
$$
\{\Uparrow(W0);\Uparrow(R0,W1);\Uparrow(R1,W0);\n\qquad\qquad (B0,W1);\Downarrow(R0,W1);\Downarrow(R0)\}\n\tag{5}
$$

#### III. SIMULATION AND MEASUREMENT ON SILICON

<span id="page-2-0"></span>The proposed SRAM prototype is fabricated on silicon using TSMC 40-nm CMOS technology and packaged in a 32-pin dual-inline package (DIP) chip. Referring to Fig. [9,](#page-3-1) it shows the die photograph of the proposed 1-kb SRAM. The die is 595 $\times$ 595  $\mu$ m<sup>2</sup> and the core is 249 $\times$ 212  $\mu$ m<sup>2</sup>. An all-PVT-corner post-layout simulation is performed for the design. Fig. [10\(](#page-3-2)a) shows the worst corner Static Noise Margin (SNM) simulation at  $VDD = 0.8$  V. The worst-case SNM is 608.11 mV. Fig. [10\(](#page-3-2)b) shows the Dynamic Noise Margin (DNM) of the proposed cell showing that the cells could still operate even when a 0.3 V noise enters the wordline ensuring that the cells would not be easily flipped during hold operation. SNM for the supply variations from 0.3 V to 0.8 V is shown in Fig. [11](#page-3-3) and is compared to recent proposed Schmitt-trigger



<span id="page-3-0"></span>Fig. 7. Memory Built-in Self-test (MBIST)



Fig. 8. MBIST Timing diagram

based SRAM cells. This shows that the cells can still operate at a VDD of 0.3 V. Ref. [\[10\]](#page-4-10) has managed to have their cells work at 0.2 V supply, this could be because they are using an advanced 7-nm FinFET process.

Referring to Fig. [12,](#page-3-4) the measurement setup for the prototype chip is presented. Agilent E3631A is the power supply used. The input signals are generated using the Agilent 81250 function generator, and the outputs are measured using Keysight DSAV134A. The measurements are done for 6 chips measured 100 times each (at a clock rate of 100 MHz to 300 MHz at an increment of 10 MHz). Fig. [13](#page-4-11) shows examples of the measurement result during normal operation and when the boost signal is enabled. It shows that when boost is enabled,



4

<span id="page-3-2"></span>Fig. 10. Noise performance of the proposed SRAM cell, (a) Static Noise Margin (SNM); (b) Dynamic Noise Margin (DNM)



<span id="page-3-3"></span>Fig. 11. SNM comparison with previous Schmitt-trigger SRAM cells

there is a significant improvement of the output delay, i.e., from 2.5 ns to 1.8 ns. It can also be observed that the output glitch during write-0 operation in Fig. [13\(](#page-4-11)a) is removed when boost is enabled. Repeatability test, as shown in Fig. [14,](#page-4-12) presents the delay measurements during normal mode and when boost is enabled. The average delay during normal mode is 5.659 ns and 1.708 when the boost is enabled. This shows the reliability of the proposed design.

Table [II](#page-4-13) presents a performance comparison with several prior SRAM designs that were implemented on silicon. The proposed SRAM achieved the best SNM when referred to the supply voltage due to the use of Schmitt-trigger cells. The proposed design has the best read PDP (power delay product) and energy per bit with values of 1.758 and 4.23 fJ, respectively at the cost of a slightly larger area.



<span id="page-3-1"></span>Fig. 9. Die photograph of the fabricated SRAM



<span id="page-3-4"></span>Fig. 12. Measurement setup for the prototype chip



Fig. 13. Measurement results with (a) Boost disabled, and (b) Boost enabled

<span id="page-4-11"></span>

<span id="page-4-12"></span>Fig. 14. Delay histogram (600 measurements)

# IV. CONCLUSION

<span id="page-4-8"></span>A very low power 1-kb SRAM is realized on silicon using asymmetrical cells. A combination of traditional and Schmitt-trigger inverters is used to improve the noise rejection capability of the SRAM. A power reduction circuit is used to improve energy consumption during standby operation through a voltage mode select circuit and a line voltage boost is added to improve the read and write delays. The proposed SRAM offered the lowest energy consumption operating at more than 100 MHz up to date.

> TABLE II PERFORMANCE COMPARISON WITH PREVIOUS SRAM

<span id="page-4-13"></span>

## **REFERENCES**

5

- <span id="page-4-0"></span>[1] Y. Kim, S. Patel, H. Kim, N. Yadav, and K. K. Choi, "Ultra-low power and high-throughput SRAM design to enhance AI computing ability in autonomous vehicles," *Electronics*, vol. 10, no. 3, pp. 1–22, Jan. 2021, Art. no. 2050095.
- <span id="page-4-1"></span>[2] N. Surana and J. Mekie, "Energy efficient single-ended 6-T SRAM for multimedia applications," *IEEE Tran. Circuits Syst. II- Exp. Briefs*, vol. 66, no. 6, pp. 1023–1027, Jun. 2019.
- [3] D.-S. Wang, Y.-H. Su, and C.-C. Wang, "A readout circuit with cell output slew rate compensation for 5T single-ended 28 nm CMOS SRAM," *Microelectron. J.*, vol. 70, pp. 107–116, Dec. 2017.
- <span id="page-4-3"></span>[4] C.-C. Wang, R. G. B. Sangalang, and I.-T. Tseng, "A single-ended low power 16-nm FinFET 6T SRAM design with PDP reduction circuit," *IEEE Tran. Circuits Syst. II- Exp. Briefs*, vol. 68, no. 12, pp. 3478– 3482, Dec. 2021.
- <span id="page-4-16"></span>[5] C.-C. Wang and C.-P. Kuo, "200-MHz single-ended 6T 1-kb SRAM with 0.2313 pJ energy/access using 40-nm CMOS logic process," *IEEE Tran. Circuits Syst. II- Exp. Briefs*, vol. 68, no. 9, pp. 3163–3166, Sep. 2021.
- <span id="page-4-2"></span>[6] C.-C. Wang, R. G. B. Sangalang, I.-T. Tseng, Y.-J. Chiu, Y.-C. Lin, and O. L. J. A. Jose, "A 1.0 fJ energy/bit single-ended 1 kb 6T SRAM implemented using 40 nm CMOS process," *IET Circ. Devices Syst.*, vol. 17, no. 2, pp. 75–87, Mar. 2023.
- <span id="page-4-4"></span>[7] M. S. M. Siddiqui, Z. C. Lee, and T. T.-H. Kim, "A 16-kb 9T ultralowvoltage SRAM with column-based split cell-VSS, data-aware writeassist, and enhanced read sensing margin in 28-nm FDSOI," *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.*, vol. 29, no. 10, pp. 1707–1719, Oct. 2021.
- <span id="page-4-5"></span>[8] J. Wang, H. An, Q. Zhang, H. S. Kim, D. Blaauw, and D. Sylvester, "A 40-nm ultra-low leakage voltage-stacked SRAM for intelligent IoT sensors," *IEEE Solid-State Circ. Lett.*, vol. 4, pp. 14–17, Dec. 2020.
- <span id="page-4-6"></span>[9] E. Abbasian and M. Gholipour, "Design of a Schmitt-trigger-based 7T SRAM cell for variation resilient low-energy consumption and reliable internet of things applications," *AEU - Int. J. Electron. Commun.*, vol. 138, pp. 1–11, Aug. 2021, Art. no. 153899.
- <span id="page-4-10"></span>[10] E. Abbasian, E. Mani, M. Gholipour, M. Karamimanesh, M. Sahid, and A. Zaidi, "A Schmitt-trigger-based low-voltage 11T SRAM cell for lowleakage in 7-nm FinFET technology," *Circuits Syst. Signal Process.*, vol. 41, no. 6, pp. 3081–3105, Jan. 2022.
- [11] K. Cho, J. Park, T. W. Oh, and S.-O. Jung, "One-sided Schmitt-triggerbased 9T SRAM cell for near-threshold operation," *IEEE Trans. Circuits Syst. I-Regul. Pap.*, vol. 67, no. 5, pp. 1551–1561, May 2020.
- [12] A. S. Rajput, M. Pattanaik, and R. Tiwari, "Design and analysis of Schmitt-trigger based 10T SRAM in 32-nm technology," in *Proc. 2017 IEEE Int. Symp. Nanoelectron. Inf. Syst. (iNIS)*, Bhopal, India, Dec. 2017, pp. 234–237.
- [13] S. Reddy, R. G. B. Sangalang, and C.-C. Wang, "Sub-0.2 pJ/access Schmitt trigger based 1-kb 8T SRAM implemented using 40-nm CMOS process," in *Proc. 2022 Int. Conf. IC Design Technol. (ICICDT)*, Hanoi, Vietnam, Sep. 2022, pp. 24–27.
- [14] K. Sanapala, R. Sakthivel, and S.-S. Yeo, "Schmitt trigger-based singleended 7T SRAM cell for internet of things (IoT) applications," *J. Supercomput.*, vol. 74, no. 9, pp. 4613–4622, May 2018.
- <span id="page-4-7"></span>[15] V. Srinivasan, R. V. Venkatraman, and K. S. Kumar, "Schmitt trigger based SRAM cell for ultralow power operation- a CNFET based approach," *Procedia Eng.*, vol. 64, pp. 115–124, Jul. 2013.
- <span id="page-4-9"></span>[16] R. J. Baker, *CMOS: Circuit Design, Layout, and Simulation*, 3rd ed. John Wiley & Sons, Inc., Aug. 2010.
- <span id="page-4-14"></span>[17] A. T. Do, S. M. A. Zeinolabedin, and T. T.-H. Kim, "Energy-efficient data-aware SRAM design utilizing column-based data encoding," *IEEE Tran. Circuits Syst. II- Exp. Briefs*, vol. 67, no. 10, pp. 2154–2158, Oct. 2020.
- <span id="page-4-15"></span>[18] J. Chen, W. Zhao, Y. Wang, and Y. Ha, "Analysis and optimization strategies toward reliable and high-speed 6T compute SRAM," *IEEE Trans. Circuits Syst. I-Regul. Pap.*, vol. 68, no. 4, pp. 1520–1531, Apr. 2021.