Journal of Circuits, Systems, and Computers Vol. 29, No. 6 (2020) 2050088 (17 pages) © World Scientific Publishing Company DOI: 10.1142/S0218126620500887



# 2-GHz 2×VDD 28-nm CMOS Digital Output Buffer with Slew Rate Auto-Adjustment Against Process and Voltage Variations<sup>\*</sup>

Chua-Chin Wang<sup>†</sup>, Zong-You Hou<sup>‡</sup> and Yu-Lin Deng<sup>§</sup>

 $\begin{array}{c} Department \ of \ Electrical \ Engineering, \ National \ Sun \ Yat-Sen \ University,\\ 70 \ Lian-Hai \ Rd., \ Kaohsiung, \ Taiwan \\ & ^\dagger ccwang@ee.nsysu.edu.tw \\ & ^\$ blazesky@vlsi.ee.nsysu.edu.tw \\ & ^\$x9900@vlsi.ee.nsysu.edu.tw \end{array}$ 

U-Fat Chio<sup>¶</sup> and Wei Wang<sup>||</sup>

College of Electronics Engineering, Chongqing University of Posts and Telecommunications, Chongqing, P. R. China <sup>¶</sup>u.c.mo@ieee.org <sup>¶</sup>wangwei@cqupt.edu.cn

> Received 21 June 2018 Accepted 18 June 2019 Published 5 August 2019

A 2×VDD CMOS output buffer with process, voltage and leakage (PVL) detection mechanism is proposed such that slew rate is auto-adjusted to reduce the variations at different corners. To boost the driving current, low threshold voltage transistors are used instead of devices with typical threshold voltage in the driving transistor of output stage. More importantly, to prevent large leakage of those large low threshold voltage devices, leakage detection resistors are added at the gates of the always-on low threshold voltage transistors to clamp the leakage. The static power consumption is reduced when it is not activated. Another feature of the proposed design is that the gate-oxide leakage is also reduced by lengthening the driving transistors. Besides, all biases in the proposed design are generated from bandgap circuits such that not only is the variation caused by temperature drifting reduced, the area overhead and power dissipation are also minimized. The proposed design is carried out by using 28-nm CMOS process. The data rate proved by physical measurement is proved to be 2.0 GHz given 1.8/1.05 V supply voltage, namely, VDD or 2×VDD, when the proposed PVL detection as well as the compensation circuitry are activated.

Keywords: 2×VDD; CMOS; slew rate adjustment; PV variation and leakage detection; output buffer.

\*This paper was recommended by Regional Editor Piero Malcovati.

<sup>†</sup>Corresponding author.

#### 1. Introduction

By the demand of lower fabrication cost, lower voltage and lower power consumption, the CMOS technology has been developed toward nano-scale nodes for years. Nevertheless, a lot of PCB-based applications and systems are still equipped with chips fabricated by prior CMOS processes using various digital voltage levels, e.g., 5.0 V, 3.3 V or even 1.8 V. Thus, the digital data exchange among these chips becomes a topic. Particularly, the slew rate is one of the most important requirements for digital transmission and data exchange. Voltage level converter chips were then utilized to resolve this problem, which consume extra area and power in the PCB-based systems. To remove the level converters and reduce the PCB size, mixed-voltage buffers have been deemed as a better alternative.

Digital I/O interfaces have strict slew rate demand. For instance, the latest specification of double data rate fourth generation (DDR4) DRAM needs to comply high performance computing system (HPC), where minimum slew rate limitation is 4 V/ns. If the slew rate is too high, simultaneous switching noise (SSN) becomes a threat to jeopardize the signal integrity.<sup>1</sup> If the slew rate is too low, the time margin will violate setup time requirement.<sup>2</sup> Many researches in the past were proposed to resolve these slew rate problems. The stacked-transistor-based output stage and variation detectors are major approaches to resolve the variations of slew rate and increase reliability, respectively.<sup>3–8</sup> Meanwhile, a digital-based process detector has been reported to enhance the detection speed and make easier the detection design as well.<sup>2</sup> Many other PVT (process, voltage, temperature) detection methods were also proposed.<sup>9–12</sup> However, although PVT (process, voltage, temperature) variations are always discussed together as the all-corner variations, they have individual impact on the digital signal quality in reality.

However, an interesting phenomenon should be addressed. When the cost of the buffer design is an important consideration, the impact of (P, V, T) on the slew rate variation must be explored to prioritize these three major variation sources. Table 1 summarizes the result of Monte Carlo simulations (100 times), where the 1 V, TT, 25°C is considered as a baseline for fair comparison. One of the three variations, i.e., P, V, and T, is simulated at a time to see the impact that it generates. The temperature variation can be neglected from buffer designs if the cost of chip area is a primary design concern because the impacts of voltage and process on slew rate of

| VDD (V)               | Process corners | Temp. (°C) | $\Delta SL_{rise}$ (V/ns) | $\Delta SL_{fall}$ (V/ns) | $\frac{\rm Impact\ ratio}{({\rm SL}_{\rm rise}/{\rm SL}_{\rm fall})}$ |
|-----------------------|-----------------|------------|---------------------------|---------------------------|-----------------------------------------------------------------------|
| 0.9–1.1×VDD           | TT              | 25         | 2.08                      | 1.93                      | 4.1/3.64                                                              |
| $1 \times \text{VDD}$ | All             | 25         | 1.78                      | 1.74                      | 3.63/3.28                                                             |
| $1 \times \text{VDD}$ | TT              | 0 - 100    | 0.49                      | 0.53                      | 1/1                                                                   |

Table 1. Comparison of different factors to slew rate variation.

| Reference                               | Topology                                                                                                                                           | Compensation                                                                                                                                 | Publication                                                                                                                |
|-----------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------|
| 13<br>14<br>15<br>16<br>17<br>this work | comparator<br>PMOS threshold voltage detection<br>skewed inverter<br>NMOS threshold voltage detection<br>2-stage threshold voltage, resistor clamp | only leakage<br>only leakage for SRAM<br>PVT detection<br>PT detection<br>PVT detection<br>PVL detection<br>(Temperature is<br>not included) | IEEE TCAS-II (2017)<br>IEEE TVLSI (2016)<br>Microelectronics J. (2015)<br>IEEE TCAS-I (2013)<br>Microelectronics J. (2013) |

Table 2. Review of recent related works.

rising and falling edges are found to be three times larger than temperature on slew rate of rising and falling edges.

# 1.1. Review of prior related researches

The research team of this work has very much focused on mixed-voltage I/O buffer and PVTL detection designs for many years. Table 1 summarizes individual technical contribution of each significant milestone so that the feature of this investigation can be exclusively highlighted.

Notably, all the references in Table 2 were not carried out by 28-nm technologies which have serious leakage issue compared with prior technology nodes. Leakage current results in the degradation of slew rate and power dissipation, which must be resolved in advanced nano-scale CMOS technology nodes. Furthermore, this investigation highlights the priority of three variation sources, where temperature variation is found to be the least significant one.

Therefore, a cost-effective buffer design is demonstrated on silicon in this investigation, where the first process, voltage and leakage (PVL) detection and corresponding slew rate auto-adjustment designs for 28-nm mixed-voltage buffer designs are proved to be very performance competitive compared to all of the existing solutions. In short, the proposed design not only resolves leakage of digital output buffers in advanced nano-scale CMOS technology but also increases slew rate and enhances data rate to GHz level.

#### 2. Nano-Scale 2×VDD Output Buffer

Figure 1 shows the Output Stage, Bandgap circuits, N-Process and P-Process Detectors, Voltage Detector and Digital Logic Circuit. DOUT is the digital signal to be transmitted outward and Clock is the main system clock. EN, Reset1 and Reset2 are three signals to activate PVL detection. The digital output voltage can be selected to be VDD or  $2 \times VDD$  relying on the VDDIO. A total of six large driving NMOS and six driving PMOS in Output Stage are divided into two categories. One category comprises three NMOS and three PMOS driving transistors in response to



Fig. 1.  $2 \times VDD$  digital output buffer.

the process detection outputs, while the other category consists of the rest of driving transistors in charge of the voltage variation. The function of the proposed buffer is briefed as follows:

- (1) Voltage detection is realized by comparing two reference voltages in a PMOS string with a pre-defined bias voltage generated by another bandgap circuit.  $V_{\rm H}$  and  $V_{\rm L}$  will then be generated and coupled to Digital Logic Encoder.
- (2) Clock with 50% duty cycle drives P-corner and N-corner Detectors at the same time. A ramping voltage is generated by each detector to compare with  $V_{\rm ref}$  such that the delay of each detector will be monitored independently.  $V_{\rm NS}$  and  $V_{\rm NF}$  are generated by the N-corner detection, while  $V_{\rm PS}$  and  $V_{\rm PF}$  are generated by the P-corner detection opposite side.
- (3) EN and Reset1 are externally generated signals to either deactivate or activate the proposed process and voltage detection and auto-adjustment circuitry.
- (4) In fact, Digital Logic Encoder is a hard-wired circuit to encode the mentioned signals, including V<sub>PS</sub>, V<sub>PF</sub>, V<sub>NS</sub>, V<sub>NF</sub>, V<sub>H</sub>, and V<sub>L</sub>, into three categories of digital signals, Pstat[3:1], Nstat[3:1], and Vstat[3:1], to switch on or off the driving transistors correspondingly in Output Stage.
- (5) Reset2 is responsible for compensation and leakage detection, where an External circuit is used to monitor the leakages of two always-on transistors in Output Stage and generate Reset2. If it is high, the leakage compensation mechanism is shut off.

#### 2.1. P-corner detector/N-corner detector

The performance of the proposed buffer mainly relies on the process detection, which makes the slew rate auto-adjustment of the corresponding process variation feasible. As shown in Fig. 2, P-corner Detector consists of two comparators (i.e., CMP1 and CMP2), two DFFs and a P-skew cell. The aspect ratio (W/L) of pull-up PMOS device in the P-skew cell is relatively smaller than that of a regular inverter to accurately estimate the process variation impact upon the rise edge timing, while that of pull-down NMOS device therein is oppositely designed. In other words, P-skew cell is a skewed inverter with a long charging time. Because of the long pull-up device and the wide pull-down device,  $V_{\rm Cp}$  at  $C_{\rm P}$  will turn out to be a slow rising but fast dropping signal. Two thresholding voltages, namely,  $V_{\rm P-H}$  and  $V_{\rm P-L}$ , using CMP1 and CMP2, respectively, are then compared with  $V_{\rm Cp}$ . The inverted outputs of these two comparators ( $V_{\rm PF}$  and  $V_{\rm PS}$ ) are stored into DFF1 and DFF2, separately.

If PMOS is "Fast",  $V_{C_P}$  will be charged over  $V_{P\_H}$  and  $V_{P\_L}$  to make  $V_{PF}$  and  $V_{PS}$ both to be logic high. Therefore,  $V_{PF}$  and  $V_{PS}$  will be sampled as low at the rising edge of clock. If PMOS is in a "Typical" device with normal threshold voltage,  $V_{C_P}$  will be pulled up to stay between  $V_{P\_H}$  and  $V_{P\_L}$  such that  $V_{PF}$  and  $V_{PS}$  are separately registered with high (VDD) and low (GND). Last, if PMOS is a "Slow" device with high threshold voltage,  $V_{C_P}$  will be charged slower such that  $V_{C_P}$  cannot be pulled up over  $V_{P\_H}$  and  $V_{P\_L}$ . Figure 3 demonstrates a typical timing waveform for P-corner detection.

Figure 4 shows that the process detection circuit for NMOS device, consisting of two comparators (i.e., CMP3 and CMP4), two DFFs, two inverters and an N-skew cell. N-skew cell is also a skewed inverter. However, it is composed of a long pull-down device, a wide pull-up device, and a capacitor,  $C_N$ . The function of the N-corner Detector is very similar to that of the P-corner Detector. Figure 5 shows that a typical timing waveform of the N-corner Detector. All the above scenarios for N-corner and P-corner detection analysis are summarized in Table 3.



Fig. 2. P-corner detector.

C.-C. Wang et al.



Fig. 3. Illustrative waveform for P-corner detector operation.



Fig. 4. N-corner detector.



Fig. 5. Illustrative waveform for N-corner detector operation.

| N corner | $V_{ m C_N}$                                                                              | $V_{ m NF}$  | $V_{\rm NS}$ |
|----------|-------------------------------------------------------------------------------------------|--------------|--------------|
| Fast     | $V_{\mathrm{C}_{\mathrm{N}}} > V_{\mathrm{N}_{\mathrm{H}}} > V_{\mathrm{N}_{\mathrm{L}}}$ | low          | low          |
| Typical  | $V_{\mathrm{N\_H}} > V_{\mathrm{C_N}} > V_{\mathrm{N\_L}}$                                | high         | low          |
| Slow     | $V_{ m N\_H} > V_{ m N\_L} > V_{ m C_N}$                                                  | high         | high         |
| P corner | $V_{ m C_P}$                                                                              | $V_{\rm PF}$ | $V_{\rm PS}$ |
| Fast     | $V_{C_P} > V_{P_H} > V_{P_L}$                                                             | low          | low          |
| Typical  | $V_{\mathrm{P}_{\mathrm{H}}} > V_{\mathrm{C}_{\mathrm{P}}} > V_{\mathrm{P}_{\mathrm{L}}}$ | high         | low          |
| Slow     | $V_{\rm P\_H} > V_{\rm P\_L} > V_{\rm CP}$                                                | high         | high         |

Table 3. Function table of process corner detectors.

# 2.2. Voltage detector

With reference to Fig. 6, a Voltage Detector consisting of nine diode-connected PMOSs in a string is disclosed. These three categories correspond to three subranges from GND to VDD : GND ~  $V_{\rm L}$ ,  $V_{\rm L} \sim V_{\rm H}$ ,  $V_{\rm H} \sim$  VDD.<sup>19</sup> The variation of VDD between  $\pm 10\%$  VDD can be directly sensed by such a configuration. Notably, all the PMOS transistors are designed with the same aspect ratio such that the influence, which is due to temperature and process variations, is auto-eliminated from each other. In other words, even though the resistance of diode-connected PMOS will drift given voltage variations, the generated reference voltages,  $V_{\rm H}$  and  $V_{\rm L}$ , are equally drifted. By thorough simulations,  $V_{\rm ref}$  will be varied between -1.26% and +1.49% provided that VDD is suffering  $\pm 10\%$  variation. Detailed function of this circuit is tabulated in Table 4.

# 2.3. Digital logic encoder

The Process Detectors and Voltage Variation Detectors generate all the mentioned signals, including  $V_{\rm PF}$ ,  $V_{\rm PS}$ ,  $V_{\rm NF}$ ,  $V_{\rm NS}$ ,  $V_{\rm H}$  and  $V_{\rm L}$  to Digital Logic Encoder. Notably, if



Fig. 6. Voltage variation detector.

Table 4. Function table of voltage variation detector.

| Voltage                   | $V_{\mathrm{H}}$ | $V_{\rm L}$ |
|---------------------------|------------------|-------------|
| $\geq +10\%$ VDD          | high             | high        |
| VDD                       | high             | low         |
| $\leq -10\%~\mathrm{VDD}$ | low              | low         |

EN = high, output stage turns on all output driving current paths, namely, N201, N202, N203, N301, N302 and N303 or P201, P202, P203, P301, P302, and P303. Another issue is that the priority of signal EN is higher than that of Reset1. When Reset1 = low, the compensation mechanism is activated to carry out slew rate auto-adjustment. Then, Pstat[3:1], Nstat[3:1], and Vstat[3:1] are generated to activate different current paths in Output stage. On the contrary, when Reset1 = high, Pstat [3:1], Nstat[3:1], and Vstat[3:1] are all set as (low high high) regardless of whatever the outcomes of Voltage Detector and P/N-corner Detectors are. The truth table of Digital Logic Encoder is summarized in Table 5. Notably, the functionality of Voltage Variation Detector is the same as that of N-corner Detector provided that  $V_{\rm NF}$ ,  $V_{\rm NS}$ , Nstat[3], Nstat[2], and Nstat[1], are replaced with  $V_{\rm H}$ ,  $V_{\rm L}$ , Vstat[3], Vstat[2] and Vstat[1], respectively.

# 2.4. Output stage

Referring to Fig. 7, Output Stage comprises Pre-Driver, Vg1 level shifter, VDDIO detector/Vg2 generator, and Driving Transistors. The input signal Vstat[3:1], Pstat-[3:1], Nstat[3:1], and DOUT are coupled to Pre-Driver, which generates a total of 12 signals by a hard-wired logic circuit.  $V_{p1}, V_{p2}, V_{p3}$  and  $V_{n1}, V_{n2}, V_{n3}$  are coupled to Vg1 level shifter, while  $V_{n201}, V_{n202}, V_{n203}$  and  $V_{n301}, V_{n302}, V_{n303}$  are directly used as gate drives, individually, to N201, N202, N203, N301, N302, and N303. Notably, P201,

| EN   | $\operatorname{Reset1}$ | $V_{\rm PF}$ | $V_{\rm PS}$ | Pstat $[3]$ | Pstat $[2]$ | Pstat $[1]$ |
|------|-------------------------|--------------|--------------|-------------|-------------|-------------|
| high | x                       | x            | x            | low         | low         | low         |
| low  | high                    | x            | x            | low         | low         | high        |
| low  | low                     | high         | high         | low         | low         | low         |
| low  | low                     | high         | low          | low         | low         | high        |
| low  | low                     | low          | low          | low         | high        | high        |
| EN   | Reset1                  | $V_{\rm NF}$ | $V_{\rm NS}$ | Nstat[3]    | Nstat $[2]$ | Nstat[1]    |
| high | x                       | x            | x            | high        | high        | high        |
| low  | high                    | x            | x            | high        | high        | low         |
| low  | low                     | high         | high         | high        | high        | high        |
| low  | low                     | high         | low          | high        | high        | low         |
| low  | low                     | low          | low          | high        | low         | low         |

Table 5. Function table of digital logic encoder.



Fig. 7. The Output stage in Fig. 1.

P202, P203 and N201, N202, N203 generate individual driving currents in response to the process variation detection outcome. By contrast, P301, P302, P303, and N301, N302 and N303 are used to generate driving currents upon the outcome detection of voltage variations. Table 6 summarizes the voltage levels of all the signals in the Output Stage.

| VDDIO $(V)$ | $V_{p20x}, V_{p30x} (V)$ | $V_{g2} \ (V)$ | $V_{n20x}, V_{n30x} (V)$ |
|-------------|--------------------------|----------------|--------------------------|
| 1.05        | 1.05/1.8                 | 1.05           | 1.05/1.8                 |
| 1.8         | 0.0/1.05                 | 0.0            | 0.0/1.05                 |

Table 6. Voltage levels of driving signals (x = a, b, c).

#### 2.4.1. Leakage detection and reduction

It is well recognized that known nano-scale CMOS processes suffer from significant leakage, e.g., 28-nm process. The leakage is particularly severe in an output buffer, where large driving transistors must be used to provide enough driving currents for off-chip loads. Otherwise, small driving current would result in poor slew rate to violate many I/O interfacing requirements. For the sake of providing large driving currents in the proposed buffer design, low threshold voltage devices are used to replace those regular devices in the basic current paths, which are P101, P201, N101, N201 and P301, N301, as shown in Fig. 7, since these two paths are always on.

The price to pay for using low threshold voltage devices is the enlargement of gate oxide leakage. It is even more severe in the low threshold voltage NMOS. Another fact is that most logic CMOS processes might not have a thick oxide layer to prevent such a leakage problem. Because N101 is an always-on low threshold voltage device, the NMOS is one of the major leakage sources. To reduce the unwanted power dissipation caused by this leakage, a three-input OR gate is added at the gate driving path of N101. The inputs of the OR gate are  $\overline{\text{DOUT}}$ , Reset2, and  $V_{g2}$ , such that N101 can be turned off in certain circumstances.

Regarding the generation of Reset2, namely the control signal of leakage compensation, an external circuit shown in the right-hand block of Fig. 7 is used. The voltage drops over R101 and R102, i.e.,  $(Vr_{101b}-Vr_{101a})$  and  $(Vr_{102a}-Vr_{102b})$ , are sampled and coupled to individual OPA-based voltage subtractors. The output voltages of the subtractors are compared with a pre-defined  $V_{\text{leak}_{\text{th}}}$  leakage threshold voltage. If either one is higher than  $V_{\text{leak}_{\text{th}}}$ , the output voltage of this external circuit, Reset2, will be pulled high. Otherwise, Reset2 stays low. Notably, R101 and R102, respectively, at the gates of P101 and N101, must be selected carefully. The resistances of these two monitoring resistors should be large enough to generate measurable voltage drops, but small enough not to attenuate the gate drives therewith.

In short, when Reset2 = high, the leakage compensation circuit is off. If Reset2 and  $V_{g2}$  are low such that VDDIO = VDD, V301 will be the same logic value as DOUT. Therefore, N101 is off to clamp the leakage path when DOUT is low. The leakage power will be reduced at least  $\frac{1}{4}$  dramatically thanks to this extra leakage detection and reduction design.

# 2.4.2. $V_{q1}$ level shifter

 $V_{g1}$  level shifter, as shown in Fig. 6, is an enhanced version of that in Ker's design.<sup>7</sup> The total of six identical voltage level shifters comprises this circuit to generate  $V_{p201}, V_{p202}, V_{p203}$  and  $V_{p301}, V_{p302}, V_{p303}$ . When  $V_{g2}$  is high, all of the six generated outputs are boosted to [1.8 V, 1.05 V] from [1.05 V, 0 V], (namely,  $V_{b201}, V_{b202}, V_{b203}$  and  $V_{b301}, V_{b302}, V_{b303}$ ). Otherwise, the voltage level remains the same.

# 2.4.3. V<sub>g2</sub> generator/VDDIO detector

This circuit is designed to carry out two functions, as shown in Fig. 9, i.e.,  $V_{g2}$  generation and VDDIO detection.

- VDDIO detection: Voltage divider, which is the PMOS string at the left-hand side, generates the gate drives of P507 and P508. Note that P501 and N501 are not G-D-connected devices, which are used to create enough voltage drops and act as active current sources. If VDDIO is high enough to turn on P507 and P508, Vb will be pulled high to turn on N503. Thus,  $V_a$  is grounded to shut off N502 and latch on P508. Meanwhile,  $V_b$  is buffered to be  $V_c$  by two-tapered inverters.
- $V_{g2}$  generation: One of the biggest challenges of nano-scale CMOS output buffer design is the small driving current. However,  $V_{g2}$  is used as an important bias in the entire circuit, it cannot be directly generated by a feature size inverter or buffer.



Fig. 8. V<sub>g1</sub> level shifter.



Fig. 9. VDDIO detector/ $V_{g2}$  generator.

The reason is that the small inverter or buffer will be affected by big load. The right-hand side of Fig. 9 shows that the proposed NMOS-based power inverter is made up of NMOS devices only to boost the output current of  $V_{g2}$  so that the loading problem is prevented.



Fig. 10. Layout of the proposed output buffer.



Fig. 11. Die photo of the proposed output buffer.



Fig. 12. Eye diagram with PV compensation given 1×VDD.

### 3. Implementation and Measurement

The proposed I/O buffer design is realized using TSMC 28 nm CMOS LOGIC Low Power ELK Cu 1P10M 1.05 and 2.5 V. Figures 10 and 11, respectively, show the layout and the die photo of the prototype on silicon, where a single I/O buffer circuit is only  $0.142 \times 0.059 \text{ mm}^2$ . The prototype chips were measured in Tainan Branch of CIC (Chip Implementation Center), Taiwan, using Analog Measurement System consisting of digital OSC (Keysight DSAV134A), logic analyzer (Agilent 16902B), etc. (*Note*: CIC is now renamed as TSRI, Taiwan Semiconductor Research Institute.) With reference to Figs. 12 and 13, which are screendumps generated by DSAV134A Infiniium V-Series Oscilloscope with Infiniium Oscilloscope Software,<sup>18</sup> the measurement of maximum data rate is 2.0 GHz when VDDIO = 1.8/1.05 V, respectively, with the activated PVL compensation. The performance comparison of the proposed design and several recent works is tabulated in Table 7. The proposed design attains the highest data rate with the least chip area among all solutions for 2×VDD buffer designs on silicon.



Fig. 13. Eye diagram with PV compensation given 2×VDD.

|                    | Ref. 6<br>ISCAS | Ref. 7<br>TCAS-I | Ref. 8<br>EDSSC  | Ref. 19<br>ISCAS | Ref. 20<br>TCAS-II | This work   |
|--------------------|-----------------|------------------|------------------|------------------|--------------------|-------------|
| Year               | 2013            | 2013             | 2014             | 2016             | 2018               | 2019        |
| Process (nm)       | 40              | 90               | 90               | 90               | 65                 | 28          |
| Implementation     | simulation      | measurement      | simulation       | simulation       | measurement        | measurement |
| VDD (V)            | 0.9             | 1.2              | 1.0              | 1.0              | 1.2                | 1.05        |
| VDDIO (V)          | 1.8/0.9         | 2.5              | 1.8/1.0          | 2.0/1.0          | 1.2                | 1.8/1.05    |
| Lock Time          | Tens            | One cycle        | $\geq$ One cycle | One cycle        | One cycle          | One cycle   |
|                    | of cycle        |                  |                  |                  |                    |             |
| Maximum Date       | 460             | 200              | 330/500          | 800/500          | 1                  | 2000/2000   |
| Rate (MHz)         |                 |                  |                  |                  |                    |             |
| Slew Rate          | 6               | 37.5             | N/A              | 33.9             | 33.3               | 55/58       |
| Vriation           |                 |                  |                  |                  |                    | (rise/fall) |
| Improvement (%)    |                 |                  |                  |                  |                    |             |
| Leakage            | NO              | NO               | YES              | YES              | NO                 | YES         |
| compensation       |                 |                  |                  |                  |                    |             |
| Process Corner     | All             | Only TT          | All              | All              | none               | All         |
|                    |                 | FF SS            |                  |                  |                    |             |
| Detected           |                 |                  |                  |                  |                    |             |
| Core area $(mm^2)$ | 0.013           | N/A              | 0.024            | 0.020            | 0.0000075          | 0.0084      |
| (IIIII-)<br>EOMa   | 1069            | NT / A           | 1049 (           | 67900            | 44400              | 65 4769     |
| FOM                | 1002            | N/A              | Slow Pate        | 07800            | 44400              | 034702      |
|                    |                 |                  | Variation        |                  |                    |             |
|                    |                 |                  | variation        |                  |                    |             |
|                    |                 |                  | mprovement)      |                  |                    |             |

Table 7. Performance comparison of output buffers.

Note:  ${}^{\mathrm{a}}\mathrm{FOM} = \frac{\mathrm{Max. Date Rate \times Slew Rate Variation Improvement \times Corners}}{\mathrm{Core area \times Lock time}}$ .

# 4. Conclusion

In this paper, a 28-nm 2×VDD output buffer is proposed. First of all, the impact of P, V and T, on slew rate is analytically demonstrated to highlight that the temperature detection and compensation is dispensable, since the impact of temperature on slew rate variation is around 10%. By contrast, the leakage poses significant impact on slew rate and power dissipation when nano-scale CMOS processes are used. Several critical designs have been proposed and verified on silicon to resolve the serious loss of performance, including low threshold voltage devices used in the driving transistors, the addition of leakage detection resistor at the gate of these low threshold voltage devices, and optimizing the length of the devices in the string. Therefore, not only is the slew rate variation reduced, but also the data rate is drastically enhanced to GHz level. When VDDIO = 1.8/1.05 V, respectively, the measurement of the proposed design on silicon is 2.0 GHz, which is fastest solution so far for mixed-voltage digital output buffer designs. The slew rate improvement is demonstrated to be at least over 50% regardless of VDD or  $2 \times \text{VDD}$  data transmission mode, which is the best by far. To the best of our knowledge, this investigation is the first 28-nm CMOS mixed-voltage buffer design to demonstrate the GHz level data rate.

### C.-C. Wang et al.

# Acknowledgments

This research was partially supported by the Ministry of Science and Technology under grant MOST 106-2221-E-110-058-, 107-2218-E-110-016-, and 107-2218-E-110-004-. The authors would like to express their deepest gratitude to the TSRI (Taiwan Semiconductor Research Institute, which was called CIC before) of the NARL (National Applied Research Laboratories), Taiwan, for the thoughtful chip fabrication service.

# References

- K. Leung, Controlled slew rate output buffer, Proc. IEEE Custom Integrated Circuits Conf. (Rochester, NY, USA, 1988), pp. 5.5/1–5.5/4.
- C.-C. Wang, C.-L. Chen, H.-Y. Tseng, H.-H. Hou and C.-Y. Juan, A 800 Mbps and 12.37 ps jitter bidirectional mixed-voltage I/O buffer with dual-path gate-tracking circuit, *IEEE Trans. Circuits Syst. I: Reg. Papers* 60 (2013) 116–124.
- Y.-H. Kwak, I. Jung, H.-D. Lee, Y.-J. Choi, Y. Kumar and C. Kim, A one-cycle lock time slew-rate-controlled output driver, *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)* (San Francisco, CA, USA, 2007), pp. 408–611.
- Y.-H. Kwak, I. Jung and C. Kim, A Gb/s+ slew-rate/impedance-controlled output driver with single-cycle compensation time, *IEEE Trans. Circuits Syst. II: Exp. Briefs* 57 (2010) 120–125.
- C.-L. Chen, H.-Y. Tseng, R.-C. Kuo and C.-C. Wang, On-chip MOS PVT variation monitor for slew rate self-adjusting 2×VDD output buffers, *Proc. IEEE Int. Conf. IC Design and Technology (ICICDT)* (Austin, TX, USA, 2012), pp. 1–4.
- C.-C. Wang, W.-J. Lu and H.-Y. Tseng, A high-speed 2×VDD output buffer with PVT detection using 40-nm CMOS technology, *Proc. IEEE Int. Symp. on Circuits and Syst.* (*ISCAS*) (Beijing, China, 2013), pp. 2079–2082.
- M.-D. Ker and P.-Y. Chiu, Design of 2×VDD-tolerant I/O buffer with PVT compensation realized by only 1×VDD thin-oxide devices, *IEEE Trans. Circuits Syst. I: Reg. Papers* 60 (2013) 2549–2560.
- T.-J. Lee, W. Lin and C.-C. Wang, Slew rate improved 2×VDD output buffer using leakage and delay compensation, *Proc. IEEE Int. Conf. on Electron Devices and Solid-State Circuits (EDSSC)* (Chengdu, China, 2014), pp. 1–2.
- S.-K. Shin, S.-M. Jung, J.-H. Seo, M.-L. Ko and J.-W. Kim, A slew-rate controlled output driver using PLL as compensation circuit, *IEEE J. Solid-State Circuits* 38 (2003) 1227– 1233.
- S.-K. Shin, W. Yu, Y.-H. Jun, J.-W. Kim, B.-S. Kong and C.-G. Lee, Slew-rate-controlled output driver having constant transition time over process, voltage, temperature, and output load variations, *IEEE Trans. Circuits Syst. II: Exp. Briefs* 54 (2007) 601–605.
- S.-W. Choi and H.-J. Park, A PVT-insensitive CMOS output driver with constant slew rate, *IEEE Asia-Pacific Conf. Advanced System Integrated Circuits (AP-ASIC)* (Fukuoka, Japan, 2004), pp. 116–119.
- M. Bazes, Output buffer impedance control and noise reduction using a speed-locked loop, IEEE Int. Solid-State Circuits Conf. (ISSCC), Vol. 1 (2004), pp. 486–541.
- C.-C. Wang, Z.-Y. Hou and K.-W. Ruan, 2xVDD 40 nm CMOS output buffer with slew rate self-adjustment using leakage compensation, *IEEE Trans. Circuits Syst. II: Exp. Briefs* 64 (2017) 812–816.

- C.-C. Wang, D.-S. Wang, C.-H. Liao and S.-Y. Chen, A leakage compensation design for low supply voltage SRAM, *IEEE Trans. Very Large Scale Integration Systems* 24 (2016) 1761–1769.
- C.-C. Wang, W.-J. Lu, K.-W. Juan, W. Lin, H.-Y. Tseng and C.-Y. Juan, Process corner detection by skew inverters for 500MHZ 2 x VDD output buffer using 40-nm CMOS technology, *Microelectron. J.* 46 (2015) 1–11.
- C.-C. Wang, C.-L. Chen, R.-C. Kuo, H.-Y. Tseng, J.-W. Liu and C.-Y. Juan, On-chip process and temperature monitor for self-adjusting slew rate control of 2xVDD output buffers, *IEEE Trans. Circuits Syst. I: Reg. Papers* 60 (2013) 1432–1440.
- C.-C. Wang, W.-J. Lu, C.-L. Chen, H.-Y. Tseng, R.-C. Kuo and C.-Y. Juan, A 2xVDD output buffer with PVT detector for slew rate compensation, *Microelectron. J.* 44 (2013) 393–399.
- Keysight DSAV134A on-line information available: https://literature.cdn.keysight.com/ litweb/pdf/5992-0483EN.pdf?id=2570359
- T.-Y. Tsai, Y.-L. Teng and C.-C. Wang, A nano-scale 2xVDD I/O buffer with encoded PV compensation technique, *Proc. IEEE Int. Symp. on Circuits and Syst. (ISCAS)* (Montreal, QC, Canada, 2016), pp. 598–601.
- V. L. Le and T. T.-H. Kim, An area and energy efficient ultra-low voltage level shifter with pass transistor and reduced-swing output buffer in 65-nm CMOS, *IEEE Trans. Circuits Syst. II: Exp. Briefs* 65 (2018) 607–611.