|
Introduction
The microelectronics industry owes a great deal of its success to the existence of the thermal oxide of silicon, i.e., silicon dioxide (SiO2). A thin layer of SiO2 forms the insulating layer between the control gate and the conducting channel of the transistors used in most modern integrated circuits. As circuits are made more dense, all of the dimensions of the transistors are reduced (scaled) correspondingly [1], so that nowadays the SiO2 layer thickness (tox) is 2 nm or less. Figure 1 shows the historical trend in oxide thickness for high-performance logic applications over the past decade.1 The oxide thickness has been decreasing quasi-exponentially, but clearly this trend must saturate at some point, since there are physical and practical limits on how thin an oxide film can be made. Where will this point be? What is the ultimate oxide thickness limit, and when will it be reached? This paper reviews the limitations to SiO2 film thickness, with particular emphasis on reliability. We describe the physics and statistics of oxide breakdown, illustrated by recent long-term stress data. Finally, we explore the implications of oxide breakdown for the reliable functioning of circuits, and ways in which this might be improved.
Figure 1
Limitations to oxide thickness reduction
The essential physical limitations on gate insulator thickness, ignoring extrinsic concerns related to manufacturability and yield (e.g., pinhole formation and film uniformity) are related to the exponentially increasing gate current as tox is reduced, and the effect of this current on both the functionality and reliability of devices and circuits. Although no insulator is ideal, amorphous SiO2, with a bulk resistivity of ~1015 -cm and a dielectric breakdown strength of ~107 V/cm, is one of the best nature has provided. Its usefulness in silicon technology is due to the fact that films of this material can be grown by thermal oxidation of clean silicon surfaces, and that the resulting films can be made with sufficiently low densities of charge traps. In addition, the energy barrier (conduction-band discontinuity) between Si and SiO2 is 3.1 eV, only about 1 eV lower than the Si/vacuum barrier. The leakage current is controlled by quantum-mechanical tunneling, either FowlerNordheim (FN) tunneling [2] at high fields, or, for applied bias less than about 3 V, by direct tunneling. At a typical operating field of 5 MV/cm, the current density for FN tunneling is <109 A/cm2 [3]. However, for tox 4 nm, the direct tunneling current increases exponentially by about one order of magnitude for every 0.20.3-nm reduction in oxide thickness [4].
Performance and power consumption
The gate leakage current causes increased power consumption and may affect device and circuit functionality. Thus, the leakage current imposes a practical limit on oxide thickness. It is instructive to look at the history of this concept. In 1972, some claimed [5] that the lower limit would be 5 nm. Since this was about 100 times thinner than the devices in use at that time, it must have seemed a rather futuristic limit; nonetheless, the researchers projected that this limit would be reached by the mid-1980s on the basis of the rate of technology improvement at that time. The data of Figure 1 suggest that the rate of oxide thickness reduction was somewhat lower than was projected, but certainly the improvement in MOS technology did not cease in 1980, nor has 5-nm SiO2 turned out to be a fundamental limit. As early as 1988, researchers in Japan [6] fabricated MOSFETs with 2.3-nm gate oxide, noting that the device characteristics were not significantly perturbed as long as the gate leakage was significantly less than the channel current, which is achieved by decreasing the gate length. The same consideration was later shown to permit a 1.5-nm oxide [7]. The lesson from this is that it is very difficult to make accurate predictions about the limitations of technology, possibly because engineers and scientists regard such predictions as a challenge.
A thickness of ~1.5 nm, corresponding to a leakage current of ~10 A/cm2, is now viewed as a sort of practical limit for SiO2 [8], although some researchers have recently claimed that leakage current as high as 100 A/cm2, corresponding to ~1.01.3 nm SiO2, is feasible for both static and dynamic logic [9]. (Clearly, such high leakage currents are acceptable only for high-performance server systems, not for low-power portable electronics.) A recent analysis claims that 0.8-nm SiO2 is a lower limit in MOSFET applications, based on consideration of threshold voltage fluctuations when the resistance of the oxide becomes comparable to that of the gate metal [10]. These limits are not fundamental, but depend on circuit and system design, as well as materials properties such as film uniformity and interface roughness. However, below tox 1 nm, overlap of the electronic wave functions of the silicon conduction bands may become large enough to effectively reduce the tunneling barrier [11]. Other performance limitations, e.g., the decreasing n-FET drive current observed for oxide thickness below 1.3 nm, may also represent limitations of fundamental physics [12, 13].
Wearout and breakdown
The current flowing through an ultrathin gate oxide is not merely a nuisance parasitic, but also causes reliability problems by leading to long-term parameter shifts (wearout) and eventually to oxide breakdown. The stress-induced parameter shifts are gradual, and the degradation is predictable on the basis of experimental data and physical models. The impact on device design therefore involves an engineering tradeoff between short-term and long-term performance. Oxide breakdown, on the other hand, is a sudden discontinuous increase in conductance, often accompanied by increased current noise. Breakdown is generally understood to be the result of a gradual and predictable buildup of defects such as electron traps in the oxide, but the precise point at which breakdown occurs is statistically distributed so that only statistical averages can be predicted.
The rate of defect generation in the oxide is proportional to the current density flowing through it, and therefore the reliability margin for gate-oxide breakdown has been drastically reduced as a consequence of device scaling. At present, predicted reliability limits for the gate-oxide thickness range from less than 1.5 nm [14] to 2.8 nm [15]. Even if the smallest estimates were correct, the latest international roadmap for the industry [16] anticipates that tox ~ 1 nm will be needed by 2005 for 6070-nm gate lengths (90-nm lithography node) operating at ~1 V in order to meet the desired performance targets. One way this might be achieved is by replacing the SiO2 with another dielectric with a higher dielectric constant, such as metal oxides, metal silicates, and epitaxial perovskites. This avenue is being aggressively pursued but faces large hurdles, since neither the materials science nor the electrical properties of these materials are understood nearly as well as for SiO2 [17, 18]. A more conservative approach is to use oxynitrides, which have been reported to show somewhat improved reliability characteristics compared to pure SiO2 [1921].
Oxide lifetime and product lifetime
The reliability of SiO2 in microelectronics, i.e., the ability of a thin film of this material to retain its insulating properties while subjected to high electric fields for many years, has always been a concern and has been the subject of numerous publications over the last 3540 years, ever since the realization that SiO2 could be used as an insulating and passivating layer in silicon-based transistors [22, 23]. (For a recent review, see [24].) In a recent survey of nine manufacturers asked to rate seventeen different reliability-related technology constraints in order of importance, gate-dielectric reliability was the top concern overall and was ranked among the top five concerns by every manufacturer surveyed [25].
In retrospect, it is not clear that this concern is justified. First, for the thicker oxides used in the past with tox 34 nm, recently published models [4, 14, 15, 26] have indicated that intrinsic gate-oxide reliability has probably not been a real issue at the operating voltages employed. The difficulty has always been to extrapolate oxide lifetime data from accelerated stress conditions (high voltage), under which data can be collected within a reasonable time, to the lower operating voltage, at which the failure rate should be very low. The oxide wearout and breakdown mechanisms change as a function of voltage and oxide thickness [27], so that earlier extrapolations, before this was understood, were easily in error. A controversy still exists over the proper extrapolation from ~2 V to 1 V [14, 26, 28, 29], and this has major implications for reliability projection. This is the subject of a large part of the present paper. Second, for the ultrathin oxides at lower voltages now being employed, a potentially nondestructive soft breakdown mode is frequently observed [30]. This is discussed in the penultimate section of this paper. Thus, the true implications of oxide breakdown for the reliability of integrated circuits are still unclear. Indeed, microprocessors have already been successfully manufactured with sub-2-nm gate oxide [31, 32], with no immediate catastrophic result. The next section of this paper deals with the phenomenon of oxide breakdown from a fundamental physical viewpoint, and describes in detail the uncertainties surrounding oxide reliability projection. The possible implications for product reliability are revisited later.
The physics and statistics of oxide breakdown
Defect generation and breakdown at low voltage
The essential elements of our present understanding of oxide wearout and breakdown are illustrated in Figure 2. Electrons flowing across the oxide trigger several processes depending on their energy, which is determined by the gate voltage for thin oxides where electron transport is ballistic or quasi-ballistic [33, 34]. At least three defect-generation mechanisms have been identified: The first two, impact ionization and anode hole injection, occur at higher voltages and lead to hole trapping and hole-related defect generation [27, 35]. The lowest-energy process so far identified, which dominates at the voltages at which present MOSFETs operate, is the so-called trap-creation process attributed to hydrogen release from the anode [33], illustrated in Figure 3. This process has a threshold gate voltage of about 5 V, but continues in the subthreshold region even at operating voltages down to 1.2 V or lower [34, 36]. Eventually, damage builds up to the point at which the oxide breaks down destructively.
Figure 2
Figure 3
The evidence for hydrogen involvement in defect generation and breakdown is circumstantial but strong [24, 33, 37]notably the observation of substrate dopant passivation [38] and hydrogen redistribution [3941] during hot-electron stress, the enhanced degradation rate of hydrogen-soaked films [42], and experiments showing that exposure of bare SiO2 films to atomic hydrogen radicals, in the absence of any electric field, produces electrically active defects essentially identical to those produced by electrical stress or radiation [37, 4355]. The desorption rate of hydrogen from Si surfaces was measured as a function of incident electron energy [56] and showed a dependence remarkably similar to the voltage dependence of the trap-generation process [33, 36]. However, a major perceived stumbling block to the general acceptance of the hydrogen model for breakdown has been the apparent lack of any isotope effect for the breakdown process [57] compared to the large effect observed for hydrogen/deuterium desorption and for hot-electron-induced channel interface degradation [58]. This may recently have been resolved by the observation [59, 60] of a significant isotope effect on the stress-induced flat-band voltage shift and stress-induced leakage current (SILC), which is a measure [50, 61, 62] of the bulk traps that ultimately relate to breakdown. A significant isotope effect on trap generation and oxide breakdown in deuterated oxide was also reported earlier [63].
Two other physical models for breakdown have been widely discussed in the literature. The first is the anode-hole injection (AHI) model, which asserts that breakdown is caused by holes that are injected from the anode contact [3, 64]. One of the main attributes of this model is a constant value of the hole fluence to breakdown independent of oxide field [3], Qp 0.1 C/cm2, decreasing for tox less than about 6 nm [64]. According to earlier calculations [65], the gate-voltage threshold for positive charge generation by hole trapping due to AHI is 78 V for FETs with n+-poly gates, and this was confirmed experimentally [35]. Thus, this mechanism probably cannot account for defect generation and breakdown at lower voltage. A recent modification of the AHI model [14, 66] proposes that a weaker minority-carrier ionization process [67] is responsible for hole injection and defect generation at low voltages. This mechanism will be operative for electron injection into a p-type material or hole inversion layer, including n-FETs with low gate doping when the n-poly gate is inverted [68]. The modified model can successfully fit the measured slope of TBD vs. voltage at high fields [14, 69] but cannot account for the absolute magnitude of the defect-generation rate. Since the hole current at low voltage (e.g., 23 V) is at least 12 orders of magnitude lower than the primary electron current [66], the defect-generation rate per hole is very much greater than the rate per injected electron. However, direct measurement of the rate of defect generation by transport of hot holes through an SiO2 layer [70, 71] gives values comparable to the generation rate due to electrons and many orders of magnitude less than what is required by a minority-carrier mechanism. Charge-to-breakdown measurements using substrate hot-hole injection [72, 73] and on p-FETs at low bias where hole tunneling dominates the total gate current (Figure 4) [71] have also shown that the hole fluence to breakdown, Qp, is as large as eight orders of magnitude greater than the value used in the AHI model, consistent with the measured defect-generation rate. The data in Figure 4 indicate that p-FET breakdown may be the limiting factor at low voltage, contrary to the usual idea [74] that n-FETs should represent the worst case.
Figure 4
The other widely cited breakdown model is the thermochemical model, or E-model, in which E refers to the electric field across the oxide. This model proposes that defect generation is a field-driven process, and that the current flowing through the oxide plays at most a secondary role. The development of this model from its origin in the mid-1980s through the late 1990s has been reviewed by McPherson et al. [75, 76]. The E-model has attained widespread acceptance, largely on the basis that, empirically, the time-to-breakdown (TBD) data appear to follow an exponential dependence on oxide field [7780], including an experiment of three years' duration at oxide fields down to 5.3 MV/cm on 9-nm films [81]. However, it must be pointed out that the exponential dependence on field is not proof of the validity of the particular physical model. Probably the strongest evidence against the thermochemical model comes from substrate-hot-electron (SHE) injection experiments [82, 83]. In this experiment, using a specially designed n-FET structure [84], a forward-biased n+ diffusion in the p-type substrate is used to inject electrons toward the inversion layer at the Si/SiO2 interface. The current through the oxide can be controlled independently of both the substrate and gate bias by varying the magnitude of the forward bias applied across the injector junction. The substrate voltage controls the maximum electron energy incident on the Si/SiO2 interface, and for substrate bias high enough that some electrons are injected over the energy barrier into the oxide, the gate voltage additionally controls the energy of the electrons incident on the gate/oxide interface. In these experiments it was found that the charge to breakdown (QBD) is strongly dependent on the substrate bias, even though the oxide field is held fixed [83]. Therefore, QBD correlates with the electron energy, not the oxide field. This was also demonstrated for conventional FowlerNordheim stress by varying the doping of the anode [68, 85]. The SHE experiments also showed that TBD is inversely related to the current density [83], again showing that breakdown is dominated by the effect of the energetic electrons and not the field in the oxide.
Independently of the assumed physical mechanism of defect creation, it is an experimental observation that in a large variety of oxide thicknesses stressed over a wide range of voltages, the charge to breakdown (QBD) is inversely related to the initial rate of defect generation for most stress conditions [27, 79, 8688]. Defining the defect generation probability per injected electron density as Pg qdN/dQ, DiMaria proposed [4, 27, 34, 89] the simple relationship
where q is the magnitude of the electron charge and NBD is a proportionality constant, which we later see is a function of tox. More generally,
|
NBD =
|
1
|
|
QBD
|
Pg dQ.
|
|
|
q
|
0
|
|
(2)
|
Equation (1) holds when the defect generation is first-order in the electron fluence. (It is commonly observed that the defect density N is a sublinear power-law function of the injected charge Q. However, observed over a sufficiently wide range of fluence, the full dependence is typically sigmoidal, with a linear region bracketed by sublinear portions at low and high fluence [4, 71]. If the low-fluence background is subtracted, a linear behavior is found independent of stress conditions.)
Voltage dependence of defect generation
Extensive work in the 1980s revealed the existence of the so-called 2-eV trap-creation threshold (electron energy measured with respect to the bottom of the SiO2 conduction band) for defect creation by hot electrons in SiO2 [33]. For thick oxides, electrons in the conduction band obtain this average energy for oxide fields greater than about 24 MV/cm. In the thin oxides of current interest, for which electron transport is ballistic or quasi-ballistic [33], the threshold corresponds to a gate voltage of about 5 V for FowlerNordheim tunneling through the 3-eV potential barrier at the Si/SiO2 interface. For voltages below this threshold, it was shown [34] that the defect-generation rate depends only on the absolute value of the gate voltage (Vg), independent of substrate or gate doping or polarity. Recent work has given the somewhat surprising result that 2 eV is not a hard threshold [36, 89]. Instead, there is a subthreshold trap-generation process that decreases exponentially below 5 V, shown in Figure 5. The data are capacitance voltage (CV) measurements of the interface state density and stress-induced leakage-current (SILC) data. In SILC measurements the relative increase in current in the direct tunneling range below 3 V is expressed as J/J0, where J0 is the initial current density in the as-fabricated device. This quantity is proportional to the density of generated neutral electron traps [50]. The left- and right-hand vertical scales in Figure 5 have been adjusted to match the values obtained using the two measurement techniques on the 5-nm sample, from which we obtain [90] the relation Ns (cm2) 3 × 108 J/J0. This is consistent with the value found [50] for the bulk neutral traps, where Nn (cm2) 1.25 × 108 J/J0.
Figure 5
Figure 5 includes data from a variety of oxide thicknesses and processes, including thermal SiO2 grown in O2, oxides grown in N2O, and oxides grown on nitrogen-ion-implanted (N2I/I) substrates, indicating the relative insensitivity of Pg to the oxidation process. Pg is also observed to be independent of stress voltage polarity. For each oxide thickness, it is possible to measure Pg over only a limited range of Vg while keeping the measurements within an experimentally accessible time scale. Plotting overlapping ranges of Vg obtained using oxides of different thicknesses shows a universal exponential behavior of Pg as a function of Vg, independent of tox. Using substrate hot-electron (SHE) and channel hot-electron (CHE) stress, it is possible to obtain much greater hot-electron flux at the interface, permitting measurements of Pg at low electron energy for even thick samples. Using these techniques, it has been shown [36, 91, 92] that defect generation by hot electrons impinging on the substrate/oxide interface follows the same dependence on energy as that from FowlerNordheim injection through the oxide, and that the exponential Vg dependence extends at least as low as 1.2 V [92].
A remarkable and unexpected feature of the data of Figure 5 is that there is no further threshold voltage below which the generation rate drops faster than the exponential trend. In particular, there is no large discontinuity or change in slope at the transition from FowlerNordheim to direct tunneling at about 3 V. This is contrary to earlier suggestions [93, 94] that breakdown would be strongly suppressed in the direct-tunneling regime, and strongly suggests that the relevant energy scale is the electron energy in the anode, which is the polysilicon gate or the silicon substrate depending on injection polarity. In fact, detailed measurements [26, 92] have found a sigmoidal inflection in the voltage dependence of Pg between 2 and 3 V. This is shown in Figure 6.
Figure 6
Critical defect density for breakdown
The idea of damage building up to a critical level [95, 96] has been a key insight in leading to a predictive model of oxide reliability. The concept does not depend in any way on the physics of defect generation: Like the hydrogen-induced-defect-creation model, the field-driven model [75, 76, 97] (E-model) also assumes a critical density of broken bonds in order to induce electric breakdown and thermal runaway, and the most recent version of the AHI model [14, 98] likewise adopts the viewpoint that holes create some (unspecified) form of defect which eventually leads to a critical conduction path. Two important papers in the 1990s have led to quantitative application of this concept. Suñé et al. [99] showed that the assumption of a critical defect density (treating NBD as a fitting parameter) leads to the correct statistical behavior describing breakdown distributions. Degraeve et al. [100] formulated the percolation model, in which breakdown is envisioned as the formation of a connecting path of defects, as a result of random defect generation throughout the insulating film.
The percolation concept (and the origin of the thickness dependence of NBD) is schematically illustrated by the computer simulation [90] in Figure 7. According to this model, breakdown can occur only when a connecting path of traps is formed across the gate oxide, forming a conducting path from the substrate to the gate. The probability of forming such a connecting path (percolation path) with randomly generated defects throughout the oxide bulk is computed as a function of defect density and oxide thickness. For a given defect density, the formation of a percolation path is more likely for thinner oxides. Conversely, as the oxide is made thinner, a percolation path can form with some probability at a lower average defect density than is necessary in a thick oxide. In the simulation, we begin by placing defects (black squares) randomly throughout the sample [Figure 7(a)], and then discard all defects which are not part of a cluster that is attached (via nearest neighbors) to one face of the sample [Figure 7(b)].
Figure 7
The face at which we begin when the percolation cluster is computed does not matter, because if a site is part of a cluster that connects the two faces of the sample, it will be counted no matter where we begin. It would be computationally redundant to separately examine the opposite cluster extending from the other face of the sample, because this cluster cannot touch the first face unless the first cluster also spans the sample. For the particular defect density illustrated in Figure 7, there is no percolation path connecting the top to the bottom of the sample, and therefore we have not reached breakdown. However, breakdown would have occurred at this density if the sample were thinner. As the defect density is increased, the percolating cluster extends deeper into the sample, until at some critical density it touches the opposite face. In addition, even for the same defect density and thickness, if the sample were made wider (corresponding to a larger device area), there is a finite probability of locally finding a cluster that does span the thickness. Thus, this rather simple model accounts nicely for all of the statistical features of oxide breakdown, including thickness and area dependence.
One important prediction of the percolation model is that the breakdown distributions become broader as tox is decreased [34, 90, 100, 101]. This is illustrated schematically in Figure 8. The shape of the failure distribution is an important parameter used in reliability estimations, as is discussed in the next section of this paper.
Figure 8
In Figure 9 we plot the measured NBD value (averaged over a range of stress voltage) as a function of tox. NBD drops by a factor of ~105 from 6 to 3 nm, and then reaches a plateau below 3 nm. These features are fully explained by the percolation model calculations [90] shown by the filled symbols. The fit is described by two parameters, which are adjusted to give a consistent fit to both the data above 3 nm and the plateau below this thickness. The two parameters are the defect diameter (hopping distance) a0 and the probability (f) that a percolation path triggers destructive breakdown. For tox < a0, the percolation model predicts that the thickness dependence will vanish, because only one defect is required to form a connecting path across the oxide. Therefore, the plateau region is tox < a0, and NBD in the plateau corresponds to one active defect in the area of the sample. In terms of absolute defect density, normalizing the SILC to interface state density as discussed earlier, the plateau value below 3 nm corresponds to a constant number equal to 2000 defects in the 5 × 104-cm2 area of each sample, suggesting that each percolation path has a probability of about 103 of initiating a destructive breakdown event. In other words, only a fraction f ~ 103 of the defects are active or effective in causing breakdown. This might result from the different energy levels of defects, or from their different physical or chemical nature. The idea that there exists some special subset of defects which trigger breakdown was independently suggested by the electron microscopy observation of breakdown patterns [102], where it was found that the density of weak spots is ~1% of the defect density at breakdown. The value of the defect size obtained from the NBD data is a0 3 nm. This is believed to correspond physically to the electrical sphere of influence of a point defect, e.g., a tunneling distance or trapping cross section. Other authors obtained smaller values for the defect diameter (1.61.8 nm) based on fitting to the distribution of breakdown times [100] or resistances [103].
Figure 9
One component of the percolation model is that NBD is relatively independent of the way in which the oxide is stressed, since it is a zero-field model in the formulation described above. This has been the assumption whenever NBD measurements at elevated voltage have been used to project oxide reliability down to operating conditions [4, 15, 104106]. Several experiments have supported this assumption, using various measurements of the defect density including interface states, trapped charge, and stress-induced leakage [34, 83, 101, 107111]. On the other hand, several groups have reported measurements showing a decrease in NBD as the stress voltage is reduced using 35-nm oxides [89, 112114]. This observation could have a significant effect on the reliability projections for such oxides.
In order to measure NBD at lower voltage, i.e., closer to actual operating conditions, it has been necessary to perform long-term reliability experiments on bonded chips. Figure 10 shows SILC measurements performed for two years at 2.4 V using n-FETs with tox = 2.2 nm. The horizontal dotted lines in this figure indicate roughly the 10%, median, and 90% values of J/J0 at breakdown for the higher stress voltages. The 2.4-V stress shows breakdown events occurring at statistically significant higher values of NBD.
Figure 10
Figure 11 plots the characteristic (i.e., the 63rd percentile) values of NBD vs. the gate voltage during stress (Vg), for the same sample as in Figure 10 and other samples with thinner oxide. The oxide thickness is estimated from the accumulation capacitance, including quantum-mechanical corrections, but it must be admitted that the quoted values are uncertain to at least ±0.1 nm. Error bars in NBD reflect the statistical uncertainty; in some cases, at the lowest voltages only ~10% of the samples had failed, and therefore NBD is estimated using the initial fail data and the statistical distributions from higher voltages. In this figure, the NBD values have been normalized to a reference area of 5 × 104 cm2 using Weibull statistics. It can be observed that NBD is only weakly dependent on tox in this range, and tends to increase with decreasing Vg. The observed increase in NBD correlates with an increase in charge to breakdown at low voltage [26], which is shown in the next section.
Figure 11
Although the data in Figure 11 show a trend for NBD to increase as Vg is lowered, there does not appear to be a universal behavior as a function of Vg. For example, for the 2.23-nm oxide, NBD starts to increase below 2.8 V, while for the 1.44-nm oxide the increase in NBD is not seen until Vg is reduced below ~2.2 V. In Figure 12 the same data have been replotted as a function of the breakdown times [TBD(63%)] instead of the stress voltage. TBD is a function of Vg, tox, and device area. In this figure TBD is the projected value to 63% failure for cases in which not all samples have reached breakdown. Again, the error bars reflect the statistical uncertainty. Note that in this figure, NBD is again normalized to a constant reference area, but TBD is not normalized; i.e., the x-axis values reflect the actual time under stress, projected to 63% failure but not normalized by area. Here the data show a universal trend, wherein NBD begins to increase for stress times longer than about 106 seconds (~10 days). The long stress times necessary to see this effect would preclude its observation in most experiments. For the 1.28-nm oxide at 1.8 V (open squares), devices with two different areas were tested, which when plotted against Vg result in different values of NBD even after area normalization (Figure 11). However, smaller-area devices have a longer lifetime, which accounts for the higher value of NBD as shown in Figure 12.
Figure 12
A voltage-dependent NBD could arise from several sources. The percolation path, which has been modeled in zero field [90, 100] to obtain the fit shown in Figure 9, could be weakly field-dependent. The formation of the percolation path, i.e., the generation of new defects, could depend on the local field produced by the other defects [115, 116]. This would lead to more directed paths at higher voltage, so that the average defect density to form a connecting path across the sample would be reduced. However, according to the idea that the stress time, not the voltage, is responsible for the increase in NBD, the data may be interpreted in terms of a reduction in the defect effectiveness (the fraction f of active defects), which was introduced [90, 102, 113] to describe the probability that a defect, or percolation path, can trigger breakdown. For long stress times it requires a greater defect density to trigger breakdown, contrary to the assumption that NBD should be independent of the stress condition. The observation of reduced defect effectiveness for stress experiments of very long duration may imply that defects undergo a slow relaxation that reduces their ability to participate in breakdown. It must be emphasized, however, that the final runaway stage of destructive breakdown, whereby a percolation path leads to catastrophic failure, is still not fully understood.
Charge to breakdown (QBD) and oxide lifetime
QBD in Equation (1), defined as the time-integrated current density that flows through the oxide until breakdown occurs, is a physically meaningful quantity, but the quantity of practical interest for an electronic component is the failure rate, which can be derived from the lifetime or time to breakdown, TBD. For constant-voltage stress this is related to QBD by the relation
|
TBD
|
J dt = QBD,
|
|
0
|
|
(3)
|
where J is the instantaneous value of the current density. For thin oxides the current is nearly constant until breakdown (in marked contrast to thicker oxides, in which electron trapping and/or hole trapping cause large changes in the current during stress); therefore,
The complete measurements of Pg and NBD described in the previous section can explain in detail the voltage and thickness dependence of QBD in ultrathin gate oxides, shown in Figure 13(a). Because of the exponentially increasing tunnel-current density with decreasing tox [4], TBD decreases exponentially with decreasing tox 3 nm, even though QBD is only slightly thickness-dependent in this range because of the weak thickness dependence of NBD. The measured TBD data are shown in Figure 13(b). This implies a rapidly diminishing margin for reliability as device dimensions are scaled. Figure 13(b) also shows that data from different laboratories [26, 117119] are in reasonable agreement, taking into account the differences in oxide thickness, indicating little dependence on processing for state-of-the-art facilities.
Figure 13
It is commonplace in the microelectronics industry to specify an operating life of ten years, i.e., to guarantee a specified (usually small) failure rate over a ten-year period. First, the TBD data such as those in Figure 13(b) must be projected from the high percentiles where experimental data are collected to the low percentiles desired for the product failure rate. Second, a lifetime correction must be made from the small-area test structures to the total gate-oxide area of a chip. Both of these projections depend on the shape of the breakdown distribution, which must be known for reliability estimates to be accurate.
The statistics of gate-oxide breakdown are described using the Weibull distribution [100, 102124],
F(x) = 1 exp[ (x/ )ß],
|
(5)
|
which is an extreme-value distribution in ln(x) and is appropriate for a weakest-link type of problem. Here F is the cumulative failure probability, i.e., the population fraction failed by age x, where x can be either charge or time. The characteristic life is percentile 63.2, and ß is called the slope parameter or Weibull slope. Plotting
W ln [ ln(1 F)]
|
(6)
|
against ln(x) yields a straight line with slope ß. Equation (6) has the useful property that if the area of the samples tested is increased by a factor N, the curve shifts vertically by ln(N) [90]. Figure 8 illustrates this effect schematically. If the desired low failure rate is Fchip over the product lifetime Tlife for the total gate area Aox on the chip (point indicated by the asterisk in Figure 8), this is equivalent to a higher failure rate Ftest in time Ttest on the test structures with area Atest, where from Figure 8 we can obtain
|
Tlife
|
|
|
Atest
|
|
Fchip
|
|
1/ß
|
.
|
|
|
|
|
Ttest
|
Aox
|
Ftest
|
|
(7)
|
This equation is used to scale measured breakdown times to the expected product lifetime, or equivalently to estimate the chip failure rate from test-structure measurements. Since Fchip < Ftest and typically Atest < Aox, then Ttest > Tlife by this equation; therefore, it is always necessary to measure the test structure under accelerated stress conditions (voltage and temperature). Understanding the voltage dependence is the major reason for investing the time to obtain long-term stress data, as shown in Figures 10 and 13, and it is the reason why so much attention is paid to the physical models of trap generation and breakdown, as discussed above [125, 126]. The uncertainty in the extrapolation to operating voltage is the major contributor to our present uncertainty in the reliability predictions. We now see that neither TBD nor QBD will obey any simple law such as exponential dependence on E, 1/E, or Vg, as has been commonly assumed in reliability extrapolations. Without the simplifying assumptions of a voltage-independent NBD and a purely exponential voltage dependence of Pg, it becomes more difficult to extrapolate reliability to operating voltage. The steeper Vg dependence of Pg between 2 and 3 V can easily lead to an overly optimistic TBD projection if data are limited to this range. Similarly, projection of data from higher voltages without knowing the complete Vg dependence may lead to more pessimistic or otherwise erroneous projections. Moreover, according to the idea that NBD increases for long stress duration, breakdown is a time-dependent as well as voltage-dependent phenomenon. Care must be taken to separate the voltage- and time-dependent effects when interpreting QBD or TBD data.
There is no guarantee in practice that actual failure distributions will be perfectly linear on a Weibull plot, and other distributions could in principle be more appropriate [123]. Nonetheless, a study of 900 samples [125] clearly supported the Weibull distribution down to a failure rate of 103. (See also [127].) In the simplest case, the presence of multiple failure modes such as extrinsic failure caused by processing defects will lead to bi- or multi-modal distributions with varying slope. A subtler problem can arise from nonuniformity of tox, which will lead to variations in QBD because of the thickness dependence of the critical defect density, and will cause further variation in TBD because of the thickness dependence of the tunnel current. This will lead to distortion and curvature of the Weibull plot [119, 128130]. Such issues can seriously complicate the otherwise rather straightforward projections. Curvature in the Weibull plot usually indicates a problem such as extrinsic failure modes or nonuniformities that must be dealt with before the data can be properly interpreted.
From the previous discussion, it can be seen that an important parameter for reliability projections is the Weibull slope ß. A key advance was the realization that ß is a function of tox, becoming smaller as tox decreases and approaching ß = 1 for tox 2 nm [34, 90, 100]. The thickness variation of the Weibull slope of the QBD distribution stems from the properties of NBD and is a statistical property of the percolation model for NBD. It is an intrinsic property of the breakdown of ultrathin oxides, and does not imply that the breakdown mechanism is changing. A smaller ß means greater sensitivity to the area and failure-rate extrapolations and gives more pessimistic projections via Equation (7), so this is a crucial issue for predicting breakdown of ultrathin oxides. It is often difficult to obtain an accurate value of ß from a direct measurement of the failure distribution because of thickness variations and statistical uncertainty [131, 132]. In order to circumvent these difficulties, Wu et al. [125, 133] adopted the area-scaling relation of Equation (7) in order to obtain accurate values.
Another implication of the reduction in ß for ultrathin oxides is a change in the character of the statistical failure rate. For larger values of ß, the failure rate is initially small, and most of the fails occur near the end of life (see Figure 14). However, for ß approaching unity, Equation (5) reduces to a simple exponential, and the failure rate becomes constant. This means that a certain number of early fails are to be expected from good parts that pass all screening tests. In this context, it should be noted that intrinsic breakdown, as discussed in this paper, is a purely statistical phenomenon, so that it is not possible to pre-select good samples on the basis of any prior measurement, as far as is currently known. Similarly, it is not possible to screen out the early fails using burn-in. Burn-in, where chips are operated for a short time at elevated voltage and temperature, is designed to eliminate bad parts with an early failure distribution (modeled by the dashed curve with ß = 0.5 in Figure 14). This will also remove those chips in the early part of the intrinsic distribution, but will not reduce (in fact will slightly increase) the intrinsic fail rate during operation.
Figure 14
Reliability projections
The common notion that oxide breakdown is controlled by the magnitude of the electric field E across the insulator is in accord with traditional scaling laws [1], even if in actual practice the voltage has not been reduced as quickly as the oxide thickness. The popular anode hole-injection (AHI) model [3, 134] predicted a lifetime exponentially dependent on 1/E, and it was claimed, on the basis of extrapolations of data from 2.4-nm and thicker oxides [135], that an oxide field of up to 8 MV/cm is compatible with a 20-year lifetime. Thus, the projected maximum operating voltage for a 2-nm oxide should be 1.6 V. The E-model, as implied by the name, predicts that TBD scales as exp( E), where is a constant [75, 76] and the observation that wearout and breakdown for ultrathin oxides is controlled by gate voltage rather than oxide field suggests a similar relationship of the form exp( Vg), where is a constant known as the acceleration parameter [4, 28, 34, 105]. As we have seen from the previous section, none of these simple extrapolation laws are exact. Figure 15 shows various oxide-reliability projections from different research groups [4, 21, 127, 136]. Plotted in this figure is the maximum allowable voltage that can be applied to the total gate area on a chip, such that no more than a specified failure rate will result. The failure rate in this case is defined as the fraction of chips which will experience one or more oxide breakdown events. As can be ascertained from this figure, there remains considerable disagreement among laboratories regarding this reliability limit for SiO2.
Figure 15
The various projections in this figure are compared to the latest international roadmap for the operating voltage and physical oxide thickness [16]. Note that various laboratories may use different methods to determine the oxide thickness, which can produce results varying by as much as 0.4 nm [4]. As a rule of thumb, a factor of 10 change in any reliability criterion (e.g., lifetime, failure rate, device area) results in ~0.1 nm change in the minimum tox for a given supply voltage.
The three most conservative projections date from several years ago, and were based on fitting the average Pg and NBD behavior using breakdown data from relatively short-term experiments (TBD 105 s). The earliest curve in this figure is from data on 2.5-nm and thicker oxides measured over a range of voltages down to about 2.5 V. These data were then fitted to the AHI model [135, 136]. Since the data fitted were the breakdown times of individual small structures, this curve is a voltage projection only, not a full projection to low failure rate and chip area, since the appropriate ß values were not known at that time. It is probably fortuitous, therefore, that the extrapolation of this line appears in good agreement with some of the other projections in this figure.
The projections from the more recent version of the AHI model [14] are shown by the open and closed circles in Figure 15 [21]. In this case the model was fitted to data on oxides ranging in thickness from 1.4 nm to 2.4 nm and measured between 3.2 and 4.9 V [137]. An example of such data is shown in Figure 13(b). In this voltage range, the field in the oxide is ~2028 MV/cm, which is in the range of the intrinsic breakdown strength of SiO2. Such high field data may therefore be influenced by other bond-breaking mechanisms, in addition to wearout, and probably should not be used to extrapolate to long-term-use conditions. These are the most optimistic of any published oxide-reliability projections of which we are aware.
Using the recent long-term stress data [26], a range of possible oxide reliability projections results depending on how the data are extrapolated. The lower range results from taking into account the complete behavior of the defect-generation rate (Pg) and the critical defect density (NBD) as described previously, using the NBD values projected to approximately ten-year lifetimes and the sigmoidal voltage dependence of Pg to extrapolate from the lowest-voltage data points. In this case, the roadmap runs into problems very quickly below 2 nm, which corresponds to the technology being manufactured today. The upper range of projected oxide reliability comes from a straight-line [exp( Vg)] projection of the TBD data [Figure 13(b)] in the region between 2 and 3 V. In this voltage range TBD shows the strongest voltage dependence, because of the behavior of Pg (decreasing relative to the average exponential trend) and NBD (increasing with decreasing voltage and increasing stress time) for these measurement conditions. In this case the results suggest a concern regarding the ITRS roadmap below about 1.6 nm for 1.2-V operation. If the voltage acceleration parameter should continue to increase at lower Vg, as some have suggested [28, 29], a more favorable projection will result.
It is clear that there is no agreement at the present time regarding the correct oxide-reliability extrapolation. While all of the estimates for the minimum tox are rather closely clustered in the 1.52.5-nm range, this is precisely the range of interest for the industry roadmap, and it represents a large variation in device performance and circuit density. Thus, the situation is complicated by the fact that we cannot safely adopt either a worst-case or a best-case outlook: The worst-case scenario appears to be unacceptable from a technological and economic perspective, and the best-case scenario is risky because even in this case we are left with no margin for error. Thus, it appears that the traditional method of assessing oxide reliability, by observing the breakdown time during an accelerated stress experiment and extrapolating this to operation conditions using physical or empirical models, may no longer be adequate to convincingly ensure the long-term reliability of products based on CMOS circuits. It has therefore become necessary to look in more detail at the nature of the breakdown event and the behavior of devices and circuits after oxide failure.
Device and circuit reliability
The signature of oxide breakdown, which we have been discussing up to now, is the first discrete jump in leakage current, and it has been implicitly assumed that such a breakdown event, affecting even a single transistor in a circuit, can potentially cause a circuit to fail. Some researchers have pointed out, however, that the magnitude of the current increase after breakdown is not always large enough to completely destroy the functionality of a transistor, much less that of an entire circuit [30, 138, 139]. On the other hand, some have argued that breakdown will cause a significant increase in the transistor off current if the breakdown spot is in the drain region [119, 125, 137, 140, 141], which is increasingly likely in short-channel devices [142]. It is also claimed that gate-oxide breakdown can significantly affect rf performance [143]. Current research is therefore aimed at better understanding the nature of the conduction after breakdown [144155] and the impact of this conduction on device and circuit functionality.
Oxide breakdown events are typically classified as soft or hard depending on the magnitude of the post-breakdown conduction. There is probably some confusion in the literature over the precise characterization of breakdown modes because of the lack of a precise definition of the terms and because for some experimental conditions the detection of one or the other breakdown mode may be difficult. For example, when testing a large-area structure or a very thin oxide where the initial current is larger than the breakdown current, a soft event could be missed, or a hard event could be interpreted as soft.
Soft breakdown (SBD) occurs in a localized spot, and therefore the current after breakdown is independent of device area [148, 153, 156158]. The size of this spot is estimated to be 1014 to 1012 cm2 [153]. Several soft-breakdown events, occurring in different spots, may sometimes be seen in large-area devices prior to a hard breakdown [148, 150, 153, 156, 159, 160]. In addition, the SBD currentvoltage (IV) characteristic is independent of the oxide thickness, at least down to 3 nm, within a band of observed curves [148, 150, 153, 161]. In contrast to the hard breakdown (HBD) which shows a roughly linear (ohmic) IV characteristic of resistance of about 10 k if the damage region remains localized and does not propagate [102, 153], the SBD IV characteristic is a power law with an exponent of 36 [144, 145, 153, 162], although it may be better described by an exponential voltage dependence [154]. The SBD voltage dependence can be explained by a quantum point-contact model [146, 148151, 153, 154]. The point-contact model can account for both SBD and HBD within a single framework as two limiting cases, depending only on the lateral size of the breakdown spot, which determines the energies of the subbands in the conduction path [153].
Simultaneous with the jump in channel-to-gate current after breakdown is a large increase in the substrate-to-gate current [157, 163166] and in gate and substrate noise [147, 157, 163, 164, 167170]. These may be important concerns in specific applications, e.g., analog circuits, but their impact has not been investigated in sufficient detail.
Besides understanding the nature of the conduction through a breakdown spot, a complete assessment of circuit reliability will require that we understand the factors that control the hardness of the breakdown event, i.e., the magnitude of the post-breakdown conductivity. HBD is probably a result of thermal damage when sufficient energy is deposited during the breakdown transient [102, 171181]. This is influenced by the impedance of the circuitry driving the gate in a particular application. For example, in a CMOS circuit the stress conditions on a transistor gate are not the same as the typical experimental conditions used to study reliability. A circuit does not usually subject a gate to either a constant-voltage or a constant-current stress, which are the two types of stress typically employed, but rather to a current-limited stress in which the current through a breakdown spot is limited by the saturation on-current (Idsat) of a complementary transistor in series [180]. Figure 16 shows the situation for an SRAM cell consisting of two cross-coupled inverters. Not all circuits will fall into this category (for example, some gates will be driven by a low-impedance clock), but SRAM (cache) occupies a large fraction of the area of many chips, and therefore it is a useful place to begin an investigation of circuit reliability. In the SRAM cell, the gate oxides are not connected directly to the power supply. The n-FET gates are connected via p-FETs, and the p-FET gates via n-FETs. Compared to a typical constant-voltage source, the small transistor in series has a much higher impedance, a much lower current capability, and a lower capacitive loading. It has recently been shown [180, 182] that limiting the drive current to low values (corresponding to very small p-FET drivers) will halt the breakdown event such that the transistors may still be operational after breakdown. Similarly, increasing the circuit time constant by inserting an inductive impedance in series with the device will reduce the severity of the breakdown [181]. Breakdown also becomes softer at low inversion-layer density [155]. In all of these examples, the time to breakdown is not affected, only the post-breakdown conduction. This shows that the nature of the breakdown spot is influenced by the circuit environment of the device, which in turn may affect the overall projected reliability of a circuit.
Figure 16
Besides considering the magnitude of the perturbation of the FET electrical properties after oxide breakdown, we must also consider possible voltage droop caused by the increased current after breakdown [24, 182]. The increased gate current will cause a voltage drop across the transistor which is driving the gate, and this voltage drop will increase as transistors become smaller [182]. Moreover, because there will be a distribution of post-breakdown leakage levels [103, 151, 153, 155, 162, 178, 179], it is likely that some circuit failures will occur as a result of oxide breakdown, but it is difficult to make quantitative predictions. The overall effect of SBD on circuit performance is still an open question, since many different circuit elements are used in practice and some may be more sensitive than others to noise and voltage margins.
After a hard breakdown, a MOSFET device is clearly nonfunctional by any ordinary criterion, exhibiting a negative drain current when the gate-to-drain leakage exceeds the normal transistor-on current [140]. However, even a hard breakdown may not completely destroy circuit functionality: It was recently reported [183] that in some cases a circuit may be able to survive an oxide breakdown that previously would have been assumed to be catastrophic. In this experiment it was observed that a ring oscillator continued to function even after multiple transistors suffered oxide breakdown, with breakdown resistances of about 3 k . This means that a ring oscillator/inverter is not highly susceptible to breakdown, and is a poor test vehicle for oxide reliability. Indeed, simple circuit analysis shows that shorting one transistor in an inverter with a resistance of this magnitude produces a circuit which still functions as an inverter, albeit one with poorer performance and higher power consumption.
Even if devices survive after an initial breakdown event, the subsequent stress on the damaged oxide can lead to erratic behavior and a progressive degradation of the device characteristics [125]. Thus, it would be premature to disregard oxide breakdown as a factor in circuit reliability. Much more research will be needed in order to formulate a complete methodology for the quantitative prediction of device and circuit reliability.
Future outlook
Since different circuits will have various degrees of sensitivity to the erosion of noise and voltage margins resulting from oxide breakdown, more research is needed in order to develop a quantitative methodology for predicting the reliability of circuits. The present oxide-reliability methodology will have to evolve from characterizing oxide degradation and breakdown, to the more complex problem of characterizing the response of circuits to oxide breakdown [24].
In the meantime, what avenues are open for avoiding the potential scaling limit imposed by gate-oxide reliability? Although the intrinsic breakdown does imply a fundamental physical limit to the oxide thickness, oxide breakdown is a statistical process, so this limit is dependent on what is deemed to be an acceptable failure rate. Laptop and palmtop reliability of three to five years may be practical. However, because of the uncertainty in prediction, most manufacturers have a minimum lifetime goal of ten years [25], and some key technologies (e.g., telecommunications and networking infrastructure) may require even greater longevity, e.g., 25 to 30 years. However, adjusting the reliability target does not have a great effect on the oxide thickness limit. As pointed out earlier, a factor of 10 change in the reliability target reduces the minimum tox by only about 0.1 nm. This allows some leeway, but it is not a real solution.
Improved control over the physical properties of the Si/SiO2 interface has been argued by some to be key for improved performance and reliability [119, 184189], and to some extent a reduction in the strain at the Si/SiO2 interface may account for the excellent reliability of present state-of-the-art oxides [188, 189]. Continued development of pre-oxidation cleaning and oxide growth technologies are likely to provide incremental improvements in gate leakage, device performance, and yield, but will not alter the fundamental physics limiting the long-term reliability of ultrathin SiO2 films [129, 130, 190].
Another avenue is to more aggressively reduce the power-supply voltage. For example, 0.5-V operation of a microprocessor built with 1.5-nm oxide has been proposed [191]. As can be seen from Figure 15, this would nearly satisfy even the most pessimistic reliability projection based on our current data. Of course, this approach has its own problems, including tight control of threshold voltage.
Some of the power-management schemes used for low-power applications can also have a side benefit of improving reliability if sections of a chip are turned off when not in use. It is important to understand, however, that standby power, not just active power, is responsible for the degradation that leads to oxide breakdown [71, 92]. It is therefore not sufficient to use clock gating, which reduces active power but not standby power. Instead, active-power-supply management may be required.
Finally, any failure mechanism, provided that it is not too severe, can be fixed by using error correction and redundancyfeatures already present in many systems.
Acknowledgments
This paper owes a great debt to D. J. DiMaria, who took many of the key data and provided key insights into the physics of defect generation in SiO2. A. Vayshenker took the long-term stress data, and I thank B. P. Linder for extremely helpful discussions about soft breakdown.
Footnote
1 The sources for these data are the technical digests of the International Electron Devices Meeting and the Symposium on VLSI Technology. Data prior to 1995 compiled with the help of Ping Yang, Texas Instruments Corporation.
Received September 16, 2001; accepted for publication December 28, 2001
|