IBM Skip to main content
  Home     Products & services     Support & downloads     My account  
  Select a country  
Journals Home  
  Systems Journal  
Journal of Research
and Development
  ·  Current Issue  
  ·  Recent Issues  
  ·  Papers in Progress  
  ·  Search/Index  
  ·  Orders  
  ·  Description  
  ·  Patents  
  ·  Recent publications  
  ·  Author's Guide  
  Staff  
  Contact Us  
  Related links:  
     IBM Microelectronics  
     ITRS  
IBM Journal of Research and Development  
Volume 46, Numbers 2/3, 2002
Scaling CMOS to the Limits
 Table of contents: arrowHTML arrowPDF arrowASCII   This article: HTML arrowPDF arrowASCII   DOI: 10.1147/rd.462.0133 arrowCopyright info
   

Beyond the conventional transistor

by H.-S. P. Wong
This paper focuses on approaches to continuing CMOS scaling by introducing new device structures and new materials. Starting from an analysis of the sources of improvements in device performance, we present technology options for achieving these performance enhancements. These options include high-dielectric-constant
(high-k) gate dielectric, metal gate electrode, double-gate FET, and strained-silicon FET. Nanotechnology is examined in the context of continuing the progress in electronic systems enabled by silicon microelectronics technology. The carbon nanotube field-effect transistor is examined as an example of the evaluation process required to identify suitable nanotechnologies for such purposes.

1. Introduction

The semiconductor industry has been so successful in providing continued system performance improvement year after year that the Semiconductor Industry Association (SIA) has been publishing roadmaps for semiconductor technology since 1992. These roadmaps represent a consensus outlook of industry trends, taking history as a guide. The recent roadmaps [1] incorporate participation from the global semiconductor industry, including the United States, Europe, Japan, Korea, and Taiwan. They basically affirm the desire of the industry to continue with Moore's law [2], which is often stated as the doubling of transistor performance and quadrupling of the number of devices on a chip every three years. The phenomenal progress signified by Moore's law has been achieved through scaling of the metal-oxide–semiconductor field-effect transistor (MOSFET) [3, 4] from larger physical dimensions to smaller physical dimensions, thereby gaining speed and density.

Shrinking the conventional MOSFET beyond the 50-nm-technology node requires innovations to circumvent barriers due to the fundamental physics that constrains the conventional MOSFET. The limits most often cited [4–12] are 1) quantum-mechanical tunneling of carriers through the thin gate oxide; 2) quantum-mechanical tunneling of carriers from source to drain, and from drain to the body of the MOSFET; 3) control of the density and location of dopant atoms in the MOSFET channel and source/drain region to provide a high on-off current ratio; and 4) the finite subthreshold slope. These fundamental limits have led to pessimistic predictions of the imminent end of technological progress for the semiconductor industry [4]. On the other hand, the push to scale the conventional MOSFET continues to show remarkable progress [13, 14].

Instead of reiterating the considerations of device scaling limits here, we refer the reader to our previous analyses [8–10] as well as analyses by others in the literature [4–7, 11, 12]. We focus this paper instead on approaches to circumvent or surmount the barriers to device scaling. The organization of this paper is as follows. We first address opportunities for the silicon MOSFET, focusing primarily on approaches that depart from conventional scaling techniques (for example, doping profile control, thin silicon dioxide gate dielectrics, SOI). Topics covered include high-dielectric-constant (high-k) gate dielectric, metal gate electrode, double-gate FET, and strained-silicon FET. The second part of this paper examines the space between conventional microelectronics technology and the more exploratory nanotechnology. Such a wide spectrum of nanotechnologies are being explored today that it is impossible to make even a modest attempt to cover the field. The approach adopted in this paper is to select an example, the carbon nanotube field-effect transistor, to illustrate both the opportunities offered by nanotechnologies and the most important questions that must be answered before such technologies can find practical use. The example is therefore chosen for illustrative purposes rather than an implied suggestion of eventual technological utility.

2. Silicon MOSFET

For digital circuits, a figure of merit for MOSFETs for unloaded circuits is CV/I, where C is the gate capacitance, V is the voltage swing, and I is the current drive of the MOSFET. For loaded circuits, the current drive of the MOSFET is of paramount importance. Historical data indicate that scaling the MOSFET channel length improves circuit speed, as suggested by scaling theory [3]. Reference [15] illustrates data on the CV/I metric from recent literature. The off-current specification for CMOS has been rising rapidly to keep the speed performance high. While 1 nA/µm was the maximum off-current allowed in the late 1990s [8], off-currents in excess of 100 nA/µm are proposed today [13]. This trend obviously cannot continue, since the on-current increases only linearly as off-current increases exponentially in a typical device design tradeoff. Means to mitigate the standby power increase must be found.

Keeping in mind both the CV/I metric and the benefits of a large current drive, we note that device performance may be improved by 1) inducing a larger charge density for a given gate voltage drive; 2) enhancing the carrier transport by improving the mobility, saturation velocity, or ballistic transport; 3) ensuring device scalability to achieve a shorter channel length; and 4) reducing parasitic capacitances and parasitic resistances. Table 1 summarizes these opportunities and proposed technology options for capitalizing on them. These options generally fall into two categories: new materials and new device structures. In many cases, the introduction of a new material requires the use of a new device structure, or vice versa. Throughout the discussion, we direct attention to areas of device physics and materials science that must be better understood in order to advance the technology.


Table 1   Device performance improvement opportunities.
Source of improvement Parameters affected Method

Charge density
  1. S (inverse subthreshold slope)
  2. Qinv at a fixed off-current
  1. Double-gate FET.
  2. Lowered operating temperature.
Carrier transport
  1. Mobility (µeff)
  2. Carrier velocity
  3. Ballistic transport
  1. Strained silicon.
  2. High-mobility and -saturation-velocity materials (e.g., Ge, InGaAs, InP).
  3. Reduced mobility degradation factors (e.g., reduced transverse electric field, reduced Coulomb scattering due to dopants, reduced phonon scattering).
  4. Shorter channel length.
  5. Lowered operating temperature.
Ensuring device scalability to a shorter channel length
  1. Generalized scale length
    (lambda).
  2. Channel length (Lg)
  1. Maintaining good electrostatic control of channel potential (e.g., double-gate FET, ground-plane FET, and ultrathin-body SOI) by controlling the device physical geometry and providing means to terminate drain electric fields.
  2. Sharp doping profiles, halo/pocket implants.
  3. High gate capacitance (thin gate dielectrics, metal gate electrode) to provide strong gate control of channel potential.
Parasitic resistance
  1. Rext
  1. Extended/raised source/drain.
  2. Low-barrier Schottky contact.
Parasitic capacitance
  1. Cjn
  2. CGD, CGS, CGB
  1. SOI.
  2. Double-gate FET.

MOSFET gate stack
Continued device scaling requires the continued reduction of the gate dielectric thickness. This requirement arises from two different considerations: controlling the short-channel effect and achieving a high current drive by keeping the amount of charge induced in the channel large as the power-supply voltage decreases. In both cases, to a first approximation, it is the electrical thickness that is important. The electrical thickness at inversion is determined by the series combination of three capacitances in the gate stack: the depletion capacitance of the gate electrode, the capacitance of the gate dielectric, and the capacitance of the inversion layer in the silicon [Figure 1, part (a)].

Figure 1Figure 1

On the other hand, the direct tunneling current through the gate dielectric grows exponentially with decreasing physical thickness of the gate dielectric [16]. This tunneling current has a direct impact on the standby power of the chip and puts a lower limit on unabated reduction of the physical thickness of the gate dielectric. It is likely that tunneling currents arising from silicon dioxides (SiO2) thinner than 0.8 nm cannot be tolerated, even for high-performance systems [10].

Solutions that reduce the gate tunneling current and gate capacitance degradation due to polysilicon depletion are explored through introduction of new materials: high-dielectric-constant gate dielectrics and metal gate electrodes.

High-k gate dielectric
A gate dielectric with a dielectric constant (k) substantially higher than that of SiO2 (kox) will achieve a smaller equivalent electrical thickness (teq) than the SiO2, even with a physical thickness (tphys) larger than that of the SiO2 (tox):

teq= open parenthesis kox close parenthesis  tphys.

k

Replacing the SiO2 with a material having a different dielectric constant is not as simple as it may seem. The material bulk and interface properties must be comparable to those of SiO2, which are remarkably good. Basic material properties such as thermodynamic stability with respect to silicon, stability under thermal conditions relevant to microelectronic fabrication, low diffusion coefficients, and thermal expansion match are some critical examples. In addition, interface traps of the order of a few 1010 cm–2eV–1 and bulk traps of the order of a few 1010 cm–2 are common among SiO2 and the closely related oxynitrides [17, 18]. Charge trapping and reliability for the gate dielectrics are particularly important considerations.

Thermal stability with respect to silicon is an important consideration, since high-temperature anneals are generally employed to activate dopants in the source/drain as well as the polysilicon gate. Although many binary and ternary oxides are predicted to be thermally stable with respect to silicon [19], recent research on high-dielectric-constant gate insulators have focused primarily on binary metal oxides such as Ta2O5, TiO2, ZrO2, HfO2, Y2O3, La2O3, Al2O3, and Gd2O3 and their silicates [20]. Table 2 compares the properties of the common high-k gate dielectrics reported in the literature. The dielectric constant of these materials generally ranges from 10 to 40, which is about a factor of 3 to 10 higher than SiO2. Leakage current reduction from 103× to 106×, in comparison with SiO2 of the same electrical thickness, is generally achieved experimentally for high-k gate dielectrics [21]. The benefits of using a very-high-dielectric-constant material to simply replace SiO2 for the same electrical thickness are limited because of the presence of two-dimensional electric fringing fields from the drain through the physically thicker gate dielectric [10, 22]. The drain fringing field lowers the source-to-channel potential barrier and lowers the threshold voltage in a way similar to the well-known drain-induced barrier lowering (DIBL), in which the drain field modulates the source-to-channel potential barrier via coupling through the silicon substrate. The use of higher-k materials must therefore be combined with a concurrent reduction of the electrical thickness.


Table 2   Selected material and electrical properties of high-k gate dielectrics. Data compiled from Robertson [25], Gusev et al. [20], Hubbard and Schlom [19], and other sources.
Dielectric Dielectric constant (bulk) Bandgap (eV) Conduction band offset (eV) Leakage current reduction w.r.t. SiO2 Thermal stability w.r.t. silicon
(MEIS data)

Silicon dioxide (SiO2) 3.9 9 3.5 N/A >1050°C
Silicon nitride (Si3N4) 7 5.3 2.4 >1050°C
Aluminum oxide (Al2O3) ~10 8.8 2.8 102–103× ~1000°C, RTA
Tantulum pentoxide (Ta2O5) 25 4.4 0.36 Not thermodynamically stable with silicon
Lanthanum oxide (La2O3) ~21 6* 2.3
Gadolinium oxide (Gd2O3) ~12
Yttrium oxide (Y2O3) ~15 6 2.3 104–105× Silicate formation
Hafnium oxide (HfO2) ~20 6 1.5 104–105× ~950°C
Zirconium oxide (ZrO2) ~23 5.8 1.4 104–105× ~900°C
Strontium titanate (SrTiO3) 3.3 –0.1
Zirconium silicate (ZrSiO4) 6* 1.5
Hafnium silicate (HfSiO4) 6* 1.5

*Estimated value.

A large silicon-to-insulator energy barrier height is desirable because the gate direct-tunneling current is exponentially dependent on the (square root of the) barrier height [23]. In addition, hot-carrier emission into the gate insulator is also related to the same barrier height [24]. The high-k material should therefore not only have a large bandgap, but also have a band alignment which results in a large barrier height. Figure 2 illustrates the bandgap and band alignment for several high-k gate dielectrics calculated by Robertson [25]. Most high-k materials that have other desirable properties do have relatively low band offsets and small bandgaps. Aluminum oxide (Al2O3) is probably the only material that has a bandgap and band alignment similar to those of SiO2.

Figure 2Figure 2

Figure 1 illustrates examples of thin gate dielectrics: SiO2, Al2O3, and ZrO2 with an interfacial SiO2 layer. These dielectrics are only a few atoms thick. The thin dielectric films can be deposited by sputtering, sol–gel, physical vapor deposition (PVD), metallo-organic chemical vapor deposition (MOCVD), or atomic-layer deposition (ALD). Deposition uniformity does not appear to be a significant issue. However, integration of the deposited dielectric with the rest of the device fabrication process requires further research and development in several areas. If a conventional self-aligned polysilicon gate is used, the dielectric film must be able to withstand rapid thermal anneals (RTAs) up to at least 950°C for dopant activation in the polysilicon gate. The typical thermal treatments during a polysilicon gate CMOS process pose potential problems such as formation of silicates and interfacial SiO2. In addition, diffusion (for example, boron, oxygen) through the gate dielectric is a serious concern. If a metal gate electrode is employed (using a low-temperature process), many of the thermal stability concerns can be relieved.

Figure 3(a) shows the electrical characteristics of an 80-nm polysilicon gate n-FET using Al2O3 as the gate dielectric, as reported by Buchanan et al. [26]. This work and that of others (for example, [21]) illustrates some of the obstacles for high-k gate dielectrics: 1) There are a significant number of traps and fixed charges in the film (or at the interfaces), leading to flat-band voltage shifts (up to 450 mV) and voltage bias instability; 2) the traps raise questions of reliability as channel hot carriers and carriers from gate tunneling traverse the gate dielectric, resulting in trap generation; and 3) the mobility of carriers in the FET channel is severely degraded (up to a factor of 2) for high-k gate dielectrics [Figure 3(b)].

Figure 3Figure 3

The cause of the mobility degradation is not clear at present. Presumably, some of the differences can be attributed to the difficulty of obtaining accurate estimates of the effective electric field due to the charge trapping. Coulomb scattering due to the trapped charge alone cannot explain entirely the mobility degradation observed. Another source of mobility lowering may be found in remote phonon scattering [27]. The static dielectric constant of a high-bandgap high-k material derives its high dielectric constant primarily from ionic polarizability, since the large bandgap results in a small electronic polarizability. The ionic polarizability is associated with the “soft” metal–oxygen bonds with low-energy phonons. Fischetti et al. [27] studied the scattering of electrons in the inversion layer by surface optical phonons and suggested that there is generally an inverse relation between surface-optical-phonon-limited mobility and the static dielectric constant: the higher the dielectric constant, the lower the surface-optical-phonon-limited mobility.

Metal gate electrode
A metal gate electrode has several advantages compared to the doped polysilicon gate used almost exclusively today. Gate capacitance degradation due to the depletion of the doped polysilicon gate typically accounts for 0.4–0.5 nm of the equivalent-oxide thickness of the total gate capacitance at inversion. This is a substantial amount, considering that a gate equivalent oxide of less than 1.5 nm (at inversion) is required for sub-50-nm CMOS. The thermal instability of most high-k gate dielectrics may require the use of a low thermal budget process after the gate dielectric deposition. While junction activation may be performed prior to gate dielectric deposition, the high-temperature gate polysilicon activation step necessarily occurs after the gate dielectric formation. A further potential benefit of metal gate electrodes is the elimination of carrier mobility degradation due to plasmon scattering from the gate electrode. The plasmon frequency of a highly conductive metal electrode is too high to interact with the carriers in the inversion layer [28].

From a device design point of view, the most important consideration for the gate electrode is the work function of the material. While the polysilicon gate technology has somewhat locked in the gate work functions to values close to the conduction band and the valence band of silicon,1 the use of a metal gate material opens up the opportunity to choose the work function of the gate and redesign the device to achieve the best combination of work function and channel doping. For bulk or partially depleted SOI, because of the requirements on the threshold voltages and the need to use heavy dopants to control short-channel effects, the most suitable gate work-function values are still close to the conduction and valence bands of silicon. A mid-gap work function results in either a threshold voltage that is too high for high-performance applications, or compromised short-channel effects, since the channel must be counterdoped to bring the threshold voltage down. For double-gate FETs (see the section on double-gate FET electrostatics), because the short-channel effects are controlled by the device geometry, the threshold voltage is determined mainly by the gate work function. Therefore, the choice of the gate electrode is particularly important for the double-gate FET. For example, for symmetric double-gate FETs (SDG), a gate-electrode work function ±250 mV from mid-gap is suitable. The section on double-gate FET electrostatics expands on this discussion.

While there are plenty of metal choices that may satisfy the work-function requirements [30–32], other device and integration considerations narrow down the choices significantly. The requirements of a low gate-dielectric/silicon interface state density and low gate-dielectric fixed charges imply that a damage-free metal deposition process (e.g., CVD instead of sputtering) is required. At the same time, the deposition process must not introduce impurities (e.g., traces of the CVD precursor materials) into the gate stack. The thermal stability of the metal electrode must at least withstand the thermal anneals required to passivate the silicon/gate-dielectric interface (e.g., forming-gas anneal) after the metal deposition, as well as the thermal processing of the back-end metallization processes. Furthermore, it is desirable to have a low resistivity (at least similar to conventional silicides such as CoSi2 and TiSi2), although this requirement may be relaxed by strapping the gate electrode of the proper work function with a lower- resistivity material on top.

In principle, a single-metal electrode is advantageous, since it avoids many potential problems of alloyed metals such as composition uniformity control and phase separation. On the other hand, alloying provides flexibility in choosing the desired material properties. The gate-electrode work-function issue is further complicated by the fact that the work function measured in vacuum (values reported in most materials data books) is different from the work-function value when the metal is in contact with a dielectric. In general, a dipole forms at the metal/dielectric interface which alters the effective work function of the metal/dielectric combination [33, 34]. The choice of appropriate metal electrode is then also dependent upon the choice of the gate dielectric: SiO2 or high-k material.

One of the promising process integration schemes for metal gate is the replacement-gate technology [35]. In this process, a dummy gate material (e.g., polysilicon) is used for forming the self-aligned gate-to-source/drain structure. Subsequently, the dummy gate material is removed and replaced with the desired gate dielectric and electrode [35]. Alternatively, the metal gate electrode may be etched in a way similar to the polysilicon gate technology. However, the learning curve is long and steep for developing the same (or a better) level of etch selectivity and profile control for the metal gate compared to the polysilicon gate. In addition, thermal stability issues (from the source/drain dopant activation anneal) must be addressed. In either case, if metals with two different work functions are employed for n-FET and p-FET, respectively, the integration of n-FET and p-FET in a CMOS process remains a challenge, since 1) the deposition of the metals for n-FET and p-FET must be done separately, and 2) one must find a way to strap the two different metals in a compact way to connect the n-FET and p-FET gates. It is desirable to circumvent these two requirements and find a way to alter the work function of the metal by some simple means (for example, one that requires only a block mask). Two interesting approaches, yet to be proven through more rigorous examination, have been reported. In the first approach, a single metal (molybdenum) is deposited, and the work function is altered using ion implantation of nitrogen into the metal [36]. It is not clear how the nitrogen influences the work function, and how thermally stable this material is. On the other hand, ion implantation is an attractive process, since it requires only a single metal deposition and a photoresist block mask. In another approach, metals are intermixed to obtain the desired work function [37]. Two metals (Ti and Ni) are sequentially deposited on the gate dielectric, followed by selective etching of the top metal, leaving the bottom metal at desired locations. After thermal annealing, the metal on the top migrates to the metal/gate-dielectric interface and alters the work function locally.

Ultimately scalable FET—the double-gate FET

Device concepts
The double-gate FET (DG FET) shown in Figure 4, part (a) was proposed in the early 1980s [38]. The concept has been gradually explored both experimentally and theoretically by many groups [39–46]. The Monte Carlo and drift-diffusion modeling work by Fiegna et al. [41] and Frank et al. [42] clearly showed that a DG FET can be scaled to a very short channel length (25 to 30 nm) while achieving the expected performance derived from scaling. While the early work focused on the better scalability of DG FET, recent work suggests that the scalability advantage may not be as large as previously envisioned [10, 47], although the carrier transport benefits may be substantial. In this section, we first discuss the advantages of DG FET, followed by device design requirements, and conclude with a summary of latest hardware results.

Figure 4Figure 4

The salient features of the DG FET (Figure 4) are [48] 1) control of short-channel effects by device geometry, as compared to bulk FET, where the short-channel effects are controlled by doping (channel doping and/or halo doping); and 2) a thin silicon channel leading to tight coupling of the gate potential with the channel potential. These features provide potential DG FET advantages that include 1) reduced 2D short-channel effects leading to a shorter allowable channel length compared to bulk FET; 2) a sharper subthreshold slope (60 mV/dec compared to >80 mV/dec for bulk FET) which allows for a larger gate overdrive for the same power supply and the same off-current; and 3) better carrier transport as the channel doping is reduced (in principle, the channel can be undoped). Reduction of channel doping also relieves a key scaling limitation due to the drain-to-body band-to-band tunneling leakage current. A further potential advantage is more current drive (or gate capacitance) per device area; however, this density improvement depends critically on the specific fabrication methods employed and is not intrinsic to the device structure.

The most common mode of operation of the DG FET is to switch the two gates simultaneously. Another use of the two gates is to switch only one gate and apply a bias to the second gate to dynamically alter the threshold voltage of the FET2 [49, 50]. In this mode of operation, called “ground plane” (GP) or back-gate (BG), the subthreshold slope is determined by the ratio of the switching gate capacitance and the series combination of the channel capacitance and the nonswitching gate capacitance, and is generally worse than the DG FET. A thin gate dielectric at the nonswitching gate reduces the voltage required to adjust the threshold voltage and preserves the drain-field-shielding advantage of the double-gate device structure. However, a thinner gate dielectric also means extra capacitance that does not contribute to channel charge for switching. Since the back-gate FET is very similar to a single-gated SOI FET with an adjustable threshold voltage [49], we focus our discussion here on the DG FET configurations in which both gates are switched.

Double-gate FET electrostatics
The double-gate device structure allows for termination of the drain electric field at the gates and leads to a more scalable FET. To evaluate the scalability of FETs, the concept of the “scale length” for a MOSFET is useful [10, 43, 44, 51]. The electrostatic potential of the MOSFET channel can be approximated by analytically solving the 2D LaPlace equation using the superposition principle (with suitable boundary conditions), and the short-channel behavior can be described by a characteristic “scale length,” lambda. Table 3.3 of [52] lists the generalized scale length derived by Frank et al. [10, 51] and the simpler, but less accurate, scale length derived by Suzuki et al. [44]. The minimum gate length is jointly determined by the scale length and by the amount of 2D short-channel effects one can tolerate in an application. The 2D short-channel effects can range from increased off-current due to threshold-voltage roll-off, drain-induced barrier lowering (DIBL), and degraded subthreshold slope, to degraded output resistance. Figure 5 of [10] illustrates the trend of these 2D effects as the channel length is decreased with respect to the scale length of the MOSFET. Manufacturing tolerances put a premium on the minimum channel length. With typical tolerances of 20–30% gate-length variation, an L/lambda of 1.5 is required.3 Conventional short-channel-effect theory [23] correlates the junction depth to the short-channel effects. In the case of the DG FET, consideration of junction depth is moot, since the 2D electrostatic behavior is controlled by the thickness of the silicon channel instead of the junction depth. However, the steepness of the source/drain junction is still an important consideration, as in the case of bulk FETs [47]. Figure 5 illustrates the threshold-voltage roll-off characteristics of the DG FET with lateral junction profile gradients of 2, 4, and 6 nm (Gaussian analytical profile). It is clear that a steep junction gradient commensurate with the channel length is required.

Figure 5Figure 5

Comparing scale lengths for the DG FET, ultrathin silicon SOI FET, and bulk devices, as well as considering other leakage mechanisms (such as tunneling leakages), leads to the conclusion that the DG FET can be scaled up to 50% further than the bulk FET for some applications [10]. Illustrations of the threshold-voltage roll-off behavior (an example of 2D short-channel effects) comparing DG FET, ultrathin-silicon FET, and ground-plane FET (in which the bottom gate of a DG FET is tied to a fixed bias) can be found in [10, 50] and many other references in the literature and are not repeated here. Similar analyses based on on-current and off-state subthreshold leakage current can be found in [53, 54]. Simply put, the better scalability of DG FET can be used to achieve a shorter channel length using the same gate-oxide thickness, or the same channel length using a thicker gate oxide.

We now turn our discussion to the relationship of the channel inversion charge and the gate voltage. The analytical model of Taur [55] and the numerical modeling of Ieong et al.4 [56] form the basis for much of this discussion. The gate work function of the two gates can be the same (the SDG, with a symmetric energy-band diagram in the direction normal to the gate electrode) or different (the ADG, with an asymmetric energy-band diagram in the direction normal to the gate electrode), as illustrated in Figure 4. In the subthreshold region, where there is negligible inversion charge, the silicon channel is fully depleted, and the energy bands closely follow the gate bias in a one-to-one relationship. For the SDG, the bands remain flat throughout the subthreshold region, since there is little or no depletion charge. Once inversion charge begins to build up, the mobile charges screen the gate field and the gate voltage is dropped primarily in the inversion layer. The threshold voltages for the SDG and the ADG with various gate work functions are plotted in Figure 6 as a function of the silicon channel thickness. The threshold voltage of the SDG has a small channel thickness dependence (approximately equal to 6 mV/nm), while the threshold voltages of the ADG and BG have a larger dependence on channel thickness (approximately equal to 28–32 mV/nm) because of the asymmetric band diagram.4 At the same off-current (same integrated mobile charge in the subthreshold region), the surface potential of the ADG is higher than that of the SDG. In other words, the surface of the ADG is inverted more than the surfaces of the SDG at the same off-current condition. This is because the SDG has a fairly constant charge density throughout the silicon film, while the ADG charge density peaks toward one of the surfaces. This surface-potential difference is carried forward throughout the entire gate-voltage range well into the fully inverted condition in which both surfaces of the ADG are in inversion [55].

Figure 6Figure 6

For the SDG with a channel thickness greater than 5 nm, there are two distinct charge-density peaks near the two surfaces, and the two inversion layers are basically independent of each other (see the inset of Figure 6) [56]. For a channel thickness less than 5 nm, the two inversion charge peaks begin to merge. For the ADG, the inversion charge forms first at the surface, where the gate work function is lower. Although the ADG has only one predominant channel, the total integrated charge is more than half that for the SDG. The SDG-to-ADG charge ratio is about 2× for a thick silicon channel and approaches 1× for very thin channels. The physics behind this observation is best explained by using the capacitive coupling model of Taur [55]. The ADG gate with a higher gate work function (say, back-gate) induces inversion charge at the opposite surface through the capacitive coupling of the series combination of the back-gate dielectric (Coxb = epsilonox/toxb) and the channel capacitance (CSi = epsilonSi/tSi). The total gate capacitance of the ADG is therefore more than the front-gate dielectric capacitance (Coxf = epsilonox/toxf) alone. The amount of back-gate coupling obviously depends on the back-gate dielectric thickness and the silicon channel thickness. The thinner the silicon channel with respect to the gate dielectric, the more effective the coupling, and the closer the SDG-to-ADG charge ratio is to unity.

The quantization in the channel introduces several interesting effects, which are discussed here. The threshold-voltage increase due to quantum effects is illustrated in Figure 6 [50] using a simple particle-in-a-box approximation (more accurate in SDG than BG or ADG): DeltaVt =
–(h2/4qm*t
Si2)(DeltatSi/tSi) [57], where Vt is the threshold voltage, h is Planck's constant, q is the electronic charge, m* is the carrier effective mass, and tSi is the channel thickness. The quadratic increase of the threshold voltage with decreasing channel thickness (steep rise of dVt/dtSi) means that channel thickness much below 5 nm will be almost inpractical to manufacture unless an atomically precise method of defining the channel thickness is found. In a more realistic approximation, one solves the one-dimensional Schrödinger equation and incorporates the solution self-consistently in a coupled 2D Poisson and continuity equation solution (with appropriate boundary conditions) [56]. Figure 6 compares the threshold voltages computed from a classical and quantum-mechanical description of the channel for several cases of gate-work-function values. For gates with a symmetric work function, the work function of the gates should be about 250 to 350 mV above/below mid-gap for n/p channels, respectively, for high-performance applications. Another strategy for setting the threshold voltage is to employ a set of asymmetric work-function gates: a) a front gate with a work function close to the conduction band (“n+ gate”) and a back gate with a work function close to the valence band (“p+ gate”); and b) a front gate with a work function close to the conduction band (valence band) for n-FET (p-FET) and a back gate with a common mid-gap work function (for both n-FET and p-FET). Both of these approaches provide a symmetric threshold voltage for both n-channel and p-channel DG FETs. Obviously, the work-function requirement for the ADG case also depends heavily on the gate dielectric and silicon channel thicknesses. A typical gate dielectric (1 nm) and silicon channel (5–10 nm) thickness required for a sub-50-nm DG FET gives a threshold voltage that is too high if the n+/p+ ADG (case a) is used, while case b above has a more appropriate threshold voltage for high-performance applications.

Figure 74 [56] shows two noteworthy effects from quantum behavior: 1) threshold-voltage shifts due to quantum effects are larger in DG FETs with asymmetric work-function gates (BG FET and ADG) compared to SDG (see also the values shown in Figure 6); and 2) threshold-voltage roll-off is worse when quantum effects are included. The first observation can easily be understood, since the band diagrams of the BG FET and the ADG are asymmetric (with a high normal electric field), which forces the charge carriers toward one of the surfaces. The second observation is more subtle: As the channel length is shortened, classical (Poisson equation) short-channel effects reduce the normal electric field, thereby reducing the quantization effects (“opening up” the channel potential). This effect is, in fact, more apparent in the BG FET. A similar effect can also be observed in the ADG [56].

Figure 7Figure 7

Taking into account the short-channel design considerations in [50, 53], the design space for a sub-20-nm DG FET is summarized as follows (Figure 8): a channel thickness of 5–10 nm, a gate dielectric of less than 1 nm (equivalent electrical thickness), a source/drain doping profile with less than 4 nm/decade lateral gradient, a highly doped source/drain fan-out structure to reduce series resistance, and a set of self-aligned front and back gates to minimize gate to source/drain overlap capacitances. The “alignment” of the gates refers to both front and back gates aligned with respect to each other as well as to the source/drain doping.

Figure 8Figure 8

The requirement for a set of self-aligned gates is underscored by the study in [46], which we summarize here. Two scenarios for a non-self-aligned DG FET are 1) misaligned (offset) top and bottom gates of equal size (minimum bottom gate) [45], and 2) an oversized bottom gate to ensure gate overlap of the source/drain [58] [see Figure 9, part (b)]. The minimum bottom-gate approach minimizes overlap capacitance. The energy barrier for carrier injection from the source for the misaligned gate [offset from the source case; see Figure 9, part (b)] is significantly higher than the nominal, aligned case. The higher source-side energy barrier limits the on-current and degrades the gate delay. The gate-delay degradation amounts to more than a full generation of device scaling performance gain [Figure 9, parts (c)–(d)]. A fully self-aligned fabrication process (both gates aligned with each other and with the source/drain doping) is therefore required for the highest performance benefits. If misalignment is unavoidable, the study in [46] indicates that it is more important to have a large enough bottom gate to ensure adequate gate-to-source overlap in order to attain a high on-current than to minimize parasitic capacitance.

Figure 9Figure 9

Double-gate FET carrier transport
So far, most of the discussion has been focused on the electrostatics of the FET. We now turn our attention to device design issues including carrier transport. Because of the extremely small depletion capacitance, the gate-to-channel potential coupling is not de-rated by the capacitor divider between the gate-oxide capacitance and the depletion capacitance. The subthreshold slope is therefore 60 mV/decade in the absence of short-channel effects (Figure 10). For the same off-current, the threshold voltage can be set about 60 mV lower than bulk FETs, thereby providing more gate overdrive—an important advantage as the power supply is reduced.

Figure 10Figure 10

Carrier mobility for the DG FET with a thin silicon channel deserves further discussion. Early work on single-gated SOI [59] demonstrated that carrier mobility in SOI follows a “universal mobility” curve similar to that of bulk FETs [60] when the “effective field” is properly taken into account in a single-gated SOI FET. Recent work using a double-gated FET structure has also indicated that the carrier mobility follows the same “universal mobility” behavior for both front-gated and back-gated channels, provided that the gate dielectrics on both sides are of high quality and the “effective field” is properly accounted for [61] (see Figure 11). However, early reports of carrier mobility measurement for very-thin-silicon-channel FETs (single-gated SOI FET) show a significant degradation of mobility as the silicon channel thickness is decreased below 20 nm [62–64]. Subsequent theoretical calculations suggest a complex behavior of mobility as a function of silicon channel thickness attributed to phonon scattering in the thin confined silicon channel [65–67]. Both findings raise legitimate concern for the DG FET [10], since the thin silicon channel required for control of short-channel effects may result in poor carrier transport. Recent experimental mobility measurements [68] for thin-silicon-channel FETs down to 5 nm as a function of carrier density depict a more complete picture. First, the early experimental results [62–64] may be tempered by a poor back-channel oxide because the thin silicon channel of the single-gated SOI FET was obtained by oxidation thinning from a SIMOX wafer which is known to have a substandard back-channel oxide interface that can degrade carrier transport. In addition, the mobility at low inversion carrier concentration (or, equivalently, low effective electric field) was reported. Figure 12 summarizes the data to date [62–64, 68]. The electron mobility degradation at channel thicknesses below 20 nm is clearly observable at a low inversion carrier density (Ninv = 1012 cm–2). However, at the higher inversion carrier density (Ninv = 1013 cm–2) that is important for nanoscale CMOS, the electron mobility is less sensitive to the silicon channel thickness. While the above results are for single-gated SOI FETs, similar results have been obtained for double-gate operation [69]. The degradation at low inversion carrier density appears to be related to phonon scattering, suggested by the larger degradation seen at lower temperatures [69].

Figure 11Figure 11 Figure 12Figure 12

Carrier transport in the DG FET with an undoped channel is superior to that in conventional bulk FETs for two reasons: reduced Coulomb scattering due to fewer ionized dopants in the undoped/low-doped channel, and reduced surface roughness scattering due to a lower surface electric field, as illustrated in Figure 13. In bulk FETs, channel doping is employed to set the threshold voltage, and halo or pocket dopings are employed to control the short-channel effects. These ionized depletion charges contribute appreciably to the surface electric field. In a DG FET with an undoped channel, there is no ionized depletion charge; therefore, the surface electric field is contributed entirely from inversion carriers (application of Gauss's law). Even though the carrier mobility follows the “universal mobility” curve [Figure 13(a)], at the same gate overdrive the carrier mobility can be significantly higher [Figure 13(b)] because the effective field is lower at the same gate overdrive. Figure 13(a) also shows the mobility and range of effective field for a typical bulk FET. While the bulk FET operates at an effective field above 1 MV/cm, the DG FET with an undoped channel operates at around 0.5 MV/cm, thereby improving the mobility by almost two times. This improved transport potentially provides the DG FET a better CV/I metric because although the capacitance C is doubled in a DG FET, the current I is improved by more than two times because of the better transport.

Figure 13Figure 13

Device fabrication
Fabrication of the DG FET is difficult. Early experimental work began with non-self-aligned DG FET structures for a first demonstration of the device principles [39, 40, 45, 58, 70], the most advanced being the work of Tanaka et al. [45, 58] where non-self-aligned DG FET circuits were demonstrated. In this section, we focus on the more recent development of the self-aligned DG FET because of its better performance, as described in the section on double-gate FET electrostatics.

In general, the DG FET may be fabricated with any of the three orientations depicted in Figure 14 [9, 48, 71]. The planar structure (Type I) has the advantage of better silicon channel thickness uniformity because film thickness in the plane of the wafer has the best uniformity and controllability. However, the fabrication of a back gate and a thin gate dielectric underneath a single-crystal silicon channel is difficult. In addition, accessing the bottom gate from the top surface for device wiring is not straightforward, and may have a negative impact on device density. The nonplanar structures (Types II and III) allow for easier access and formation of both gates (or a wraparound gate) on crystalline channels with thin gate dielectrics. On the other hand, the channel thickness is defined by lithography and patterning techniques (e.g., reactive ion etching), and may therefore have poorer uniformity than planar films. The device electrostatic design (see the section on double-gate FET electrostatics) requires that the channel thickness be about 1/3 to 1/4 of the channel length. Historically, the smallest dimension patterned on a chip is the gate length. For the nonplanar DG FET structures, the smallest dimension patterned must be considerably smaller than the channel length, which is a major departure from conventional processes. Carrier transport along etched surfaces with different crystallographic orientations [e.g., (110) surface for a notch <110> wafer] may degrade performance, although there is little data to date to support these assumptions. While the topography of Type II and III nonplanar structures may raise some fabrication concerns, it should be noted that this topography problem may not be as severe as it appears, because the height of the vertical structure can be made similar to that found in the gate stacks of planar structures.

Figure 14Figure 14

The buried gate of the planar, Type I device structure has been demonstrated using two techniques: selective epitaxial growth [71, 72], and wafer bonding and layer transfer [73, 74]. We illustrate these approaches using the two examples below.

Figure 15 shows the fabrication sequence for the first self-aligned DG FET. The fabrication process utilizes a planar CVD-deposited film in a dummy gate stack as a placeholder for the silicon channel. This dummy gate stack is etched out (forming a tunnel) and then replaced by a single-crystal silicon channel formed by selective epitaxial silicon growth through the tunnel with a seed from one side of the tunnel to prevent grain-boundary formation in the middle of the silicon channel. Excess epitaxial growth is polished away by chemical–mechanical polishing (CMP). Since the selective epitaxial silicon growth is confined to the dielectric tunnel [75–79], the thickness of the silicon channel is determined by the previously deposited CVD film (which has good thickness uniformity). Despite the seemingly difficult task of filling a long, thin tunnel, experimental evidence shows that large aspect ratios of more than 220:1 can be filled [79], well in excess of the requirements for the electrostatic device design for short-channel effects. After the formation of the silicon channel and subsequent CMP planarization to remove excess grown silicon, the source/drain is formed using ion implantation that is self-aligned to the dielectric dummy gate stack. The dummy gate stack is then removed, leaving a silicon channel in the form of a suspended bridge between the source and drain. Finally, the gate dielectric is formed, and the gate material is deposited and patterned. This “double-replacement” process provides a means to form a wraparound gate over the single-crystal silicon channel and self-align the source/drain doping to the channel and the gates. Both the silicon channel and the gate stack are formed by replacing a dummy material/structure with the final material/structure. The device characteristics reported suffer from high series resistance, possibly a result of non-optimized source/drain doping and silicide process. While this process possesses many positive attributes, as outlined above, it does have several shortcomings: 1) lack of an independently adjustable dielectric spacer thickness to separate the gate and the source/drain for parasitic capacitance reduction; 2) difficulty in doping and siliciding the underside of the wraparound gate; 3) the fact that the front and back gates cannot be independently biased; and 4) source/drain silicides that are not self-aligned to the gates. Some of these problems are addressed in the process proposed in [73, 80].

Figure 15Figure 15

Parts (a)–(e) of Figure 16 show the fabrication sequence of a triple self-aligned planar double-gate FET using wafer-bonding and layer-transfer techniques [74]. The starting substrate is an unpatterned bonded wafer with the doped polysilicon back gate and thin gate dielectric in place beneath the silicon channel. The bonded interface is 350 nm below the devices [Figure 16(e)]. Locating the bonding interface below the active device region is advantageous for two reasons: 1) any imperfections of the bonding process (stress fields, minor voids, and embedded particles) will not affect the active device; 2) any subsequent fabrication steps that etch to a level below the silicon channel will not expose the bonding interface, thereby avoiding the possibility of delamination of the bonded substrate. The undoped channel was initially thinned to 20–30 nm using oxidation, achieving uniformity to within 1 nm over much of an 8-inch-diameter wafer. The bottom gate and the top gates are separately patterned and accessible on opposite sides of the device width direction [Figure 16(f)]. CMP is employed to planarize the surface every time any topography is generated by patterning (etch). This eliminates the possibility of “stringers” and provides a robust process. For planarization at the front-end process, planarity of nanometer-scale accuracy and uniformity is required. Using custom-designed chemistry and processes, 2–3-nm topography over an 8-inch wafer is achievable. In order to provide access to the bottom gate and provide self-alignment of the bottom gate to the top gate and the source/drain, the source/drain fan-out regions are implemented as doped silicon sidewalls which are subsequently silicided. The sidewall source/drain is used as a self-aligned etch mask to center the length direction of the bottom gate with respect to the top gate and the sidewall source/drain. The undercut bottom gate is passivated with nitride dielectric. Contact to the sidewall source/drain is made by a tungsten plug to the source/drain well, followed by CMP to replanarize the surface. The difficult requirements of this fabrication approach include the following: 1) a precise and controllable back-gate undercut process, 2) a back-gate dielectric that is in place already during the wafer-bonding process and all subsequent process steps, and 3) a carefully controlled sidewall source/drain silicon deposition/etch and subsequent anneal, recrystallization, and dopant profile control via ion implantation of the sidewall and silicidation. Using a dry-etch technique, the back-gate undercut and sidewall source/drain etch can be better controlled. Device scaling requires that the back-gate dielectric be as thin as the front-gate dielectric. The use of a high-k gate dielectric is also a possibility. Both scenarios demand a low-temperature wafer-bonding process (preferably well below 900°C) for the back-gate dielectric to be viable.

Figure 16Figure 16

Figure 16(g) shows the measured transconductance of a 125-nm-gate-length n-FET fabricated using the aforementioned triple-self-aligned DG FET process. The back gate (BG) is n+-doped polysilicon, and the front gate (FG) is p+-doped polysilicon. The double-gate (DG and DG*) mode of operation provides a transconductance that is slightly more than the sum of the FG and BG modes (single-gated, ground-plane operation). An ideal subthreshold slope of 60–62 mV/decade is measured for DG-mode operation. Mobility measurement of the front-gated channel and back-gated channel shows electron and hole mobilities following the universal mobility curves, indicating good transport properties for these bonded silicon channels with thin front- and back-gate dielectrics [74]. DG FET circuits are also demonstrated in this work. Ratioed inverters are demonstrated using a DG FET as the load device. The back gate of the load DG FET is employed to control the load current and adjust the input bias of the load device. By using the front gate and the back gate of the same DG FET as two separate inputs of the “inverter” circuit (again with another DG FET as a load), a compact two-input NOR is demonstrated [74]. The tungsten-plugged source/drain well can be shared in a circuit layout to achieve high device density.

The most successful vertical structure as a high-performance device is the Type III structure, also known as the FinFET [81], since the silicon channel protrudes from the silicon wafer surface like a fin. Figure 17 illustrates the fabrication sequence and device structures. The FinFET is essentially a scaled-down version of the DELTA device reported by Hisamoto et al. [82]. Two types of FinFET are reported: 1) a gate-first process in which the source and drain are formed after the formation (patterning) of the gate stack [83–86]; and 2) a gate-last (or replacement-gate) process in which the source and drain are formed before the formation of the gate stack [81, 87].

Figure 17Figure 17

The fabrication of the FinFET begins with the patterning and etching of a thin fin on an SOI substrate using a hard mask which is retained throughout the fabrication process. The thickness of the fin will be the silicon channel thickness of the DG FET. As discussed earlier, the fin thickness is smaller than the gate length; thus, either electron-beam lithography [81, 84, 86, 87] or optical lithography with extensive linewidth trimming [85] is used to pattern the thin fin. For the gate-first process, the fabrication steps after the fin formation are analogous to the fabrication steps of the conventional bulk FET: After the gate oxide is grown, the gate polysilicon is deposited, patterned, and etched. A sidewall spacer is formed next to the gate. Source/drain and extension implants can be performed before and/or after the gate spacer, using angled implants [85]. A selective epitaxial growth of silicon or germanium from the fin surfaces forms the source/drain fan-out that reduces series resistance [85, 86]. Self-aligned silicide can be formed on the source/drain and gate, as in conventional bulk FETs. For the gate-last process, the source/drain is formed immediately after the fin patterning. Doped polysilicon or polycrystalline SiGe is deposited on the fin, followed by lithographic patterning of the source/drain fan-out pads with a thin slot between the source and drain. This distance between the source and drain determines the gate length. The slot length is further reduced by a dielectric sidewall spacer. Then the gate oxide is grown, and the gate polysilicon (or polySiGe) is deposited and patterned. A potential device density improvement for FinFET using direct etch and sidewall image transfer to generate fins is illustrated in Figure 18. The sidewall image transfer technique can reduce the fin pitch by a factor of 2 using the same lithography pitch. The fin pitch must be smaller than the fin height to provide more effective device width than a planar single-gated FET.

Figure 18Figure 18

The difficulties of fabricating the FinFET DG FET include 1) variability of the fin thickness (and hence silicon channel thickness); 2) highly selective RIE and long over-etches which are required to pattern the gate polysilicon that wraps around the fin (for the gate-first process), and to create the sidewall spacers (for both the gate-first and the gate-last processes); and 3) a difficult integration scheme for the source/drain of complementary FETs (n-FET and p-FET on the same wafer) for the gate-last process. The apparent difficulty in handling the topography of a “vertical” structure is manifested in the RIE and over-etch issues mentioned above. While tall fins may provide an improvement in density, the fin height must be contained within reasonable limits. The variability of the fin thickness deserves special attention. The variability in fin thickness arises from several sources: linewidth variation of the lithographic process, linewidth variation of the fin etching process, and line-edge roughness of the lithographic process [88], all of which are translated to the final fin dimensions. There is no systematic data yet that correlates fin thickness to processing conditions. The sidewall image transfer technique produces a set of fin surfaces that have correlated roughness; this is a different situation from the direct-etched-fin case, in which the fin surface undulations on either side of the fin are uncorrelated.

Both the gate-first and the gate-last FinFET processes have produced devices with excellent I–V characteristics (Figure 19). The current drive and the CV/I of the FinFET rival those of the best conventional bulk devices [13, 14], albeit with a threshold voltage that has to be centered. Despite the unconventional device structure and topology, the minimum gate length achieved is among the shortest [81, 85, 86], including those of conventional FETs [13, 14]. Since the silicon channel thickness is determined by the patterned fin, experimental data comparing the short-channel effect of different silicon thicknesses is readily obtained. The experimental results of Huang et al. [81] and Kedzierski et al. [85] both corroborate the short-channel electrostatic device design and scale length analyses described above. While the carrier-transport properties along the etched fin surfaces have been a concern, the good current drive exhibited by the FinFET DG FET indicates that there is no significant degradation.

Figure 19Figure 19

Challenges ahead
While there has been tremendous progress both in understanding the device physics of the DG FET and in fabricating it, significant challenges remain to be met before this new device structure is ready for manufacturing. Some of these open issues have no obvious solution at the moment.

First and foremost is the need to set the threshold voltage and to achieve multiple threshold voltages on the same chip. The section on double-gate FET electrostatics outlined the requirements on the work function of the gates to achieve the required threshold voltages for SDG and ADG. At this moment, no material satisfies the requirements for SDG except possibly a midgap-work-function material such as tungsten or CoSi2, with doping of the silicon channel to lower the threshold voltage to the desired level. For ADG, the combination of n+ (p+) polysilicon with tungsten or CoSi2 (materials having a mid-gap work function) for the n-FET (p-FET) gates provides one possible solution, although an integration scheme to achieve self-aligned gates is still lacking. Adjusting the threshold voltage for multiple threshold voltages on the same chip is a different challenge from setting the threshold voltage. For ULSI systems, it is typically necessary to provide a menu of devices with different threshold voltages to allow for the optimization of performance and power consumption. The ability to tune the threshold voltage by about 150 mV is often required. The obvious solution is to dope the channel of the DG FET in analogy with bulk FETs. For devices with geometries in the sub-50-nm gate-length regime, a channel doping of the order of high 1018 cm–3 is required. This tends to reduce the benefit of the DG FET, since the advantages of an undoped channel (higher mobility, reduced discrete dopant fluctuation effects) will diminish or even vanish. Recent work on gate-material work-function tuning [36] by ion implantation offers interesting opportunities, although much still has to be proven.

Because of the thin silicon channel, the series resistance of the DG FET is of particular concern [50]. Some form of raised source/drain process would be required in order to achieve a source/drain fan-out. The results given in [85] illustrated the efficacy of the solution. Growing selective epitaxial silicon on thin (<15-nm) silicon substrates (or fins) is still difficult, because the thin starting silicon tends to break up during or prior to the epitaxial growth. In addition, as the silicon channel thickness is reduced to less than 10 nm, optimization of the series resistance and parasitic capacitance of the “overlap” region may prove difficult.

Uniformity of the silicon channel is an important concern, especially when the silicon channel is reduced to less than 10 nm to satisfy short-channel control requirements. Silicon channel thickness tolerance may be translated to an equivalent gate-length tolerance by considering a constant scale length [10, 46]. A 15% variation in silicon channel thickness approximately translates into a 10% gate-length variation. This tolerance is added quadratically to the overall tolerances. Maintaining a thin, uniform silicon channel thickness remains a major manufacturing obstacle.

Finally, most double-gate FET devices are structurally different from conventional FETs. Parasitic capacitance (gate-to-source/drain capacitance) critically depends on the device structure. There is no fundamental barrier to achieving a back-gate parasitic capacitance that is as small as the front-gate parasitic capacitance, especially for the FinFET DG FET. While most device results to date have focused on the current drive and static I–V characteristics, more work has to be done to characterize and optimize the parasitic capacitance.

Higher mobility
The section on double-gate FET carrier transport has already pointed out the importance of a low-doped channel for carrier transport in DG FET. Another avenue to achieving a higher carrier mobility and saturation velocity is through the choice of material for the FET channel. Fischetti and Laux [89] compared the performance of several semiconductors that have high carrier mobilities and saturation velocities, including Ge, InP, InGaAs, GaAs, and several others. They concluded that materials which provide a significantly higher carrier mobility give only a moderate performance advantage over a lower-mobility material such as silicon. That work succinctly pointed out that under non-equilibrium, high lateral field transport, carrier mobility is not the only determinant of performance (using metrics such as transconductance). The band structure, which determines the density of states (i.e., the inversion capacitance, Cinv) [90] and the carrier scattering rates at high carrier energies are just as important as the carrier mobility: a low-carrier-energy, quasi-equilibrium quantity. Furthermore, in the limit in which carriers are transported ballistically, many other factors combine to determine performance [91]. On the other hand, a higher carrier mobility does provide some moderate performance gains, albeit not in proportion to the mobility values [92].

Strained-silicon FET—device concepts
It has been known for some time that carrier mobility in silicon under biaxial tensile strain is enhanced [93–98]. The theory of mobility enhancement for strained silicon is still evolving [96]. The most commonly cited reason for electron mobility enhancement in strained silicon is that under the biaxial tensile strain, the sixfold degeneracy of the conduction band of silicon is lifted, raising the higher-effective-mass fourfold-degenerate ellipsoids and lowering the lower-effective-mass twofold-degenerate ellipsoids. This has the effect of keeping most of the carriers in the lower-energy, lower-effective-mass valleys and reducing the intervalley scattering. In the valence band, the biaxial tensile strain lifts the heavy-hole/light-hole degeneracy at the Gamma point, resulting in a smaller in-plane transport effective mass due to band deformation and reduced intervalley scattering.

The use of strained silicon provides a plausible tradeoff between moderate levels of performance enhancement over silicon and ease of fabrication and integration with silicon (as compared to other higher-mobility materials such as Ge, InGaAs, GaAs, and InP). Recent work has provided encouraging experimental evidence that introducing the biaxial tensile strained silicon through a layer of relaxed SiGe may provide adequate performance gains for incorporation into conventional CMOS technologies. This section summarizes the recent results, focusing on strained silicon on relaxed SiGe.

Materials and device fabrication
The tensile strain can be introduced in several ways. Local strain caused by the thermal mismatch of silicon and the isolation materials [such as shallow-trench isolation (STI)] has been shown to introduce enough strain to alter device characteristics [97]. A more general approach is to introduce biaxial tensile strain by growing a thin layer of epitaxial silicon on a material with a slightly larger lattice constant, such as relaxed SiGe [95, 98, 99]. The strained silicon must be relatively thin in order to prevent relaxation and strain relief through dislocations. The underlying relaxed SiGe serves as an anchor to constrain the lattice of the strained silicon on top and has no beneficial electrical role otherwise to first order. Compared to a closely related approach of using a pseudomorphic layer of SiGe on silicon, where the carrier transport is in the compressively stressed SiGe layer (see for example [100]), strained silicon on relaxed SiGe has the advantage of having silicon as the top surface. The silicon provides an easy means of forming a good gate dielectric and enables surface channel operation for good short-channel-effect control.

Experimental results for bulk strained silicon on relaxed SiGe FETs and strained silicon on relaxed SiGe on insulator FETs are quite encouraging. For bulk strained silicon on relaxed SiGe, the fabrication begins with the growth of a relaxed SiGe buffer of 1–2 µm by a step-graded approach [101–103]. This step can be performed using rapid thermal chemical vapor deposition (RTCVD) [95, 98] or ultrahigh-vacuum chemical vapor deposition (UHVCVD) [104]. A thin (typically <20 nm) silicon layer is then epitaxially grown on the relaxed SiGe. Because of the larger lattice constant of Si1–xGex (4 × x% larger than that of silicon), the top silicon layer is under biaxial tensile strain. Early work on FET fabrication has limited high-temperature processes and ion implantation in the channel to avoid strain relaxation. Recent results indicate that a conventional CMOS process flow can be adopted while still achieving the mobility and current drive enhancement [104]. Process modules such as shallow-trench isolation (STI), channel ion implantation, source/drain extensions and halos, and associated high-temperature activation anneals are shown to have no adverse impact on device characteristics. Figure 20, part (a) shows that the effective electron-mobility enhancement over the unstrained-silicon control, as well as the enhancement over the universal mobility curve, persists into an effective electric field (Eeff) range (up to 1.5 MV/cm) that is relevant to short-channel devices with high channel/halo dopings (e.g., bulk or PDSOI). A portion of this mobility enhancement is translated into current drive enhancement over the unstrained-silicon control [Figure 20, part (b)]. When the effect of self-heating is accounted for in the measurement, it is expected that the current drive will improve by another 5–10%.

Figure 20Figure 20

It is also desirable to combine the benefits of enhanced transport in strained-silicon materials with the benefits of silicon-on-insulator (SOI) technologies [e.g., reduced junction capacitances, dynamic floating-body effects (PD SOI), scalability to shorter channel length (ultrathin-body SOI)]. Several approaches have been reported, all of which rely on growing strained silicon on a relaxed SiGe layer on insulator (SGOI). These methods include wafer bonding and layer transfer
[105–107], oxygen implantation (SIMOX) [99, 108], and oxidation enrichment of SiGe [109, 110] (see Figure 21). For the wafer-bonding and SIMOX approaches, mobility enhancements similar to that obtained in bulk strained-silicon FET have been demonstrated for n-FETs [99, 106, 107] as well as p-FETs [99, 106, 108]. Device design for SOI devices in the sub-50-nm regime calls for a thin SOI layer of less than 50 nm. Therefore, the combined layer thickness of the strained silicon and the relaxed SiGe should be less than 50 nm. In addition, defect density must be low enough for ULSI applications.

Figure 21Figure 21

The wafer-bonding and layer-transfer technique is the most straightforward and preserves the qualities of the starting relaxed SiGe layer. These include desirable qualities such as a high Ge content and high degree of relaxation of the SiGe buffer layer (e.g., by the step-graded approach). It also retains undesirable qualities such as the defect density of the starting SiGe buffer. A practical limitation of the layer transfer approach is thickness and uniformity control, since the thickness of the SiGe layer on the insulator is dependent on the final polish step. Using the SIMOX approach, the Ge content in the SiGe layer appears to be limited to less than 10% because the melting point of Si1–xGex with x > 0.1 is lower than the SIMOX annealing temperature. This limitation can be partially recovered by oxidation enrichment. When the SiGe is oxidized, the Ge is driven from the oxide layer, and the Ge content in the underlying SiGe layer increases (the “snowplow effect”). Thus, the Ge content of the SiGe layer can be “enriched” by this oxidation process. The enrichment process is used in the SIMOX approach to obtain a Ge content greater than 10% in the SGOI film [99]. This process can also be employed to produce a relaxed SiGe layer on insulator (oxide) by oxidizing a film stack consisting of a starting silicon-on-insulator layer with a SiGe layer pseudomorphically grown on top [109]. In this case, thickness uniformity and control will be very similar to that of the starting SOI film.

Electrical results of strained-silicon-on-insulator (SSOI) FETs fabricated from strained silicon grown on SGOI virtual substrates are quite encouraging. Essentially, the mobility enhancements commensurate with the amount of strain introduced as similarly observed in bulk strained-silicon substrates are reproduced in SSOI FETs. These results verify that the concept of combining strained silicon with SOI will retain the benefits of strained silicon.

Challenges ahead
Strained-silicon CMOS is at present one of the most promising technology options for insertion into CMOS technologies. The materials set is friendly to conventional CMOS technologies, and the device performance gains demonstrated so far have been promising. The main issue for the materials development is defect control and understanding. The defect density of the materials must be comparable to those of exiting bulk and SOI materials for the relaxed SiGe materials to be viable in an ULSI application. The nature of the defect is as important as the defect density. Correlating device fail mechanisms to the physical defect will be an important area of work. In particular, for SSOI materials, the nature of the defects can be quite different from that of the bulk relaxed SiGe material. Obviously, there is also a strong dependence of defects on the method of SSOI materials fabrication. It may also be possible to obtain strained-silicon-on-insulator without the underlying SiGe layer (e.g., using compliant substrate concepts [111]), thereby breaking out of the SiGe materials system altogether. In the device area, the fundamental principles of mobility enhancement must be further elucidated. In addition, relating the mobility enhancement to current drive gains for short-channel devices will continue to add to the understanding of the relative role of mobility in describing carrier transport, since strained silicon provides another means of modifying mobility while keeping other device parameters the same. Finally, we note that while it may appear difficult to combine strained silicon with the double-gate FET device structure, in principle nothing precludes this possibility.

3. Nanotechnology

Nanotechnology is a broad term which may refer to such diverse technical disciplines as chemistry, biology, physics, materials, and electrical engineering. Here we focus on the aspect of nanotechnology that pertains to extending and broadening the impact of the microelectronics and semiconductor industry. Therefore, the discussion here is necessarily narrowly focused. Instead of including a comprehensive discussion of this narrowly focused yet still vast subject, we choose to select one example as an illustration of the approach and thought process we adopt to identify and/or adopt nanotechnology for commercial applications. In this example, we examine a potential device technology: the carbon nanotube field-effect transistor (CNFET). This example is chosen because it appears to be one of the nanotechnologies which allows a meaningful comparison with incumbent technologies in that it uses the same circuit and system architecture. It is also one of the most widely studied in the nanoscale science community.

Carbon nanotube FET

Device concepts
Carbon nanotubes are nanoscale high-aspect-ratio cylinders of carbon atoms with exceptional electrical and mechanical properties. The basic properties and preparation of carbon nanotubes are well documented in the literature [112–116]. Many applications for this material have been proposed: electronic switch [117, 118], field emitter for flat-panel displays [119, 120], electrochemical energy storage for batteries and fuel cells [121], interconnect for electronics, probe tips for scanning probe microscopy and lithography [122–124]. Here we focus our discussion on the use of a carbon nanotube as an electronic switching element which performs basic logic functions.

Single-wall carbon nanotubes (SWNT) are two-dimensional graphene sheets rolled into nanometer-diameter cylinders that can either be 1D metals or semiconductors [125]. With the appropriate chirality [114, 116], SWNT can be semiconducting, with a bandgap inversely proportional to the diameter of the tube: Egap = 2gamma0aC–C/d, where Egap is the bandgap, gamma0 is the carbon-to-carbon tight-binding overlap energy, aC–C is the nearest-neighbor carbon-to-carbon distance (0.142 nm), and d is the diameter of the nanotube [126]. A typical semiconducting SWNT has a diameter of 1.4 nm and a bandgap of about 0.5–0.65 eV [126]. A straightforward application of this semiconducting property of the carbon nanotube is to form a field-effect transistor (FET) analogous to the MOSFET. Tans et al. [117] and Martel et al. [118] have already demonstrated FET operation using gold electrodes as source and drain contacts and a back-gated structure. Both p- and n-channel carbon nanotube transistors (CNFETs), as well as CNFETs with ambipolar behavior, have been reported [127, 128]. Recently, simple circuits such as inverters [127, 129] and ring oscillators [129] have been successfully fabricated.

Device fabrication
The fabrication of an early CNFET is straightforward: First, the gold electrodes are formed on a silicon wafer with a thermally grown oxide (the gate insulator); then carbon nanotubes are dispersed on the wafer. The wafer substrate serves as the gate electrode of the CNFET. While this process is simple and suitable for a first demonstration of FET action, the source/drain contact resistance clearly limited the current drive of the CNFET [117, 118]. A recent embodiment of the CNFET is shown in the inset of Figure 22(a), where ideas borrowed from conventional microelectronics fabrication are employed to improve the contact resistance. In this process, carbon nanotubes are dispersed onto a heavily doped silicon wafer with a thermally grown oxide (serving as the gate insulator). Then a transition metal (titanium or cobalt) is patterned on the wafer (by lift-off) as the source/drain contact metal. A subsequent anneal at
400°C (Co) or at 820°C (Ti) in an inert ambient forms a low-resistivity cobalt or Ti carbide at the source/drain contact [128, 130]. Figure 22 shows the I–V characteristics of a p-type CNFET with a cobalt source/drain contact [131]. The cobalt contact improves the contact resistance noticeably compared to previous results obtained with the same SWNT material [117, 118] as evidenced from Figure 22(b), where the drain current shows no sign of saturation at large gate voltages. A similar device with a SiO2 passivation layer reduces the subthreshold slope [131], but does not completely remove the interface traps at the nanotube surface. While this device structure is easy to fabricate and enables advance learning, a device configuration in which the gates are separately controllable (in contrast to a common gate provided by the substrate) is necessary for a technology.

Figure 22Figure 22

Performance assessment
Table 3 summarizes the key device parameters extracted from the CNFET I–V characteristics. The source/drain series resistance is an upper bound estimated from the measured total resistance from source to drain.5 In order to assess whether the CNFET is a viable technology, it is useful to compare the device performance of CNFET with that of conventional silicon MOSFETs. Table 4 compares several key device parameters. It should be noted that several assumptions must be made in order to make a direct comparison. While some of these assumptions may not be valid for various practical or fundamental reasons, Table 4 does serve as a starting point for discussion.


Table 3   Selected device parameters of p-type carbon nanotube FETs. The source/drain series resistance is an upper bound estimated from the measured total resistance from source to drain.
Contact scheme Cobalt TiC

Channel length (nm) 1030 800
Nanotube diameter 1.4 1.4
Gate-oxide thickness (nm) 150 150
Source/drain resistance (kOmega) <25 *
Inverse subthreshold slope (mV/decade) 2500 730
Interface trap capacitance ratio,** Cit/Ci 40.9 11.2
Transconductance# (µS) 0.342 0.280
(VD = 1.4 V) (VD = 1.0 V)
“Mobility”# (cm2V–1s–1) 60 68

*This value cannot be accurately estimated for this device.
**Ratio of the capacitance due to interface trap relative to the gate capacitance as estimated from the subthreshold slope assuming that the capacitance due to band bending in the s-SWNT is negligible.
#The transconductance and the mobility values are given without a correction for the contact resistance. After Martel et al. [131].


Table 4   Comparison of device characteristics of carbon nanotube FET and conventional silicon MOSFET.
    p-CNFET(a)
1030 nm
100-nm
MOSFET
25-nm
MOSFET

  Transconductance (µS/µm) 122 1000 (n-FET)(b) 1200 (n-FET)(c)
      460 (p-FET)(b) 640 (p-FET)(c)
  External resistance (Omega-cm per side) <70 ~66 (n-FET)(d) ~40 (n-FET)(d)
      ~143 (p-FET)(d) ~86 (p-FET)(d)
  Gate insulator (nm) 150 2.0 0.8

(a)  Device A from Martel et al. [131].
(b)  Ghani et al., IEDM 1999 [136].
(c)  Chau et al., IEDM 2000 [13]; these values approach the 1500 mS/mm predicted for n-FET for an ideal single-gated bulk MOSFET [47].
(d) ITRS 1999 [1].

First, it is necessary to make some assumptions about how the CNFET will be used. Here, we assume that a circuit topology similar to today's silicon-based VLSI circuits will be used; that is, the CNFETs are switches that form logic gates interconnected to one another by wires. In this scenario, the CNFET must deliver a current-driving capability similar to or surpassing that of the MOSFET in the future time when the CNFET will be introduced.6 To provide this large current, it is likely that an array of CNFETs will be needed. If we further assume that the CNFET can be made with equal lines and spaces with dimensions equal to the diameter of the nanotube (1.4 nm), the CNFET characteristics may be compared with those of MOSFETs on a per-unit device-width basis, as in Table 4.

Understanding the carrier transport mechanism of the CNFET is vital to assessing its potential as a device technology. Here, we adopt a phenomenological approach based on a classical description of transport that allows a direct comparison with silicon devices. Although it is clear that the characteristic length for electron–phonon and defect scattering in metallic single-wall carbon nanotubes is very long (of the order of microns) and favors ballistic transport [132, 133], this classical approach takes into account the fact that the semiconducting SWNTs show stronger backscattering at room temperature [118, 125, 134].

In this phenomenological model, the channel conductance is given by G = µQL/L, where µ is the carrier “mobility” and QL is the charge density per unit length. Note that µ is a phenomenological parameter for these quasi-1D systems [118]. If one draws an analogy with the silicon MOSFET in which the channel charge is related to a threshold voltage VT, then QL = CL(VG – VT), where CL is the gate-to-channel capacitance per unit length and VG is the gate voltage [118]. Within this framework, the hole “mobility” of the CNFET is about 60–68 cm–2V–1s–1. This hole “mobility” compares very well with surface mobility of holes in MOSFETs, which range from about
160 cm–2V–1s–1 at low vertical electric fields (0.1 MV/cm) to about 50 cm–2V–1s–1 at high vertical electric fields (1 MV/cm) [60]. The transconductance of the p-type CNFET shown in Figure 22 reaches 122 µS/µm (assuming equal lines and spaces for a CNFET array) for a 1030-nm-long channel and a gate insulator thickness of 150 nm.7 Continuing with the classical transport picture in which the CNFET current scales inversely with the channel length, the CNFET is extrapolated to have a transconductance of 1257 µS/µm at a 100-nm channel length and 5028 µS/µm at a 25-nm channel length. For silicon MOSFETs, respective transconductances of 1000 µS/µm and 460 µS/µm for n-FETs and p-FETs have been achieved at 100-nm gate length [136]. At 25-nm gate length, the transconductances reach 1200 µS/µm for n-FETs and 640 µS/µm for p-FETs [13], which approaches the 1500 µS/µm predicted for n-FETs for an ideal single-gated bulk MOSFET [47]. The transconductance of n-channel double-gate FETs in the 25-nm regime is projected to be of the order of 5000 µS/µm.

The electrostatics of the CNFET is another area that requires detailed analysis, primarily due to the cylindrical geometry, the small size, and the 1D quantized channel (and band structure) of the carbon nanotube. Short-channel effects of the CNFET have not yet been explored. The capacitance for an isolated cylindrical structure (either in a planar gate conductor or coaxial gate conductor configuration [131]) varies inversely and logarithmically with the distance from the center of the conductor [137]. For gate insulator thicknesses that are large compared to the nanotube diameter, the gate capacitance for an isolated CNFET is greater than that for a corresponding parallel-plate capacitor (e.g., in a planar silicon MOSFET) with an equivalent width [131] (due to the fringing field). For example, a 150-nm gate SiO2 for an isolated CNFET is roughly equivalent to a 3–10-nm gate SiO2 for a planar MOSFET. However, this advantage due to the fringing field vanishes for gate insulator thicknesses that are comparable to or smaller than the nanotube diameter. Furthermore, for an array of CNFETs, the fringing field diminishes as the packing density of the array increases. At high packing densities, the shielding from neighboring nanotubes renders the electric field pattern similar to the parallel-plate situation. It is therefore important to explore in future work the use of high-k gate dielectric materials and to study the electrostatics of alternative gate and channel geometries (such as the coaxial geometry) to maximize the gate-to-channel charge coupling.

In principle, the carbon nanotube has a perfect interface with the gate dielectric because all bonds of the carbon atoms are satisfied in a carbon nanotube. In other words, there are no dangling bonds which form interface states. However, the subthreshold turn-off characteristics shown in Figure 22(b) are very gradual (hundreds of mV/decade). This behavior is probably caused by traps in the gate dielectrics that are close to the interface. Reducing the impurities in the gate dielectric will probably lead to improved subthreshold characteristics (see recent results by Wind et al. [135]).

Challenges ahead
Despite the optimism suggested by the above analysis, the challenges facing CNFET technology are daunting. The first and most important is to understand the device scaling and carrier transport properties of the CNFET (as a function of the gate length, gate dielectric thickness), as well as key device physics such as short-channel effects; gate-dielectric-to-carbon-nanotube interface properties (interfa