|
1. Introduction
The semiconductor industry has been so successful in providing
continued system performance improvement year after year that the
Semiconductor Industry Association (SIA) has been publishing roadmaps
for semiconductor technology since 1992. These roadmaps represent a
consensus outlook of industry trends, taking history as a guide. The
recent roadmaps [1] incorporate participation from the global
semiconductor industry, including the United States, Europe, Japan,
Korea, and Taiwan. They basically affirm the desire of the industry to
continue with Moore's law [2], which is often stated as the doubling
of transistor performance and quadrupling of the number of
devices on a chip every three years. The phenomenal progress signified
by Moore's law has been achieved through scaling of the
metal-oxidesemiconductor field-effect transistor (MOSFET)
[3, 4] from larger physical dimensions to smaller physical
dimensions, thereby gaining speed and density.
Shrinking the conventional MOSFET beyond the 50-nm-technology node
requires innovations to circumvent barriers due to the fundamental
physics that constrains the conventional MOSFET. The limits most often
cited [412] are 1) quantum-mechanical tunneling of carriers through
the thin gate oxide; 2) quantum-mechanical tunneling of carriers from
source to drain, and from drain to the body of the MOSFET; 3) control
of the density and location of dopant atoms in the MOSFET channel and
source/drain region to provide a high on-off current ratio; and 4) the
finite subthreshold slope. These fundamental limits have led to
pessimistic predictions of the imminent end of technological progress
for the semiconductor industry [4]. On the other hand, the push to
scale the conventional MOSFET continues to show remarkable progress
[13, 14].
Instead of reiterating the considerations of device scaling limits
here, we refer the reader to our previous analyses [810] as well as
analyses by others in the literature [47, 11, 12]. We focus this
paper instead on approaches to circumvent or surmount the barriers to
device scaling. The organization of this paper is as follows. We first
address opportunities for the silicon MOSFET, focusing primarily on
approaches that depart from conventional scaling techniques (for
example, doping profile control, thin silicon dioxide gate dielectrics,
SOI). Topics covered include high-dielectric-constant
(high-k) gate dielectric, metal gate electrode, double-gate
FET, and strained-silicon FET. The second part of this paper
examines the space between conventional microelectronics technology and
the more exploratory nanotechnology. Such a wide spectrum of
nanotechnologies are being explored today that it is impossible to make
even a modest attempt to cover the field. The approach adopted in this
paper is to select an example, the carbon nanotube field-effect
transistor, to illustrate both the opportunities offered by
nanotechnologies and the most important questions that must be answered
before such technologies can find practical use. The example is
therefore chosen for illustrative purposes rather than an implied
suggestion of eventual technological utility.
2. Silicon MOSFET
For digital circuits, a figure of merit for MOSFETs for unloaded
circuits is CV/I, where C is the gate
capacitance, V is the voltage swing, and I is the
current drive of the MOSFET. For loaded circuits, the current drive of
the MOSFET is of paramount importance. Historical data indicate that
scaling the MOSFET channel length improves circuit speed, as suggested
by scaling theory [3]. Reference [15] illustrates data on the
CV/I metric from recent literature. The off-current
specification for CMOS has been rising rapidly to keep the speed
performance high. While 1 nA/µm was the maximum off-current allowed
in the late 1990s [8], off-currents in excess of 100 nA/µm are
proposed today [13]. This trend obviously cannot continue, since the
on-current increases only linearly as off-current increases
exponentially in a typical device design tradeoff. Means to
mitigate the standby power increase must be found.
Keeping in mind both the CV/I metric and the benefits of a
large current drive, we note that device performance may be improved by
1) inducing a larger charge density for a given gate voltage drive; 2)
enhancing the carrier transport by improving the mobility, saturation
velocity, or ballistic transport; 3) ensuring device scalability
to achieve a shorter channel length; and 4) reducing parasitic
capacitances and parasitic resistances. Table
1 summarizes these opportunities and
proposed technology options for capitalizing on them. These options
generally fall into two categories: new materials and new device
structures. In many cases, the introduction of a new material requires
the use of a new device structure, or vice versa. Throughout the
discussion, we direct attention to areas of device physics and
materials science that must be better understood in order to
advance the technology.
|
|
Table 1
|
Device performance improvement
opportunities.
|
| |
| |
| |
| |
| |
|
Source of improvement
|
Parameters affected
|
Method
|
|
|
Charge density
|
- S (inverse subthreshold slope)
- Qinv at a fixed off-current
|
- Double-gate FET.
- Lowered operating temperature.
|
|
Carrier transport
|
- Mobility (µeff)
- Carrier velocity
- Ballistic transport
|
- Strained silicon.
- High-mobility and -saturation-velocity materials (e.g., Ge, InGaAs, InP).
- Reduced mobility degradation factors (e.g., reduced
transverse electric field, reduced Coulomb scattering due to dopants,
reduced phonon scattering).
- Shorter channel length.
- Lowered operating temperature.
|
|
Ensuring device scalability to a shorter channel length
|
- Generalized scale length
( ).
- Channel length (Lg)
|
- Maintaining good electrostatic
control of channel potential (e.g., double-gate FET, ground-plane FET,
and ultrathin-body SOI) by controlling the device physical geometry and
providing means to terminate drain electric fields.
- Sharp doping profiles, halo/pocket implants.
- High gate capacitance (thin gate dielectrics, metal
gate electrode) to provide strong gate control of channel potential.
|
|
Parasitic resistance
|
- Rext
|
- Extended/raised source/drain.
- Low-barrier Schottky contact.
|
|
Parasitic capacitance
|
- Cjn
- CGD, CGS, CGB
|
- SOI.
- Double-gate FET.
|
|
MOSFET gate stack
Continued device scaling requires the continued reduction of the
gate dielectric thickness. This requirement arises from two different
considerations: controlling the short-channel effect and achieving a
high current drive by keeping the amount of charge induced in the
channel large as the power-supply voltage decreases. In both cases, to
a first approximation, it is the electrical thickness that is
important. The electrical thickness at inversion is determined by
the series combination of three capacitances in the gate stack: the
depletion capacitance of the gate electrode, the capacitance of the
gate dielectric, and the capacitance of the inversion layer in the
silicon [Figure 1, part
(a)].
Figure 1
On the other hand, the direct tunneling current through the gate
dielectric grows exponentially with decreasing physical thickness of
the gate dielectric [16]. This tunneling current has a direct impact
on the standby power of the chip and puts a lower limit on unabated
reduction of the physical thickness of the gate dielectric. It is
likely that tunneling currents arising from silicon dioxides
(SiO2) thinner than 0.8 nm cannot be tolerated, even for
high-performance systems [10].
Solutions that reduce the gate tunneling current and gate capacitance
degradation due to polysilicon depletion are explored through
introduction of new materials: high-dielectric-constant gate
dielectrics and metal gate electrodes.
High-k gate dielectric
A gate dielectric with a dielectric constant (k)
substantially higher than that of SiO2
(kox) will achieve a smaller equivalent
electrical thickness (teq) than the
SiO2, even with a physical thickness
(tphys) larger than that of the SiO2
(tox):
|
teq=
|
|
kox
|
|
tphys.
|
|
| k |
Replacing the SiO2 with a material having a different
dielectric constant is not as simple as it may seem. The material bulk
and interface properties must be comparable to those of
SiO2, which are remarkably good. Basic material
properties such as thermodynamic stability with respect to
silicon, stability under thermal conditions relevant to microelectronic
fabrication, low diffusion coefficients, and thermal expansion match
are some critical examples. In addition, interface traps of the
order of a few 1010 cm2eV1
and bulk traps of the order of a few 1010
cm2 are common among SiO2 and the closely
related oxynitrides [17, 18]. Charge trapping and reliability for the
gate dielectrics are particularly important considerations.
Thermal stability with respect to silicon is an important
consideration, since high-temperature anneals are generally employed to
activate dopants in the source/drain as well as the polysilicon gate.
Although many binary and ternary oxides are predicted to be thermally
stable with respect to silicon [19], recent research on
high-dielectric-constant gate insulators have focused primarily on
binary metal oxides such as Ta2O5,
TiO2, ZrO2, HfO2,
Y2O3, La2O3,
Al2O3, and Gd2O3 and
their silicates [20]. Table
2 compares the properties of the
common high-k gate dielectrics reported in the literature.
The dielectric constant of these materials generally ranges from 10
to 40, which is about a factor of 3 to 10 higher than
SiO2. Leakage current reduction from 103× to
106×, in comparison with SiO2 of the same
electrical thickness, is generally achieved experimentally for
high-k gate dielectrics [21]. The benefits of using a
very-high-dielectric-constant material to simply replace
SiO2 for the same electrical thickness are limited
because of the presence of two-dimensional electric fringing
fields from the drain through the physically thicker gate dielectric
[10, 22]. The drain fringing field lowers the source-to-channel
potential barrier and lowers the threshold voltage in a way similar to
the well-known drain-induced barrier lowering (DIBL), in which the
drain field modulates the source-to-channel potential barrier via
coupling through the silicon substrate. The use of higher-k
materials must therefore be combined with a concurrent reduction of the
electrical thickness.
|
|
Table 2
|
Selected material and electrical properties of high-k
gate dielectrics. Data compiled from Robertson [25], Gusev et al.
[20], Hubbard and Schlom [19], and other
sources.
|
| |
| |
| |
| |
| |
|
Dielectric
|
Dielectric constant
(bulk)
|
Bandgap (eV)
|
Conduction band
offset (eV)
|
Leakage current reduction w.r.t.
SiO2
|
Thermal stability w.r.t. silicon (MEIS data)
|
|
|
Silicon dioxide
(SiO2)
|
3.9
|
9
|
3.5
|
N/A
|
>1050°C
|
|
Silicon nitride
(Si3N4)
|
7
|
5.3
|
2.4
|
|
>1050°C
|
|
Aluminum oxide
(Al2O3)
|
~10
|
8.8
|
2.8
|
102103×
|
~1000°C,
RTA
|
|
Tantulum pentoxide
(Ta2O5)
|
25
|
4.4
|
0.36
|
|
Not
thermodynamically stable with silicon
|
|
Lanthanum oxide
(La2O3)
|
~21
|
6*
|
2.3
|
|
Gadolinium oxide (Gd2O3)
|
~12
|
|
Yttrium oxide
(Y2O3)
|
~15
|
6
|
2.3
|
104105×
|
Silicate
formation
|
|
Hafnium oxide
(HfO2)
|
~20
|
6
|
1.5
|
104105×
|
~950°C
|
|
Zirconium oxide
(ZrO2)
|
~23
|
5.8
|
1.4
|
104105×
|
~900°C
|
|
Strontium titanate (SrTiO3)
|
|
3.3
|
0.1
|
|
Zirconium silicate (ZrSiO4)
|
|
6*
|
1.5
|
|
Hafnium silicate
(HfSiO4)
|
|
6*
|
1.5
|
|
| *Estimated value. |
|
A large silicon-to-insulator energy barrier height is desirable because
the gate direct-tunneling current is exponentially dependent on the
(square root of the) barrier height [23]. In addition, hot-carrier
emission into the gate insulator is also related to the same barrier
height [24]. The high-k material should therefore not only
have a large bandgap, but also have a band alignment which results in a
large barrier height. Figure 2
illustrates the bandgap and band alignment for several
high-k gate dielectrics calculated by Robertson [25]. Most
high-k materials that have other desirable properties do
have relatively low band offsets and small bandgaps. Aluminum oxide
(Al2O3) is probably the only material that has
a bandgap and band alignment similar to those of SiO2.
Figure 2
Figure 1 illustrates examples of thin gate dielectrics:
SiO2, Al2O3, and ZrO2
with an interfacial SiO2 layer. These dielectrics are only
a few atoms thick. The thin dielectric films can be deposited by
sputtering, solgel, physical vapor deposition (PVD), metallo-organic
chemical vapor deposition (MOCVD), or atomic-layer deposition (ALD).
Deposition uniformity does not appear to be a significant issue.
However, integration of the deposited dielectric with the rest of the
device fabrication process requires further research and development in
several areas. If a conventional self-aligned polysilicon gate is used,
the dielectric film must be able to withstand rapid thermal anneals
(RTAs) up to at least 950°C for dopant activation in the polysilicon
gate. The typical thermal treatments during a polysilicon gate CMOS
process pose potential problems such as formation of silicates and
interfacial SiO2. In addition, diffusion (for example,
boron, oxygen) through the gate dielectric is a serious concern. If a
metal gate electrode is employed (using a low-temperature process),
many of the thermal stability concerns can be relieved.
Figure 3(a) shows the
electrical characteristics of an 80-nm polysilicon gate n-FET using
Al2O3 as the gate dielectric, as reported by
Buchanan et al. [26]. This work and that of others (for example,
[21]) illustrates some of the obstacles for high-k gate
dielectrics: 1) There are a significant number of traps and fixed
charges in the film (or at the interfaces), leading to flat-band
voltage shifts (up to 450 mV) and voltage bias instability; 2) the
traps raise questions of reliability as channel hot carriers and
carriers from gate tunneling traverse the gate dielectric, resulting in
trap generation; and 3) the mobility of carriers in the FET channel is
severely degraded (up to a factor of 2) for high-k gate
dielectrics [Figure 3(b)].
Figure 3
The cause of the mobility degradation is not clear at present.
Presumably, some of the differences can be attributed to the difficulty
of obtaining accurate estimates of the effective electric field due to
the charge trapping. Coulomb scattering due to the trapped charge alone
cannot explain entirely the mobility degradation observed. Another
source of mobility lowering may be found in remote phonon scattering
[27]. The static dielectric constant of a high-bandgap
high-k material derives its high dielectric constant
primarily from ionic polarizability, since the large bandgap
results in a small electronic polarizability. The ionic polarizability
is associated with the soft metaloxygen bonds with low-energy
phonons. Fischetti et al. [27] studied the scattering of
electrons in the inversion layer by surface optical phonons and
suggested that there is generally an inverse relation between
surface-optical-phonon-limited mobility and the static dielectric
constant: the higher the dielectric constant, the lower the
surface-optical-phonon-limited mobility.
Metal gate electrode
A metal gate electrode has several advantages compared to the
doped polysilicon gate used almost exclusively today. Gate capacitance
degradation due to the depletion of the doped polysilicon gate
typically accounts for 0.40.5 nm of the equivalent-oxide thickness of
the total gate capacitance at inversion. This is a substantial amount,
considering that a gate equivalent oxide of less than 1.5 nm
(at inversion) is required for sub-50-nm CMOS. The thermal
instability of most high-k gate dielectrics may require the
use of a low thermal budget process after the gate dielectric
deposition. While junction activation may be performed prior to gate
dielectric deposition, the high-temperature gate polysilicon activation
step necessarily occurs after the gate dielectric formation. A further
potential benefit of metal gate electrodes is the elimination of
carrier mobility degradation due to plasmon scattering from the gate
electrode. The plasmon frequency of a highly conductive metal electrode
is too high to interact with the carriers in the inversion layer
[28].
From a device design point of view, the most important consideration
for the gate electrode is the work function of the material. While the
polysilicon gate technology has somewhat locked in the gate work
functions to values close to the conduction band and the valence band
of silicon,1 the use of a metal gate
material opens up the opportunity to choose the work function of the
gate and redesign the device to achieve the best combination of work
function and channel doping. For bulk or partially depleted SOI,
because of the requirements on the threshold voltages and the need to
use heavy dopants to control short-channel effects, the most
suitable gate work-function values are still close to the
conduction and valence bands of silicon. A mid-gap work function
results in either a threshold voltage that is too high for
high-performance applications, or compromised short-channel effects,
since the channel must be counterdoped to bring the threshold voltage
down. For double-gate FETs (see the section on double-gate FET
electrostatics), because the short-channel effects are controlled by
the device geometry, the threshold voltage is determined mainly by the
gate work function. Therefore, the choice of the gate electrode is
particularly important for the double-gate FET. For example, for
symmetric double-gate FETs (SDG), a gate-electrode work function ±250
mV from mid-gap is suitable. The section on double-gate FET
electrostatics expands on this discussion.
While there are plenty of metal choices that may satisfy the
work-function requirements [3032], other device and integration
considerations narrow down the choices significantly. The
requirements of a low gate-dielectric/silicon interface state density
and low gate-dielectric fixed charges imply that a damage-free metal
deposition process (e.g., CVD instead of sputtering) is required. At
the same time, the deposition process must not introduce impurities
(e.g., traces of the CVD precursor materials) into the gate stack. The
thermal stability of the metal electrode must at least withstand the
thermal anneals required to passivate the silicon/gate-dielectric
interface (e.g., forming-gas anneal) after the metal deposition, as
well as the thermal processing of the back-end metallization
processes. Furthermore, it is desirable to have a low resistivity (at
least similar to conventional silicides such as CoSi2 and
TiSi2), although this requirement may be relaxed by
strapping the gate electrode of the proper work function with a lower-
resistivity material on top.
In principle, a single-metal electrode is advantageous, since it avoids
many potential problems of alloyed metals such as composition
uniformity control and phase separation. On the other hand, alloying
provides flexibility in choosing the desired material properties. The
gate-electrode work-function issue is further complicated by the fact
that the work function measured in vacuum (values reported in most
materials data books) is different from the work-function value when
the metal is in contact with a dielectric. In general, a dipole
forms at the metal/dielectric interface which alters the effective work
function of the metal/dielectric combination [33, 34]. The choice
of appropriate metal electrode is then also dependent upon the choice
of the gate dielectric: SiO2 or high-k
material.
One of the promising process integration schemes for metal gate is the
replacement-gate technology [35]. In this process, a dummy gate
material (e.g., polysilicon) is used for forming the self-aligned
gate-to-source/drain structure. Subsequently, the dummy gate material
is removed and replaced with the desired gate dielectric and electrode
[35]. Alternatively, the metal gate electrode may be etched in a way
similar to the polysilicon gate technology. However, the learning curve
is long and steep for developing the same (or a better) level of etch
selectivity and profile control for the metal gate compared to the
polysilicon gate. In addition, thermal stability issues (from the
source/drain dopant activation anneal) must be addressed. In either
case, if metals with two different work functions are employed for
n-FET and p-FET, respectively, the integration of n-FET and p-FET
in a CMOS process remains a challenge, since 1) the deposition of the
metals for n-FET and p-FET must be done separately, and 2) one must
find a way to strap the two different metals in a compact way to
connect the n-FET and p-FET gates. It is desirable to
circumvent these two requirements and find a way to alter the work
function of the metal by some simple means (for example, one that
requires only a block mask). Two interesting approaches, yet to be
proven through more rigorous examination, have been reported. In the
first approach, a single metal (molybdenum) is deposited, and the work
function is altered using ion implantation of nitrogen into the metal
[36]. It is not clear how the nitrogen influences the work function,
and how thermally stable this material is. On the other hand, ion
implantation is an attractive process, since it requires only a single
metal deposition and a photoresist block mask. In another approach,
metals are intermixed to obtain the desired work function [37]. Two
metals (Ti and Ni) are sequentially deposited on the gate dielectric,
followed by selective etching of the top metal, leaving the bottom
metal at desired locations. After thermal annealing, the metal on the
top migrates to the metal/gate-dielectric interface and alters the work
function locally.
Ultimately scalable FETthe double-gate FET
Device concepts
The double-gate FET (DG FET) shown in Figure 4,
part (a) was proposed in the early 1980s
[38]. The concept has been gradually explored both experimentally
and theoretically by many groups [3946]. The Monte Carlo and
drift-diffusion modeling work by Fiegna et al. [41] and Frank et
al. [42] clearly showed that a DG FET can be scaled to a very short
channel length (25 to 30 nm) while achieving the expected performance
derived from scaling. While the early work focused on the better
scalability of DG FET, recent work suggests that the scalability
advantage may not be as large as previously envisioned [10, 47],
although the carrier transport benefits may be substantial. In this
section, we first discuss the advantages of DG FET, followed by device
design requirements, and conclude with a summary of latest hardware
results.
Figure 4
The salient features of the DG FET (Figure 4) are [48] 1) control of
short-channel effects by device geometry, as compared to bulk FET,
where the short-channel effects are controlled by doping (channel
doping and/or halo doping); and 2) a thin silicon channel leading to
tight coupling of the gate potential with the channel potential. These
features provide potential DG FET advantages that include 1) reduced 2D
short-channel effects leading to a shorter allowable channel length
compared to bulk FET; 2) a sharper subthreshold slope (60 mV/dec
compared to >80 mV/dec for bulk FET) which allows for a larger gate
overdrive for the same power supply and the same off-current; and 3)
better carrier transport as the channel doping is reduced (in
principle, the channel can be undoped). Reduction of channel doping
also relieves a key scaling limitation due to the drain-to-body
band-to-band tunneling leakage current. A further potential advantage
is more current drive (or gate capacitance) per device area; however,
this density improvement depends critically on the specific fabrication
methods employed and is not intrinsic to the device structure.
The most common mode of operation of the DG FET is to switch the two
gates simultaneously. Another use of the two gates is to switch only
one gate and apply a bias to the second gate to dynamically alter the
threshold voltage of the FET2 [49, 50]. In
this mode of operation, called ground plane (GP) or back-gate
(BG), the subthreshold slope is determined by the ratio of the
switching gate capacitance and the series combination of the channel
capacitance and the nonswitching gate capacitance, and is generally
worse than the DG FET. A thin gate dielectric at the nonswitching gate
reduces the voltage required to adjust the threshold voltage and
preserves the drain-field-shielding advantage of the double-gate device
structure. However, a thinner gate dielectric also means extra
capacitance that does not contribute to channel charge for switching.
Since the back-gate FET is very similar to a single-gated SOI FET with
an adjustable threshold voltage [49], we focus our discussion here on
the DG FET configurations in which both gates are switched.
Double-gate FET electrostatics
The double-gate device structure allows for termination of the
drain electric field at the gates and leads to a more scalable FET. To
evaluate the scalability of FETs, the concept of the scale length
for a MOSFET is useful [10, 43, 44, 51]. The electrostatic
potential of the MOSFET channel can be approximated by
analytically solving the 2D LaPlace equation using the superposition
principle (with suitable boundary conditions), and the short-channel
behavior can be described by a characteristic scale length, .
Table 3.3 of [52] lists the generalized scale length derived by Frank
et al. [10, 51] and the simpler, but less accurate, scale length
derived by Suzuki et al. [44]. The minimum gate length is jointly
determined by the scale length and by the amount of 2D
short-channel effects one can tolerate in an application. The 2D
short-channel effects can range from increased off-current due to
threshold-voltage roll-off, drain-induced barrier lowering (DIBL), and
degraded subthreshold slope, to degraded output resistance. Figure 5 of
[10] illustrates the trend of these 2D effects as the channel
length is decreased with respect to the scale length of the MOSFET.
Manufacturing tolerances put a premium on the minimum channel length.
With typical tolerances of 2030% gate-length variation, an
L/ of 1.5 is required.3
Conventional short-channel-effect theory [23] correlates the junction
depth to the short-channel effects. In the case of the DG FET,
consideration of junction depth is moot, since the 2D electrostatic
behavior is controlled by the thickness of the silicon channel instead
of the junction depth. However, the steepness of the source/drain
junction is still an important consideration, as in the case of bulk
FETs [47]. Figure 5
illustrates the threshold-voltage roll-off characteristics of the DG
FET with lateral junction profile gradients of 2, 4, and 6 nm (Gaussian
analytical profile). It is clear that a steep junction gradient
commensurate with the channel length is required.
Figure 5
Comparing scale lengths for the DG FET, ultrathin silicon SOI FET, and
bulk devices, as well as considering other leakage mechanisms (such as
tunneling leakages), leads to the conclusion that the DG FET can be
scaled up to 50% further than the bulk FET for some applications
[10]. Illustrations of the threshold-voltage roll-off behavior (an
example of 2D short-channel effects) comparing DG FET,
ultrathin-silicon FET, and ground-plane FET (in which the bottom
gate of a DG FET is tied to a fixed bias) can be found in [10,
50] and many other references in the literature and are not repeated
here. Similar analyses based on on-current and off-state subthreshold
leakage current can be found in [53, 54]. Simply put, the better
scalability of DG FET can be used to achieve a shorter channel length
using the same gate-oxide thickness, or the same channel length
using a thicker gate oxide.
We now turn our discussion to the relationship of the channel
inversion charge and the gate voltage. The analytical model of Taur
[55] and the numerical modeling of Ieong et
al.4 [56] form the basis for much of this
discussion. The gate work function of the two gates can be the same
(the SDG, with a symmetric energy-band diagram in the direction normal
to the gate electrode) or different (the ADG, with an asymmetric
energy-band diagram in the direction normal to the gate electrode), as
illustrated in Figure 4. In the subthreshold region, where there is
negligible inversion charge, the silicon channel is fully depleted, and
the energy bands closely follow the gate bias in a one-to-one
relationship. For the SDG, the bands remain flat throughout the
subthreshold region, since there is little or no depletion charge. Once
inversion charge begins to build up, the mobile charges screen the gate
field and the gate voltage is dropped primarily in the inversion layer.
The threshold voltages for the SDG and the ADG with various gate work
functions are plotted in Figure 6 as a function of the silicon
channel thickness. The threshold voltage of the SDG has a small channel
thickness dependence ( 6 mV/nm), while the threshold voltages of the
ADG and BG have a larger dependence on channel thickness ( 2832
mV/nm) because of the asymmetric band diagram.4 At the same
off-current (same integrated mobile charge in the subthreshold region),
the surface potential of the ADG is higher than that of the SDG. In
other words, the surface of the ADG is inverted more than the surfaces
of the SDG at the same off-current condition. This is because the SDG
has a fairly constant charge density throughout the silicon film, while
the ADG charge density peaks toward one of the surfaces. This
surface-potential difference is carried forward throughout the entire
gate-voltage range well into the fully inverted condition in which
both surfaces of the ADG are in inversion [55].
Figure 6
For the SDG with a channel thickness greater than 5 nm, there
are two distinct charge-density peaks near the two surfaces, and
the two inversion layers are basically independent of each other (see
the inset of Figure 6) [56]. For a channel thickness less than 5 nm,
the two inversion charge peaks begin to merge. For the ADG, the
inversion charge forms first at the surface, where the gate work
function is lower. Although the ADG has only one predominant channel,
the total integrated charge is more than half that for the SDG. The
SDG-to-ADG charge ratio is about 2× for a thick silicon channel
and approaches 1× for very thin channels. The physics behind this
observation is best explained by using the capacitive coupling model of
Taur [55]. The ADG gate with a higher gate work function (say,
back-gate) induces inversion charge at the opposite surface through
the capacitive coupling of the series combination of the
back-gate dielectric (Coxb = ox/toxb) and the channel
capacitance (CSi =
Si/tSi). The total gate
capacitance of the ADG is therefore more than the front-gate dielectric
capacitance (Coxf =
ox/toxf) alone. The amount
of back-gate coupling obviously depends on the back-gate dielectric
thickness and the silicon channel thickness. The thinner the silicon
channel with respect to the gate dielectric, the more effective the
coupling, and the closer the SDG-to-ADG charge ratio is to unity.
The quantization in the channel introduces several interesting effects,
which are discussed here. The threshold-voltage increase due to quantum
effects is illustrated in Figure 6 [50] using a simple
particle-in-a-box approximation (more accurate in SDG than BG or ADG):
Vt = (h2/4qm*tSi2)( tSi/tSi)
[57], where Vt is the threshold voltage,
h is Planck's constant, q is the electronic
charge, m* is the carrier effective mass, and
tSi is the channel thickness. The quadratic
increase of the threshold voltage with decreasing channel thickness
(steep rise of dVt/dtSi)
means that channel thickness much below 5 nm will be almost inpractical
to manufacture unless an atomically precise method of defining the
channel thickness is found. In a more realistic approximation, one
solves the one-dimensional Schrödinger equation and incorporates
the solution self-consistently in a coupled 2D Poisson and continuity
equation solution (with appropriate boundary conditions) [56]. Figure
6 compares the threshold voltages computed from a classical and
quantum-mechanical description of the channel for several cases of
gate-work-function values. For gates with a symmetric work
function, the work function of the gates should be about 250 to 350 mV
above/below mid-gap for n/p channels, respectively, for
high-performance applications. Another strategy for setting the
threshold voltage is to employ a set of asymmetric work-function gates:
a) a front gate with a work function close to the conduction band
(n+ gate) and a back gate with a work function close
to the valence band (p+ gate); and b) a front
gate with a work function close to the conduction band (valence band)
for n-FET (p-FET) and a back gate with a common mid-gap work function
(for both n-FET and p-FET). Both of these approaches provide a
symmetric threshold voltage for both n-channel and p-channel DG
FETs. Obviously, the work-function requirement for the ADG case also
depends heavily on the gate dielectric and silicon channel thicknesses.
A typical gate dielectric (1 nm) and silicon channel (510 nm)
thickness required for a sub-50-nm DG FET gives a threshold voltage
that is too high if the n+/p+ ADG (case a) is
used, while case b above has a more appropriate threshold voltage
for high-performance applications.
Figure 74
[56] shows two noteworthy effects from quantum behavior: 1)
threshold-voltage shifts due to quantum effects are larger in DG FETs
with asymmetric work-function gates (BG FET and ADG) compared to SDG
(see also the values shown in Figure 6); and 2) threshold-voltage
roll-off is worse when quantum effects are included. The first
observation can easily be understood, since the band diagrams of the BG
FET and the ADG are asymmetric (with a high normal electric field),
which forces the charge carriers toward one of the surfaces. The second
observation is more subtle: As the channel length is shortened,
classical (Poisson equation) short-channel effects reduce the normal
electric field, thereby reducing the quantization effects (opening
up the channel potential). This effect is, in fact, more apparent in
the BG FET. A similar effect can also be observed in the ADG
[56].
Figure 7
Taking into account the short-channel design considerations in [50,
53], the design space for a sub-20-nm DG FET is summarized as
follows (Figure 8): a
channel thickness of 510 nm, a gate dielectric of less than 1 nm
(equivalent electrical thickness), a source/drain doping profile with
less than 4 nm/decade lateral gradient, a highly doped source/drain
fan-out structure to reduce series resistance, and a set of
self-aligned front and back gates to minimize gate to source/drain
overlap capacitances. The alignment of the gates refers to
both front and back gates aligned with respect to each other as
well as to the source/drain doping.
Figure 8
The requirement for a set of self-aligned gates is underscored by the
study in [46], which we summarize here. Two scenarios for a
non-self-aligned DG FET are 1) misaligned (offset) top and bottom
gates of equal size (minimum bottom gate) [45], and 2) an oversized
bottom gate to ensure gate overlap of the source/drain [58] [see
Figure 9, part (b)]. The
minimum bottom-gate approach minimizes overlap capacitance. The energy
barrier for carrier injection from the source for the misaligned gate
[offset from the source case; see Figure 9, part (b)] is
significantly higher than the nominal, aligned case. The higher
source-side energy barrier limits the on-current and degrades the
gate delay. The gate-delay degradation amounts to more than a full
generation of device scaling performance gain [Figure 9, parts
(c)(d)]. A fully self-aligned fabrication process (both gates
aligned with each other and with the source/drain doping) is therefore
required for the highest performance benefits. If misalignment is
unavoidable, the study in [46] indicates that it is more important to
have a large enough bottom gate to ensure adequate gate-to-source
overlap in order to attain a high on-current than to minimize
parasitic capacitance.
Figure 9
Double-gate FET carrier transport
So far, most of the discussion has been focused on the
electrostatics of the FET. We now turn our attention to device design
issues including carrier transport. Because of the extremely small
depletion capacitance, the gate-to-channel potential coupling is not
de-rated by the capacitor divider between the gate-oxide capacitance
and the depletion capacitance. The subthreshold slope is therefore 60
mV/decade in the absence of short-channel effects (Figure
10). For the same off-current, the
threshold voltage can be set about 60 mV lower than bulk FETs, thereby
providing more gate overdrivean important advantage as the power
supply is reduced.
Figure 10
Carrier mobility for the DG FET with a thin silicon channel deserves
further discussion. Early work on single-gated SOI [59] demonstrated
that carrier mobility in SOI follows a universal mobility curve
similar to that of bulk FETs [60] when the effective field is
properly taken into account in a single-gated SOI FET. Recent work
using a double-gated FET structure has also indicated that the carrier
mobility follows the same universal mobility behavior for both
front-gated and back-gated channels, provided that the gate dielectrics
on both sides are of high quality and the effective field is
properly accounted for [61] (see Figure
11). However, early reports of
carrier mobility measurement for very-thin-silicon-channel FETs
(single-gated SOI FET) show a significant degradation of mobility
as the silicon channel thickness is decreased below 20 nm [6264].
Subsequent theoretical calculations suggest a complex behavior of
mobility as a function of silicon channel thickness attributed to
phonon scattering in the thin confined silicon channel [6567]. Both
findings raise legitimate concern for the DG FET [10], since the thin
silicon channel required for control of short-channel effects may
result in poor carrier transport. Recent experimental mobility
measurements [68] for thin-silicon-channel FETs down to 5 nm as a
function of carrier density depict a more complete picture. First, the
early experimental results [6264] may be tempered by a poor
back-channel oxide because the thin silicon channel of the
single-gated SOI FET was obtained by oxidation thinning from a
SIMOX wafer which is known to have a substandard back-channel oxide
interface that can degrade carrier transport. In addition, the mobility
at low inversion carrier concentration (or, equivalently, low effective
electric field) was reported. Figure
12 summarizes the data to date
[6264, 68]. The electron mobility degradation at channel
thicknesses below 20 nm is clearly observable at a low inversion
carrier density (Ninv = 1012
cm2). However, at the higher inversion carrier density
(Ninv = 1013 cm2)
that is important for nanoscale CMOS, the electron mobility is
less sensitive to the silicon channel thickness. While the above
results are for single-gated SOI FETs, similar results have been
obtained for double-gate operation [69]. The degradation at low
inversion carrier density appears to be related to phonon scattering,
suggested by the larger degradation seen at lower temperatures [69].
Figure 11
Figure 12
Carrier transport in the DG FET with an undoped channel is superior to
that in conventional bulk FETs for two reasons: reduced Coulomb
scattering due to fewer ionized dopants in the undoped/low-doped
channel, and reduced surface roughness scattering due to a lower
surface electric field, as illustrated in Figure
13. In bulk FETs, channel doping
is employed to set the threshold voltage, and halo or pocket dopings
are employed to control the short-channel effects. These ionized
depletion charges contribute appreciably to the surface electric field.
In a DG FET with an undoped channel, there is no ionized
depletion charge; therefore, the surface electric field is
contributed entirely from inversion carriers (application of
Gauss's law). Even though the carrier mobility follows the
universal mobility curve [Figure 13(a)], at the same gate
overdrive the carrier mobility can be significantly higher [Figure
13(b)] because the effective field is lower at the same gate
overdrive. Figure 13(a) also shows the mobility and range of
effective field for a typical bulk FET. While the bulk FET operates at
an effective field above 1 MV/cm, the DG FET with an undoped channel
operates at around 0.5 MV/cm, thereby improving the mobility by
almost two times. This improved transport potentially provides the DG
FET a better CV/I metric because although the capacitance
C is doubled in a DG FET, the current I is
improved by more than two times because of the better transport.
Figure 13
Device fabrication
Fabrication of the DG FET is difficult. Early experimental work
began with non-self-aligned DG FET structures for a first demonstration
of the device principles [39, 40, 45, 58, 70], the most advanced
being the work of Tanaka et al. [45, 58] where non-self-aligned DG
FET circuits were demonstrated. In this section, we focus on the more
recent development of the self-aligned DG FET because of its better
performance, as described in the section on double-gate FET
electrostatics.
In general, the DG FET may be fabricated with any of the three
orientations depicted in Figure
14 [9, 48, 71]. The planar
structure (Type I) has the advantage of better silicon channel
thickness uniformity because film thickness in the plane of the
wafer has the best uniformity and controllability. However, the
fabrication of a back gate and a thin gate dielectric underneath a
single-crystal silicon channel is difficult. In addition, accessing the
bottom gate from the top surface for device wiring is not
straightforward, and may have a negative impact on device density. The
nonplanar structures (Types II and III) allow for easier access and
formation of both gates (or a wraparound gate) on crystalline channels
with thin gate dielectrics. On the other hand, the channel thickness is
defined by lithography and patterning techniques (e.g., reactive ion
etching), and may therefore have poorer uniformity than planar films.
The device electrostatic design (see the section on double-gate FET
electrostatics) requires that the channel thickness be about 1/3 to 1/4
of the channel length. Historically, the smallest dimension patterned
on a chip is the gate length. For the nonplanar DG FET structures, the
smallest dimension patterned must be considerably smaller than the
channel length, which is a major departure from conventional processes.
Carrier transport along etched surfaces with different crystallographic
orientations [e.g., (110) surface for a notch <110> wafer] may
degrade performance, although there is little data to date to support
these assumptions. While the topography of Type II and III nonplanar
structures may raise some fabrication concerns, it should be noted that
this topography problem may not be as severe as it appears, because the
height of the vertical structure can be made similar to that found in
the gate stacks of planar structures.
Figure 14
The buried gate of the planar, Type I device structure has been
demonstrated using two techniques: selective epitaxial growth [71,
72], and wafer bonding and layer transfer [73, 74]. We illustrate
these approaches using the two examples below.
Figure 15 shows the
fabrication sequence for the first self-aligned DG FET. The fabrication
process utilizes a planar CVD-deposited film in a dummy gate stack as a
placeholder for the silicon channel. This dummy gate stack is etched
out (forming a tunnel) and then replaced by a single-crystal silicon
channel formed by selective epitaxial silicon growth through the tunnel
with a seed from one side of the tunnel to prevent grain-boundary
formation in the middle of the silicon channel. Excess epitaxial growth
is polished away by chemicalmechanical polishing (CMP). Since the
selective epitaxial silicon growth is confined to the dielectric tunnel
[7579], the thickness of the silicon channel is determined by the
previously deposited CVD film (which has good thickness uniformity).
Despite the seemingly difficult task of filling a long, thin tunnel,
experimental evidence shows that large aspect ratios of more
than 220:1 can be filled [79], well in excess of the requirements for
the electrostatic device design for short-channel effects. After the
formation of the silicon channel and subsequent CMP planarization to
remove excess grown silicon, the source/drain is formed using
ion implantation that is self-aligned to the dielectric dummy
gate stack. The dummy gate stack is then removed, leaving a silicon
channel in the form of a suspended bridge between the source
and drain. Finally, the gate dielectric is formed, and the
gate material is deposited and patterned. This
double-replacement process provides a means to form a
wraparound gate over the single-crystal silicon channel and
self-align the source/drain doping to the channel and the gates.
Both the silicon channel and the gate stack are formed by replacing
a dummy material/structure with the final material/structure. The
device characteristics reported suffer from high series resistance,
possibly a result of non-optimized source/drain doping and silicide
process. While this process possesses many positive attributes, as
outlined above, it does have several shortcomings: 1) lack of an
independently adjustable dielectric spacer thickness to separate the
gate and the source/drain for parasitic capacitance reduction; 2)
difficulty in doping and siliciding the underside of the wraparound
gate; 3) the fact that the front and back gates cannot be independently
biased; and 4) source/drain silicides that are not self-aligned to the
gates. Some of these problems are addressed in the process proposed
in [73, 80].
Figure 15
Parts (a)(e) of Figure 16
show the fabrication sequence of a triple self-aligned planar
double-gate FET using wafer-bonding and layer-transfer techniques
[74]. The starting substrate is an unpatterned bonded wafer with
the doped polysilicon back gate and thin gate dielectric in place
beneath the silicon channel. The bonded interface is 350 nm below the
devices [Figure 16(e)]. Locating the bonding interface below the
active device region is advantageous for two reasons: 1) any
imperfections of the bonding process (stress fields, minor voids,
and embedded particles) will not affect the active device; 2)
any subsequent fabrication steps that etch to a level below
the silicon channel will not expose the bonding interface, thereby
avoiding the possibility of delamination of the bonded substrate. The
undoped channel was initially thinned to 2030 nm using oxidation,
achieving uniformity to within 1 nm over much of an 8-inch-diameter
wafer. The bottom gate and the top gates are separately patterned and
accessible on opposite sides of the device width direction [Figure
16(f)]. CMP is employed to planarize the surface every time any
topography is generated by patterning (etch). This eliminates the
possibility of stringers and provides a robust process. For
planarization at the front-end process, planarity of nanometer-scale
accuracy and uniformity is required. Using custom-designed chemistry
and processes, 23-nm topography over an 8-inch wafer is achievable.
In order to provide access to the bottom gate and provide
self-alignment of the bottom gate to the top gate and the source/drain,
the source/drain fan-out regions are implemented as doped silicon
sidewalls which are subsequently silicided. The sidewall source/drain
is used as a self-aligned etch mask to center the length direction
of the bottom gate with respect to the top gate and the sidewall
source/drain. The undercut bottom gate is passivated with nitride
dielectric. Contact to the sidewall source/drain is made by a
tungsten plug to the source/drain well, followed by CMP to replanarize
the surface. The difficult requirements of this fabrication approach
include the following: 1) a precise and controllable back-gate undercut
process, 2) a back-gate dielectric that is in place already during the
wafer-bonding process and all subsequent process steps, and 3)
a carefully controlled sidewall source/drain silicon deposition/etch
and subsequent anneal, recrystallization, and dopant profile control
via ion implantation of the sidewall and silicidation. Using a dry-etch
technique, the back-gate undercut and sidewall source/drain etch can be
better controlled. Device scaling requires that the back-gate
dielectric be as thin as the front-gate dielectric. The use of a
high-k gate dielectric is also a possibility. Both scenarios
demand a low-temperature wafer-bonding process (preferably well below
900°C) for the back-gate dielectric to be viable.
Figure 16
Figure 16(g) shows the measured transconductance of a
125-nm-gate-length n-FET fabricated using the aforementioned
triple-self-aligned DG FET process. The back gate (BG) is
n+-doped polysilicon, and the front gate (FG) is
p+-doped polysilicon. The double-gate (DG and DG*) mode of
operation provides a transconductance that is slightly more than
the sum of the FG and BG modes (single-gated, ground-plane
operation). An ideal subthreshold slope of 6062 mV/decade is measured
for DG-mode operation. Mobility measurement of the front-gated
channel and back-gated channel shows electron and hole mobilities
following the universal mobility curves, indicating good transport
properties for these bonded silicon channels with thin front- and
back-gate dielectrics [74]. DG FET circuits are also demonstrated in
this work. Ratioed inverters are demonstrated using a DG FET as the
load device. The back gate of the load DG FET is employed to control
the load current and adjust the input bias of the load device. By using
the front gate and the back gate of the same DG FET as two separate
inputs of the inverter circuit (again with another DG FET as a
load), a compact two-input NOR is demonstrated [74]. The
tungsten-plugged source/drain well can be shared in a circuit
layout to achieve high device density.
The most successful vertical structure as a high-performance device is
the Type III structure, also known as the FinFET [81], since the
silicon channel protrudes from the silicon wafer surface like a fin.
Figure 17 illustrates the
fabrication sequence and device structures. The FinFET is essentially a
scaled-down version of the DELTA device reported by Hisamoto et al.
[82]. Two types of FinFET are reported: 1) a gate-first process in
which the source and drain are formed after the formation (patterning)
of the gate stack [8386]; and 2) a gate-last (or replacement-gate)
process in which the source and drain are formed before the formation
of the gate stack [81, 87].
Figure 17
The fabrication of the FinFET begins with the patterning and etching of
a thin fin on an SOI substrate using a hard mask which is retained
throughout the fabrication process. The thickness of the fin will be
the silicon channel thickness of the DG FET. As discussed earlier, the
fin thickness is smaller than the gate length; thus, either
electron-beam lithography [81, 84, 86, 87] or optical lithography
with extensive linewidth trimming [85] is used to pattern the thin
fin. For the gate-first process, the fabrication steps after the fin
formation are analogous to the fabrication steps of the conventional
bulk FET: After the gate oxide is grown, the gate polysilicon is
deposited, patterned, and etched. A sidewall spacer is formed next to
the gate. Source/drain and extension implants can be performed before
and/or after the gate spacer, using angled implants [85]. A selective
epitaxial growth of silicon or germanium from the fin surfaces forms
the source/drain fan-out that reduces series resistance [85, 86].
Self-aligned silicide can be formed on the source/drain and gate, as in
conventional bulk FETs. For the gate-last process, the source/drain
is formed immediately after the fin patterning. Doped polysilicon
or polycrystalline SiGe is deposited on the fin, followed by
lithographic patterning of the source/drain fan-out pads with a
thin slot between the source and drain. This distance between the
source and drain determines the gate length. The slot length is further
reduced by a dielectric sidewall spacer. Then the gate oxide is grown,
and the gate polysilicon (or polySiGe) is deposited and patterned. A
potential device density improvement for FinFET using direct etch and
sidewall image transfer to generate fins is illustrated in
Figure 18. The sidewall
image transfer technique can reduce the fin pitch by a factor of 2
using the same lithography pitch. The fin pitch must be smaller than
the fin height to provide more effective device width than a planar
single-gated FET.
Figure 18
The difficulties of fabricating the FinFET DG FET include 1)
variability of the fin thickness (and hence silicon channel thickness);
2) highly selective RIE and long over-etches which are required to
pattern the gate polysilicon that wraps around the fin (for the
gate-first process), and to create the sidewall spacers (for both the
gate-first and the gate-last processes); and 3) a difficult integration
scheme for the source/drain of complementary FETs (n-FET and p-FET on
the same wafer) for the gate-last process. The apparent difficulty in
handling the topography of a vertical structure is manifested in
the RIE and over-etch issues mentioned above. While tall fins may
provide an improvement in density, the fin height must be contained
within reasonable limits. The variability of the fin thickness deserves
special attention. The variability in fin thickness arises from several
sources: linewidth variation of the lithographic process, linewidth
variation of the fin etching process, and line-edge roughness of the
lithographic process [88], all of which are translated to the
final fin dimensions. There is no systematic data yet that correlates
fin thickness to processing conditions. The sidewall image transfer
technique produces a set of fin surfaces that have correlated
roughness; this is a different situation from the direct-etched-fin
case, in which the fin surface undulations on either side of the fin
are uncorrelated.
Both the gate-first and the gate-last FinFET processes have produced
devices with excellent IV characteristics (Figure
19). The current drive and the
CV/I of the FinFET rival those of the best conventional bulk
devices [13, 14], albeit with a threshold voltage that has to be
centered. Despite the unconventional device structure and topology, the
minimum gate length achieved is among the shortest [81, 85, 86],
including those of conventional FETs [13, 14]. Since the silicon
channel thickness is determined by the patterned fin, experimental data
comparing the short-channel effect of different silicon thicknesses is
readily obtained. The experimental results of Huang et al. [81] and
Kedzierski et al. [85] both corroborate the short-channel
electrostatic device design and scale length analyses described above.
While the carrier-transport properties along the etched fin surfaces
have been a concern, the good current drive exhibited by the FinFET DG
FET indicates that there is no significant degradation.
Figure 19
Challenges ahead
While there has been tremendous progress both in understanding the
device physics of the DG FET and in fabricating it, significant
challenges remain to be met before this new device structure is
ready for manufacturing. Some of these open issues have no obvious
solution at the moment.
First and foremost is the need to set the threshold voltage and to
achieve multiple threshold voltages on the same chip. The section on
double-gate FET electrostatics outlined the requirements on the work
function of the gates to achieve the required threshold voltages for
SDG and ADG. At this moment, no material satisfies the requirements for
SDG except possibly a midgap-work-function material such as tungsten or
CoSi2, with doping of the silicon channel to lower the
threshold voltage to the desired level. For ADG, the combination of
n+ (p+) polysilicon with tungsten or
CoSi2 (materials having a mid-gap work function) for the
n-FET (p-FET) gates provides one possible solution, although an
integration scheme to achieve self-aligned gates is still lacking.
Adjusting the threshold voltage for multiple threshold voltages on
the same chip is a different challenge from setting the threshold
voltage. For ULSI systems, it is typically necessary to provide a menu
of devices with different threshold voltages to allow for the
optimization of performance and power consumption. The ability to tune
the threshold voltage by about 150 mV is often required. The obvious
solution is to dope the channel of the DG FET in analogy with bulk
FETs. For devices with geometries in the sub-50-nm gate-length regime,
a channel doping of the order of high 1018
cm3 is required. This tends to reduce the benefit of the
DG FET, since the advantages of an undoped channel (higher mobility,
reduced discrete dopant fluctuation effects) will diminish or even
vanish. Recent work on gate-material work-function tuning [36] by ion
implantation offers interesting opportunities, although much still has
to be proven.
Because of the thin silicon channel, the series resistance of the DG
FET is of particular concern [50]. Some form of raised source/drain
process would be required in order to achieve a source/drain fan-out.
The results given in [85] illustrated the efficacy of the solution.
Growing selective epitaxial silicon on thin (<15-nm) silicon
substrates (or fins) is still difficult, because the thin starting
silicon tends to break up during or prior to the epitaxial growth. In
addition, as the silicon channel thickness is reduced to less than
10 nm, optimization of the series resistance and parasitic capacitance
of the overlap region may prove difficult.
Uniformity of the silicon channel is an important concern, especially
when the silicon channel is reduced to less than 10 nm to satisfy
short-channel control requirements. Silicon channel thickness tolerance
may be translated to an equivalent gate-length tolerance by
considering a constant scale length [10, 46]. A 15% variation in
silicon channel thickness approximately translates into a 10%
gate-length variation. This tolerance is added quadratically to the
overall tolerances. Maintaining a thin, uniform silicon channel
thickness remains a major manufacturing obstacle.
Finally, most double-gate FET devices are structurally different from
conventional FETs. Parasitic capacitance (gate-to-source/drain
capacitance) critically depends on the device structure. There is no
fundamental barrier to achieving a back-gate parasitic capacitance that
is as small as the front-gate parasitic capacitance, especially for the
FinFET DG FET. While most device results to date have focused on the
current drive and static IV characteristics, more work has
to be done to characterize and optimize the parasitic capacitance.
Higher mobility
The section on double-gate FET carrier transport has already
pointed out the importance of a low-doped channel for carrier transport
in DG FET. Another avenue to achieving a higher carrier mobility and
saturation velocity is through the choice of material for the FET
channel. Fischetti and Laux [89] compared the performance of several
semiconductors that have high carrier mobilities and saturation
velocities, including Ge, InP, InGaAs, GaAs, and several others. They
concluded that materials which provide a significantly higher carrier
mobility give only a moderate performance advantage over a
lower-mobility material such as silicon. That work succinctly pointed
out that under non-equilibrium, high lateral field transport, carrier
mobility is not the only determinant of performance (using metrics such
as transconductance). The band structure, which determines the density
of states (i.e., the inversion capacitance,
Cinv) [90] and the carrier scattering rates at
high carrier energies are just as important as the carrier
mobility: a low-carrier-energy, quasi-equilibrium quantity.
Furthermore, in the limit in which carriers are transported
ballistically, many other factors combine to determine
performance [91]. On the other hand, a higher carrier mobility
does provide some moderate performance gains, albeit not in
proportion to the mobility values [92].
Strained-silicon FETdevice concepts
It has been known for some time that carrier mobility in silicon
under biaxial tensile strain is enhanced [9398]. The theory of
mobility enhancement for strained silicon is still evolving [96]. The
most commonly cited reason for electron mobility enhancement in
strained silicon is that under the biaxial tensile strain, the sixfold
degeneracy of the conduction band of silicon is lifted, raising the
higher-effective-mass fourfold-degenerate ellipsoids and lowering the
lower-effective-mass twofold-degenerate ellipsoids. This has the effect
of keeping most of the carriers in the lower-energy,
lower-effective-mass valleys and reducing the intervalley scattering.
In the valence band, the biaxial tensile strain lifts the
heavy-hole/light-hole degeneracy at the point, resulting in a
smaller in-plane transport effective mass due to band deformation and
reduced intervalley scattering.
The use of strained silicon provides a plausible tradeoff between
moderate levels of performance enhancement over silicon and ease of
fabrication and integration with silicon (as compared to other
higher-mobility materials such as Ge, InGaAs, GaAs, and InP). Recent
work has provided encouraging experimental evidence that
introducing the biaxial tensile strained silicon through a layer of
relaxed SiGe may provide adequate performance gains for incorporation
into conventional CMOS technologies. This section summarizes the recent
results, focusing on strained silicon on relaxed SiGe.
Materials and device fabrication
The tensile strain can be introduced in several ways. Local strain
caused by the thermal mismatch of silicon and the isolation materials
[such as shallow-trench isolation (STI)] has been shown to introduce
enough strain to alter device characteristics [97]. A more general
approach is to introduce biaxial tensile strain by growing a thin layer
of epitaxial silicon on a material with a slightly larger lattice
constant, such as relaxed SiGe [95, 98, 99]. The strained silicon
must be relatively thin in order to prevent relaxation and strain
relief through dislocations. The underlying relaxed SiGe serves as an
anchor to constrain the lattice of the strained silicon on top and has
no beneficial electrical role otherwise to first order. Compared to a
closely related approach of using a pseudomorphic layer of SiGe on
silicon, where the carrier transport is in the compressively stressed
SiGe layer (see for example [100]), strained silicon on relaxed SiGe
has the advantage of having silicon as the top surface. The silicon
provides an easy means of forming a good gate dielectric and enables
surface channel operation for good short-channel-effect control.
Experimental results for bulk strained silicon on relaxed SiGe FETs and
strained silicon on relaxed SiGe on insulator FETs are quite
encouraging. For bulk strained silicon on relaxed SiGe, the fabrication
begins with the growth of a relaxed SiGe buffer of 12 µm by a
step-graded approach [101103]. This step can be performed using
rapid thermal chemical vapor deposition (RTCVD) [95, 98] or
ultrahigh-vacuum chemical vapor deposition (UHVCVD) [104]. A thin
(typically <20 nm) silicon layer is then epitaxially grown on the
relaxed SiGe. Because of the larger lattice constant of
Si1xGex (4 × x%
larger than that of silicon), the top silicon layer is under biaxial
tensile strain. Early work on FET fabrication has limited
high-temperature processes and ion implantation in the channel to avoid
strain relaxation. Recent results indicate that a conventional CMOS
process flow can be adopted while still achieving the mobility and
current drive enhancement [104]. Process modules such as
shallow-trench isolation (STI), channel ion implantation, source/drain
extensions and halos, and associated high-temperature activation
anneals are shown to have no adverse impact on device characteristics.
Figure 20, part (a) shows
that the effective electron-mobility enhancement over the
unstrained-silicon control, as well as the enhancement over the
universal mobility curve, persists into an effective electric
field (Eeff) range (up to 1.5 MV/cm) that
is relevant to short-channel devices with high channel/halo dopings
(e.g., bulk or PDSOI). A portion of this mobility enhancement is
translated into current drive enhancement over the
unstrained-silicon control [Figure 20, part (b)]. When the effect of
self-heating is accounted for in the measurement, it is expected
that the current drive will improve by another 510%.
Figure 20
It is also desirable to combine the benefits of enhanced transport in
strained-silicon materials with the benefits of
silicon-on-insulator (SOI) technologies [e.g., reduced junction
capacitances, dynamic floating-body effects (PD SOI), scalability to
shorter channel length (ultrathin-body SOI)]. Several approaches have
been reported, all of which rely on growing strained silicon on a
relaxed SiGe layer on insulator (SGOI). These methods include wafer
bonding and layer transfer [105107], oxygen implantation
(SIMOX) [99, 108], and oxidation enrichment of SiGe [109, 110] (see
Figure 21). For the
wafer-bonding and SIMOX approaches, mobility enhancements
similar to that obtained in bulk strained-silicon FET have been
demonstrated for n-FETs [99, 106, 107] as well as p-FETs [99, 106,
108]. Device design for SOI devices in the sub-50-nm regime calls for
a thin SOI layer of less than 50 nm. Therefore, the combined layer
thickness of the strained silicon and the relaxed SiGe should be less
than 50 nm. In addition, defect density must be low enough for ULSI
applications.
Figure 21
The wafer-bonding and layer-transfer technique is the most
straightforward and preserves the qualities of the starting relaxed
SiGe layer. These include desirable qualities such as a high Ge content
and high degree of relaxation of the SiGe buffer layer (e.g., by the
step-graded approach). It also retains undesirable qualities such as
the defect density of the starting SiGe buffer. A practical
limitation of the layer transfer approach is thickness and uniformity
control, since the thickness of the SiGe layer on the insulator is
dependent on the final polish step. Using the SIMOX approach, the Ge
content in the SiGe layer appears to be limited to less than 10%
because the melting point of
Si1xGex with
x > 0.1 is lower than the SIMOX annealing temperature.
This limitation can be partially recovered by oxidation enrichment.
When the SiGe is oxidized, the Ge is driven from the oxide layer, and
the Ge content in the underlying SiGe layer increases (the snowplow
effect). Thus, the Ge content of the SiGe layer can be
enriched by this oxidation process. The enrichment process is
used in the SIMOX approach to obtain a Ge content greater than 10% in
the SGOI film [99]. This process can also be employed to produce a
relaxed SiGe layer on insulator (oxide) by oxidizing a film stack
consisting of a starting silicon-on-insulator layer with a SiGe
layer pseudomorphically grown on top [109]. In this case, thickness
uniformity and control will be very similar to that of the starting SOI
film.
Electrical results of strained-silicon-on-insulator (SSOI) FETs
fabricated from strained silicon grown on SGOI virtual substrates are
quite encouraging. Essentially, the mobility enhancements commensurate
with the amount of strain introduced as similarly observed in bulk
strained-silicon substrates are reproduced in SSOI FETs. These results
verify that the concept of combining strained silicon with SOI will
retain the benefits of strained silicon.
Challenges ahead
Strained-silicon CMOS is at present one of the most promising
technology options for insertion into CMOS technologies. The materials
set is friendly to conventional CMOS technologies, and the device
performance gains demonstrated so far have been promising. The main
issue for the materials development is defect control and
understanding. The defect density of the materials must be
comparable to those of exiting bulk and SOI materials for the
relaxed SiGe materials to be viable in an ULSI application. The nature
of the defect is as important as the defect density. Correlating device
fail mechanisms to the physical defect will be an important area of
work. In particular, for SSOI materials, the nature of the defects can
be quite different from that of the bulk relaxed SiGe material.
Obviously, there is also a strong dependence of defects on the method
of SSOI materials fabrication. It may also be possible to obtain
strained-silicon-on-insulator without the underlying SiGe layer (e.g.,
using compliant substrate concepts [111]), thereby breaking out of
the SiGe materials system altogether. In the device area, the
fundamental principles of mobility enhancement must be further
elucidated. In addition, relating the mobility enhancement to current
drive gains for short-channel devices will continue to add to the
understanding of the relative role of mobility in describing carrier
transport, since strained silicon provides another means of modifying
mobility while keeping other device parameters the same. Finally,
we note that while it may appear difficult to combine strained silicon
with the double-gate FET device structure, in principle nothing
precludes this possibility.
3. Nanotechnology
Nanotechnology is a broad term which may refer to such diverse
technical disciplines as chemistry, biology, physics, materials, and
electrical engineering. Here we focus on the aspect of nanotechnology
that pertains to extending and broadening the impact of the
microelectronics and semiconductor industry. Therefore, the discussion
here is necessarily narrowly focused. Instead of including a
comprehensive discussion of this narrowly focused yet still vast
subject, we choose to select one example as an illustration of the
approach and thought process we adopt to identify and/or adopt
nanotechnology for commercial applications. In this example, we examine
a potential device technology: the carbon nanotube field-effect
transistor (CNFET). This example is chosen because it appears to be one
of the nanotechnologies which allows a meaningful comparison with
incumbent technologies in that it uses the same circuit and system
architecture. It is also one of the most widely studied in the
nanoscale science community.
Carbon nanotube FET
Device concepts
Carbon nanotubes are nanoscale high-aspect-ratio cylinders of
carbon atoms with exceptional electrical and mechanical properties.
The basic properties and preparation of carbon nanotubes are well
documented in the literature [112116]. Many applications
for this material have been proposed: electronic switch [117,
118], field emitter for flat-panel displays [119, 120],
electrochemical energy storage for batteries and fuel cells [121],
interconnect for electronics, probe tips for scanning probe microscopy
and lithography [122124]. Here we focus our discussion on the use
of a carbon nanotube as an electronic switching element which performs
basic logic functions.
Single-wall carbon nanotubes (SWNT) are two-dimensional graphene sheets
rolled into nanometer-diameter cylinders that can either be 1D metals
or semiconductors [125]. With the appropriate chirality [114,
116], SWNT can be semiconducting, with a bandgap inversely
proportional to the diameter of the tube:
Egap =
2 0aCC/d, where
Egap is the bandgap, 0 is the
carbon-to-carbon tight-binding overlap energy,
aCC is the nearest-neighbor carbon-to-carbon
distance (0.142 nm), and d is the diameter of the nanotube
[126]. A typical semiconducting SWNT has a diameter of 1.4 nm and a
bandgap of about 0.50.65 eV [126]. A straightforward application of
this semiconducting property of the carbon nanotube is to form a
field-effect transistor (FET) analogous to the MOSFET. Tans et al.
[117] and Martel et al. [118] have already demonstrated FET
operation using gold electrodes as source and drain contacts and a
back-gated structure. Both p- and n-channel carbon nanotube transistors
(CNFETs), as well as CNFETs with ambipolar behavior, have been
reported [127, 128]. Recently, simple circuits such as inverters
[127, 129] and ring oscillators [129] have been successfully
fabricated.
Device fabrication
The fabrication of an early CNFET is straightforward: First, the
gold electrodes are formed on a silicon wafer with a thermally grown
oxide (the gate insulator); then carbon nanotubes are dispersed on the
wafer. The wafer substrate serves as the gate electrode of the CNFET.
While this process is simple and suitable for a first demonstration of
FET action, the source/drain contact resistance clearly limited the
current drive of the CNFET [117, 118]. A recent embodiment of the
CNFET is shown in the inset of Figure
22(a), where ideas borrowed from
conventional microelectronics fabrication are employed to improve the
contact resistance. In this process, carbon nanotubes are dispersed
onto a heavily doped silicon wafer with a thermally grown oxide
(serving as the gate insulator). Then a transition metal (titanium or
cobalt) is patterned on the wafer (by lift-off) as the source/drain
contact metal. A subsequent anneal at 400°C (Co) or at 820°C (Ti)
in an inert ambient forms a low-resistivity cobalt or Ti carbide at the
source/drain contact [128, 130]. Figure 22 shows the IV
characteristics of a p-type CNFET with a cobalt source/drain contact
[131]. The cobalt contact improves the contact resistance noticeably
compared to previous results obtained with the same SWNT material
[117, 118] as evidenced from Figure 22(b), where the
drain current shows no sign of saturation at large gate voltages.
A similar device with a SiO2 passivation layer reduces the
subthreshold slope [131], but does not completely remove the
interface traps at the nanotube surface. While this device
structure is easy to fabricate and enables advance learning, a
device configuration in which the gates are separately controllable (in
contrast to a common gate provided by the substrate) is necessary
for a technology.
Figure 22
Performance assessment
Table 3 summarizes the
key device parameters extracted from the CNFET IV
characteristics. The source/drain series resistance is an upper bound
estimated from the measured total resistance from source to
drain.5 In order to assess whether the CNFET
is a viable technology, it is useful to compare the device performance
of CNFET with that of conventional silicon MOSFETs. Table
4 compares several key device
parameters. It should be noted that several assumptions must be made in
order to make a direct comparison. While some of these assumptions may
not be valid for various practical or fundamental reasons, Table 4 does
serve as a starting point for discussion.
|
|
Table 3
|
Selected device parameters of p-type carbon nanotube FETs. The
source/drain series resistance is an upper bound estimated from the
measured total resistance from source to
drain.
|
| |
| |
| |
| |
| |
|
Contact scheme
|
Cobalt
|
TiC
|
|
|
Channel length
(nm)
|
1030
|
800
|
|
Nanotube
diameter
|
1.4
|
1.4
|
|
Gate-oxide thickness
(nm)
|
150
|
150
|
Source/drain resistance
(k )
|
<25
|
*
|
|
Inverse
subthreshold slope (mV/decade)
|
2500
|
730
|
|
Interface
trap capacitance ratio,**
Cit/Ci
|
40.9
|
11.2
|
|
Transconductance# (µS)
|
0.342
|
0.280
|
|
|
(VD
=
1.4
V)
|
(VD
=
1.0
V)
|
|
Mobility#
(cm2V1s1)
|
60
|
68
|
|
*This value cannot be accurately estimated for this device.
**Ratio of the capacitance due to interface trap relative to the gate
capacitance as estimated from the subthreshold slope assuming that the
capacitance due to band bending in the s-SWNT is negligible.
#The transconductance and the mobility values are given
without a correction for the contact resistance. After Martel et al.
[131].
|
|
|
|
Table 4
|
Comparison of device characteristics of carbon nanotube FET and
conventional silicon MOSFET.
|
| |
| |
| |
| |
| |
|
|
|
p-CNFET(a) 1030 nm
|
100-nm MOSFET
|
25-nm MOSFET
|
|
|
|
Transconductance
(µS/µm)
|
122
|
1000 (n-FET)(b)
|
1200
(n-FET)(c)
|
|
|
|
|
460
(p-FET)(b)
|
640
(p-FET)(c)
|
|
|
External resistance ( -cm per
side)
|
<70
|
~66
(n-FET)(d)
|
~40
(n-FET)(d)
|
|
|
|
|
~143
(p-FET)(d)
|
~86
(p-FET)(d)
|
|
|
Gate insulator
(nm)
|
150
|
2.0
|
0.8
|
|
|
(a) |
Device A from Martel et al. [131].
|
|
(b) |
Ghani et al., IEDM 1999 [136].
|
|
(c) |
Chau et al., IEDM 2000 [13]; these values approach the 1500 mS/mm
predicted for n-FET for an ideal single-gated bulk MOSFET [47].
|
|
(d) | ITRS 1999 [1].
|
|
First, it is necessary to make some assumptions about how the CNFET
will be used. Here, we assume that a circuit topology similar to
today's silicon-based VLSI circuits will be used; that is, the CNFETs
are switches that form logic gates interconnected to one another by
wires. In this scenario, the CNFET must deliver a current-driving
capability similar to or surpassing that of the MOSFET in the future
time when the CNFET will be introduced.6 To
provide this large current, it is likely that an array of CNFETs will
be needed. If we further assume that the CNFET can be made with equal
lines and spaces with dimensions equal to the diameter of the
nanotube (1.4 nm), the CNFET characteristics may be compared
with those of MOSFETs on a per-unit device-width basis, as in Table
4.
Understanding the carrier transport mechanism of the CNFET is vital
to assessing its potential as a device technology. Here, we adopt a
phenomenological approach based on a classical description of transport
that allows a direct comparison with silicon devices. Although it is
clear that the characteristic length for electronphonon and defect
scattering in metallic single-wall carbon nanotubes is very long (of
the order of microns) and favors ballistic transport [132, 133], this
classical approach takes into account the fact that the semiconducting
SWNTs show stronger backscattering at room temperature [118,
125, 134].
In this phenomenological model, the channel conductance is given by
G = µQL/L, where µ is the
carrier mobility and QL is the charge
density per unit length. Note that µ is a phenomenological parameter
for these quasi-1D systems [118]. If one draws an analogy with
the silicon MOSFET in which the channel charge is related
to a threshold voltage VT, then
QL =
CL(VG
VT), where CL is the
gate-to-channel capacitance per unit length and
VG is the gate voltage [118]. Within this
framework, the hole mobility of the CNFET is about 6068
cm2V1s1. This hole
mobility compares very well with surface mobility of holes in
MOSFETs, which range from about 160
cm2V1s1 at low vertical
electric fields (0.1 MV/cm) to about 50
cm2V1s1 at high vertical
electric fields (1 MV/cm) [60]. The transconductance of the
p-type CNFET shown in Figure 22 reaches 122 µS/µm (assuming
equal lines and spaces for a CNFET array) for a 1030-nm-long
channel and a gate insulator thickness of 150
nm.7 Continuing with the classical transport
picture in which the CNFET current scales inversely with the channel
length, the CNFET is extrapolated to have a transconductance of 1257
µS/µm at a 100-nm channel length and 5028 µS/µm at a 25-nm
channel length. For silicon MOSFETs, respective transconductances of
1000 µS/µm and 460 µS/µm for n-FETs and p-FETs have been
achieved at 100-nm gate length [136]. At 25-nm gate length, the
transconductances reach 1200 µS/µm for n-FETs and 640 µS/µm for
p-FETs [13], which approaches the 1500 µS/µm predicted for
n-FETs for an ideal single-gated bulk MOSFET [47]. The
transconductance of n-channel double-gate FETs in the 25-nm regime
is projected to be of the order of 5000 µS/µm.
The electrostatics of the CNFET is another area that requires detailed
analysis, primarily due to the cylindrical geometry, the small size,
and the 1D quantized channel (and band structure) of the carbon
nanotube. Short-channel effects of the CNFET have not yet been
explored. The capacitance for an isolated cylindrical structure (either
in a planar gate conductor or coaxial gate conductor configuration
[131]) varies inversely and logarithmically with the distance from
the center of the conductor [137]. For gate insulator thicknesses
that are large compared to the nanotube diameter, the gate capacitance
for an isolated CNFET is greater than that for a corresponding
parallel-plate capacitor (e.g., in a planar silicon MOSFET) with an
equivalent width [131] (due to the fringing field). For example, a
150-nm gate SiO2 for an isolated CNFET is roughly
equivalent to a 310-nm gate SiO2 for a planar MOSFET.
However, this advantage due to the fringing field vanishes for gate
insulator thicknesses that are comparable to or smaller than the
nanotube diameter. Furthermore, for an array of CNFETs, the fringing
field diminishes as the packing density of the array increases. At high
packing densities, the shielding from neighboring nanotubes renders the
electric field pattern similar to the parallel-plate situation. It is
therefore important to explore in future work the use of
high-k gate dielectric materials and to study the
electrostatics of alternative gate and channel geometries (such as the
coaxial geometry) to maximize the gate-to-channel charge coupling.
In principle, the carbon nanotube has a perfect interface with the gate
dielectric because all bonds of the carbon atoms are satisfied in a
carbon nanotube. In other words, there are no dangling bonds which form
interface states. However, the subthreshold turn-off characteristics
shown in Figure 22(b) are very gradual (hundreds of mV/decade). This
behavior is probably caused by traps in the gate dielectrics that
are close to the interface. Reducing the impurities in the gate
dielectric will probably lead to improved subthreshold characteristics
(see recent results by Wind et al. [135]).
Challenges ahead
Despite the optimism suggested by the above analysis, the
challenges facing CNFET technology are daunting. The first and most
important is to understand the device scaling and carrier transport
properties of the CNFET (as a function of the gate length, gate
dielectric thickness), as well as key device physics such as
short-channel effects; gate-dielectric-to-carbon-nanotube interface
properties (interfa |