Introduction
The IBM Advanced Semiconductor Technology Center (ASTC) is a pilot
line created in 1989 for the development and early manufacture of all
advanced IBM semiconductor products. It has also been the site of
recent development alliances among IBM, Siemens, and Toshiba, and is
currently in the process of transferring products
to manufacturing sites throughout the world. ASTC
acts as the common foundry for future generations of DRAM (64Mb, 256Mb,
and 1Gb), associated CMOS logic, and various experimental chip designs
and processes. The need to process a wide spectrum of products,
spanning all stages from early exploration to fully qualified
manufacturing, poses a unique challenge to the efficient operation of a
lithography sector at ASTC.
Deep-ultraviolet (DUV) lithography has enabled ASTC to pursue
aggressive circuit ground-rule migration in the early 1990s; it has
been the primary means of reducing the minimum ground rule
(G  ),
as illustrated in Figure
1. The actual minimum ground rules achieved during
the first half of the decade are displayed in two ways: 1) the lower,
dashed curve shows the minimum ground rule processed successfully on
integrated hardware in a given year, and 2) the upper, solid curve
shows the weighted average ground rule across all hardware programs and
DUV exposure levels processed successfully in a given year. The
weighted average minimum ground rule processed in the ASTC DUV sector
is projected to fall below 0.2 µm by the year 2000.
Also indicated in Figure 1 are key DUV lithography milestones
associated with the decrease in the minimum ground rule. These are
expressed as the increase in numerical aperture realized on
production-worthy DUV optical imaging systems over the first half of
the decade, and the decrease in wavelength from 248 nm to 193 nm
anticipated for the latter half
[1]. While essential to the
achievement of this ground-rule migration, the evolution of imaging
systems is usually accompanied by a shrinking process window: namely, a
decrease in the allowed depth of focus and exposure latitude to achieve
the target dimensions and tolerances, and a corresponding decrease in
overlay tolerance.
Figure 1
The scaling of the process window with exposure equipment and
photoresist improvements undertaken or anticipated at ASTC over the
current decade is represented in Figure 2,
where the simulated depth of focus (DOF) is plotted as a function of
feature size for different exposure-tool and photoresist combinations.
The solid curves A, B, C indicate existing 248-nm tool and resist
options. The dashed curves D, E are extrapolations to future 193-nm
tools and resists. Curve B in
Figure 2 corresponds to the current
operating condition, a numerical aperture of 0.5 (0.5 NA) and imaging
at a wavelength of 248 nm in Apex-E photoresist
[2], for nearly
all of the hardware being processed through the DUV sector in ASTC.
At the 0.25-µm ground rules required by 256Mb manufacturing, the
theoretical DOF is approximately 1.0 µm. In practice, the usable DOF
is in the range of 0.6 to 0.8 µm. Curve A shows the resolution
achievable with an earlier generation of tooling and resist (0.35 NA,
Apex-M [3]) at ASTC.
Resolution improvements are expected from the
introduction of new exposure tools and resists to ASTC manufacturing.
The migration of DOF as ground rules decrease with DRAM generation is
shown by the labels (16Mb, 64Mb, ...) in Figure 2, given a
plausible transition among DUV tooling and resist options. While it is
possible to increase the DOF at a specific ground rule by the
appropriate selection of NA/wavelength and resist combination (e.g., at
0.25 µm in Figure 2),
the DOF continues to shrink from one generation
to the next. Resist improvements and the employment of
resolution-enhancement techniques, such as off-axis illumination and/or
phase-shift masks, may counteract this effect to some degree, but the
trend toward a smaller process window with a smaller ground rule is
inexorable.
Figure 2
The challenge posed by the shrinking process window is exacerbated in
ASTC by the need to work with current-generation tooling to pilot the
next-generation products. This implies low k-factor
operation, where k is the coefficient of the Rayleigh
resolution criterion, defined in terms of the NA, wavelength
, and
minimum ground rule
G  ,
by
k = NA · G  / .
The value of k provides an indicator of the
degree of difficulty of the lithography. Higher values of k
imply greater aerial image acuity due to the inclusion of more
diffracted orders
[4], so that lower requirements can be placed on
resist contrast, pattern-dependent mask biasing, or
resolution-enhancement techniques. To provide an adequate process
window, historical k-factors have been greater than 0.5
during product development, and greater than 0.7 for effective
manufacturing. As is evident in Table 1, ASTC
has already broken the development "rule" by operating at or
below k = 0.5. As the limits of photolithography are
extended, the manufacturing rule of k greater than 0.7 will
be challenged as well. In particular, the exponential increase in
tooling cost by generation [5]
will drive the need to prolong the
useful life of each generation in manufacturing. To date, the principal
penalty for a low k-factor operation is the increased effort
required for process optimization [6],
and proximity correction [7,8].
One important role of proximity correction is compensation for
image shortening, as is described in the process development section
below.
Table 1 DRAM development k-factors, actual and projected.
| DRAM | Date | G   (µm) | Wavelength | NA | k |
| 64Mb | 1990-1993 | 0.35 | 248 | 0.35 | 0.49 |
| 256Mb | 1993-1996 | 0.25 | 248 | 0.5 | 0.50 |
| 1Gb | 1996-1999 | 0.18 | 248 | 0.6 | 0.44 |
| 4Gb | 1999-2002 | 0.13 | 193 | 0.7 | 0.47 |
Against this backdrop of shrinking linewidths and latitudes, the
objective of the DUV lithography photosector at ASTC is to maximize the
production of high-quality product. Toward that end, many disparate
functions are involved. Lithography must satisfy both its own
technology requirements and the need to integrate its performance
with other process sectors in the manufacturing line. Lithography can
be viewed as a support for the various process sectors which define the
overall manufacturing process. It is the structure of the lithography
support, in a pilot line environment, which concerns us here.
The lithography constituents are materials (such as photoresist,
developer, and antireflection layers), tools (stepper, track,
metrology) and masks (product, test). ASTC specifies, characterizes,
and melds these elements into a process. The development of a viable
baseline process and its product-specific variations, the
implementation of appropriate means of process control, and the
integration from lithography to upstream and downstream process steps,
constitute the principal missions of the ASTC.
Process constituents
The generic photolithographic process allows for a mask or reticle
pattern to be transferred via spatially modulated light (the aerial
image) to a photoresist film on a wafer. In cases where the wafer is
already patterned, this exposure process must be preceded by an
alignment step to ensure precise positioning of the new pattern level
with respect to prior pattern levels. As practiced at ASTC, DUV
lithography has two distinguishing attributes: 1) the delayed image
formation process intrinsic to chemically amplified photoresist, and 2)
the implementation of step-and-scan alignment and exposure equipment.
Chemically amplified photoresist
In any photo-imaging process, those portions of the absorbed
aerial image whose energy exceeds a threshold energy of chemical bonds
in the photo-active component of the resist material act to form a
latent image in the photoresist. The latent image marks the volume of
photoresist material that either is removed during the development
process (in the case of positive photoresist) or remains after
development (in the case of negative photoresist) to create a
three-dimensional pattern in the photoresist film.
The need for chemical amplification in the DUV imaging process is
driven by the relatively low DUV light intensity available at the wafer
plane from exposure tools [9].
The spectral intensity of mercury
lamps, used in early generations of DUV tools, is roughly an
order of magnitude lower at 248 nm than at 365 nm. Excimer sources can
produce much higher intensities at 248 nm, but their pulsed operation,
and the line narrowing required by chromatic correction in
high-numerical-aperture imaging systems, constrains the light that can
be delivered to the wafer. It is likely, therefore, that excimer-based
systems will continue to require chemical amplification.
In chemically amplified DUV resists, the photo-active component is an
acid generator. Each absorbed photon generates an acid molecule. The
latent image is formed during a post-exposure bake (PEB). In the
presence of heat, the photo-generated acid acts as a catalyst for a
thermally activated reaction that cleaves the blocking group from the
bulk polymer. The separation of the blocking group makes the exposed
region of the resist soluble in the developer. Since the acid is not
consumed in the reaction, each acid molecule causes many reactions
as it diffuses through the resist, resulting in a photosensitivity gain
of 10²-10³ [10]. Hence, DUV resists belong
to a class of materials known as chemically amplified, acid-catalyzed
photoresist.
Because of the delay in image formation between exposure and PEB, the
DUV resist is vulnerable to airborne and/or substrate contaminants
which alter the acid concentration. The contaminants are typically
basic compounds that can neutralize the photo-generated acid, thereby
inhibiting image formation. An increase in the exposure dose is then
required to produce an image, and deformations occur in the developed
image profile. In the case of airborne contamination, a lip forms at
the top of the developed resist profile, popularly known as
"T-topping" [11]. Extreme
cases of T-topping cause bridging
between adjacent patterns. Even at low contamination levels, the slight
T-topping evident by comparing the profiles in Figures
3(a) and 3(b) can cause significant
loss of process capability, as is discussed further in the section
on process integration. In the case of substrate contamination, an
undeveloped foot extends from the base of the profile. As with
T-topping, the size of the foot and its effect on the process are
dependent on the concentration of contaminant in the substrate
contacting the resist and its ability to diffuse into the resist.
Figure 3
Various strategies are used to eliminate the undesired contamination.
Given the need to process product with available resists, regardless of
their contamination sensitivity, the approach at ASTC has been a
combination of chemical air filtration in both the stepper and track
tools, use of chemical barrier layers, and tight control of the
exposure to PEB delay. As a result, observable effects of contamination
have been limited to rare incidents when basic chemicals were
inadvertently introduced to the photocluster enclosure.
The ASTC process has been improved and simplified over time. In the
1991-1992 time frame, when our DUV process was Apex-M resist on the
Micrascan® I expose tool, a topcoat layer was required to protect
the resist from airborne contaminants. The higher exposure intensity of
the Micrascan II tool in late 1992, combined with the availability of
Apex-E (which traded photosensitivity for reduced contamination
sensitivity) and improved filtration, enabled us to eliminate the need
for a topcoat. Where required by the substrate materials, our bottom
antireflection layer serves a dual purpose as a chemical barrier. In
the meantime, newly available resists such as the IBM/Shipley UV2HS
show greatly reduced susceptibility to contamination
[12]. This is
achieved by the appropriate selection of resist materials to
enable a post-apply bake near the glass transition temperature of the
bulk polymer. The resulting film densification prior to exposure
inhibits the diffusion of contaminants into the resist film
[13].
An important consequence of resist contamination was the early
resolution of a longstanding debate regarding stepper/track clustering.
Our initial experience in 1990 with unclustered DUV tools provided
convincing evidence that automated wafer transport between exposure and
PEB was required for adequate process control even at 0.5-µm ground
rules. The Micrascan I tools, brought on line in 1991, were directly
interfaced with a PEB station. The Micrascan II tools and associated
resist process modules, brought on line in late 1992 and early 1993,
were configured as fully integrated (antireflection-layer coating
through resist development) stepper/track photoclusters.
The stepper/track clustering poses the logistics problem of cascading
lots, i.e., initiating a second lot while the first is still in
process. (A lot is a set of wafers that receive similar processing.)
Without cascading, significant stepper production time is lost while
the last wafer of a lot is completed on the track (baked and developed)
and the first wafer of the next lot is initiated (coated and baked).
Fundamental to cascading is compatibility of process and track
configuration from one lot to the next. The successful implementation
of cascading at ASTC, where a large and diverse set of products is in
process at any given time, has depended on the establishment of a
baseline process, with a limited number of product-specific variations.
The baseline process must accommodate the requirements of the most
advanced product for which there is a significant number of wafer lots.
At present, this is the 256Mb DRAM. All products not requiring minimum
ground rules are fabricated with the same nominal process, and,
consequently, higher process capability. Thanks to this commonality
across programs, cascading has improved productivity. Our ability to
sustain cascading as new product generations start up will depend on
our ability to migrate existing products to a more advanced process.
Step-and-scan exposure tool
Coincident with the need to decrease optical wavelength to keep
pace with resolution requirements is a need to decouple resolution and
overlay capability from increasing field-size requirements
[14]. The
chip size growth from one DRAM generation to the next is currently
about 50%. For estimates of field size, it is instructive to consider
the area required by two chips, a so-called "twin-chip," both to
realize productivity advantages in manufacturing and to accommodate the
typically larger area of associated logic chips. The 256Mb DRAM
generation twin-chip requires a resolution of 0.25 µm and overlay
error less than 100 nm over an estimated field size of close to 600
mm² [15].
Given the constraints on optical aberrations
implied by such capability, conventional refractive, full-field,
step-and-repeat exposure systems become intractable. In
anticipation of this field-size requirement, DUV lithography at ASTC
migrated from excimer steppers to Hg-lamp-illuminated step-and-scan
tools in early 1991. It appears that all tooling manufacturers will
adopt step-and-scan technology for the DUV generation of production
lithography equipment. To achieve the necessary intensity, future
tooling will contain excimer illuminators, using KrF emission to
produce a wavelength of 248 nm and ArF to produce a wavelength of 193
nm.
The step-and-scan system circumvents the field-size versus
image-control limitations of the step-and-repeat system by imaging only
a section of the field at any given instant of time--much as the
step-and-repeat system circumvented the limitations of the earlier
scanning systems by imaging only a part of the wafer at a time. From
the wafer viewpoint, the differences between the two systems are
illustrated in Figure 4.
Figure 4
The step-and-repeat system patterns the wafer by moving it in
increments of the overall field dimensions, exposing the full
field (which must fit within the circular area of the lens) at each
stepping location. A shutter between the light source and illumination
system controls exposure time, and thus the light energy incident on
the wafer, or "dose." The local wafer plane is matched to the image
plane of the lens by leveling each field. The distance between the
wafer surface and the lens is measured at three or more points, a
best-fit plane is interpolated, and the wafer stage is moved (average
distance to lens, tilt along two orthogonal axes) to match.
The step-and-scan system must also step the wafer in increments of the
overall field dimensions. At each field location, however, the full
mask image is "painted" onto the wafer by scanning mask and wafer
simultaneously in opposite directions, through a slit that illuminates
and images only a narrow segment of the mask field at any given instant
of time. For a reduction system, typically 4:1, the mask must scan four
times faster than the wafer. The dose delivered to the wafer is
controlled by a combination of illumination intensity and scan speed.
The scanned image area is naturally rectangular, resulting in a more
efficient match to chip designs than is possible with the circular
lenses of step-and-repeat systems. The height of the field is limited
by the height of the illuminated slit, defined by the optical system
design. In principle, the width of the field is limited only by the
length of the scan, defined by allowed reticle and wafer stage travel.
In practice, the field width is limited by the size of the mask and the
reduction ratio of the optical system. Nonetheless, the step-and-scan
system has the intrinsic advantage that larger field sizes can be
achieved: to date, field dimensions of 26 mm × 33 mm = 858
mm², on 4:1 reduction systems using standard 150-mm-square
masks. This should be sufficient area to print a 1Gb twin-chip at
0.18-µm ground rules.
Scanning has other major advantages: the minimization of optical
aberrations possible over a smaller image area, the inherent averaging
of aberrations in the scan direction, and increasing the degrees of
freedom for correcting both overlay and image quality over the field.
In the latter case, for example, "on-the-fly" wafer leveling during
the exposure scan results in a closer match of the wafer plane to the
image plane across the field. This is a significant factor in our
ability to stay within the smaller DOF that accompanies the more
advanced product ground rules.
Of course, the step-and-scan system brings with it the added complexity
of synchronized reticle and wafer stage motion during exposure. This is
offset to some degree in the SVG Lithography Systems (SVGL) Micrascan
design by a simplified, partially reflective optical system allowed by
scanning in contrast to the all-refractive lenses in step-and-repeat
systems. Nonetheless, on the manufacturing floor more errors can occur
with a step-and-scan tool than with the mechanically simpler
step-and-repeat tool. Learning to take advantage of the improved
capability offered by step-and-scan's greater degrees of freedom,
while eliminating its concomitant opportunities for error, has been one
of the DUV lithography challenges at ASTC.
Masks
The products currently fabricated in the DUV sector employ
conventional pellicle-protected chrome masks. Various combinations of
phase-shift mask and off-axis illumination techniques are being
explored [16], but
present-generation exposure tools are not well
suited to the introduction of these techniques in manufacturing. It is
unlikely that we will see their full implementation until the next
generation of exposure tools, equipped with flexible,
computer-controlled illuminators, is available in the 1996-1997 time
frame.
With more advanced ground rules, even conventional chrome masks must be
considered an integral part of the lithography nonlinearities invoked
by extending the resolution limits of exposure tools. Optimization of
mask bias across the chip becomes a painstaking task, subject to
subtleties in both the mask-making and imaging processes.
To date, the mask role of ASTC DUV lithography is principally one of
qualification and management. The qualification task scales with the
maturity of the product. For early development test sites, a simple
visual check for pattern integrity often suffices. For shippable
product, of course, stepper lithography brings with it the unenviable
characteristic that a single mask defect can destroy every chip on the
wafer. The avoidance and early detection of such "repeaters" is a
critical responsibility that requires periodic, in-line inspection of
the active area pattern on the wafer as well as bit-fail analysis after
electrical testing.
Equally formidable is the need to manage the multiple mask sets that
accompany multiple products processed in the same line, and possible
multiple mask revisions at any one product level, in a way that ensures
the use of the correct mask on each exposure level of each lot being
processed in the line. Toward that end, a system that coordinates the
assignment of masks, photocluster recipes, and dispositioning
specifications to each exposure level has been developed and
implemented within the ASTC lithography sector.
Process development
The process development role of the ASTC DUV lithography sector is
largely one of extending the limits of the critical dimension and
overlay capability, provided by available exposure tool and resist
combinations, in the context of early chip development. As noted above,
this usually entails working within a small process window for initial
product runs, and attempting to enlarge that window as the product
matures. A notable recent example has been the implementation of a
0.25-µm process on the Micrascan II photocluster.
Migration to 0.25 µm
As late as November of 1992, our working assumption was that some
combination of phase-shift mask and off-axis illumination technology
would be required to realize 0.25-µm lithography capability for early
learning on the 256Mb program. Given the embryonic state of the
resolution-enhancement techniques, both in mask-making capability and
illumination control, this did not bode well for the required
conversion to 0.25-µm ground rules. Consequently, with the
installation at ASTC of the first Micrascan II (a preproduction model)
in December 1992, our first priority was to explore the limits of
conventional chrome mask imagery.
The SEM photographs in Figure 5 are of
historical interest in that they show the results from the first
integrated lot processed to 0.25-µm ground rules at ASTC, providing
early evidence (January 1993) of our ability to produce
high-aspect-ratio 0.25-µm images (the space between the resist
islands) in Apex-E resist. The corresponding process window is
characterized by the critical dimension (CD) variation with dose and
focus shown in Figure 6. At a single point in
the field, we estimated a DOF of 0.8 µm at 8% exposure latitude,
assuming our ability to bias the length of the structure to achieve the
correct spacing, as shown at the top of Figure 6.
Figure 5
Figure 6
The 0.25-µm process window is often limited by the
"image-shortening" behavior of the pattern length evident in Figure
6. Image shortening belongs to a class of effects which cause pattern
deformations as a function of pattern design and density
[17]. For a
line or space whose nominal length is greater than its nominal width,
shortening describes a decrease in the imaged length-to-width aspect
ratio. The shortening increases in magnitude as the resolution limit of
the lithography tool is approached.
A top-down SEM photograph of the image-shortening effect is shown in
Figure 7(a). As the printed linewidth
decreases from top to bottom of the photograph, the line ends deviate
further from the parallel display cursors. Data showing the dependence
of image shortening on the nominal width of both a line and a space are
presented in Figure 7(b). The distance between adjacent
patterns was measured rather than the actual length, where the nominal
distance is 600 nm. As the nominal width decreases, approaching the
resolution limit, the shortening magnitude increases. Several factors
contribute to image shortening:
- The corners that must form the ends of the line/space
contribute higher-spatial-frequency components than the middle of the
line/space. The consequent comer rounding leads to shortening in the
aerial image as the linewidth decreases.
- The image-formation components of the resist can diffuse during
exposure, PEB, and development steps. This diffusion enhances the
shortening as the width of the line/space approaches the diffusion
length.
- The mask contributes to shortening due to the
resolution/process limitations (similar to those listed above) inherent
in mask fabrication. These effects are most severe for 1× masks, but
can also be significant on 4× and 5× masks in the form of
corner rounding on the mask pattern, which enhances the shortening
present in the aerial image of the lithography tool.
Figure 7
The characterization of mask-to-wafer pattern deformation, under
product-specific conditions, including the subsequent etch as well as
lithography, is the basis for mask-biasing algorithms to optimize the
process window [18]. With
correctly biased chrome masks, the 256Mb
test site was converted from 0.375-µm to 0.25-µm ground rules by
early August 1993. A snapshot of that conversion, as reflected in
critical dimension measurements on a level being processed through the
DUV lithography sector, is shown in Figure 8.
The downward spikes in dimensions prior to the target conversion are
precursor experiments at the smaller ground rule. The first
significant 0.25-µm electrical test structure yield was obtained in
September 1993.
Figure 8
Alignment improvement
Improvements in level-to-level overlay must be commensurate with
reduced ground rules. Overlay performance is usually dominated by the
alignment capability of the exposure tool. Consequently, much of
the overlay-related development work at ASTC (such as alignment
strategy, mark design, and kerf layout) centers on alignment
optimization by product level. In the case of DUV lithography, a key
component of this activity has been the evaluation and implementation
of the AXIOM alignment system on the Micrascan II, as discussed below.
The sole alignment system on early Micrascan II tools was a
dual-wavelength (488-nm and 514-nm lines of the Ar-ion laser),
direct-reticle-reference, through-the-lens (TTL) system
[19]. While
direct referencing of the wafer mark to the reticle precludes the
possibility of baseline drift, the narrowband illumination
characteristic of TTL laser systems makes them susceptible to
process-induced alignment errors. One underlying source of error is
optical interference in the neighborhood of the wafer alignment mark.
Process variations that are insignificant with respect to product
quality can change the optical signature of marks at the alignment
wavelength, inducing significant variation in alignment. This process
sensitivity on the part of the alignment system results in degradation
of both overlay capability and sector productivity, in that it drives a
greater use of send-ahead wafers (precursor wafers to establish
settings unique to a given lot) to compensate for lot-to-lot
alignment variation. Furthermore, it is a particularly insidious
problem in that TTL alignment may work well on some product levels but
not others.
The AXIOM alignment system was introduced to the field by SVGL in early
1994, as a retrofit to one of our Micrascan II tools at ASTC. In
contrast to TTL, AXIOM is a broadband (400- to 700-nm lamp
illumination), indirect-reticle-reference, off-axis (outside-the-lens)
alignment system [20]. The
broadband illumination makes the alignment
less sensitive to process variations. Interference effects are averaged
out, and alignment signal strength and shape show less variation over a
wide range of process structures. On the other hand, the off-axis
nature of AXIOM requires that the wafer be referenced to the reticle
via a separate calibration mark on the wafer stage. This indirect
reference allows the possibility of baseline drift, which can be
compensated only by frequent calibration. Thus, the improved process
invulnerability of AXIOM comes at the added cost of more frequent
reticle-to-wafer reference updates on the Micrascan.
The AXIOM versus TTL trade-offs can be fully evaluated only in an
environment such as ASTC, where impact on real product levels can be
assessed. Data accumulated across the levels of a common product on two
separate Micrascan II tools, nominally identical except that one used
TTL alignment and the other AXIOM, are presented in Figure
9. We show the "corrected" alignment offsets of
each lot. These are the x, y alignment offsets that, on
the basis of modeling of the overlay measurements, should have been
used to zero each lot. From the distribution of offsets by level and
even within one level, the reduced process sensitivity of AXIOM is
apparent. An approximately 2× improvement in level-to-level mean
offset control is apparent. The consequent reduction in send-ahead
wafers more than makes up for the time expended in the more frequent
reticle-to-wafer referencing.
Figure 9
Process control
Process window optimization, dependent as it is on timely advances
in materials and tooling, cannot always deliver acceptable exposure and
DOF latitudes for pilot line production. To ensure our ability to work
within very limited latitudes at ASTC, we have invested heavily in
process control automation. In particular, sustaining overlay and
critical dimension performance requires the frequent updating of both
tool baseline settings and those dependent on product, product level,
and statistically significant tool/process variation
[21]. The
operation of the photoclusters must be controlled in four ways:
- Tool baseline control specifies the settings of
the tools in the photocluster (stepper and track) independent of
product. The settings are based on both in situ calibration
and the processing of monitor wafers.
- Product logistics control specifies the lots to be
run by a given photocluster, and the mask(s), exposure tool product
file, and track recipe to be used on a particular lot. Of particular
importance to productivity is the identification of lots that can be
cascaded. The lot-specific information is typically provided at the
time the lot is released to the line.
- Product parameter control specifies the settings for
exposure (dose, focus) and the alignment offsets. These settings and
offsets are typically tool-, product-, and product-level-dependent.
They may be determined by prior lot data or send-ahead wafer data, on
the basis of feedback from the metrology sector.
- Product statistical process control (SPC) flags
out-of-control conditions using control charts of measurements on
product--most notably, critical dimension and overlay data and
associated modeled parameters.
For the implementation of tool control, we rely heavily
on the autocalibration feature (Autocal) of the Micrascan II tool
[22]. The Autocal
system uses a patterned artifact mounted on the
wafer stage to enable the automated determination of alignment and
focus baseline settings. In the alignment case, the alignment system
used on product is applied to a set of "ideal" marks on the
artifact to zero the baseline offsets and intrafield parameters. As
noted above, this becomes an integral part of the AXIOM alignment
system because of the need for frequent reticle-to-wafer baseline
updates. While the need is less frequent for the
direct-reticle-referenced TTL system, Autocal is critical to
reticle alignment and overall system maintenance. For focus setup, the
Autocal system provides a reference between the absolute focal plane of
the artifact, determined by sensing the peak actinic image acuity, and
the plane sensed by the capacitive gauges used to maintain
lens-to-wafer distance on product.
In both alignment and focus cases, the Autocal system is an in
situ means of establishing product-independent baseline settings.
Thus, it enables the rapid diagnosis of tool changes or instability. A
specific focus example is shown in Figure
10, in which both the optically determined focus
position and tilt-x and the electrically (capacitive gauge)
determined tilt-x,y are plotted from one of our tools.
The observed changes to the focus are directly traceable to tool
changes: A subtle deviation of 0.5 µm occurred in the case where
exhaust work was performed, and a dramatic shift of 1.5 µm when the
laser stage was upgraded. The Autocal system enabled us to continue to
run product with minimal interruption. For the most part, Autocal has
eliminated the need to run regular monitor wafers.
Figure 10
On the other hand, product control at ASTC relies on
bidirectional SECS-II interfaces [23]
to both exposure and metrology
tools for upload/download of tool settings, measurement, and logistics
data. Exposure-tool and process-specific models are employed to predict
dose, and alignment settings for each lot in queue at the exposure
tool, on the basis of prior lot or send-ahead wafer metrology data. The
alignment and exposure of each lot are then executed under host
control, following the automated download of the appropriate settings.
Thus, the control of photoclusters while running product requires
communication among (a) databases containing the logistics information,
(b) the various metrology and exposure tools, and (c) an analysis and
database system that stores and transforms measurement data into forms
relevant to tool correction and lot dispositioning. The resultant ASTC
lithography data system is diagrammed in Figure
11 and described in the sections that follow.
Figure 11
The physical flow of lots through the photocluster and metrology
sectors is shown in the shaded center of Figure 11.
Where send-ahead
wafers are required, they are split off as single-wafer lots. After
processing through the photocluster, the patterned wafers proceed to
metrology, where the lot is sampled (a subset of exposure fields on a
subset of wafers) for critical dimension and overlay performance
relative to predetermined specifications. If the analyzed results are
within specification, the lot is shipped to the next sector. If
specifications are not met, the lot is reworked.
The surrounding flow of information among various systems and databases
is designed to sustain and control the movement of lots, such that the
quality of the output is ensured and the rework rate, use of send-ahead
wafers, and overall transit time of the lots are minimized. In the
photocluster, logistics data, from an in-house IBM software system
called Floorworks, are merged with parametric data fed back from the
metrology sector, via an IBM data analysis system called ASTEC. The
complementary logistics and parametric data for a given lot come
together at the stepper's Tool Automation Program (TAP), an IBM
tool-specific communication and supervisory control program, which
downloads the settings to the tool and initiates the processing of the
lot.
At the start of tool processing, Floorworks transmits the lot, mask,
and process data to the tool's controller, and logs the lot onto the
tool. In its final step, Floorworks initiates the TAP communications
channel between the tool's hardware controller and Floorworks. The TAP
handles requests for data and/or function from either the tool or
Floorworks, and transmits the sequence of messages to the tool that
achieves the desired result. In the case of the lithography sector, it
also provides an interface between the data analysis system and the
tool. For the metrology tools, it enables measurement data transfer to
ASTEC for lot and/or wafer dispositioning (a pass/fail decision on
lithographic quality). Part of the dispositioning process is to
determine settings for future lots and wafers. The TAP then enables the
feedback of overlay- and critical-dimension-performance-related tool
settings on a per-lot basis.
ASTEC was built at ASTC within the environment of a commercially
available statistical package (SAS®). ASTEC contains our modeling
and lot-dispositioning algorithms, specification tables, and a summary
database. In addition, it provides a rich environment for data review
and engineering diagnostics. ASTEC resides both in the LAN OS/2®
environment, for real-time interaction with lots as they move through
the lithography sector, and in the mainframe MVS environment, for
archival data analysis. This commonality across different platforms
provides advantages in the speed with which problems can be resolved
and data analysis improvements can be implemented on the production
floor.
Lot and wafer dispositioning is the principal real-time function of
ASTEC in the lithography sector. A flowchart of the dispositioning
process is shown in Figure 12. After the
CD and/or overlay data are imported via the metrology tool TAPs, the
system selects specifications and process models specific to the
product, level, and operation. The data analysis then computes the
appropriate statistics and modeled parameters, which are compared to
the specifications. Analysis reports are prepared by sampled wafer and
by lot, photocluster-specific SPC charts are updated to establish
whether or not the lithography process at that photocluster is in
control, and the dispositioning actions pertinent to the particular lot
are recommended to the operator.
Figure 12
Regardless of the ship/rework decision, an additional function is
performed by ASTEC at the time of dispositioning, as shown on the left
side of Figure 12. Via the stepper TAP, ASTEC
obtains the stepper dose
and the alignment settings that were used at the time the lot was
exposed. The models applied to the data during dispositioning determine
the dose and alignment corrections that must be applied to minimize CD
deviation from target and overlay error. Summing the dispositioning
corrections with the stepper settings gives the so-called corrected
settings that should have been used. In the case where a send-ahead
wafer is used, or a lot is reworked, these corrected settings become
the settings for exposing the lot. In the case where send-ahead wafers
are not used, the trends apparent in the corrected settings become the
basis for forecasting the tool settings for the next lot at that
product and level. In this manner, ASTEC is able to feed back tool
settings for lots that are at the tool or feed forward settings for
lots to be processed by the tool in the future.
Process integration
The overall process has a profound impact on lithography
requirements. For a given product, the lithographic process
window must be consistent with the variations introduced in the rest of
the process. For example, a key factor in extending the life of current
photolithography generations has been the improvement of overall
process planarity through the use of chemical-mechanical polish (CMP),
relieving pressure on the shrinking DOF apparent in Figure 2.
Establishment of consistency throughout a product routing is the role
of process integration. From the lithography viewpoint, process
integration bridges lithography to upstream and downstream sectors in a
product routing. While in-line measurements usually ensure
self-consistent optimization within the lithography sector as described
in the previous section, optimization between sectors cannot always be
ensured.
One example of the unforeseen effect of an upstream process on
lithography over an 11-month period is shown in Figure
13. A change to an anneal process caused an
observable increase in the uncorrectable or "residual" overlay
errors at subsequent lithography steps. A significant decrease in
overlay capability resulted, even though lithography continued to
optimize overlay by all available means. Ultimately, the cause of the
problem was traced to wafer deformation introduced by the anneal step
[24]. Owing to the terrain-following
focus system of the Micrascan,
the effect on imagery performance was not readily apparent. The overlay
residuals, however, proved a sensitive indicator of local wafer
planarity. Using them as a diagnostic tool, the anneal process was then
readjusted to meet both the product and lithography requirements.
Figure 13
Similarly, lithography process deviations can cause problems in
downstream process sectors. Subtle variations in the resist profile
shown in Figure 3, below the detection
limit of in-line metrology, may
be amplified by subsequent etch processes
[25]. Figure
14 illustrates the effect of resist poisoning on
two critical levels. In both cases in-line SPC charts indicated that
the resist image sizes were in control, while there was an overall
reduction in the process capability of the final etched image size. An
unbalanced ANOVA (analysis of variation, where "unbalanced"
refers to the fact that unequal numbers of lots were processed through
the various tools in question) indicated that the lithography tools
were a significant contributor to the overall variability in the
process. It was found that a low level of alkaline material had been
introduced during maintenance activity on one of the steppers,
resulting in a slow poisoning of the resist over time.
Figure 14(a) shows the relationship between the gate length of a
typical product measured in resist and measured after etch over a
12-month period. During the "poisoned" time frame, the etched image
was consistently larger than the resist image. Providing a slightly
different view of the same problem, Figure 14(b)
shows the difference
between the etched image size and the resist image size (bias) for a
deep-trench process. The gradual degradation in profile is reflected in
the downward trend in bias in the "poisoned" time frame. In both
cases, the bias returned to nominal values after tool cleanup and
filter replacement. The resist image size, as measured by top-down SEM,
remained relatively constant throughout.
The above examples illustrate our approach to integration within ASTC.
In both cases, the application of SPC to in-line measurements
automatically flagged an out-of-control condition. Each flag initiated
an investigation to determine and fix the basic cause of the problem.
Capability
Process capability is the performance of key process parameters
against specifications established to ensure product quality, as
embodied in the capability indices
C ,
C
[26]. The capability index
C
is defined as the ratio of a wafer lot's
specification width, |USL - LSL|, to the
distribution width, 6
( = standard deviation),
where
USL and LSL are the wafer lot's upper and lower
specification limits, respectively.
C
includes
the effect of the difference between the mean and a design value. For
lithography, the primary parameters are critical dimension and overlay.
Representative control charts and capability indices of integrated lots
outgated from the DUV lithography sector for a 0.25-µm ground-rule
level are shown in Figure 15. The mean
+ 3
variation of critical dimension is less than 45 nm, and that of
the X-overlay is less than 60 nm over the 44 lots shown.
Figure 15
This performance is not possible without the dynamic control of tool
settings described in the process control section, as is shown by the
trends of associated tool parameters for the same level over the same
period of time. Among the three photoclusters in use, the forecasted
values of dose, the principal driver of the critical dimension,
and X-offset, the principal driver of the X-overlay, are
presented in Figure 16. In other words,
Figure 16 shows how the tool parameters
were adjusted, on the basis
of lot-dispositioning data, to achieve the capability illustrated in
Figure 15. Significant differences among
the tools as well as variation
over time for a given tool were tracked. Our ability to sustain the
in-control operation shown in Figure 15
demonstrates the efficacy of
our feedback control system.
Figure 16
Given the diverse product mix at ASTC, key indicators of this form of
lithography capability (and detractors from productivity) are rework
and send-ahead wafers. When product variation exceeds lithography
capability, send-ahead wafers are required to preclude rework.
Our objective has been to keep rework at an acceptably low level while
reducing the need for send-ahead wafers. Our ability
to improve is apparent in Figure
17. However, the complete elimination of
send-ahead wafers is unlikely in a pilotline environment. Their use
is economically justified when lot-to-lot variation in tool
settings dominates. In these situations, the cost of added rework
exceeds the cost of running send-ahead wafers to preclude rework.
Figure 17
In running without send-ahead wafers, the data from prior lots must
predict the alignment and exposure conditions for future lots. The
automated forecast algorithm employed in ASTEC is based on a
vintage-weighted average of the last n lots. The analysis is
conducted, and the posted settings for a particular product level are
updated as each lot is dispositioned. The success of forecasting is
dependent on the stability of the tools, the maturity of the process,
the number of lots processed and continuity of work for a given
product, and, to a lesser degree, the specifications that must be
achieved. Adiabatic trends in tool settings can be tracked
successfully, whereas lot-to-lot fluctuations cannot. The magnitude of
lot-to-lot fluctuations relative to the specifications will determine
the level of rework incurred.
Productivity
DUV productivity must be viewed in the context of the ASTC pilot
production mission and the leading-edge capability of DUV
lithography. Only a small fraction of the ASTC capacity is devoted to
what could be called manufacturing--namely, integrated lots free of
process splits. Most of the ASTC capacity is consumed by development
lots: approximately half integrated (a complete process routing through
to electrical test) and half engineering experiments (deviated
routings, short loop test runs, etc.). Since nearly all of the
so-called critical levels, those having the most advanced ground rules,
are patterned by DUV, the DUV lithography sector sees the preponderance
of process splits, early development lots, deviated routings, and
engineering experiments.
For an ASTC process sector, such as DUV lithography, productivity
encompasses a set of specific measurements. Most notable among these
are number of lots processed successfully (lot outgates) per
photocluster, which we want to maximize, and time to process a lot
(cycle time or turnaround time) through the sector, which we want
to minimize. The two must be considered simultaneously to obtain a
true picture of productivity. For a fixed photocluster throughput,
outgates are maximized by ensuring sufficient lots in the queue that
the tools are never idle; however, by definition, queued lots cannot
achieve the minimum cycle time. Thus, as the throughput is increased,
productivity improvement requires a balance between effective
throughput and the amount of product at the tool. Our progress in
productivity improvement over an approximately three-year period is
summarized in Figure 18, where we show the
lot outgates and turnaround time (TAT) trends, normalized to their
values in the first month. Over the period shown, lot outgates have
increased by 3.5 times as TAT has decreased by 2.5 times. The
fluctuations in the two curves, most evident in TAT, signify instances
of imbalance between capacity and loading in the DUV lithography
sector. That balance is especially difficult to maintain in a pilot
line environment. Nonetheless, the overriding trends are encouraging,
particularly when the coincident decrease of ground rules shown in
Figure 1 is considered.
Figure 18
The productivity gains of Figure 18 took
place during a period of
relative stability in DUV lithography. Over that two-year period, the
photoclusters at ASTC had a common configuration and similar
capability. Productivity improvement could become the focus of
engineering activity once the prerequisite phases of process
development, control, and integration had coalesced into a capable
DUV process.
Summary
We have described the diverse activities of a lithography
operation in a pilot line production environment. The overview and
examples reveal the underlying structure of the ASTC DUV
lithography sector. At present, step-and-scan 248-nm exposure tools and
chemically amplified resists are the basic constituents of our
process. Attention to integration and process control has enabled
us to increase the productivity of our 248-nm process, even as ground
rules shrink. Projections of chip density indicate that DUV
photolithography at ASTC must migrate from 248-nm to 193-nm exposure
wavelengths in the near future. Thus, our efforts to improve DUV
lithography remain critical to our continued success in developing
semiconductor products.
Acknowledgment
The broad scope of work described in this paper, undertaken over a
period of more than five years, has had many participants. We
especially wish to recognize our current and former colleagues in the
ASTC and SRDC lithography projects. While we have attempted to present
the ASTC lithography perspective, it is often impossible to separate
lithography activities from those of related process sectors, systems
support, and product integration teams--all of whom have been partners
in the progress demonstrated. Furthermore, many valued colleagues have
been employees of Siemens and Toshiba during this period. We
acknowledge our indebtedness to them for their contributions.
Micrascan is a registered trademark of SVG Lithography Systems,
Inc.
SAS is a registered trademark of the SAS Institute.
OS/2 is a registered trademark of International Business Machines
Corporation.
References
Received February 9, 1996; accepted for publication November 9,
1996
|