0018-8646/99/$5.00 (C) 1999 IBM Sputter deposition for semiconductor manufacturing S. M. Rossnagel Sputter deposition, also known as physical vapor deposition, or PVD, is a widely used technique for depositing thin metal layers on semiconductor wafers. These layers are used as diffusion barriers, adhesion or seed layers, primary conductors, antireflection coatings, and etch stops. With the progression toward finer topographical dimensions on wafers and increasing aspect ratios, the broad angular distribution of depositing, sputtered atoms leads to poor or nonexistent coverage in deep features. This has been partially addressed using directional sputtering techniques such as collimated sputtering or long-throw sputtering. More recently, work originating in IBM has moved toward the deposition of films from metal-rich plasmas fed by sputtering, a technique known as I-PVD (for ionized PVD). This technique, based on fairly minor modifications of existing PVD systems, solves many of the intrinsic problems of PVD and appears headed for widespread manufacturing applications. Introduction Sputter deposition is one of the most widely used techniques for the fabrication of thin-film structures on semiconductor wafers. It is used primarily for the deposition of metal thin films used to form vias and lines, as well as the various related thin films which function as diffusion barriers, adhesion or orientation layers, or seed layers. Sputter deposition is usually carried out in diode plasma systems known as magnetrons, in which the cathode is sputtered by ion bombardment and emits the atoms, which are then deposited on the wafer in the form of a thin film. Depending on the lithography scheme, these films are then etched by means of reactive ion etching (RIE) or polished using chemical-mechanical polishing (CMP) to help delineate circuit features. This paper explores the plasma technology relevant to sputter deposition as applied to semiconductor technology, the sputtering process itself, and then semiconductor applications. In this latter area, recent (less than ten years) developments in sputtering, such as collimated sputtering, reflow, and ionized deposition, are examined. Some of these are in wide use in manufacturing; others, just becoming available, are expected to constitute the sputter-deposition processes of the future. Plasma technology The most widely used technology for sputter deposition is based on the magnetron cathode. Originally, physical sputter deposition utilized dc diodes, which were simply parallel plates powered by a power supply of several kilovolts in a working pressure of several tens to several hundreds of mTorr. The negative plate, also known as the cathode, was bombarded by ions from the plasma set up between these two plates, and cathode atoms were dislodged from the metal surface. These atoms could then deposit on other surfaces inside the vacuum system, forming films. The dc diodes were characterized by slow deposition rates, high voltages, and low currents, and hence are no longer of interest. They were also inadequate for the deposition of dielectric films because of charging, arcing, and very low deposition rates. The second evolution of sputtering technology was to replace the dc power supply with an rf supply, generally operating at a frequency of 13.56 MHz. This change eliminated the charging and arcing problems with dielectrics and also led to slightly higher deposition rates. The oscillating rf potential applied to anode and cathode resulted in a modification to the electron motions, which produced better energy coupling to the electrons as well as higher plasma densities [1]. Because of the high electron mobility in a plasma (rf or dc), an rf diode system tends to develop large electron currents during the positive portion of the applied rf cycle. Usually a large capacitor (500-2000 pF) is placed in series between the rf power supply and the powered electrode, or electrodes if the anode is not grounded. The large series capacitor allows a significant negative bias to develop on the cathode, typically half of the value of the applied peak-to-peak rf voltage. This bias is then the acceleration voltage for ions from the plasma, which move much too slowly to respond to the applied rf potentials. In addition to the series capacitor, it is common to use two other tuning components to help match the impedance of the plasma to the output impedance of the rf power supply. These components, usually a shunt capacitor to ground and a series 3-4-turn inductor, are located along with the series capacitor in the "matchbox," which is physically located adjacent to the cathode position. The inductor is fixed, and both of the capacitors (shunt and series) are variable. A control circuit within the matchbox controller senses the reflected power (from the matchbox and plasma back to the power supply) and adjusts the variable capacitors to minimize the reflected power. Usually this is done automatically by means of reversible motor drives on the capacitors, but occasionally laboratory-based systems will have manual controls for the tuning network. In the 1980s, rf diode sputter-deposition systems were widely used for the deposition of silicon dioxide dielectric films. By splitting some of the applied rf power between the cathode (typically fabricated from silicon dioxide) and the sample pedestal (where the wafer or wafers were mounted), it was possible to deposit films with a moderate degree of ion bombardment of the growing films. In effect, the samples functioned as partial cathodes in the circuit and were bombarded by ions from the plasma, albeit at a lower rate than the primary cathode. This is currently known as bias sputtering and can be useful in helping to densify as well as planarize the depositing film. Planarization results in a flatter film with smoother coverage over steps and gaps in the underlying substrate. The principal type of system currently used for high-rate deposition of metals, alloys, and compounds is known as the magnetron cathode system. This type of tool uses magnetic confinement of electrons in the plasma, which results in a higher plasma density than in either the rf or dc diode systems. The higher plasma density reduces the discharge impedance and results in a much higher-current, lower-voltage discharge. As a rough example, an rf diode tool operating at 2 kW might have a peak-to-peak rf voltage of over 2000 V. A conventional magnetron system operated at 2 kW might have a dc discharge voltage of 400 V and an ion current of 5 A to the cathode. The electron confinement on a magnetron is due to the presence of orthogonal E and B fields at the cathode surface. This results in a classic E x B drift for electrons (the Hall effect), which gives rise to a sequence of cycloidal hopping steps parallel to the cathode face (Figure 1). As a result, the secondary electrons which are emitted from the cathode because of ion bombardment are confined to the near vicinity of the cathode. In a magnetron, the electric field is always oriented normal to the surface of the cathode. The transverse magnetic field is configured so that the E x B drift paths form closed loops, in which the trapped, drifting electrons are constrained to circulate many times around the cathode face. The most common design for a magnetron cathode is the circular planar cathode, in which the cathode is simply a flat, circular disk, and the E x B drift currents form circles centered on the disk axis, around the face of the cathode (Figure 2). The magnitude of the total drift current can be measured by means of its induced magnetic field, and results (Figure 3) indicate a ratio of about 3-7 for the circulating current compared to the discharge, or net current [2]. On average, this ratio is a measure of the trapping and the number of times a secondary electron traverses the E x B drift loop prior to leaving the discharge and arriving at the anode. This ratio, though, is accurate only for the specific size of cathode used in that study (150-mm diameter), and more accurately, perhaps, indicates the length of the drift path for the electrons. This allows the data to be scaled to larger cathodes or ones with differing E x B paths. The magnetron effect is generic: The requirement is that the E x B drift paths be closed. Several geometric perturbations have been developed over the years for specific applications, and examples are shown in Figure 4. The intrinsic deposition uniformity of a magnetron-type cathode is not good, though, and this has implications for semiconductor processing. The conventional, circular planar cathode is characterized by high levels of erosion under the E x B drift path (also known as the etch path), which is in the form of a ring. At short cathode-to-sample, or throw, distances, this ring source shows up as a ring-shaped deposition on a planar sample. At moderate throw distances (of the order of the ring diameter), the profile is flatter, since the center region fills in somewhat; at large throw distances, the source functions much as a point source. Unfortunately, there is no throw distance at which the deposition uniformity approaches that required for semiconductor processing, which is of the order of 1-2% (1[sigma]). In many magnetron deposition configurations, it is acceptable to physically move the sample during deposition to effectively average out the nonuniform deposition flux in order to create a uniform film. In semiconductor processing, however, this is not an acceptable solution, since it introduces additional motion, feedthroughs, complexity, and (most significantly) increased particulate formation to the deposition chamber. An acceptable solution, though, has been to physically translate the E x B drift path across the surface of the cathode to average out the intrinsic nonuniformity. This is achieved by configuring the magnets behind the cathode (which form the B field at the cathode surface) to move by means of a motor drive (Figure 5). Since the magnets are outside the vacuum system, their physical motion does not introduce additional particulate formation inside the chamber. In addition, the more uniform erosion of the cathode surface results in better utilization of the (high-purity) cathode materials, increasing the time before required maintenance. The rotating-magnet, heart-shaped E x B drift-path magnetron is the primary type of cathode currently used in semiconductor processing. Small variations in the magnet configuration have also been developed to tailor the deposition uniformity under various deposition conditions (such as changes in throw distance or pressure) or with different cathode materials. As such, a class of these magnetrons might have 3-10 different magnet configurations, which are changed depending on the application and tool configuration. There are also variations of this general design which allow the E x B drift path to spill slightly over the edge of the cathode. This reduces the intrinsic discharge efficiency and may result in modest increases in the operating voltage. However, the design reduces the presence of areas near the cathode edge which might not be heavily eroded. In a magnetron system, the net erosion and deposition rates can easily exceed the several-micron-per-minute range. While the deposition is mostly forward, there is a small component which can deposit back onto the cathode surface because of in-flight gas scattering, reflection of the depositing atoms at other surfaces, less-than-unity sticking of the depositing atoms, or even resputtering from other surfaces in the chamber [4]. Redeposition onto the cathode is generally ignored because the erosion rate is usually so large. However, in the edge regions (and also at the very center in some designs), the deposition rate can exceed the erosion rate, resulting in film buildup and eventual flaking or nodule formation. Sweeping the E x B drift paths off the edge of the cathode results in the elimination of the edge regions as a possible contamination source. However, it must be done carefully so as to limit the possibility of sputtering of (and sputter deposition from) the side areas of the cathode. Magnetron sputter-deposition systems have evolved into a near-UHV environment to reduce the influence of residual gas atoms on the structure of the thin, deposited films. While it may seem surprising that there are contamination concerns with films deposited at 1 [muon]m/min, the impurity levels even in a moderate vacuum system can be significant. A deposition rate of 1 [muon]m/min of Al corresponds to an atomic deposition rate of about fifty monolayers of Al per second. In a base pressure of 10**-6 Torr, the arrival rate of background gas atoms is 1 monolayer/s, and if all of these atoms were incorporated into the film, it would constitute a 2% impurity. These numbers scale down linearly with base pressure, so that at a base pressure of 10**-9 Torr, the impurity flux can still be two parts in 10**5. When this is compared to the purity of a typical cathode (99.999%), the effect of the vacuum system can become very significant. Current manufacturing-scale magnetron systems are constructed from stainless steel with a minimum of o-rings. They are typically configured with cryopumps connected directly to their deposition chamber by means of large-diameter valves, and the resultant base pressure is generally in the low 10**-8-Torr range for most cathodes, and in the 10**-9-Torr range for Ti, for which the chemically active nature of the deposited films can contribute appreciably to the net pumping speed of the system. The working pressure during sputtering is typically 0.5 to 5 mTorr, which requires a gas flow of many tens of standard cubic centimeters per minute (sccm). Because of base-pressure considerations, manufacturing-level systems are not baffled and therefore retain approximately the true base pressure of the chamber during deposition. Their large gas flows, however, can lead to the need for frequent regeneration of their cryopumps, which is usually automated and accomplished during routine maintenance. The throw distance for commercial sputter-deposition systems varies depending on the material to be deposited, the application, and the supplier. Since in most tools the substrate position is movable vertically for the purpose of picking up or clamping the wafer, the throw distance is usually a controllable variable, and varies from 3 cm at a minimum to 10 cm. In a variant on sputtering to be discussed below, known as long-throw deposition, the throw distance is increased to 25-30 cm. The magnetron chambers used for large-scale semiconductor applications are configured as ports on an integrated-process, load-locked tool, and wafers are introduced to the deposition chamber via a load lock (Figure 6). In general, this type of system has its worst vacuum in the load-lock region, and each successive chamber from the load lock has a better (i.e., lower) base pressure. Such systems are also configured with degassing stations for the wafers, which are necessary to remove water vapor that the wafers absorb from air exposure prior to deposition. Generally, the degassing temperature is 50-100[degree]C higher than the deposition temperature. Commercial magnetron cathodes are almost always configured for dc rather than rf power. The cathode diameter is roughly 50% larger than the wafer diameter, with diameters of about 30 cm (12 in.) for 200-mm wafers, and 45 cm (18 in.) for 300-mm wafers. There is no intrinsic limit to the cathode dimensions other than that due to structural considerations, and cathodes as long as several meters in length have been used. However, for semiconductor applications, the dimensions approaching the 300-mm generation are somewhat problematic, because the cathode plate can be somewhat deflected by the large span and the pressure of the back-side cooling water. The cathodes are water-cooled at a rate of many gallons per minute, and are typically rated at powers of 20-30 kW. There is no fundamental plasma limit to this power rating: It is limited primarily by the ability to cool the cathode by means of flowing water. The power supplies are switching supplies, typically ganged together in 10-kW increments. The supplies have fairly sophisticated arc-detection and -suppression circuitry to reduce unipolar arc formation, which can result in spitting of microscopic droplets onto the wafer substrate. The cathode materials currently used are generally Al(0.5% Cu), Ti, Ta, and Cu. The purity is generally 99.99% for the Ti and Ta cathodes, and 99.999% or higher for the Al(Cu) and Cu cathodes. The high-purity disks, typically about 1 cm thick, are diffusion-bonded to Al or Cu backing plates which form the vacuum seal as well as the cavity for both the magnets and the water cooling. The cathode plate must be insulated from the grounded vacuum system, and this is achieved by means of a 1-cm-thick ceramic ring and o-rings. These large-diameter o-rings (typically 14 in. diameter x 1/8 in. thick) limit the base vacuum pressure of the deposition chamber to the 10**-8-Torr range. Sputtering background Physical sputtering has been known for more than a hundred years and has been in common usage for many decades. Several good reviews of the topic are available [5, 6], and it is covered only briefly here. Physical sputtering is a relatively violent, atomic-scale process in which an energetic particle strikes a solid, resulting in the emission of one or more substrate atoms from the solid. The dynamics of the collision process depend strongly on the incident energy and mass of the bombarding particle. At relatively low energies, the incident particles do not have adequate energy to break atomic bonds of the surface atoms, and the bombardment process could result in simply desorbing a few lightly bound gas atoms, perhaps inducing a chemical reaction at the sample surface, or nothing at all. At relatively high incident energies, the bombarding particles travel deeply into the bulk of the substrate and may cause deep-level disruptions in the physical structure, but few if any surface atoms are released. At the moderate energies, typically in the range from several hundred eV through several keV, the incident particle can cause substantial numbers of near-surface broken bonds, atomic dislocations, and ejection or sputtering of atoms. The parameter most used to characterize sputtering is the yield S, which is simply the ratio of the number of emitted particles to the number of incident ones. The yield is an average number, including not only the emission rates from a number of different crystalline orientations, but, on the atomic scale, a number of different impact points for the incident particles. In this moderate-energy range of interest, the yields for Ar+ bombardment of most materials range from 0.1 to about 5, with the majority of materials in the 0.5-2 range. The yields are energy-dependent in a roughly linear fashion (Figure 7), such that an increase in the incident ion energy of 2x results in an increase in the yield of slightly less than 2x. It is this near-linear dependence of the yield that makes sputter deposition roughly power-dependent. Since the rate of sputtering (and sputter deposition) scales directly with the bombarding flux (i.e, current), the near-linearity of the sputtering yield with voltage leads to the essentially linear relation between the deposition rate and the discharge power. The emission profile for the sputtered atoms is characterized by a cosine distribution for most materials. This means that the emission rate at some angle other than normal (perpendicular) is equal to the normal incidence emission rate times the cosine of the angle from the normal. This is usually drawn as a circle touching the impact point, in which the circle is the envelope of the magnitudes of the emission at other angles. Various departures from the cosine distribution are seen, depending on the ion energy and the sample structure. As a function of ion energy, the cosine distribution is seen to be flattened at lower energies ("under-cosine"), in which there is more emission at lower angles than at the surface normal, and "over-cosine," or more peaked in the normal direction at higher energies (Figure 8). There are also unusual cases of directed emission for specific crystalline orientations of the target. The angular emission profiles for a number of orientations are shown in Figure 9. The cosinelike angular emission profile of the sputtered atoms, coupled with the extended area of the magnetron cathode and perhaps some in-flight gas scattering, implies that the depositing flux of atoms has a very broad, almost isotropic nature. This can be extremely useful in depositing films on unusual, nonplanar surfaces, over steps or ledges, and on the sides of features. As a result, sputter-deposited films are smoother than the underlying surface. However, if the surface is composed of high-aspect-ratio (aspect ratio = depth/width) holes or trenches, it is unlikely that the deposition will cover or fill the structures entirely. This latter aspect forms the basis for much of the rest of this paper. Sputter-deposition rates can, in principle, be obtained from sputtering yields and some consideration of tool geometry and gas scattering. This is almost never done, since the details of tool configuration, shielding, shutters, etc. are very tool-dependent, and the experimental measurements are easy to perform. Sputtering rates are often described in terms of the incident power, using o units of A/s/kW. A typical number for Al deposition would be about 1.0, and for Ti about 0.2. Perturbations of the system configuration, such as collimation or long throw (discussed below), can reduce these rates significantly. To first order, though, the deposition rate can be predicted at moderate powers from this sort of power-rate indicator, and the rates should also scale roughly with the sputter yields described above. Sputter deposition is managed by deposition time, and there are rarely any in situ diagnostics of deposition rate used in manufacturing-scale tools. The rate is calibrated against time, and then films are deposited for a fixed time period. It should be noted that the linearity of deposition rate with time or power fails at very short deposition times (very thin films) or very high levels of discharge power. Patterning techniques for semiconductor processing There are two families of circuit-patterning techniques currently used in semiconductor manufacturing. The choice of which is used is determined by deposition capabilities, required feature dimensions and aspect ratio, and material requirements. The first of these two classes is based on the deposition of planar, metallic films, which are then patterned using a light-sensitive photoresist and etched using reactive ion etching (RIE) to form circuit elements. These features are then encapsulated with a dielectric (for example, silicon dioxide), which is subsequently polished flat. Vias can then be opened into the dielectric, filled with W by means of chemical vapor deposition (CVD), and polished flat prior to the deposition of the next, planar metal film (Figure 10). This technique is known by several designations, such as RIE-metal patterning, or cloisonne patterning. Because of the need to reactively etch the deposited metal, this scheme is limited to Al-based conductor systems because of the relatively low vapor pressure of the resultant CuCl product molecules from the RIE process. Metal deposition for the RIE-metal process is virtually always done with sputter deposition from magnetron sources. The broad angular distribution of the sputtered atoms leads to good, continuous coverage over small bumps and ledges on the surface, densities of the deposited films are close to bulk levels, and with a small degree of annealing, the films display near-bulk electrical conductivity. The second class of patterning techniques currently used is known as damascene processing, after its similarity to ancient jewelry-inlay processes [11]. In damascene processing (Figure 11), a thick, planar dielectric layer is deposited. Then, using photolithography and RIE, holes and trenches are etched into the dielectric. The next step is to fill the cavities by means of a metal-deposition process, after which the excess metal is polished off, back to the original dielectric surface. This results in an embedded line or via in a planar surface. Another layer of dielectric could then be deposited to form the basis of the next metallization layer. In many ways, RIE-metal and damascene processes are simply the opposite of each other. In the first, the metal is patterned and the dielectric is used to fill in the gaps; in the second the dielectric is patterned and the metal fills in the features. However, there are practical issues which come into play in the applications of these techniques, such as the previously mentioned difficulty in reactively etching Cu films. From the point of view of topography, the RIE-metal processing techniques, which leave behind a freestanding metal feature prior to dielectric encapsulation, are limited to features with modest aspect ratios, perhaps of the order of 1:1, whereas the damascene techniques allow the use of much deeper features. However, from the point of view of metal deposition, there are significant difficulties in the development of viable metal-deposition processes for high-aspect-ratio damascene features. Sputtering, which is routinely used for RIE-metal depositions, fails at filling features with aspect ratios greater than about 0.5 because of the wide angular spread of the depositing atoms. The feature that makes PVD attractive for planar films inhibits its application to damascene processing. An additional feature for both RIE-metal and damascene processing is the need for film layers in addition to the principal layers. Examples: Diffusion barriers are often required on the sidewalls or bottoms of vias or interconnections to prevent the interaction between two materials; a barrier of some kind is needed at the interface between a W stud and an Al metal line to reduce spiking or void formation at the interface; and a continuous, hermetic film is needed on the sidewalls of a silicon dioxide via during CVD deposition of W to prevent the reaction between the oxide and the WF6 working gas. Other thin layers may be required to function as adhesion layers, seed layers for electroplating, or orientation layers to facilitate a desired orientation in a subsequently deposited conducting layer. In each case, the general requirement for these layers is that they be thin and yet conformal within deep features. These requirements tend to preclude conventional sputter deposition as a viable technique because of the wide divergence of the depositing atoms. As a result, significant effort has been expended in the exploration of CVD-based deposition methods, which tend to be more conformal in deep features because of the relatively low sticking or incorporation coefficients of the reactants. Sputter deposition, however, can still be a viable, less expensive solution to many of these interconnection applications with subtle modifications to the sputtering, transport, or surface processes. Since sputtering is a widely used manufacturing-scale technology, it is desirable to extend it further into future generations. Directional sputter deposition Sputter deposition is, on a macroscopic scale, a nearly isotropic deposition process when used at short-throw distances with a wide-area cathode. However, on an atomic scale, the sputtered atoms tend to travel in straight lines without in-flight collisions from the cathode to the sample at the pressures most commonly used. Since the sputtered atoms are virtually all neutrals, it is not possible to redirect their trajectories in flight. However, two techniques have evolved which can filter the angular distribution of the sputtered atoms, resulting in a more normal-incidence deposition process. The laterally moving sputtered atoms contribute most to the pinching-off deposition effect in high-aspect-ratio features. If these atoms are filtered, the more vertically oriented (i.e., vertical to the sample plane) sputtered atoms can travel more readily down into high-aspect-ratio features. The two filtering techniques are known as long-throw deposition and collimated sputter deposition. o Long-throw deposition Long-throw deposition is simply an increase in the cathode-to-sample or throw distance over conventional systems. It was first used to mimic evaporation for the deposition into lift-off mask features [12]. In a long-throw system, the throw distance is increased from perhaps 5 cm to 25-30 cm (for 200-mm wafers), which is roughly equivalent to the cathode diameter. A requirement of long- throw deposition is that the operating gas pressure be low enough that in-flight gas scattering is minimal. At 25 cm, this means a working pressure in the low 0.1-mTorr range, which is just about the limit for conventional magnetron operation. In the original work, hollow-cathode electron sources were used to allow even lower-pressure operation, but hollow-cathode enhancement is generally impractical for manufacturing-scale operation using rotating-magnet magnetrons. With a throw distance of 25 cm, and assuming a purely ballistic, unscattered deposition, the angular divergence of the arriving atoms is limited to about +-45[degree], or an effective aspect ratio of about 1:1. Atoms which travel at higher angles are unable to reach the sample, and deposit on the chamber walls. This reduces the net deposition rate by 70% or so and by implication increases the frequency at which the tooling or shielding inside the chamber must be cleaned to reduce flaking and particulate formation. Long-throw deposition can be used to fill low-aspect-ratio features (1:1) or to form reasonably conformal liners at aspect ratios of about 2:1. There is also the possibility of using a significantly increased sample temperature to facilitate enough surface mobility to fill higher-aspect-ratio features. High-temperature processes are discussed below. Long-throw deposition has an intrinsic directional flaw as well as a difficulty in scaling to larger wafer sizes [13, 14]. The directional flaw is based on the asymmetry of deposition near the wafer edge. At the centerline of the system, deposition onto the wafer arrives uniformly from the entire cathode in a symmetric manner. However, at the wafer edge, there is a greater arrival flux from the center regions of the cathode than from the outer edge. This results in greater deposition on the inner-facing sidewalls of a trench or via at the wafer edge, compared to the outer-facing sidewall (Figure 12). The situation can also be modeled numerically [15]. The result is that at the wafer edge (i.e., the outermost chip), the asymmetry can be as high as 2x at the top of modest-aspect-ratio features, and greater than 5x for the bottom corner. The scaling difficulty is based on the inability to operate a magnetron discharge at pressures below the 0.1-mTorr range. For the 300-mm wafer generation, the cathode diameter is expected to increase to 45 cm, requiring a 45-cm throw distance to achieve similar directionality. While the assumption of no gas scattering was somewhat questionable at a 25-cm throw distance, it is clearly very questionable at 45 cm. This results in a significant level of in-flight gas scattering and loss of directionality of the depositing atoms. o Collimated sputter deposition Directional filtering similar to long-throw deposition can be obtained by interposing an array of collimating tubes between the magnetron cathode and the sample (Figure 13) [16]. The collimator functions much like the sidewalls of the chamber. It collects atoms which are traveling laterally to the cathode/sample plane. These atoms are deposited on the sidewalls of the collimator cells and hence not deposited onto the sample. Geometrically, the use of a collimator need not require any significant increase in throw distance over conventional sputter deposition, thus removing the very low-pressure constraint of long-throw sputtering. The aspect ratio of the collimator can be readily changed without significantly altering its overall dimensions. For practical manufacturing applications, collimator cell diameters are of the order of 1 cm, and therefore the height of a 2:1 collimator is only 2 cm. The overall throw distance increase is then of the order of 3-4 cm; manufacturing-scale tools outfitted with collimators typically use a spacer which is about 4-5 cm high. Collimators were originally fabricated from solid plates of Al or Cu into which were milled arrays of closely packed round holes. Since the collimator is located quite close to the plasma discharge, it can be subjected to significant heating as a result of bombardment by electrons and sputtered atoms, as well as the heat of condensation released by those sputtered atoms. These early designs allowed water cooling of the collimator from the outer edge, which limited its edge temperature to about 100[degree]C and its mid-region temperature to about 150[degree]C. Eventually, use was made of hexagonal holes to increase its effective transparency. In the past several years, there has been a trend toward the use of thin, lightweight, uncooled collimators constructed from sheet metal. These collimators are assembled from either stainless steel or Ti sheet stock and spot-welded into hexagonal arrays. The collimators drop into uncooled housings with a minimum of fixtures and are held in place by gravity. Unfortunately, during high-power operation, this approach can result in significant collimator heating, as high as 500[degree]C at the center, which in some cases can result in modifications to the films on the substrate. Use is made of Ti collimators for Ti deposition to reduce film stress and peeling during thermal cycling. The filtering of a collimator is directly related to its aspect ratio; Figure 14 shows schematically the reduction in angular divergence of the flux as a function of aspect ratio. The deposition rate, though, is also significantly reduced, and Figure 15 shows the pressure dependence of the deposition rate as a function of collimator aspect ratio. The general result is that the deposition rate is reduced by a factor of about 35% for each unit increase in the collimator aspect ratio. The converse of this strong reduction in deposition rate is a significant deposition rate onto the collimator itself. Eventually this leads to either closure of the collimator holes, or peeling and flaking of thick deposits; neither is desirable from a manufacturing point of view. At a practical level, it is necessary to change the collimator about two to three times as often as the cathode, which can add significantly to overall expense. Collimated sputtering was originally intended to be used for filling trenches and vias, but its slow deposition rates and the increased effective cost due to collimator changes, cleaning, etc. have limited its use to liner or contact layers. The liner application was first described in 1992, when thin TiN layers sputtered through a collimator were found to have adequate conformality to function as diffusion barrier films [17]. Also widely used is the contact-layer o application, where a Ti layer ~200 A thick is deposited on the bottom of a deep contact hole to make a less resistive electrical contact to an underlying layer. Heating and pressurization techniques Since physical sputtering produces neutral atoms which cannot be easily controlled aside from subtractive filtering, an alternate approach to using sputter deposition with high-aspect-ratio features is to increase the mobility of the sputtered atoms once they reach the sample. If the sample temperature is increased to a level of more than half the melting-point temperature of the depositing film, there is significant surface diffusion of the depositing atoms as well as recrystallization of the deposited film [18]. Since these are processes limited by an activation energy, higher temperatures (more than half the melting-point temperature) result in faster processes. Such use of higher substrate temperatures is known generically as "reflow," although it is not completely clear whether the predominant motion on the sample surface is atomic surface diffusion or the actual microscopic flow of surface layers. One of the intrinsic advantages of this approach is that small, high-aspect-ratio features can act as sinks for the mobile material. The bottom of a via or trench has a concave surface which is ideal as a trap for migrating atoms. Therefore, the smallest high-aspect-ratio features fill quite rapidly, which is the opposite of what occurs in a directional process such as collimated sputtering. Reflow techniques are quite strongly dependent on the underlying surface structure and species. Simply sputtering onto a hot surface is more likely to result in the formation of spherical droplets on the wafer surface than filled vias. Also, since surface diffusion generally has a much lower activation energy than bulk diffusion, it is critical to keep the top of a deep via from closing off during sputtering (due to the wide-angle deposition). If this occurs, any additional movement of atoms into the via occurs by means of bulk diffusion, which is a much slower process than surface diffusion. Reflow deposition has been applied mostly to the Al(Cu) interconnection system. For moderate levels of surface mobility, it is necessary to increase the wafer temperature to at least 400[degree]C and as high as 550[degree]C in some cases. Several schemes have been developed for reliable filling. One scheme, known generically as a two-step process, uses two sequential depositions of Al(Cu) [3]. The first is short deposition with a very low wafer temperature, below about 100[degree]C. The function of the film thus deposited is to provide a seed layer which adheres well to the substrate, is fine-grained, and is continuous down into deep features. For modest-aspect-ratio samples (>2:1), this first step often uses collimated sputter deposition. The second part of the deposition is carried out at a wafer temperature of about 500[degree]C, and uses conventional, noncollimated sputtering at a modest rate. If the rate is slow enough, the mobility of the surface atoms is sufficient to move the atoms down into the deep features before the upper portions of these features can close off. There is obviously a tradeoff here between sample temperature and deposition rate: Higher sample temperatures permit use of a faster deposition rate, but at the potential expense of other problems related to the use of the higher temperatures. Another process uses the deposition of an Al seed layer by means of a CVD process [19]. This results in a clean, conformal film which has slightly advantageous microstructural properties compared to the PVD film. The advantage gained by using such a seed layer is seen primarily on the deposits on the sidewalls of trenches and vias, which, when deposited by sputtering, are often columnar and of sub-bulk density. The CVD Al seed layer then allows somewhat faster surface diffusion of the subsequently deposited PVD Al(Cu), thus permitting a reduction in the sample temperature to about 400[degree]C. The PVD Al(Cu) also contributes small levels of Cu to the CVD Al film, which can only be deposited in a pure, undoped manner. This incompatibility with doping precludes the use of CVD Al for primary conductor applications. Another means for improving the filling of high-aspect-ratio features is the use of relatively high pressures after deposition [20]. The technique requires that the sputter-deposited film close off a deep feature, such as a via, forming a void. This occurs naturally, and more readily at high aspect ratios. Next, the wafer is heated to 400[degree]C and moved into a high-pressure chamber, where Ar gas is introduced to a pressure of 600 atm. The force of this pressure on the warm film causes it to move downward into the vias, eliminating the voids formed by sputter deposition. Once the Al metal reaches the bottom of the via, the sample is cooled and the pressure carefully decreased. There are some constraints with a mobility-based deposition process. First, the system purity must be quite high, since the arrival of impurities from the working gas, the cathode, or the sample will strongly limit the magnitude of the surface diffusion, in effect increasing the activation energy. To minimize this, the base pressure of the system must approach the UHV range, and the wafers must be well degassed prior to introduction into the chamber. The second concern, primarily for the thermal reflow techniques, is the issue of the closing off of the vias or trenches due to isotropic deposition at a faster rate than the reflow can rearrange the surface topography. This is primarily a time issue: Sputtering too slowly reduces the wafer-per-hour throughput. A third concern is with the differences in physical dimensions or the density of features on a surface. A small-diameter, high-aspect-ratio feature will fill much more rapidly than a very large, low-aspect-ratio feature, simply because it takes many more atoms to fill the larger cavity. As a result, features in the middle of a dense field of features will fill much more slowly than the ones on the edge of the field, where there is more relative surface area to accumulate the depositing and diffusing atoms. Finally, there is simply the concern over wafer temperature. Raising the wafer temperature to 400[degree]C [for the CVD-seeded Al(Cu)] or to 500[degree]C (for the two-step filling) can lead to significant thermal stress problems in the interconnection structure. The thermal expansion coefficient of silicon dioxide is 20x smaller than that of Al, and this results in the potential for extrusion and other stress-relief problems in multiple-layer interconnection stacks. The high temperatures also preclude the use of polymer-based low-k dielectrics, which tend to have a much lower temperature limitation, typically 100[degree]C or so; such a limitation would eliminate any possibility of the use of mobility-enhancement-based filling techniques. Ionized sputter deposition of metal and compound films: I-PVD An alternative to the filtering of sputtered metal atoms to enhance the net directionality of a metallic deposit is to ionize the majority of the sputtered atoms and form the film from metal ions. If the acceleration potential for the ions is significantly greater than their thermal energy, the ions will arrive at the wafer surface at almost exactly 90[degree] with a controllable energy. If the percent of the metal flux that is ionized is made high, the deposition will be primarily directional and the utilization of the sputtered atoms from the cathode will be high. This last aspect relates to the intrinsic inefficiency of collimated or long-throw sputtering, which results in low deposition rates and requires more frequent tool cleanings. Deposition by means of I-PVD has been practiced in a number of ways for many decades. Originally, evaporative deposition of metals could be enhanced or the resultant films modified by partially ionizing the evaporated flux, by passing it through either an electron beam or a weak plasma. Other plasma-based deposition processes, such as electron cyclotron resonance (ECR) or even magnetron sputter deposition, can have a small ionization fraction for metal atoms sputtered into the discharge. In the late 1980s and early 1990s, systems were specifically fabricated to optimize the relative ionization of metal particles prior to deposition. Early systems used ECR [21, 22] and were based on either sputtering or evaporation. Another primary direction has been the use of dense, inductively coupled rf plasmas in conjunction with a metal-sputtering source [23-25]. This latter technique has proven to be the most robust for semiconductor manufacturing applications. An I-PVD system, based on magnetron sputtering and in-flight ionization of the sputtered atoms by means of a dense, inductively coupled rf plasma, is shown in Figure 16. Use is made of a conventional 200-mm magnetron source (diameter = 300 mm) having a rotating magnet array. A dense, inert gas plasma is configured in the region between the cathode and the sample by means of a 1-3-turn coil, about 20% larger than the wafer and located a few centimeters from both the cathode and the sample. The coil is powered at a frequency of 1.9 or 13.5 MHz and is matched such that each end of the coil is 180[degree] out of phase from the other. The coil may or may not be water-cooled: One commercially available tool uses an uncooled coil, but most other applications use cooling. The coils are powered at 1-3 kW. The operating scheme of this type of tool is as follows. There are two somewhat separate plasmas set up within the chamber, both using the same inert background gas. The magnetron plasma is a conventional dc plasma, located close to the magnetron cathode, such that a significant flux of ions can strike the cathode, causing sputter emission of the metal cathode atoms. A second plasma, this one driven by the rf inductive coil, is configured in the region between the cathode and the sample. As the sputtered atoms enter this plasma, some fraction of them are ionized by electron bombardment as they pass through. The ionization fraction can be high because the ionization potential for the metal species is typically 5-7 eV, whereas the ionization potential for the inert gas species is 15.7 eV (Ar). The relative flux of metal atoms is small compared to the density of the inert gas atoms (typically 1-5%). Once the metal ions are formed, they drift within the plasma, and if they reach the sample sheath, they are accelerated by the difference between the plasma potential (typically +10 V) and the wafer potential (0 to -50 V). Conversely, if the metal ions drift back toward the cathode sheath, they are accelerated by the magnetron voltage (typically -400 V), and are used to sputter more atoms from the cathode. Ideally, the efficiency of utilization of the sputtered metal atoms can be quite high. The relative ionization of the sputtered atoms at the sample location is a function of several parameters. First, it is sensitive to the density of the rf plasma, which is dependent on the rf power as well as the working pressure [27]. In addition, the metal-ionization level is dependent on the amount of time the metal atoms spend in the rf plasma. At low pressures, the sputtered atoms pass through the plasma region rapidly, and the ionization level is low. As the pressure is increased to several tens of mTorr, the sputtered atoms can be slowed by gas collisions, and as a result they spend more time in the discharge and are more likely to be ionized. The optimum pressure depends on system dimensions, but typically is in the 15-30-mTorr range, which is much higher than would be used for conventional sputtering. The relative ionization at the sample has been measured to be as high as 90% at high working pressures, high levels of rf power to the coil, and relatively low metal-sputtering rates. As the metal-sputtering rate is increased, the relative ionization has been observed to fall. Originally this was thought to be primarily a cooling of the electron temperature of the rf plasma used for ionization due to the presence of large numbers of easily ionized metal atoms [27]. Recent work has also suggested a secondary effect caused by the metal flux: the rarefaction of the working gas within the plasma [28]. This was first observed for conventional magnetron sputtering in the late 1980s [29]. In I-PVD, the rarefaction results in fewer in-flight collisions for the sputtered atoms, and hence less time in the ionization region. It is as though at high magnetron power the pressure is reduced, and it is this effect which is most significant in reducing the relative ionization level. Deposition by means of I-PVD has been used in semiconductor applications in three primary areas. The most straightforward application has been the deposition of Ti at the bottom of contact holes. Originally, this was done with collimated sputtering, but I-PVD is more efficient at step coverage, even at high aspect ratios (Figure 17). Commercial suppliers have begun building tools for this application, and it is rapidly supplanting collimated sputtering. Another semiconductor application of I-PVD has been to form diffusion barriers in vias and trenches, again as a replacement for collimated sputtering. At first glance, it would not be expected that a directional process such as I-PVD would be applicable to the conformal films needed for liners and seed layers. However, two aspects of the deposition make it applicable for modest-aspect-ratio features (<5:1 AR). The first is simply that the relative ionization is less than 100%, which means that there is a small nondirectional component to the deposition. This component tends to coat the upper sidewalls of the vias. The second aspect is the ability for the depositing atoms to resputter the already deposited film. If the sample potential is made sufficiently negative, the kinetic energy of the depositing ions will be sufficient to sputter the film and redistribute it within the features. This leads to a thickening of the film in the bottom corners of vias, which is desirable because that location is the most prone to diffusion barrier failure. A schematic of this is shown in Figure 18. A third semiconductor application of I-PVD has been for the filling of features. This application is strongly dependent on the relative ionization of the depositing flux: Higher levels of ionization correlate with the filling of higher-aspect-ratio features. I-PVD filling can easily be applied to either single- or dual-damascene structures (Figure 19). From a practical point of view, ionization levels of about 70% can be measured on large tools. This leads to filling at an aspect ratio of about 2:1 for 0.35-[muon]m-wide features. Higher aspect ratios or smaller feature sizes will require higher levels of ionization. It turns out that at a finer feature size, resputtering of the deposited film is not desirable. This is seen in Figure 20, which is a simulation of the effect of resputtering and redeposition of the sputtered atoms. At a high aspect ratio, or for features less than about 0.5 [muon]m wide, cross-deposition from one side of the trench to the other results in void formation. The simulation applies at low temperatures, and may not be applicable if there is adequate sample temperature to allow some degree of reflow during deposition. Conclusions Sputter deposition is likely to continue to be used for semiconductor applications because of the large, already installed base of PVD systems, the simplicity of the process, and the wealth of associated understanding. Variations to sputter deposition, such as collimated sputtering or long-throw deposition, are in widespread use, but are likely to be made obsolete in the next several years by the introduction of ionized deposition. It is likely that the near-term applications of I-PVD will be primarily for contact layers and diffusion- barrier or seed layers. Filing applications are more likely to be addressed by use of elevated-temperature reflow techniques. The use of I-PVD deposition for filling deep features will require some degree of surface mobility to help fill in the less dense structures on the sidewalls, as well as reducing the overburden and the required CMP. References 1. B. Chapman, Glow Discharge Processes, John Wiley & Sons, Inc., New York, 1980, p. 150. 2. S. M. Rossnagel and H. R. Kaufman, "Induced Drift Currents in Circular Planar Magnetrons," J. Vac. Sci. Technol. A 5, 88-91 (1987). 3. R. A. Powell and S. M. Rossnagel, PVD for Microelectronics, Academic Press, Inc., Boston, 1998. 4. S. M. Rossnagel, "Deposition and Redeposition in Magnetrons," J. Vac. Sci. Technol. A 6, 3049-3054 (1988). 5. P. C. Zalm, "Quantitative Sputtering," Surf. & Interface Anal. 11, 1 (1988). 6. W. O. Hofer, "Angular, Energy, and Mass Distribution of Sputtered Particles," Sputtering by Particle Bombardment, Vol. III, R. Behrisch and K. Wittmaack, Eds., Springer-Verlag, Berlin, 1991. 7. S. M. Rossnagel, "High-Vacuum-Based Processes: Sputtering," Handbook of Vacuum Science and Technology, Ch. 5.2, D. Hoffman, B. Singh, and J. Thomas, Eds., Academic Press, Inc., Boston, 1997. 8. W. F. Callaway, C. E. Young, M. Pellin, and D. M. Gruen, "Laser-Induced Fluorescence as a Tool for the Study of Ion Beam Sputtering," Handbook of Ion Beam Sputtering Technology, Ch. 7, J. J. Cuomo, S. M. Rossnagel, and H. R. Kaufman, Eds., Noyes Publications, Park Ridge, NJ, 1989. 9. J. S. Fan, R. S. Bailey, and C. E. Wickersham, Jr., "New Developments and Applications for Sputtering Targets at Tosoh SMD," presented at Semicon-China, November 1997 (published in Reference 3, Ch. 11). 10. T. J. Licata, E. G. Colgan, J. M. E. Harper, and S. E. Luce, "Interconnect Fabrication Processes and the Development of Low-Cost Wiring for CMOS Products," IBM J. Res. Develop. 39, 419-435 (1995). 11. C. W. Kaanta, S. Bombardier, W. Cote, W. Hill, G. Korszykowski, H. Landis, D. Poindexter, C. Pollard, G. Ross, J. Ryan, J. Wolff, and J. Gonin, "Dual Damascene: A ULSI Wiring Technology," Proceedings of the 8th International VLSI Multilevel Interconnection Conference, Santa Clara, CA, 1991, pp. 144-152. 12. J. J. Cuomo and S. M. Rossnagel, "Hollow Cathode Enhanced Magnetron Sputtering," J. Vac. Sci. Technol. A 4, 393-396 (1986). 13. I. Wagner, "Sputter Deposition of Ti and TiN Films with Variable Target-to-Substrate Distance," Proceedings of the 12th International VLSI Multilevel Interconnection Conference, Santa Clara, CA, 1995, p. 226. 14. J. N. Broughton, C. J. Backhouse, M. J. Brett, S. K. Dew, and G. Este, "Long Throw Sputter Deposition of Ti at Low Pressure," Proceedings of the 12th International VLSI Multilevel Interconnection Conference, Santa Clara, CA, 1995, p. 201. 15. A. A. Mayo, S. Hamaguchi, J. H. Joo, and S. M. Rossnagel, "Across-Wafer Nonuniformity of Long Throw Sputter Deposition," J. Vac. Sci. Technol. B 15, 1788 (1997). 16. S. M. Rossnagel, D. Mikalsen, H. Kinoshita, and J. J. Cuomo, "Collimated Magnetron Sputter Deposition," J. Vac. Sci. Technol. A 9, 261 (1991). 17. R. V. Joshi and S. Brodsky, "Collimated Sputtering for Ti/TiN Liners into High Aspect Ratio Contacts/Lines," Proceedings of the 9th International VLSI Multilevel Interconnection Conference, Santa Clara, CA, 1992, pp. 253-259. 18. B. Zhao, M. A. Bilberger, V. Hoffman, S.-Q. Wang, P. K. Vasudev, and T. E. Seidel, "A Novel Low Temperature PVD Planarized AlCu Process for High Aspect Ratio, Sub-Micron Interconnect," Proceedings of the International Electron Devices Meeting, IEEE, New York, 1996, pp. 353-356. 19. G. Dixit, W. Y. Hsu, K. H. Hamamoto, M. K. Jain, L. M. Ting, R. H. Havemann, C. D. Dobson, A. I. Jeffreys, P. L. Holverson, P. Rich, D. C. Butler, and J. Hems, "Application of High Pressure Extruded Aluminum to ULSI Metallization," Semicond. Int. 18, 79-85 (August 1995). 20. P. Kidd, "A Magnetically Confined and Electron Cyclotron Resonance Heated Plasma for Coating and Ion Surface Modification Use," J. Vac. Sci. Technol. A 9, 466 (1991). 21. G. Wyborn and T. Guo, "Cool Al, the 0.18 Micron Technology," Appl. Mater. HP PVD Update 3, 2 (1996). 22. W. M. Holber, J. S. Logan, H. Grabarz, J. T. C. Yeh, J. B. O. Caughman, A. Sugarman, and F. Turene, "Copper Deposition by Electron Cyclotron Resonance Plasma," J. Vac. Sci. Technol. A 11, 1903 (1993). 23. M. Yamashita, "Fundamental Characteristics of a Built-In High Frequency Coil-Type Sputtering Apparatus," J. Vac. Sci. Technol. A 7, 151-158 (1989). 24. M. S. Barnes, J. C. Forster, and J. H. Keller, "Apparatus for Depositing Material into High Aspect Ratio Holes," U.S. Patent 5,178,739, January 12, 1993. 25. S. M. Rossnagel and J. Hopwood, "Magnetron Sputter Deposition with Levels of Metal Ionization," Appl. Phys. Lett. 63, 3285-3287 (1993). 26. S. M. Rossnagel, "Ionized Magnetron Sputtering for Lining and Filling Trenches and Vias," Semicond. Int. 19, 99 (February 1996). 27. S. M. Rossnagel and J. Hopwood, "Metal Ion Deposition from Ionized Magnetron Sputtering Discharge," J. Vac. Sci. Technol. B 12, 449 (1994). 28. M. Dickson, F. Qian, and J. Hopwood, "Quenching of Electron Temperature and Electron Density Using Ionized Physical Vapor Deposition," J. Vac. Sci. Technol. A 15, 340-344 (1997). 29. S. M. Rossnagel, "Gas Density Reduction Effects in Magnetrons," J. Vac. Sci. Technol. A 6, 19-24 (1988). 30. S. M. Rossnagel, "Filling Dual Damascene Interconnect Structures with AlCu and Cu Using Ionized Magnetron Sputter Deposition," J. Vac. Sci. Technol. B 13, 125 (1995). 31. S. Hamaguchi and S. M. Rossnagel, "Liner Conformality in Ionized Magnetron Sputter Metal Deposition Process," J. Vac. Sci. Technol. B 14, 2603 (1996). Received November 13, 1997; accepted for publication March 20, 1998 Stephen M. Rossnagel IBM Research Division, Thomas J. Watson Research Center, P.O. Box 218, Yorktown Heights, New York 10598 (rossnag@us.ibm.com). Dr. Rossnagel is a Research Staff Member at the Thomas J. Watson Research Center, working in an interconnection metallurgy and processing group. He received B.S. and M.S. degrees from Penn State University in 1975 and 1977, respectively, and a Ph.D. from Colorado State University in 1982 following positions on the staff of the Princeton University Plasma Physics Laboratory and the Max Planck Institute for Plasma Physics in Munich, Germany. He joined IBM in 1983 at the Thomas J. Watson Research Center, where he has worked primarily on sputtering and plasma processing. Dr. Rossnagel has authored more than 100 papers, five books, and nine patents. He received the Peter Mark Award from the American Vacuum Society (AVS) in 1990 and was made a Fellow of the Society in 1994. He chaired the AVS Annual Conference in 1996, and was recently elected President-Elect of the AVS for 1998, to serve as President in 1999.