IBM®
Skip to main content
    Country/region [change]    Terms of use
 
 
 
    Home    Products    Services & solutions    Support & downloads    My account    

IBM Journal of Research and Development

POWER5 and Packaging   Volume 49, Number 4/5, 2005
Table of contents: HTMLPDF This article: HTML PDFDOI: 10.1147/rd.494.0777Copyright info

Microminiature packaging and integrated circuitry: The work of E. F. Rent, with an application to on-chip interconnection requirements

by M. Y. Lanzerotti,
G. Fiorenza,
and R. A. Rand

Research on Rent's rule in electrical engineering, the applied sciences, and technology has been based on the publication of a 1971 interpretation of Rent's memoranda by B. S. Landman and R. L. Russo. Because of the wide impact of Rent's work and requests from researchers, we present his original memoranda in this paper. We review the impact of Rent's work and present the memoranda in the context of IBM computer hardware development since the 1950s. Furthermore, because computer hardware components have changed significantly since the memoranda were written in 1960, a new interpretation is needed for today's ultra-large-scale integrated circuitry. On the basis of our analysis of the memoranda, one of the authors' personal knowledge of the 1401 and 1410 computers, and our experience in the design of high-performance circuitry for microprocessor chips, we have derived an historically equivalent interpretation of Rent's memoranda that is suitable for today's computer components. We describe an application of our historically equivalent interpretation to the problem of assessing on-chip interconnection requirements of control logic circuitry in the IBM POWER4™ microprocessor.

Introduction

The introduction and presentation of empirical rules that describe the interconnectivity of computer hardware components first appeared with the publication of a paper by Landman and Russo in 1971 [1]. This publication presented an overview of an empirical relationship understood by members of the IBM computer development groups prior to 1960, when E. F. Rent wrote two internal memoranda that describe his methods and graphical techniques to illustrate the relationship between properties of computer hardware components. The 1971 paper has been referenced extensively in the literature of several scientific fields, including semiconductor circuitry, computer systems, applied mathematics, the applied sciences, and semiconductor manufacturing technology. Specifically, as semiconductor circuitry continues to increase in complexity to ultra-large-scale integrated (ULSI) circuitry, Rent's work has provided guidance for the design and evaluation of ULSI chips and the technologies with which they are manufactured [23]. As an example, researchers have derived theoretical models based on the 1971 interpretation of Rent's work [14] to anticipate properties of future computer systems. In this work, the inputs to the models are empirical parameters that are referred to as Rent parameters.

There remains a need in the literature for the publication of the contents of the two memoranda. We have obtained copies of Rent's memoranda, and one goal of this paper is to present them for the first time within the context of the development of computer hardware components at IBM since the 1950s. To achieve this goal, we present an overview of the two IBM computers that Rent analyzed in the memoranda and a review of computer systems development at IBM from the 1950s to the present. A second goal of this paper is to discuss an interpretation of these memoranda that is suitable for today's computer hardware components, since the components in today's systems differ significantly from those discussed in the original memoranda. This paper also describes an application of this interpretation to the problem of assessing the extent to which estimates provided by existing wire-length distribution models agree with actual wire-length measurements. Here, the desired goal is to obtain improved estimates of wire-length requirements for circuitry on ULSI chips. For this application, we obtained model estimates for wire-length requirements and compared the estimates with measured wire-length requirements in manufactured chips.

This paper is organized as follows. We review the impact of Rent's work, followed by a review of computer systems hardware, including IBM computer systems built since the 1950s and then, more specifically, two systems that Rent discusses in his memoranda. Rent's memoranda on microminiature packaging are then presented. We describe an interpretation of the memoranda that is suitable for today's computer hardware components and then describe an application of this interpretation to existing wire-length distribution models and interconnection requirements, followed by a discussion and presentation of our conclusions.

Impact of Rent's work

Rent's work has had a significant impact in a range of scientific fields, and the literature that references the 1971 paper [1] can be broadly arranged into the following scientific areas: interconnections, semiconductor circuitry, computer systems design, applied mathematics, applied sciences, and semiconductor manufacturing technology. The purpose of this section is to provide an overview of the impact of Rent's work and to provide examples of research in each of these areas.

Several interpretations of the 1971 paper exist, including the external interpretation [48] and the topological interpretation [89]. According to the external interpretation, the external Rent parameters {kp} are obtained from the slope and y-intercept of linear fits to log–log plots of the number of input/output pins TIO as a function of the number of gates Ngates without further partitioning [14810]. In this interpretation, k represents the Rent coefficient and p represents the Rent exponent. According to the topological interpretation, the topological Rent parameters {k*, p*} are obtained by first partitioning each design hypergraph hierarchically and then extracting the slope and y-intercept from linear fits to log–log plots of the number of terminals as a function of the number of gates, as described in detail in [812]. In this interpretation, k* represents the Rent coefficient and p* represents the Rent exponent. Additional information about the external interpretation is described in [45]. The literature that discusses Rent's work prior to the interpretation [13] described in this paper, including references presented in this section, is based on the 1971 interpretation [1].

Table 1 [14581014] shows values of the Rent parameters—obtained with different interpretations of Rent's work—for several design types. In this table, the values of the Rent coefficient and Rent exponent are shown for block graphs, square arrays, static random access memory (SRAM), gate arrays, chip and module, board and system, microprocessors, application-specific integrated circuit (ASIC)-like control logic, and benchmark circuits. The date of each analysis and design names are also shown. The data in the table shows that the value of the Rent exponent tends to be less than unity for these interpretations. The data also shows that the Rent exponent can take on a range of values from 0.12 to 0.99, and that the Rent coefficient can take on a larger range of values from 0.8 to 82.


Table 1 Rent coefficient and exponent for different chip design types. The design type, date of the analysis, name of the design, and Rent parameters are shown. NA indicates that the information is not available. The ranges of the Rent parameters for benchmark circuits are shown. The Rent coefficient and exponent for individual benchmark circuits are given in [8]. ©2004 IEEE. Reprinted from [13] with permission.
Design typeDateDesign nameRent coefficientRent exponent

Block graphs1971L1 in [1]3.520.57
L2 in [1]4.730.75
L3 in [1]29.660.69
L4 in [1]20.120.66
Square arrays1979A in [14]NA0.67
B in [14]NA0.59
C in [14]NA0.75
D in [14]NA0.57
E in [14]NA0.47
F in [14]NA0.75
Square arrays1981A in [15]NA0.7
B in [15]NA0.7
C in [15]NA0.5
SRAM1990Table 9.5 in [4]60.12
Gate arrays1990Table 9.5 in [4]1.90.50
Chip and module1990Table 9.5 in [4]1.40.63
Board and system1990Table 9.5 in [4]820.25
Microprocessors1990Table 9.5 in [4]0.820.45
Microprocessors1995Figure 3 in [5]2.090.36
Figure 8 in [5]3.80.75
Figures 9–11 in [5]5.00.8
ASIC-like control logic designs in a microprocessor2004IFU in [10]0.80.69
FPU in [10]2.20.66
FXU in [10]4.40.61
IDU in [10]20.5 0.30
ISU in [10]23.30.31
LSU in [10]7.30.46
Benchmark circuits2003Table I in [8][2.9, 9.3][0.60, 0.99]
Table III in [8][2.7, 7.8][0.26, 0.82]
Table IV in [8] for circuits {i1, …, i8}[4.9, 11.7][0.55, 0.82]
Table IV in [8] for circuits {ibm01, …, ibm07}[4.6, 9.2][0.57, 0.79]

Interconnections

To describe the increasing complexity [2] of interconnections in semiconductor circuitry and to estimate the physical and electrical characteristics of chip wiring, researchers have developed models based on the 1971 interpretation of Rent's work. The purpose of this section is to discuss these characteristics, which include on-chip interconnection requirements, performance metrics for interconnections, power dissipation in interconnections, signal fan-out distributions, and interconnection congestion in a design layout.

On-chip interconnection requirements
The earliest application of the 1971 work has been the derivation of interconnection models by Donath [1415], Davis et al. [571619], and Christie and Stroobandt [9]. These interconnection models provide estimates for the wire-length distributions, average wire length, and total interconnection length required to wire chip circuitry correctly. These models assume that the designs are square, are completely tiled with square logic gates, have signals with unity fan-out, contain circuitry that is associated only with the logical function of the design, and allocate all of the available space for wire routing. The topological Rent parameters are input into the expressions provided by the Donath models [1415] and the Christie model [9], and the external Rent parameters are input into the expressions provided by the Davis model [57].

Additional work that describes interconnection requirements in large-scale integration (LSI) and very large-scale integration (VLSI) chips is provided in [102022]. Methods to extract Rent parameters are discussed in [45892325]. Interconnection requirements have been assessed in microprocessors [4], ASICs [4], memory elements [4], and control logic circuitry in high-performance microprocessors [10]. Interconnection models have also been developed for designs that are rectangular [26], designs that contain signals with multiple fan-out [2728], designs that contain one or more blockages [29], field-programmable gate arrays (FPGAs) [3035], three-dimensional integrated circuits [323640], and three-dimensional anisotropic systems [41].

Performance metrics
The wiring distributions predicted by the existing wire-length distribution models in the previous section, combined with technology information (such as wire pitch and the number of available metal layers), have been used to obtain interconnection performance metrics such as the one that combines a wire distribution model with a 50% delay model for interconnections from the input of an inverter driver to the output of a inverter receiver, as discussed in [42]. These metrics have helped researchers decide how to optimize wire and inter-layer dielectric characteristics, such as metal thickness and dielectric thickness.

Power dissipation
Models for power dissipation and power estimation that take into account the impact of interconnections have been developed and are described in [43].

Fan-out distributions
Methods to estimate the number of signals with each fan-out in chip designs are described in [284445].

Congestion
Methods to estimate interconnection congestion in chip designs are provided in [4647].

Semiconductor circuitry

Researchers have also developed models that are based on Rent's work to partition circuitry and to estimate additional circuitry requirements, such as switching requirements, gate count, and gate area. Bakoglu [4] and Stroobandt [4849] provide overviews of research in interconnection requirements in ULSI circuitry. This section provides additional references for these models. Methods to partition circuitry are discussed in [5051], switching requirements in circuitry are discussed in [52], and methods to estimate the number of logic gates and the gate area in chips are presented in [5354].

Computer systems design

The design of computer systems has changed significantly in the past 50 years [2]. Bakoglu [4], Sai-Halasz [55], and Dooijes [56] provide three discussions of trends in computer technology. Researchers in computer systems design have referenced Rent's work in several applications, such as digital information processing, estimation of system-level interconnection requirements, computer architecture, packaging, testing, interconnection requirements in systems-on-chips (SoCs), and system-cost models for evaluating the tradeoff between using off-chip interconnections and increasing chip area and the number of on-chip interconnections. In addition, a research group at the University of California at Los Angeles is collaborating with Hewlett-Packard Company to develop a chemically assembled electronic nanocomputer (CAEN) [57]. The purpose of this section is to present a discussion of some of these applications.

Digital information processing
In the field of digital information processing, Keyes [5860] references the papers of Landman and Russo [1] in his discussions of the physics of VLSI components.

System-level interconnection requirements
Methods to estimate system-level interconnection requirements are discussed in [22526166]. A series of workshops to present material on the subject of system-level interconnect prediction (SLIP) was established in 1999, and additional information is available online [67].

Computer architecture
Methods to extract Rent parameters from different types of computer architectures are discussed in [2425]. The impact of Rent's work on strategies to design massively parallel systems is discussed in [25]. Estimates of the number of memory bits that are required to replace a random logic circuit are discussed in [68]. Additional applications in computer architecture for nanotechnology are discussed in [69].

Packaging
The impact of interconnections on system and packaging requirements is discussed in [42255567072]. Interconnection requirements at the interfaces between chips and modules are discussed in [73].

Testing
Methods for chip testing and fault analysis are presented in [74].

SoCs
Design methodologies for SoCs and the associated interconnections, including global signals, clock distributions, and power grids, are discussed in [40].

System-cost models
A system-cost model has been developed [7576] to evaluate the tradeoff between using off-chip interconnections and increasing the silicon die area, which increases the number of on-chip interconnections. In this model, there are three cost elements: the cost of processing silicon, the cost of wasted silicon that is unusable after processing, and the cost of assembly. This model assumes that the cost of assembly is proportional to the number of pins on the package, which they obtain from references to Rent's work.

Applied mathematics

Rent's work has also been discussed in the field of applied mathematics. Mandelbrot discusses this work in his book entitled Fractals: Form, Chance, and Dimension [77]. Ozaktas discusses additional concepts of connectivity for circuit graphs and provides generalizations for systems with a Rent exponent that takes on different values throughout the interconnection hierarchy [7879].

Applied sciences

Researchers have applied Rent's work to molecular electronics and optics. This section discusses these applications and provides references.

Molecular electronics
Researchers in molecular electronics have developed a cellular FPGA. Inputs to this system include Rent's parameters and information about cellular automata [80]. This work points out that cellular FPGAs differ from traditional FPGAs because they are periodic structures. Cellular FPGAs support only nearest-neighbor connections, rather than interconnections on several metal layers, as is the case for ULSI back-end-of-line chip architectures.

Optics
Rent's work has been applied to an assessment of interconnectivity requirements of free-space optical interconnections, as reported in [8182], where brief comparisons with conventional interconnections are provided.

Semiconductor manufacturing technology

The increasing complexity of today's manufacturing technologies makes it more and more difficult to implement innovations in manufacturing. Although innovations are costly as a result, they do allow us to manufacture chips with desirable characteristics such as higher performance, lower power dissipation, higher circuit density, and higher yield.

Recognizing the need to estimate the impact of these desirable characteristics for each technology, researchers have developed performance models for chips and systems. These models include a cycle-time model [55], SUSPENS [483], AIM [84], GENESYS [85], RIPE [86], BACPAC [8788], and GTX [89]. The purpose of this section is to describe briefly the characteristics of three of the models: SUSPENS, BACPAC, and GTX; information about the other models is available in the references listed.

The Stanford University System Performance Simulator (SUSPENS) [483] is a performance model that calculates the maximum clock frequency, chip area, and chip power dissipation as a function of a list of parameters that includes physical constants, Rent's parameters, number of logic gates on the chip, total capacitance of the interconnection per unit length, interconnect pitch, and board-level characteristics.

The Berkeley Advanced Chip Performance Calculator (BACPAC) [8788] is a model that calculates characteristics of system-level performance as a function of device characteristics, interconnection characteristics, and system characteristics. The device characteristics include the operating voltage, threshold voltage, gate oxide thickness, and drain current. The interconnection characteristics are the number of metal layers, metal resistivity, and interlayer dielectric constant. The system-level inputs include the design size and Rent's parameters. The outputs of the model are listed in [87] and include chip area, maximum clock frequency, wiring requirements, power dissipation, and projected yield.

The Microelectronics Advanced Research Corporation (MARCO) Gigascale Systems Research Center (GSRC) Technology Extrapolation (GTX) [89] is a system model that calculates system-level performance characteristics such as system cycle time, die size, and power dissipation.

Review of computer systems hardware

Significant changes have occurred in the development and manufacturing of computer hardware components since 1960, when Rent wrote his two memoranda. The purpose of this section is to review technical hardware innovations in computer hardware components sold in the past five decades and to describe the two computers discussed in Rent's memoranda [9094].

Computer systems since the 1950s

Throughout much of the 1950s, as indicated in Table 2 [39498] a key component of IBM computers was the cathode-ray vacuum tube for storage, as well as conventional switching vacuum tubes for logic circuitry; as a result, a single central processing unit (CPU) composed of many of these tubes occupied approximately 90 cubic feet of space. Table 2 lists the IBM systems and the associated innovations in these systems since the 1950s. This table shows that by 1959–1960, vacuum tube technology was superseded by the introduction of discrete transistors in such IBM computers as the 1401 and 1410—the computers that Rent discusses in his memoranda.


Table 2 Typical hardware components in IBM computers built since 1952. The SMS card refers to the IBM proprietary standard modular system (SMS) of circuit packaging. The term SLT refers to the IBM solid logic technology (SLT).
Date [94]Technology [94]IBM system [94]

1952Cathode-ray vacuum tubes for storage, magnetic drum and tape storage, card reader–punch701
19545 ft × 3 ft × 6 ft CPU, rotating magnetic drum, card reader–punch, magnetic core memory650
1959SMS circuit packaging technology with discrete transistors and resistors; jumper wires on 2.5-in. × 2.5-in. circuit cards; punched cards, magnetic tape1401
1959SMS circuit packaging technology7090
1960SMS circuit packaging technology1410
1964SLT hybrid solid logic technology with 1.6-in.2 printed circuit boards; cards with semiconductor chips, printed lines and resistors; ferrite-core memoriesSystem/360
1968Monolithic integrated circuits with 1–4 circuits per silicon chip; four chips per ceramic moduleModel 85
1970Monolithic systems technology; all-semiconductor main memorySystem/370* Model 85
1980Thermal conduction module (TCM) with >100 chips3081 [394]
1985First mainframe with one-million-bit memory chips3090* [394]
1990VLSI: four-million-bit memory chip, 800,000 transistors per chipRISC System/6000*
1994First complementary metal oxide semiconductor (CMOS) mainframeSystem/390* G1
1997CMOS mainframe with up to ten microprocessorsSystem/390 G4
2001ULSI CMOS technology with 170 million transistors in a 2-cm × 2-cm silicon chip [95]pSeries* 690 [95]
2004121 million transistors in a 2-cm × 2-cm silicon chip [96]eServer* z990 [96]
2004276 million transistors in a 2-cm × 2-cm silicon chip [9798]eServer i5 550 [9798]

In 1964, discrete transistor technology was replaced by hybrid solid logic technology consisting of semiconductor chips, printed wires, and printed resistors. Computer hardware components constructed in hybrid solid logic technology appear in the IBM System/360* computer.

In 1980, IBM introduced the 3081 system. This system introduced the packaging concept of a thermal conduction module (TCM) to the marketplace. The design of the TCM … was driven by partitioning and wiring considerations influenced by Mr. Rent's work before semiconductor technology was able to contain an entire high-performance CPU on a single chip.1

Figure 6 in the paper by Seraphim and Feinberg [3] shows a planar board of the 3081 system, and it is stated there that the TCMs plug directly into the printed circuit back panel. A review of the evolution of electronic packaging in IBM from the early 1960s through 1981 is presented in [3].

In 1985, IBM introduced its first mainframe with memory chips containing one million bits each, as indicated in Table 2. By 1990, each memory chip in the RISC System/6000 contained four million bits.

In 2001, IBM introduced computer systems based on the POWER4* chip that contained two CPUs, approximately 174 million transistors, and more than one mile of copper wiring on a penny-sized 4-cm2 silicon chip [95], thereby realizing a significant volume reduction compared with the early 1950s CPU.

In 2004, IBM introduced two computer systems. The eServer z990 contains 121 million transistors, and the eServer i5 550 contains 276 million transistors.

IBM 1401 Data Processing System

The 1401 [Figure 1(a)] was introduced in the marketplace on October 5, 1959, and was sold for nearly 12 years, until February 8, 1971. Composed entirely of discrete transistors, the main components of the IBM 1401 were a 1401 processing unit, 1402 card reader–punch, and 1403 printer. It was the first computer to sell more than 10,000 units [9091]. According to the IBM Corporate Archives [90], the system performs functions previously requiring a number of separate machines: card reading and punching, separation of output cards, calculating, and printing. Within the 1401 system itself, the IBM 1401 processing unit controls the entire system by means of its stored program. It performs the arithmetic and logical functions, controls card reading and punching, magnetic tape input and output, and tells the printer what to print and where to print it [90]. Regarding speed, the press datasheet reported that in one minute, the 1401 Processing Unit can perform 193,300 additions (eight-digit numbers) or 25,000 multiplications (six-digit numbers by four-digit numbers) [90].

Figure 1 Figure 1

IBM 1410 Data Processing System

The 1410 [Figure 1(b)] was introduced in the marketplace on September 12, 1960, and was sold for nearly ten years, until March 30, 1970 [92]. It was offered in three basic configurations, with an optional paper tape reader and magnetic character reader [9293]. These configurations are the following:

  • IBM 1410 Data Processing Card System: This configuration was a high-speed flexible system containing large-volume storage facilities and fast input/output units for up-to-the-minute, accurate punched card output and printed reports [92]. It consisted of a 1402 card reader–punch, 1403 printer, 1411 processing unit, 1414 input/output synchronizer, and 1415 console.
  • IBM 1410 Data Processing Tape System: This configuration added to the speed and efficiency of the 1410 card system with high-volume storage capacities and increased data input/output speeds [92]. Magnetic tape units were available with this system.
  • IBM RAMAC 1410 Data Processing System: This configuration provided mammoth disk storage facilities for the IBM 1410 Data Processing Systems, [and] consisted of the 1410 card system, with or without the tape units [92]. The configuration also included disk storage.
Microminiature packaging: The memoranda of E. F. Rent

The previous section has presented a review of the computer hardware systems environment that existed in 1960 when Rent wrote his memoranda. The purpose of this section is to present the two memoranda within the context of this environment. In the memoranda, Rent describes his analysis of the IBM 1401 and 1410 computer systems and describes his method to deduce an empirical relationship between the properties of the hardware components in these systems. The first memorandum is shown in Appendix A on pages 784–788. This memorandum discusses the 1410 and is dated November 28, 1960. The second memorandum is shown in Appendix B on pages 789–791. This memorandum discusses the 1401 and the 1410 and is dated December 12, 1960.

Appendix A: First Rent memorandum—November 28, 1960

Page 784
Page 785
Page 786
Page 787
Page 788

Appendix B: Second Rent memorandum—December 12, 1960

Page 789
Page 790
Page 791

The 1401 and 1410 hardware components that Rent discusses in the memoranda are the computer chassis, card, circuit count, and edge connector count. The computer chassis contains several cards that are connected in the chassis. Each card is composed of one or more circuits; the circuits on the card are either used circuits or unused circuits. The term used circuit refers to a circuit that is connected and performs some function; the term unused circuit refers to a circuit that exists yet is not required. The purpose of edge connectors is to connect used circuits on a card with other used circuits located on other cards within the chassis. A circuit is also referred to as a logic block. Photographs of a card and the wiring on the back of a chassis are shown in Figure 2. In the earlier memo, Rent describes his method to obtain the circuit count and edge connector count of the cards on each chassis of the 1410 computer system as follows: The circuit count was made by card type and then a count of one given for each of the following: 2 way logic block, 3 way logic block, emitter follower, indicator driver, power inverter, line driver etc. No count was made for diode clamp cards or resistor load cards. All unused circuits on cards were removed from the count to give an actual logic circuit count for each chassis. The net edge connector count required a total for all edge connectors on a chassis less, (1) those connectors that were used to feed signals across a chassis and not go to any logic on the chassis and (2) those connectors that were used to distribute to the adjacent chassis signals that were developed on the chassis.

Figure 2 Figure 2

On pages 787 and 791 are Rent's plots of the circuit count per card as a function of the number of edge connectors per card on log–log graph paper. Rent also calculated the average block-to-edge-connector ratio, which is shown in another log–log plot on page 788. He wrote in the memorandum (page 784), Therefore, the average of the three control chassis will be considered as more typical of all chassis and will be used as the reference point in further discussions. On pages 787 and 791, the ordinate is labeled logic blocks/card, and the abscissa is labeled signal pins/card. Each of these plots contains several data points added by hand, where the data points are the number of logic blocks and number of edge connectors obtained according to the descriptions quoted above. A solid line is drawn by hand through several data points in these plots, and Rent pays particular attention to the solid line that connects the point labeled C1 with the point labeled single card unit logic (IO pins) (see page 785 for a discussion and a listing of the points that Rent identifies on the line). An inspection of this line indicates that it may be described by the expression,

y = m × x + b,(1)

where y represents the log of the ordinate and x represents the log of the abscissa, and where we obtain m = 1.49, b = −0.94 for the line drawn on the plot. This expression may be rearranged as

x = m´ × y + b´,(2)

where m´ = 0.67 and b´ = 0.63, and also as the expression,

X = B´Ym´,(3)

where b´ = log(B´), x = log(X), y = log(Y). Y represents the ordinate, and X represents the abscissa.

In the second memo, Rent performs his analysis on the 1401 computer system and considers four separate chassis labeled 01A7, 01A8, 01B1, and 01B2 in the memo. The plot of the data obtained for these four chassis is shown on page 791, which is a log–log plot similar to that on page 787, but with the addition of this new data. Note that a solid line also connects the two datapoints C1 and single card unit logic (IO pins) in the plot of page 791, and that the locations of the datapoints A7, A8, B1, and B2 are identified on the plot. In this later memo, Rent writes (page 789) that It should be noted that these chassis points are quite scattered on the graph but still they fall between the curve for compact cards and the curve derived from 1410 chassis counts, and that These chassis counts verify that the curve drawn from the 1410 counts when used would not pin limit the mother board, which indicates that the results in the first memo are verified by the results in the second memo.

Figure 3(a), prepared by the authors of this paper, is a schematic representation of a computer hardware chassis and the computer components described in the memoranda. The chassis shown in the figure contains six cards; three cards are used (A, B, C), and three cards are unused (D, E, F). Specifically, card A is a representation of the card shown in Figure 2(a), and the chassis is a representation of the chassis shown in Figure 2(b). In Figure 3(a), the dashed curved lines between the two chassis represent that these connections are excluded in Rent's calculations.

Figure 3 Figure 3

An interpretation for ULSI circuitry

Based on the previous section, the personal knowledge of one of us (R. A. Rand) of the 1401 and 1410 computers, and our experience with the design and analysis of ULSI circuitry for high-performance microprocessors [1099100], we have derived an historically equivalent interpretation of Rent's memoranda that is suitable for today's computer components. In this paper, the phrase historically equivalent refers to our mapping of each component described by Rent in the phrases for circuit count and edge connector count, as provided in the previous section, to a functionally equivalent component that exists in today's circuitry [13].

Figure 3(b) shows a schematic representation of today's computer circuitry. In Design A, shown in the figure, all inputs have unity fan-out, the used signals are labeled with notation {s1, …, s9}, and the used gates are labeled with notation {g1, …, g4}.

For today's designs, the historically equivalent terms for chassis and card are functional unit and design, respectively. The historically equivalent term for the building block, or circuit, is logic gate. The historically equivalent term for circuit count is gate count, which can be represented by the term Ngates, a quantity that describes the number of gates in a design. An expression for Ngates can be written according to the expression

Ngates = Nall − Nunconnected − Nspare − Nfiller − Ndecap,(4)

where the term Nall represents the total number of gates in a design, the term Nunconnected represents the number of unconnected gates, the term Nspare represents the number of gates associated with spare logic, the term Nfiller represents the number of filler cells, and the term Ndecap represents the number of decoupling capacitors. To ensure an accurate count of Ngates, it is important to itemize carefully the contributions to the four terms Nunconnected, Nspare, Nfiller, Ndecap in Equation (4). These four terms do not contribute to the number of functional gates in a design, although they may exist for various reasons [13].

The historically equivalent term for edge connector count is used connection, which can be represented with the term Nconn and can be written [13] as

Equation 5(5)

where the sum is taken over the signal nets Nnets, Fi represents the fan-out of each signal net, TIO represents the number of input pins and output pins in the design [5], and the total fan-out, Ftotal, is given by the expression

Equation a

In Equation (5), the term (Fi + 1) represents the total number of connections made by a single signal net in a design; each signal net has a single driver and can have fan-out Fi ≥ 1. Note that Equation (5) implicitly contains information about design complexity through the dependence of the historically equivalent Rent parameters kR and pR on the average fan-out f, which can be written as f = Ftotal/Nnets, since values for the Rent parameters {kRpR} are obtained through least-square linear fits to log–log plots of Nconn as a function of Ngates.

For the case of Design A in Figure 3(b), the value of Ngates = 4. The value for Nconn is obtained with Equations (4) and (5), where Ngates = 4, Ftotal = 10, Nnets = 9, and TIO = 7. Substituting these values into Equation (5), we obtain the number of connections for Design A as Nconn,A = 12, as can be verified by a direct count of the connections in Design A. Additional examples of designs with inputs that have multiple fan-outs and designs that are partitioned into several subdesigns are given in [13].

A second method of obtaining the number of connections Nconn in a design is to count the number of gates Nj of each logic type j, such as the inverter (inv) and nand2. The total number of connections Nconn can also be written according to the expression,

Equation 6(6)

where nconn,j represents the number of connections of gates of type j, and where the sum is taken over all of the gate types G in a design. Note that the total number of gates Ngates can be written as the expression

Equation 7(7)

Examples of the number of connections per gate nconn,j are shown in Figure 4. The number of connections for a gate type j can be calculated by taking the sum of the number of inputs ni and the number of outputs no, such that nconn,j = ni + no.

Figure 4 Figure 4

As an example, we consider the case of Design A shown in Figure 3(b). Design A contains three types of logic gates: inv, nand2, and nand4. From Figure 4, we see that each inv contributes two connections, each nand2 contributes three connections, and each nand4 contributes five connections. With these substitutions, Equation (6) takes the form for the number of connections for Design A, Nconn,A, as the expression

Nconn,A = 2 · Ninv + 3 · Nnand2 + 5 · Nnand4.(8)

Next, Figure 3(b) shows that the number of inv is Ninv = 2, the number of nand2 is Nnand2 = 1, and the number of nand4 is Nnand4 = 1. Substituting these values into Equation (8), we obtain Nconn,A = 12, which is the same value previously obtained with Equation (5), as described above.

Additional examples for Designs F, F1, F2, and F3 in [13] are shown in Table 3 [13]. This table shows the value for Nconn and the allocation of gates in each design for each of the different gate types: inv, nand2, nand4, nor2, and2, and or2. Equation (6) can be evaluated for each design, with Figure 4, to obtain the expressions for each design:

Nconn,F = 2 · Ninv + 3 · Nnand2 + 3 · Nnor2 + 3 · Nand2,(9)
Nconn,F1 = 2 · Ninv,(10)
Nconn,F2 = 3 · Nand2 + 3 · Nor2,(11)
Nconn,F3 = 2 · Ninv + 3 · Nnand2.(12)


Table 3 Number of connections for five designs in [13] with Equation (6), where the coefficients in each equation are given in Figure 4 in this paper.
DesignReferenceNconn valueGate allocationNconn, with Equation (6)

AFigure 3 in [13]12Ninv = 2Nconn,A = 2Ninv + 5Nnand4 + 3Nnand2
Nnand2 = 1
Nnand4 = 1
FFigure 4 in [13]24Ninv = 6Nconn,F = 2Ninv + 3Nnand2 + 3Nnor2 + 3Nand2
Nnand2 = 1
Nnor2 = 1
Nand2 = 2
F1Figure 5 in [13]4Ninv = 2Nconn,F1 = 2Ninv
F2Figure 5 in [13]9Nand2 = 2Nconn,F2 = 3Nand2 + 3Nor2
Nor2 = 1
F3Figure 5 in [13]11Ninv = 4Nconn,F3 = 2Ninv + 3Nnand2
Nnand2 = 1

In this paper, the parameters kR and pR represent the inverse log of the intercept and the slope, respectively, of a linear fit to Nconn as a function of Ngates in a log–log plot, according to the expression

log(Nconn) = log(kR + pR · log(Ngates),(13)

which can also be expressed as

Equation 14(14)

Each datapoint in such a plot is generated from one design within a functional unit, and the series of datapoints considered for a fit according to Equation (14) is generated from the set of all designs within the unit.

An inspection of Equation (6) shows that Nconn tends to increase linearly with Ngates for 1) the case in which each design in a set of designs is composed of the same type of gate, 2) the case in which each design in a set of designs contains the same allocation of gates, and 3) the case in which one gate type is the primary constituent of all of the designs in the group. For example, consider a set of designs {a, b, c} in which all of the designs are composed of a different number of nand2 gates. For these designs, the number of connections can be written in a straightforward fashion as the expressions

Nconn,a = 3 · Nnand2,(15)
Nconn,b = 3 · Nnand2,(16)
Nconn,c = 3 · Nnand2,(17)

where the terms Nconn,a, Nconn,b, and Nconn,c represent the number of connections in designs a, b, and c, respectively, and the term Nnand2 represents the number of nand2. A plot of these three datapoints would produce an exponent with value pR = 1.

As a second example, consider a set of designs {d, e, f} in which all of the designs contain the same proportion of gates, such as 80% inv, 15% nand2, and 5% nand4 gates, or 30% inv, 30% nand2, and 40% nor2. For the first example of these designs, the number of connections can be written as the expressions,

Nconn,d = 2 · Ninv + 3 · Nnand2 + 5 · Nnand4(18)
= {2 · 0.80 + 3 · 0.15 + 5 · 0.05}Ngates,
Nconn,e = 2 · Ninv + 3 · Nnand2 + 5 · Nnand4(19)
= {2 · 0.80 + 3 · 0.15 + 5 · 0.05}Ngates,
Nconn,f = 2 · Ninv + 3 · Nnand2 + 5 · Nnand4(20)
= {2 · 0.80 + 3 · 0.15 + 5 · 0.05}Ngates,

where the term Ngates represents the total number of gates. An inspection of the expressions in Equations (18)–(20) shows that they can be written in the same form as Equations (15)–(17), as shown on the right-hand side of the expressions above. The expressions also show that, for these designs, the number of connections increases linearly with the number of gates; for this case, it is expected that the exponent has value pR = 1 and that the uncertainty in the value of the exponent is zero.

For the cases in which other sets of designs contain similar allocation of gates in each design, it is expected that the exponent will also tend to approximate unity, with pR ~ 1 with a low value of uncertainty in the value of pR. For other cases in which the designs contain disparate allocations of gates in each design, it is expected that the expressions for the number of connections in each design will take on a more complicated relationship for the designs; for these cases, the value of the exponent is obtained from the log–log plot, and it is expected that the uncertainty in the value of the exponent will increase.

Application to wire-length distribution models

This section presents an application of the interpretation of microminiature packaging described in the previous section to the problem of interconnection requirements in ULSI circuitry [13]. The designs selected for this study are all of the control logic designs in the IBM POWER4 core [95]. Here, the goal is to obtain improved estimates of wire-length requirements for circuitry on ULSI chips. A discussion of the limitations of these wiring estimates and a comparison with prior work are presented in [13].

For each design, estimates and ranges for the normalized probability density function pint(L), cumulative distribution function P(L), average wire length Lavg(pR), and total wire length Ltot(pR) are obtained. To obtain these estimates, we first extract values for the parameters {kRpR} for the designs in each functional unit with Equation (13). As discussed in an earlier section, each design provides one datapoint on the log–log plot for each functional unit; for this datapoint, the value for Ngates takes into careful account the values of Nall, Nunconnected, Nspare, Nfiller, and Ndecap, and the value of Nconn is obtained from a count of Ftotal, Nnets, and TIO.

Figure 5 shows a log–log plot of Nconn as a function of Ngates for each of the six units in the POWER4 core. For example, in this figure, the 18 datapoints for the IFU are provided by the 18 individual designs in the IFU. Linear fits of the design data to Equation (13) for each unit provide the appropriate values of {kRpR}, where the value of kR falls roughly in the range of 1 to 4, as shown in Table 4 [10] and pR ~ 1. Complete listings of the estimates and ranges within one standard deviation for kR and pR are given in [13]. For comparison, Table 4 also lists the values of the Rent parameters {kp} that are obtained with the external interpretation of Rent's memoranda [15810]. The results in Table 4 show that the value of pR is close to unity for each of the six functional units in the core, which suggests that the number of connections in all of the designs in the core tends to increase linearly with the number of gates. Since these designs are composed mainly of inv, the choices of gates in all of the designs are generated with the same set of sophisticated computer-aided design (CAD) software programs, and since it has been observed that designs that contain both one type of gate and a similar gate allocation are expected to exhibit a linear dependence of the number of connections as a function of the number of gates, as discussed in a previous section, it is expected that pR ~ 1.

Figure 5 Figure 5


Table 4 Values of the parameter pairs {kp} and {kRpR}for the six POWER4 functional units [10]. Comparisons of the measured wire-length requirements LT with estimates Ltot(p) and Ltot(pR) are also shown for each unit. Wire lengths are shown in gate pitches. The range for the estimates is shown in brackets.
UnitkpkRpRLTLtot(p)
[range]
(Errora)
Ltot(pR)
[range]
(Errorb)

IFU0.790.693.820.95226,826.9141,256.9
[106,962.1, 192,256.1]
(−38)
249,497.0
[234,517.7, 265,301.2]
(10)
FPU2.210.662.41.0321,915.911,886.8
[10,244.9, 13,999.2]
(−46)
19,046.1
[18,685.9, 19,412.3]
(−13)
FXU4.360.612.910.9826,030.910,380.8
[8,991.9, 12,140.4]
(−60)
21,818.4
[20,098.3, 23,659.3]
(−16)
IDU20.50.301.241.12148,700.659,240.1
[52,017.7, 69,503.6]
(−60)
243,123.7
[230,804.0, 255,815.9]
(63)
ISU23.30.312.311.04309,901.892,190.1
[77,970.2, 114,605.0]
(−70)
390,247.9
[356,529.6, 426,134.9]
(26)
LSU7.330.462.281.02553,852.8220,556.9
[183.878.7, 274,265.0]
(−60)
747,767.6
[709,590.4, 787.415.6]
(35)
Core1.29 × 106535,511.6
[440,066.5, 676,769.3]
(−58)
1.67 × 106
[1.57 × 106, 1.78 × 106]
(30)
aE[Ltot(p)] (%) = 100 · [Ltot(p) − LT]/LT
bE[Ltot(pR)] (%) = 100 · [Ltot(pR) − LT]/LT

Table 5 [5] compares the values of the average wire-length estimates Lave(pR) obtained with the Davis model [5] for each design with the measured average wire length La and with the prior estimate Lave(p) for the control logic designs in the POWER4 IFU. Table 4 compares the total wire-length estimate LT(pR) for each design with the measured total wire length LT and with the prior estimate LT(p), where LT(pR) and LT(p) are given by the expressions,

LT(pR) = Nnets × Lavg(pR),(21)
LT(p) = Nnets × Lavg(p),

where the term Nnets represents the number of connected nets in the design. Table 4 compares estimates of the total wire length Ltot(pR) in the entire POWER4 core with the measured requirement and with the prior estimate Ltot(p), where Ltot(pR) and Ltot(p) are given by the expressions,

Equation 22(22)

where the sums are taken over all control logic designs Ndesigns in each unit, Ljavg(pR) represents the average wire length provided for design j by the Davis model [5] as a function of pR, Ljavg(p) represents the average wire-length estimate for design j provided by the Davis model evaluated as a function of p, and Njnets represents the number of connected nets in design j.


Table 5 Comparison of the estimate of the measured average wire length La and LT (in gate pitches) for 18 IFU designs with average wire-length estimates obtained with the Davis model [5]. Wire lengths are given in gate pitches. Ranges for the estimates are shown in brackets.
IFULaLavg(p)
[range]
(Errora)
Lavg(pR)
[range]
(Errorb)
LTLT(p)
[range]
(Errorc)
LT(pR)
[range]
(Errord)

i13.32.4
[2.2, 2.6]
(−28)
2.8
[2.8, 2.9]
(−13)
305.5220.5
[201.3, 243.2]
(−28)
264.7
[259.3, 270.1]
(−13)
i25.53.0
[2.6, 3.5]
(−46)
3.9
[3.8, 4.1]
(−28)
1,745.8944.4
[822.1, 1,098.0]
(−46)
1,249.8
[1,211.5, 1,289.4]
(−28)
i34.23.0
[2.6, 3.5]
(−29)
4.0
[3.8, 4.1]
(−5)
847.6605.6
[526.6, 704.9]
(−29)
803.1
[778.3, 828.8]
(−5)
i44.03.0
[2.6, 3.5]
(−24)
4.0
[3.9, 4.1]
(0)
1,011.2764.7
[664.1, 891.3]
(−24)
1,016.7
[985.0, 1,049.4]
(1)
i53.73.8
[3.1, 4.8]
(2)
5.8
[5.5, 6.0]
(54)
2,999.53,063.3
[2,500.1, 3,826.1]
(2)
4,623.4
[4,418.9, 4,836.7]
(54)
i66.24.0
[3.2, 5.1]
(−36)
6.2
[5.9, 6.5]
(−1)
6,218.33,982.3
[3,211.3, 5,041.1]
(−36)
6,159.2
[5,871.5, 6,459.8]
(−1)
i75.74.0
[3.2, 5.1]
(−29)
6.2
[5.9, 6.5]
(10)
4,731.23,348.7
[2,699.9, 4,239.9]
(−29)
5,181.1
[4,938.9, 5,434.2]
(10)
i84.14.1
[3.3, 5.2]
(0)
6.3
[6.0, 6.7]
(56)
4,261.14,269.2
[3,427.3, 5,431.2]
(0)
6,663.1
[6,345.9, 6,994.9]
(56)
i94.84.1
[3.3, 5.2]
(−16)
6.4
[6.1, 6.7]
(32)
4,472.93,769.5
[3,024.3, 4,798.8]
(−16)
5,890.5
[5,609.3, 6,184.6]
(32)
i103.64.1
[3.3, 5.3]
(14)
6.5
[6.2, 6.8]
(80)
4,140.64,726.3
[3,778.8, 6,040.3]
(14)
7,438.1
[7,077.7, 7,815.1]
(80)
i114.24.2
[3.3, 5.4]
(1)
6.7
[6.4, 7.1]
(61)
4,312.94,341.7
[3,448.7, 5,589.4]
(1)
6,924.2
[6,579.5, 7,285.1]
(61)
i129.34.8
[3.7, 6.4]
(−49)
8.2
[7.8, 8.7]
(−11)
19,794.910,193.4
[7,795.5, 13,691.2]
(−49)
17,556.1
[16,549.2, 18,615.9]
(−11)
i139.54.9
[3.7, 6.6]
(−49)
8.5
[8.0, 9.0]
(−11)
21,765.911,159.9
[8,481.9, 15,094.0]
(−49)
19,464.1
[18,323.9, 20,665.0]
(−11)
i147.04.9
[3.7, 6.7]
(−30)
8.7
[8.1, 9.2]
(23)
16,555.111,623.6
[8,806.3, 15,777.1]
(−30)
20,403.4
[19,195.5, 21,676.2]
(23)
i158.14.9
[3.7, 6.7]
(−39)
8.7
[8.2, 9.3]
(7)
20,318.612,374.6
[9,363.1, 16,821.1]
(−39)
21,779.1
[20,484.2, 23,143.9]
(7)
i1610.45.0
[3.8, 6.8]
(−52)
8.8
[8.3, 9.4]
(−15)
31,544.615,215.2
[11,479.4, 20,749.2]
(−52)
26,934.8
[25,318.2, 28,639.2]
(−15)
i178.75.6
[4.1, 7.9]
(−36)
10.6
[9.9, 11.4]
(22)
38,034.024,362.2
[17,730.2, 34,593.6]
(−36)
46,383.4
[43,276.3, 49,674.5]
(22)
i189.55.7
[4.1, 8.2]
(−40)
11.0
[10.3, 11.8]
(16)
43,767.326,291.6
[19,002.1, 37,625.6]
(−40)
50,762.3
[47,294.7, 54,438.6]
(16)
aE[Lavg(p)] (%) = 100 · {[Lavg(p) − La]/La}
bE[Lavg(pR)] (%) = 100 · {[Lavg(pR) − La]/La}
cE[LT(p)] (%) = 100 · {[LT(p) − LT]/LT}
dE[LT(pR)] (%) = 100 · {[LT(pR) − LT]/LT}

The results presented in Tables 4 and 5 show that the estimates obtained by evaluating the expressions in the models as functions of {kRpR} show improved agreement with the real wire-length requirements compared with the agreement obtained with prior estimates [10]. Note that the estimates are within 10% of LT for the IFU. Wire-length estimates are also improved [1013] for the designs in the other five units; specifically, wire-length estimates are improved for 13 of 18 IFU designs, all 12 FPU designs, 3 of 4 FXU designs, 10 of 18 IDU designs, 14 of 16 ISU designs, and 28 of 32 LSU designs, compared with prior work.

Table 4 shows that when all of the designs in the POWER4 core are considered as a group, the wire-length estimate for the interconnection requirement in the core is within 30% of the measured requirement. This result also compares favorably with the result obtained in prior work in which agreement between the estimate of total wire length is within 58% of the total wire-length measurement [10].

We now discuss the wire-length requirements for one design, i9, in the IFU. The area occupancy [100] of the design is 74%, which is near the median occupancy of all of the IFU designs; here the term area occupancy refers to the ratio of the area occupied by the used gates to the total area allocated to the design in the silicon die, as described in [100]. Figure 6 shows a histogram of the area occupancy of all 18 IFU designs. The area occupancy of the IFU designs ranges from 28% to 85%, with a mean occupancy of 65%.

Figure 6 Figure 6

Figure 7 compares measurements and estimates of the normalized probability density function pint(L) and cumulative distribution function P(L) for design i9. In this figure, the estimates are provided by the Davis model [57] as functions of the parameter pairs {kp} and {kRpR}, where the measured values of pint(L) are shown with solid circles and the measured values of P(L) are shown with hollow squares. The dotted lines indicate distributions that are obtained by evaluating the Davis model as functions of {kp} [10], and where the upper range and lower range are obtained with the values for the ranges of k and p. The solid lines indicate the distributions that are obtained by evaluating the Davis model as functions of the historically equivalent Rent parameters {kRpR}. The figure shows that the curves obtained with {kRpR} exhibit qualitatively better agreement with the measured curves compared with the curves obtained with parameter pair {kp} [10].

Figure 7 Figure 7

Discussion

The approach presented in this paper is intended to provide information for future research and additional applications of Rent's work. To date, Rent's work has been referenced in research and development of a variety of scientific and technical fields, including the following: on-chip interconnections, semiconductor circuitry, computer systems design, applied sciences, and semiconductor manufacturing technology. Examples of applications in each of these fields were provided in an earlier section of this paper. With Rent's work now available in the literature, one is able to study the original sources within the context of the development of computer hardware components. Because prior work was based on the 1971 interpretation of the two Rent memoranda and because significant changes have occurred in the development of computer systems hardware since Rent wrote the two memoranda in 1960, it is necessary to provide an interpretation of these memoranda that is suitable for today's ULSI circuitry.

For the example of on-chip interconnections in ULSI circuitry described in this paper, this approach provides chip designers with techniques to obtain wire-length estimates and distributions that exhibit improved agreement with measured interconnection requirements. These methods help achieve the goal of buildable and functional chip designs in spite of increasingly severe chip-design constraints that include the following: restrictive real-estate specifications, the limited number of metal layers available in the manufacturing technology, aggressive operating frequencies, and power dissipation limits [4424849616370].

In principle, the development of an interpretation of the memoranda that is applicable for today's ULSI circuitry is essentially the development of a technique to describe more generally the connectivity of nanoscale components, of which integrated circuitry is one specific example. This paper has presented a description of the connectivity of nanoscale components in the silicon brain, where the term silicon brain refers to the high-performance central processing unit (CPU) that contains the specific chip designs examined in this study. Note that this interpretation does not concern itself with the two-dimensional nature of chip layouts, since this constraint is imposed by the current manufacturing processes. Thus, the interpretation of connectivity presented in this paper can, in principle, be applied to descriptions of connectivity in other complex systems; these systems include, for example, the connectivity in the human brain [101104].

Considered as a complex circuit, the design of the human brain can be described as a network of neurons in conjunction with chemicals that enable a progression of electrical signals and chemical signals to connect the appropriate set of neurons. One may relate the components of the silicon brain and the human brain with the following mapping: The term circuit is understood to be the logic gate in the silicon brain and the neuron in the human brain; the term interconnection is understood to be the wire in the silicon brain and axons in the human brain; the signal inputs are the logic gate inputs in the silicon brain and dendrites in the human brain; the signal outputs are the logic gate outputs in the silicon brain and axon synaptic knobs in the human brain. Specifically, the interpretation for nanoscale components described in this paper provides a technique to quantify the connectivity in the human brain at the neuron level, for example to quantify differences among different brain regions and to distinguish different stages of development, aging, and disease that occur as the result of neural degradation, such as Alzheimer's disease, for which the underlying cause remains unknown at this time.

In this example, this mapping suggests that values of the Rent parameters to quantify the connectivity of components in each region of the human brain can be obtained with the following steps: First, the neurons in one region of the brain are grouped into subregions of neurons; second, a count is obtained of the number of dendrites and axon synaptic knobs and the number of neurons in each subregion; third, the number of potential connections (sum of the number of dendrites and axon synaptic knobs) is plotted on a log–log plot as a function of the number of neurons for each subregion; fourth, a least-square linear fit is made to the graphed data, and the empirical values of the Rent parameters can be extracted from the fit parameters, where the value of the Rent exponent is the value of the slope of the line, and the value of the Rent coefficient is the value of the inverse log of the intercept. Cox–Golgi preparations may provide one technique to visualize these components (neurons, axons, and dendrites) [101104]. One may use values of the Rent parameters in each region of a normal human brain as a basis for comparison with the corresponding region in other human brains to establish baselines for and quantify the progression of different stages of development, aging, and disease.

Conclusions

To provide background for the extensive body of research based on a 1971 interpretation of the work of E. F. Rent and to satisfy requests from researchers for these memoranda, this paper presents the original work of Rent and an historically equivalent interpretation of these memoranda that is suitable for today's computer hardware components. The interpretation that we introduce for today's computer components is needed because computer components have changed significantly compared with the discrete-transistor-based hardware components that are considered in Rent's original work. This paper also applies this interpretation to existing on-chip wire-length distribution models and presents estimates of interconnection requirements that demonstrate improved agreement to within 30% with actual wire-length requirements compared with prior methods, which obtained agreement with measurements to within 58%.

Acknowledgments

We thank the reviewers, whose comments have helped to improve this paper substantially. We thank Bernard Landman of the IBM Thomas J. Watson Research Center, Hawthorne, NY, for providing us with copies of E. F. Rent's two memoranda, which were given to him by