Transit Note #38

MBTA: Quick Overview

Andre DeHon

Original Issue: January 1991

Last Updated: Mon Nov 8 14:57:22 EST 1993

Abstract:

The Modular Bootstrapping Transit Architecture (MBTA) addresses a couple of current needs which arise as part of our research to develop novel parallel computer architectures. MBTA will serve as a full-scale test of the Transit stack packaging scheme, the Transit fault-tolerant bidelta network, and a METRO routing component. Additionally, MBTA will serve as a modular framework in which to study and experiment with the various components necessary for large scale parallel computing.

Introduction

The modular bootstrapping transit architecture (MBTA) aims to satisfy two major goals:

  1. Provide a full-scale field test for the Transit packaging scheme, network, and routing component ( METRO).
  2. Provide a flexible scaffolding in which to study the design of the components and systems necessary to build efficient parallel computers.
To meet these needs, we plan to construct a small parallel computer (64 processors). The network will be composed of METRO routing components (tn73), organized to provide fault-tolerance [DKM90] and packaged in the Transit stack packaging scheme [Kni89] (tn33). The nodes of this computer are designed to require minimal hardware while servicing the network at its full projected speed. Additionally, the processing nodes are designed to be simple and flexible in order to allow efficient emulation of a broad range of parallel computer architectures.

Transit Network and Stack Packaging

Routing Component

The METRO routing component is the basic VLSI building block for the Transit network. METRO is a custom CMOS routing component (the successor to RN1) currently under construction. It provides simple, high-speed switching for fault-tolerant networks. The target router for use in MBTA will have eight ten-bit wide input channels and eight ten-bit wide output channels. These channels provide byte-wide data transfer with the ninth bit serving as a signal for the beginning and end of transmissions and the tenth bit serving for reverse flow-control. The primary METRO configuration is a crossbar router with a dilation of two. In this configuration, all eight input channels are logically equivalent. Alternately, the component can be configured as an dilation one crossbars.

Component Packaging

METRO is packaged in a 372 pin dual sided pad grid array package (DSPGA372) ( or, perhaps, its successor) (tn33). DSPGA372 is a custom package designed by Fred Drenckhahn and Tom Knight for our high performance, high density, three-dimensional packaging strategy. The package is designed to accommodate high-speed controlled impedance signalling for large pin count VLSI components. DSPGA372 integrates features for robust alignment, cooling, and effective use of printed circuit board space.

DSPGA372 mates with a button board connector (BB372) to provide very low-profile solderless interconnect. BB372 is a custom button board connector designed to facilitate our dense three-dimensional packaging strategies. BB372 provides low-resistance, solderless vertical connection between pad grid arrays and printed circuit boards (tn33) [Kni89].

Network Organization

MBTA is based around an indirect, multistage network built from METRO routing components and organized in a bidelta configuration. Routing components are connected as described in [DKM90] to achieve fault-tolerance.

Stack Packaging

The Transit network is packaged in a three-dimensional stack structure (Figure ). The network stack is composed of alternating layers, or planes, of components and printed circuit boards. Components and printed circuit boards are interconnected using the BB372 button board connectors. The component layers perform switching while the printed circuit boards interconnect routing components to effect the network organization just described. Figure shows a cross section of stack demonstrating how the button board carriers, routing components, and printed circuit boards mate to form a stack.

Packaging Test

MBTA will offer a complete field test of this network structure and packaging scheme. The nodes will be able to keep the network fully utilized at its projected full-speed operation of 100 megabytes/second/port. MBTA will exercise the METRO routing component fully. The performance of METRO's routing and fault-tolerance protocols can be easily evaluated in this representative setting. This field test will allow us to evaluate robustness of the packaging components and stack structure. From MBTA we should gain experience with critical issues such as clocking, powering, and cooling this dense three-dimensional packaging structure.

Parallel Computer Test Bed

Building a large scale parallel computer is a huge task. There is a considerable amount which we do not understand about how to build and efficiently program parallel computers. To successfully construct and evaluate a parallel computer numerous system components must be developed and integrated together. Such components typically include a network, memory, network and cache interfaces, processors, compilers, operating systems, programming paradigms, and application programs. The volume and breadth of components necessary to construct a parallel computer make the task of bootstrapping and evaluating a particular parallel architecture, or even a single component of a parallel architecture, very difficult. Once a system is built, many of the components are set in stone, and it is not possible to easily evaluate changes to single components of the system. As a result, all of the effort expended to realize a particular parallel computer usually result in only a few datapoints in the multidimensional space of potential parallel computer architectures.

By providing parallel emulation of the hardware components in a parallel computer, MBTA attempts to provide a modular framework in which parallel architectures can be evaluated. Architecture and protocol ideas can be soft coded onto the machine. MBTA can then be used to emulate the system under study. Software can be targeted at the emulated system. Efforts at all levels of software, architecture, protocols, and paradigms can feed back on each other to identify the best ensemble of components necessary for efficient parallel architectures.

The modularity provided allows experimenters to develop or experiment with a component of the system independent from others. The MBTA scaffolding allows the system component under study to be examined in the context of an operational machine. Variants of system components can be mixed and matched to study their interaction. As the pieces of the system are better understood, designs can be spawned off which replace the generic MBTA modules with hardwired components. The modular architecture should allow the rapid incorporation of such developments into complete parallel computer systems.

Software Model

In order to take full advantage of MBTA's modular hardware architecture, we aim to utilize a software model which is modular at the compiler and programming level, as well. Figure shows the software modularity we aim to achieve. That is, at a software level, MBTA attempts to exemplify an approach to parallel computing which fully decouples the hardware architecture from the programming paradigm (tn34). This scheme makes the hardware fully modular with respect to the programming model. It allows any architecture to leverage evaluation code from many programming models. The processes of defining the parallel register transfer language allows us to focus on the computational mechanisms necessary to efficiently support the wide range of evolving parallel programming paradigms. With this software model and the ability to emulate a wide range of parallel computer hardware configurations, MBTA will be a powerful tool for evaluating parallel computer architectures.

Basic Architecture

Each MBTA node will be composed of a processor, memory, bus logic, and network interfaces. Figure shows the high level composition and organization of an MBTA node. This node architecture is intended to allow both fast network testing and fair emulation (tn25). Nodes are interconnected using the Transit network described in Section .

During each emulation cycle, the processor on each node emulates the function of the node. Thus, within the emulation cycle, the processor will execute an appropriate time-slice for each piece of hardware which is being modeled in the node. Fair emulation is guaranteed by keeping the relative progression of each node synchronized. When the emulation utilizes the underlying Transit network directly, dummy network cycles are introduced between the transmission of real data in order to match the bandwidth of the network to the bandwidth of the emulating nodes.

Intel's 80960CF [MB88] [Int89c] serves as the node processor. Initially, the node memory will be flat, fast-static RAM for simplicity and maximum emulation flexibility. The basic design will accommodate additional DRAM memory and co-processors if these seem appropriate during the design and use of the experimental prototypes. A custom network interface component of our design connects the node to the METRO based network (tn75). Additionally, a custom node bus controller is responsible for coordinating the interactions of the processor, memory, and network interfaces (tn30).

See Also...

References

DeH90a
Andre DeHon. MBTA: Boot Sequence. Transit Note 28, MIT Artificial Intelligence Laboratory, July 1990. [tn28 HTML link] [tn28 FTP link].

DeH90b
Andre DeHon. MBTA: Message Formats. Transit Note 21, MIT Artificial Intelligence Laboratory, June 1990. [tn21 HTML link] [tn21 FTP link].

DeH90c
Andre DeHon. MBTA: Network Initialization. Transit Note 27, MIT Artificial Intelligence Laboratory, July 1990. [tn27 HTML link] [tn27 FTP link].

DeH90d
Andre DeHon. MBTA: Network Interface. Transit Note 31, MIT Artificial Intelligence Laboratory, August 1990. [tn31 HTML link] [tn31 FTP link].

DeH90e
Andre DeHon. MBTA: Network Level Transactions. Transit Note 19, MIT Artificial Intelligence Laboratory, June 1990. [tn19 HTML link] [tn19 FTP link].

DeH90f
Andre DeHon. MBTA: Software Model. Transit Note 34, MIT Artificial Intelligence Laboratory, December 1990. [tn34 HTML link] [tn34 FTP link].

DeH90g
Andre DeHon. MBTA: Thoughts on Construction. Transit Note 18, MIT Artificial Intelligence Laboratory, June 1990. [tn18 HTML link] [tn18 FTP link].

DeH90h
Andre DeHon. T-Station: The MBTA Host Interface. Transit Note 20, MIT Artificial Intelligence Laboratory, June 1990. [tn20 HTML link] [tn20 FTP link].

DeH91a
Andre DeHon. MBTA: Clocking Strategy. Transit Note 37, MIT Artificial Intelligence Laboratory, January 1991. [tn37 HTML link] [tn37 FTP link].

DeH91b
Andre DeHon. MBTA: Wonderland Packaging. Transit Note 39, MIT Artificial Intelligence Laboratory, February 1991. [tn39 HTML link] [tn39 FTP link].

DeH91c
Andre DeHon. RN2 Proposal. Transit Note 44, MIT Artificial Intelligence Laboratory, April 1991. [tn44 HTML link] [tn44 FTP link].

DeH92
Andre DeHon. METRO LINK -- METRO Network Interface. Transit Note 75, MIT Artificial Intelligence Laboratory, September 1992. [tn75 HTML link] [tn75 FTP link].

DKD90
Fred Drenckhahn, Thomas Knight Jr., and Andre DeHon. Stack Packaging Components. Transit Note 33, MIT Artificial Intelligence Laboratory, December 1990. [tn33 HTML link] [tn33 FTP link].

DKM90
Andre DeHon, Thomas F. Knight Jr., and Henry Minsky. Fault-Tolerant Design for Multistage Routing Networks. AI memo 1225, MIT Artificial Intelligence Laboratory, April 1990. [FTP link].

DKM91
Andre DeHon, Thomas F. Knight Jr., and Henry Minsky. RNP: Fault Tolerant Routing Protocol. Transit Note 41, MIT Artificial Intelligence Laboratory, March 1991. [tn41 HTML link] [tn41 FTP link].

DS90a
Andre DeHon and Thomas Simon. MBTA: Node Architecture. Transit Note 25, MIT Artificial Intelligence Laboratory, July 1990. [tn25 HTML link] [tn25 FTP link].

DS90b
Andre DeHon and Thomas Simon. MBTA: Node Architecture Selection. Transit Note 22, MIT Artificial Intelligence Laboratory, June 1990. [tn22 HTML link] [tn22 FTP link].

DS90c
Andre DeHon and Thomas Simon. MBTA: Node Bus Controller. Transit Note 30, MIT Artificial Intelligence Laboratory, August 1990. [tn30 HTML link] [tn30 FTP link].

EDP +92
Eran Egozy, Andre DeHon, Samuel Peretz, Henry Minsky, and Thomas F. Knight Jr. METRO Architecture. Transit Note 73, MIT Artificial Intelligence Laboratory, August 1992. [tn73 HTML link] [tn73 FTP link].

Ego90
Eran Egozy. Symbolic Wiring Routing Package. Transit Note 29, MIT Artificial Intelligence Laboratory, August 1990. [tn29 HTML link] [tn29 FTP link].

Int89a
Intel Corporation, Literature Sales, P.O. Box 58130, Santa Clara, CA 95052-8130. 80960CA 32-bit High Performance Embedded Processor, September 1989.

Int89b
Intel Corporation, Literature Sales, P.O. Box 58130, Santa Clara, CA 95052-8130. 80960CA Product Overview, August 1989.

Int89c
Intel Corporation, Literature Sales, P.O. Box 58130, Santa Clara, CA 95052-8130. 80960CA User's Manual, 1989.

Kni89
Thomas F. Knight Jr. Technologies for Low-Latency Interconnection Switches. In Symposium on parallel architecures and algorithms. ACM, June 1989.

MB88
Glenford J. Myers and David L. Budde. The 80960 Microprocessor Architecture. Wiley-Interscience, 1988.

MDK91
Henry Minsky, Andre DeHon, and Thomas F. Knight Jr. RN1: Low-Latency, Dilated, Crossbar Router. In Hot Chips Symposium III, 1991.

Min90
Henry Q. Minsky. RN1 Data Router. Transit Note 26, MIT Artificial Intelligence Laboratory, July 1990. [tn26 HTML link] [tn26 FTP link].

Min91a
Henry Q. Minsky. A Parallel Crossbar Routing Chip for a Shared Memory Multiprocessor. Master's thesis, MIT, 545 Technology Sq., Cambridge, MA 02139, January 1991. [FTP link].

Min91b
Henry Q. Minsky. RN1 Data Router -- B revision. Transit Note 45, MIT Artificial Intelligence Laboratory, May 1991. [tn45 HTML link] [tn45 FTP link].

MIT Transit Project