Information about Tandem Computers

Tandem Computers was an early manufacturer of fault tolerant computer systems, marketed to the growing number of transaction processing customers who used them for ATMs, banks, stock exchanges and other similar needs. Tandem systems used a number of redundant processors and storage devices to provide high-speed "failover" in the case of a hardware failure, an architecture that they called NonStop. Over the two decades from the 1970s into the mid-90s, Tandem systems evolved from custom hardware to commodity CPU designs. The company was eventually purchased by Compaq in 1997 in order to provide that company with more robust server offerings. Today it is still known as NonStop, as a separate product line offered by Hewlett-Packard.

History

Tandem Computers was founded in 1974 by a group of engineers from Hewlett-Packard: James Treybig, Mike Green, Jim Katzman, and Jack Loustaunou. Their business plan called for systems that were safe from "single-point failures" that were only slightly more expensive than competing non-fault tolerant systems. Tandem considered this to be very important to their business model. Limiting the additional expense was important since customers often developed procedural solutions to failures when the price of fault tolerant hardware was too high.

The first system was the Tandem/16 or T/16 (later called NonStop I after the introduction of its successor, the NonStop II). The system design was complete in 1975, and the first example was sold to Citibank in 1976. The machine consisted of between 2 and 16 processors, each capable of about 0.7 MIPS with their own memory, I/O buses, and dual connections to their custom inter-CPU computer bus, Dynabus. The modules were constructed with dual paths so that any single failure would always leave at least one bus (both I/O and Dynabus), free for use by the other modules. The CPU was influenced by the HP3000 CPU, a microprogrammed 16-bit stack-based machine with 16-bit user addressing. Like the HP3000, the NonStop CPU added a number of registers for fast access, such as base addresses for global and local variables.

The Tandem NonStop series ran a custom operating system, initially called T/TOS (Tandem Operating System), later Guardian, and finally NonStop Kernel. It supported a "NonStop" programming paradigm that allowed a program to be completely fault tolerant. Several other companies introduced failover technologies but only Guardian supported completely fail-safe transaction processing. A properly constructed Guardian program could fail at any point and resume transaction processing without any loss of data.

While conventional systems of the era, including mainframes, had failure rates on the order of a few days, the NonStop system was designed to fail 100 times less, with "uptimes" measured in years. Nevertheless the NonStop was deliberately designed to be price-competitive with conventional systems, with a simple 2-CPU system priced at just over two times that of a competing single-processor mainframe, as opposed to four or more times of most competing solutions.

NonStop I was followed by the NonStop II in 1981, a slight improvement in speed to 0.8 MIPS, but a more measurable upgrade in memory from a maximum of 1 MB per CPU in the later versions of the NonStop I, to 2 MB in the II, and the addition of a revamped virtual memory system allowing for considerably larger address spaces. The NonStop I was limited to 4 virtual memory segments (System Data, System Code, User Data, User Code) each limited to 128 kB in size.

The NonStop II increased the number of memory maps from 4 to 16, 8 of which were used for I/O, and provided a 32 bit address mode with user-accessible "extended segments" virtually unlimited in size. The same basic system, including the physical packaging, was used in 1983's NonStop TXP system that more than doubled the speed to 2.0 MIPS, and increased the physical memory to 8 MB. In all of these machines the same Dynabus system was used, which had been overdesigned in the NonStop I so they could avoid changing it in the future.

Introduced along with the TXP was a new fibre optic bus system, FOX. FOX allowed a number of TXP and NonStop II systems to be connected together to form a larger system with up to 14 nodes. Like the CPU modules within the computers, Guardian could failover entire task sets to other machines in the network.

The company attempted to grab a piece of the rapidly-growing personal computer market in 1985 with its introduction of the MS-DOS based Dynamite PC/workstation. Sadly, numerous design compromises (include a unique 8086-based hardware platform incompatible with expansion cards of the day and extremely limited compatibility with IBM-based PC's) relegated the Dynamite to serving primarily as a smart terminal. It was quietly withdrawn from the market within a short period of time.

In 1986 a major upgrade to the system was introduced, the NonStop VLX. VLX used a new Dynabus, increasing speed from 13 Mbit/s to 40 Mbit/s (total, 20 Mbit/s per independent bus). They also introduced FOX II, increasing the size of the networks from 1 km to 4 km. Using the original FOX VLX systems could be used with the older NonStop II and TXP's, but these systems were not supported on FOX II.

VLX was partnered with the NonStop CLX, a minicomputer sized machine for smaller installations. The CLX had roughly the same performance as the earlier TXP, but was much smaller and less expensive. By the end of its lifetime the CLX had increased in speed considerably, and competed with the VLX, 1991's CLX 800 was only about 20% slower than the VLX, with the main difference being more limited expansion abilities.

In 1986 Tandem also introduced the first fault-tolerant SQL database, NonStop SQL. Developed totally in-house, NonStop SQL included a number of features based on Guardian to ensure data validity across nodes. NonStop SQL was famous for scaling linearly in performance with the number of nodes added to the system, whereas most databases of the era had performance that plateaued quite quickly, often after two CPUs. A later version released in 1989 added transactions that could be spread over nodes, a feature that remained unique for some time. Later, the SQL database group was first co-opted then absorbed into Microsoft's SQL development effort. One outcome of this collaboration was Microsoft's clustered system technology.

The NonStop Cyclone was introduced in 1989, introducing a new superscalar CPU design. It was otherwise similar to earlier systems, although much faster. In general terms the Cyclone was about four times as fast as the CLX 800, which Tandem used as their benchmark. On the downside the new CPU was complex and expensive, requiring four circuit boards to implement a single CPU.

In 1991 Tandem followed this with RISC-implementations of Guardian, running on MIPS R3000-based CPU modules in the Cyclone/R and CLX/R. Programs written for the earlier stack-based CPU design were automatically translated on the fly into R3000 code in an interpreter, although they ran considerably slower than on earlier machines. Tandem also provided a number of tools to easily port existing object code to the new systems, resulting in code that was some 25% slower than the original Cyclone. Source code compilers were also available. While slower, the new system was considerably less expensive, and it was clear that RISC performance was outpacing CISC. By making the move when they did, they were banking on increases in MIPS performance quickly wiping out any performance disadvantages the system had at the time. In 1993 the NonStop Himalaya K-Series using the MIPS R4400 was shipped.

In 1997 Tandem introduced the NonStop Himalaya S-Series. The S-Series machines were the first systems that changed the underlying architecture of the NonStop family, basing both the I/O and inter-CPU communication on their new ServerNet interconnect. Whereas Dynabus and FOX linked the CPU's together into a ring network, ServerNet was a true point-to-point network replacing both, and ran at much higher speeds. ServerNet later was used as the basis of the InfiniBand industry standard. The S-Series machines continued the use of MIPS processors, including the R4400 and R10000.

All the more recent systems were based on microprocessors, and the internal circuits of these chips are not fully checked. To assure correct computation, each logical processor had two microprocessors operating in lockstep. If the results coming out ever disagreed, the processor was considered to be faulting and instantly stopped. At that point Guardian would move that task to another processor as in earlier systems, guaranteeing that bad data was never written out due to hardware failures.

A different approach was used in a separate family of computers, the Integrity line. These computers used additional redundant CPUs running the same instruction stream. When a fault was detected (e.g. by lockstep mismatch), the failing module was disabled but the redundant module continued processing the instruction stream without interruption. Since this was handled primarily in hardware, it could be used with a slightly modified conventional operating system; Integrity used a Unix variant rather than Guardian. The line was introduced in 1989, apparently as a response to the machines of Stratus Technologies (which were remarketed by IBM as IBM System/88). Although distinct from the NonStop line, the Integrity designs were also based on the MIPS processors. With the introduction of the Integrity S4000 in 1995, the line was the first to use ServerNet and moved towards sharing hardware designs with the NonStop line.

Tandem was acquired by Compaq in 1997. Compaq was in turn acquired by HP in 2002, bringing Tandem back to its original roots. As of 2003, the NonStop product line continues to be produced, under the HP name.

After being acquired by HP, the NonStop line has moved to Itanium based processors, called Integrity NonStop Servers. The original Integrity line is no longer produced but the name 'Integrity' has been adopted by HP for all Itanium based servers.

The NonStop Kernel (NSK) can run multiple OS's. In addition to the Guardian OS, the modern NonStop platform incorporates a POSIX compatible environment (OSS) and Java. There is also an effort by HP to run Linux on the NonStop hardware.[1] Also, Linux or other Unix based operating systems could be installed on the NonStop platform via a virtual machine environment.

Culture

Tandem treated its employees with a great deal of respect, especially in the years leading to the company's first billion-dollar yearly sales figure. Innovative programs included:
  • TOPS ("Tandem Outstanding PerformerS") - every employee in the company could be nominated for this award, which was awarded to about the top 5% of employees annually. Winners (and a guest of their choosing) were treated to an all-expense paid trip to locations such as Hawaii, Vail, and similar resort areas for several days of fun and teambuilding. Management actually worked the event as hosts. TOPS was known, among other things, for its 24-hour open bar, where one could encounter senior VPs and even the company CEO dishing out drinks and stories of the company's early years.
  • Annual stock option - every employee of the company received a 100-share stock option each fall. As the company's stock rose (or split), employees could share in the company's financial success.
  • Sabbaticals - all US employees earned a six-week paid sabbatical (contiguous vacation) every four years, which could be augmented with personal vacation. Employees who chose to perform public service during their sabbatical could apply for an additional three weeks.
  • "First Friday" - the award-winning in-house Tandem TV staff produced a monthly program, broadcast live to all Tandem locations world-wide. While generally educational about some aspect of the company, the programs usually featured some member of the senior management team in a humorous way.
  • "Beer Bust" - Tandem sponsored a weekly get-together for its employees world-wide. It was called "beer bust" due to the availability of beer and wine, paid for by the company, in addition to other beverages and prepared food. This gave employees a way to cross barriers. It was not uncommon to see employees from various functions huddled in a corner, beer in hand, working to solve a problem.
  • "Third Class Mail" - Tandem was one of the first companies in which every employee had access to e-mail, which was divided into first, second, and third classes. Third Class mail allowed employees to buy and sell goods, ask questions, and share information that was not company-related. A wide variety of "SIGs" (Special Interest Groups) allowed employees to share a variety of interests with each other.
As the company entered the 90's, however, sales and profits slowed, and many of these innovative programs were either curtailed or eliminated totally. By the end, Tandem was pretty much a company like any other in the computer field, culminating in the buyout by Compaq, who wasted little time eliminating almost all of these. Only beer bust, in a greatly watered down form (literally - many sites banned alcohol), survived.

References

1. ^ CNET article on HP bringing Linux to NonStop[1]

See also

External links

Fault-tolerant design refers to a method for designing a system so it will continue to operate, possibly at a reduced level (also known as graceful degradation), rather than failing completely, when some part of the system fails.
..... Click the link for more information.
computer is a machine which manipulates data according to a list of instructions.

Computers take numerous physical forms. The first devices that resemble modern computers date to the mid-20th century (around 1940 - 1941), although the computer concept and various machines
..... Click the link for more information.
transaction processing is information processing that is divided into individual, indivisible operations, called transactions. Each transaction must succeed or fail as a complete unit; it cannot remain in an intermediate state.
..... Click the link for more information.
automated teller machine (ATM) is a computerized telecommunications device that provides the customers of a financial institution with access to financial transactions in a public space without the need for a human clerk or bank teller.
..... Click the link for more information.
bank is a commercial or state institution that provides financial services , including issuing money in various forms, receiving deposits of money, lending money and processing transactions and the creating of credit.
..... Click the link for more information.
A stock exchange, share market or bourse is a corporation or mutual organization which provides facilities for stock brokers and traders, to trade company stocks and other securities.
..... Click the link for more information.
Failover is the capability to switch over automatically to a redundant or standby computer server, system, or network upon the failure or abnormal termination of the previously active server, system, or network.
..... Click the link for more information.
Compaq Computer Corporation was an American personal computer company founded in 1982, and is now a brand name of Hewlett-Packard.

The company was formed by Rod Canion, Jim Harris and Bill Murto — former Texas Instruments senior managers.
..... Click the link for more information.
Server Computer

The inside/front of a server computer

Connects to:
  • Internet via one of

..... Click the link for more information.
Hewlett-Packard Co.

Public (NYSE:  HPQ )
Founded Palo Alto, California (1939)
Headquarters Palo Alto, California, USA

Key people Bill Hewlett, Co-founder
David Packard, Co-founder
Mark V.
..... Click the link for more information.
Hewlett-Packard Co.

Public (NYSE:  HPQ )
Founded Palo Alto, California (1939)
Headquarters Palo Alto, California, USA

Key people Bill Hewlett, Co-founder
David Packard, Co-founder
Mark V.
..... Click the link for more information.
James (Jim) Treybig founded Tandem Computers, a pioneering Silicon Valley manufacturer of fault tolerant computer systems which were marketed to the growing number of transaction processing customers who used them for ATMs, banks, stock exchanges and other similar needs.
..... Click the link for more information.
Reliability engineering is an engineering field, that deals with the study reliability: the ability of a system or component to perform its required functions under stated conditions for a specified period of time.[1] It is often reported in terms of a probability.
..... Click the link for more information.
Fault-tolerant design refers to a method for designing a system so it will continue to operate, possibly at a reduced level (also known as graceful degradation), rather than failing completely, when some part of the system fails.
..... Click the link for more information.
Citibank

Subsidiary (of Citigroup)
Founded 1812
Headquarters New York, New York

Key people Chuck Prince, CEO & Director
Industry Finance
Products Financial Services
Slogan Let's get it done.
Website www.citibank.
..... Click the link for more information.
MIPS may refer to:
  • Million instructions per second, a measure of microprocessor speed
  • MIPS architecture, Microprocessor without Interlocked Pipeline Stages, a RISC microprocessor architecture

..... Click the link for more information.
bus (bidirectional universal switch) is a subsystem that transfers data or power between computer components inside a computer or between computers, and a bus typically is controlled by device driver software.
..... Click the link for more information.
A microprogram implements a CPU instruction set. Just as a single high level language statement is compiled to a series of machine instructions (load, store, shift, etc), in a CPU using microcode, each machine instruction is in turn implemented by a series of microinstructions,
..... Click the link for more information.
16-bit integers, memory addresses, or other data units are those that are at most 16 bits (2 octets) wide. Also, 16-bit CPU and ALU architectures are those that are based on registers, address buses, or data buses of that size.
..... Click the link for more information.
In computer science, a stack machine is a model of computation in which the computer's memory takes the form of one or more stacks. The term also refers to an actual computer implementing or simulating the idealized stack machine.
..... Click the link for more information.
16-bit integers, memory addresses, or other data units are those that are at most 16 bits (2 octets) wide. Also, 16-bit CPU and ALU architectures are those that are based on registers, address buses, or data buses of that size.
..... Click the link for more information.
In computer architecture, a processor register is a small amount of storage available on the CPU whose contents can be accessed more quickly than storage available elsewhere.
..... Click the link for more information.
An operating system (OS) is the software that manages the sharing of the resources of a computer. An operating system processes system data and user input, and responds by allocating and managing tasks and internal system resources as a service to users and programs of the
..... Click the link for more information.
Mainframes (often colloquially referred to as Big Iron) are computers used mainly by large organizations for critical applications, typically bulk data processing such as census, industry and consumer statistics, ERP, and financial transaction processing.
..... Click the link for more information.
This article is about the computer term. For the TBN game show, see Virtual Memory (game show).
Virtual memory is an abstraction implemented in a computer that gives an application program the impression it has contiguous working memory, while in fact it is
..... Click the link for more information.
An optical fiber (or fibre) is a glass or plastic fiber designed to guide light along its length. Fiber optics is the overlap of applied science and engineering concerned with such optical fibers.
..... Click the link for more information.
personal computer (PC) is a computer whose original sales price, size, and capabilities make it useful for individuals.

It is unknown who coined the phrase with the intent of a small affordable computing device but John W.
..... Click the link for more information.
MS-DOS (short for Microsoft Disk Operating System) is an operating system commercialized by Microsoft. It was the most commonly used member of the DOS family of operating systems and was the dominant operating system for the PC compatible
..... Click the link for more information.
International Business Machines Corporation

Public (NYSE:  IBM )
Founded 1889, incorporated 1911
Headquarters Armonk, New York, USA

Key people Samuel J.
..... Click the link for more information.
Minicomputer (colloquially, mini) is a largely obsolete term for a class of multi-user computers that lies in the middle range of the computing spectrum, in between the largest multi-user systems (mainframe computers) and the smallest single-user systems (microcomputers or
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter