The Wonderful World of Tape Libraries: Data Storage in the Mainframe Age

The Wonderful World of Tape Libraries: Data Storage in the Mainframe Age

Imagine rooms and corridors filled up not with books, but with reels of magnetic-tapes. Thousands and thousands of magnetic tapes stocked in their cases, stacked on shelves, over kilometers of corridors, meticulously numbered and ordered.

In the middle of the room, a cartwheel or two, needed by the librarian to get batches of reels and transport them to the computer room, and then back to the library.

A portion of the Columbia Computer Center tape library (1982). Photo: Bob Resnikoff. Source: Columbia University

Those special places were called tape libraries, and were iconic elements of the mainframe age between 1945 and the early 1980s. Every big institution or research facility using IBM mainframes to perform computation and handle large amount of data was equipped with tape libraries. The machine and the data were physically separated at the time: people needed to literally go to the next room, fetch a set of raw data recorded on a tape, then bring that data physically to the computer, and remove it after the computation was done to return it to storage for further use.

Employee taking reel of computer tape off shelves at the direct mail operation of Richard A. Viguerie Company, Inc. (1979). Source: Wikimedia.
📚
And yes, there were tape librarians
Tape libraries and mainframe computers required a tremendous amount of manual labor. Tapes had to be correctly labelled, filed, and located. Tape operators — sometimes called librarians — worked among the towering shelves, fetching reels, cleaning drives, and maintaining the rhythm of the organization’s memory. Later, the physical handling of magnetic tapes was automatized both with software and hardware (now handled by robotic installations), and the noble profession of tape librarian as here described disappeared. But at the time, keeping track of the whereabouts of the tapes was a formidable and high-responsibility job, essential to for the operations to run smoothly and for the data to be preserved safely.
Data repository of the United States Social Security Administration (1975). Source: The Computer, Jens Müller, Julius Wiedemann Eds. Tashen Editions, p. 187.

The constraints of batch processing


Rather than interactive computing, the dominant method at the time was batch processing. This meant that data wasn’t worked on in real time, but processed in large, scheduled jobs that ran overnight or during low-usage hours.

📽️
For example, a company’s payroll system would store its main employee records — the master file — on a tape. When it came time to update the data, operators would physically load that master tape into the mainframe. Alongside it, they would load a second tape containing new transactions: recent hires, salary changes, deductions, or time logs.

The system would process both tapes, merge the old data with the new, and write the result to a fresh reel — a newly updated master file. That new reel would then be stored until the next update cycle. Often, earlier versions were kept as well, just in case errors were discovered, and the entire process needed to be redone. This kind of rollback safety was critical in an era where re-running a job meant physically finding and remounting the right tapes.
Dorothy Whitaker works in the National Oceanographic Data Center (NODC) magnetic tape library (source: Wikipedia).

Research institutions like NASA, scientific research centers, medical research institutes, universities, but also administrations like Census bureaus, social security administration, health and tax services and private businesses had their own tape libraries, and associated librarians.

CNES (French National Center for Spatial Studies) premises. March 11, 1985. General view of the CRIS (Rectification Center for Spatial Images) premises. (source: Wikimedia)

Computer specialist John Smith arranges and examines cannisters of magnetic tape used in the processing of medical data at the National Library of Medicine. (Source: Wikimedia).

Tape libraries today


Some institutions like CERN, NASA and almost all the big private companies and big cloud providers like Microsoft Azure operating today rely on tape libraries to store massive amount of data. This is rarely known and advertised to the public, but magnetic tape storage - now fully automatized - is a cheap (a few cents per gigabyte), reliable (typical storage duration is 30+ years versus 5 years for a disk or hard drive) and secure (as it is fully offline, data are protected from cyberattacks and data thief) system to preserve petabytes of data.

Modern days tape libraries have sadly lost the vintage charm of the mainframe era, as they are fully automatized machines managed remotely by pieces of software and robot arms. They remain a low-cost, high efficiency storage method, more sustainable and less energy-consuming than traditional data centers and drives.

Example of a modern day tape library, fully automatized.

Additional sources

Tape library - Wikipedia
Magnetic recording : the first 100 years : Daniel, Eric D : Free Download, Borrow, and Streaming : Internet Archive
Includes bibliographical references and index

--> Columbia University tape library archive: https://www.columbia.edu/cu/computinghistory/tapes.html

Read more