If I asked you “What is the first piece of software that Bill Gates and Paul Allen (the founders of Microsoft) made commercially?”… what would you say?
DOS? BASIC?
In fact… neither.
Before Microsoft was even a gleam in Bill Gates’ eye, he and Paul Allen started another computer company: Traf-O-Data.
It all started back in the early 1972 at Lakeside High School in Seattle, WA. Bill Gates was currently a Junior, and Paul Allen had graduated the year before.
To give you a visual, here are the two friends in those early days.
In 1972, those two boys (17 and 19 years old) had an idea for a business that just might make them some sweet, sweet moolah. Using computers. Which worked for them. They liked computers.
You see, the Washington State Department of Highways (which was later folded into the Department of Transportation in 1977) regularly counted the amount of traffic on key roads around the greater Seattle area — to optimize traffic flow and whatnot.
This was done by counting the number of axles that rolled across a series of pressure sensitive rubber tubes — then, every 15 minutes, that number would be punched onto a paper tape as a pattern representing a 16 bit number.
Those paper tapes would then be sent to local contractors who would read the data from the tape, write it out, and analyze the information — which would then be sent back to the Department of Highways.
By Paul Allen’s account, “The process was monotonous, inefficient, and murder on the eyes.”
Gates and Allen created software to analyze the traffic data using a computer at the University of Washington (Allen’s father was a librarian at the University, and both Gates and Allen had used the computers there — via their school computer club — in the past).
What computer did they use?
Well, there were two options available to them at that specific time, at the University of Washington. The first option was a Scientific Data Systems Sigma 5 mainframe via a connected teletype terminal.
Fun facts of the SDS Sigma 5:
The base memory was 16K 32-Bit words (equivalent to 64K Bytes) with a maximum memory of 128K words.
The Sigma 5 was the slowest of the Sigma series, and was one of only two models in the Sigma line that had no virtual memory.
The base model sold for $300,000. Want an extra 16K words of memory? That’ll run you an extra $50,000.
A very cool machine. One that I wouldn’t mind using to decorate my living room.
But… it wasn’t the SDS Sigma 5. (But, hey, great excuse to look at that gorgeous machine, eh?)
Instead, Gates and Allen were using the CDC (Control Data Corporation) 6400.
Fun facts of the CDC 6400:
10 Mhz 60 bit processor (you read that correctly… 60 bit CPU).
64k of 60-bit core memory (with optional extended memory available).
Optional disk storage, typically around 10 Million “Words”.
Developed by none other than the legendary Seymour Cray. Yes. That Cray. Of Cray Supercomputer.
There was just one problem:
The paper tape with the traffic data was a custom solution (storing 16 bits). The computer they had access to couldn’t read that in. (The CDC 6400 supported input via punch cards.)
How did Gates and Allen get around this problem?
They hired some kids.
Not joking.
They hired some of Gates’ fellow High School students to translate the 16 bit traffic data tape to the punch cards. By hand. They paid those kids 50 cents per tape to do the work that the two Microsoft founders really didn’t want to do.
Gates and Allen then fed the cards into the computer, and their custom software would create charts that they could then sell back to the Department of Highways.
Now, the question: How to improve on this system?
Gates and Allen needed to be able to cut out both the need to hire High Schoolers to translate the tapes (child labor just doesn’t scale well for this sort of project)… as well as remove the reliance on the University of Washington computer systems.
What they needed was… a computer of their own. One with a custom tape reader that could handle the traffic data directly.
Remember: This is the early 1970s. You couldn’t just walk down to Best Buy and grab a PC. Computers were so uncommon that people were renting computer time at Universities.
Luckily they found a schematic and instructions, in a computer magazine, on how to build a computer around the recently released Intel 8008 microprocessor — which would work nicely as the base system.
Note: It is unknown in which magazine Gates and Allen found the schematics for an Intel 8008 computer to base the Traf-O-Data system upon. If anyone has this information, I would love to find it and archive it.
Unfortunately, neither Gates nor Allen had any experience with designing hardware or building computers.
Enter: Paul Gilbert.
Gilbert was introduced to Gates and Allen by a mutual friend and he got to work building the Traf-O-Data machine. From parts. Including an 8008 processor purchased for $360 (in 1972 dollars… that’s over $2,500 in 2022 dollars).
The end result is this beauty:
Custom tape reader. Cool blinking lights. An Intel 8008… what’s not to love?
What sort of specs did the Traf-O-Data computer have? A great question! According to Gates:
“[We] took these very tiny Intel memory chips -- I forget if they were 256-bit or 1K-bit, -- I think 1K-bit Intel chips in this 8008 and created a system and we did special software. 8008 wasn't capable of running a BASIC Interpreter. That had always been my Holy Grail because I had played around writing a little bit of a BASIC Interpreter on a PDP-8, and a Data General machine. But the 8008 is basically an 8-bit machine with no programmable stack. Doing this traffic analysis software was pushing the limits.”
Side note: Gates states that the “8008 wasn't capable of running a BASIC Interpreter”.
Is that true?
Gates is correct in the sense that an 8008 is an incredibly limited chip and presents many challenges for creating many pieces of software (including a BASIC interpreter).
That said… a dialect of Tiny BASIC was developed for the 8008, in 1974, by a small team from the University of Illinois. They initially used an 8008 simulator running on an IBM System/360.
Before their custom 8008-based computer was even complete, Gates & Allen got to work building software for this new computer that could read in the tape and create traffic reports. How did they do it? By creating an 8008 simulator on an IBM System/360 — which they also used at the University of Washington.
Simulating an Intel 8008 on an IBM System/360 was apparently not uncommon in those days.
Their ultimate goal was to produce and sell these Traf-O-Data computers to government agencies who wanted to automate their traffic data analysis.
Gates and Allen scheduled a time to demo the machine with their first potential client of the computer, King County (where Seattle is located). Unfortunately, according to Gates:
“When the guy from the County that Seattle's in came to see it, it didn't work.”
That’s right.
This was Bill Gates’ very first commercial demo… and his hardware & software didn’t work.
If that isn’t epic foreshadowing, I don’t know what is.
In the end, Traf-O-Data managed to net a total of around $20,000 before closing down in 1975… when Gates and Allen formed Micro-Soft with the purpose of developing a BASIC interpreter for the Altair 8800.