Lunduke
Comedy • Gaming • News • Science & Tech
MS-DOS 4.0 Source Code Fails to Compile
Plus: Source comments, by Microsoft, calling the creator of DOS "brain-damaged" get censored.
April 27, 2024
post photo preview

Yesterday, Microsoft released the source code for MS-DOS 4.0... an action which I have encouraged Microsoft to take for many years (including when I worked at Microsoft).

And, while this source code release is most definitely a win for the preservation of computer history, there are some rather ridiculous issues with it.

Most notably:

  • The source doesn't actually fully compile.  It is not usable in its current state.
  • The source code has been modified by Microsoft -- even after the publication this week -- reducing the historical value of the code.
  • Also Microsoft claims to have lost some source code.

Yeah.  You read that first bullet point right.  It does not compile.  I'll walk you through the details (including a step-by-step guide for how you can fail to compile MS-DOS 4.0 yourself).

But, first, a little backstory.

The MS-DOS 4.0 Story (The Short, Short Version)

MS-DOS 4.0, released back in 1986, was a bit of an oddity.  It was a multitasking version of DOS (similar in that way to Wendin-DOS).  And, importantly, it was not a direct continuation of the existing MS-DOS line -- in fact "MS-DOS 4.0" was released between versions "3.1" and "3.3" (almost exactly coinciding with the "3.2" release).

Fun Side Note: There are multiple multitasking variants of DOS (or ways to multitask in DOS).  Most of which were not built or supplied by Microsoft.  In case you didn't know that... now you do.

This Multitasking MS-DOS 4.0 was not commercially successful -- to put it mildly -- and that line of "Multitasking MS-DOS" was quickly abandoned.

Luckily -- or not luckily, depending on how you look at it -- IBM co-developed a completely different version of "MS-DOS 4.0" that had almost nothing to do with the multitasking version Microsoft created.  This IBM-made version, a continuation of MS-DOS 3.x, continued to be single-tasking.  But, oh-boy, was it buggy.  Legendarily buggy.

Ultimately, when it was obvious that the Multitasking "MS-DOS 4.0" was a dead-end, Microsoft took IBM's totally unrelated "MS-DOS 4.0" and released it also as "MS-DOS 4.0".  (Two different Operating Systems, same name and same version number.  Because that's not at all confusing.)  Then -- quickly -- re-worked a bunch of it -- releasing that as "MS-DOS 4.01".

That single-tasking version ("4.01") went on to have some success -- though it is widely regarded as one of the buggier releases of MS-DOS.

The MS-DOS 4.0 Source Release

Two totally different things named "MS-DOS 4.0".  So what, exactly, is Microsoft releasing the source code for?

Well.  There are two parts.

Floppies of an early Beta of Multitasking MS-DOS 4.0

Both are available via GitHub.  And everything is released under the MIT license.

Which means that, yes, if you can get that single-tasking 4.0 code to build... you can, in theory, fork these releases and continue developing them.  (Though you'll need to change the name, as Microsoft still holds the trademarks.)

This work was announced in a joint blog post by Jeff Wilcox (Head of Open Source Programs Office) and Scott Hanselman (Vice President of Developer Community) at Microsoft.

Fun Side Note #2: That Vice President, Scott Hanselman, is the same Microsoft executive who has previously encouraged people to commit crimes against people based on their skin color and gender.  Telling people to be ready to "go to jail" for those crimes.  While that bit of information has absolutely nothing to do with the MS-DOS 4.0 source code release... it's nice to have background on the people in the story.

What code, exactly, did we get?

This release, from Microsoft, is a valuable and interesting one.  It contains a great deal of historically significant information -- and I am absolutely filled to the brim with nerdy joy as I go through it.

Unfortunately... it does not include code for the multitasking version of "4.0".

From the announcement:

"Jeff Wilcox and OSPO went to the Microsoft Archives, and while they were unable to find the full source code for MT-DOS, they did find MS DOS 4.00, which we’re releasing today, alongside these additional beta binaries, PDFs of the documentation, and disk images. We will continue to explore the archives and may update this release if more is discovered."

As a former Microsoft employee... this is... strange.  And, quite honestly, not at all believable.

During my time working at Microsoft, I knew of backed up copies of source code for darned near everything -- including almost every version of MS-DOS from 3.3 onward (that I, personally, saw).

Yet they were unable to find code for the Multitasking MS-DOS 4.0?  Knowing, intimately, how the various groups within Microsoft handled backing up source code and binaries for releases... this statement from Microsoft makes me highly skeptical.

Unless Microsoft completely forgot how to backup source code in the last few years, I'm going to call this claim utterly bogus.

Is it Actually MS-DOS 4.0?

Just to make everything far more confusing than it already is... this may not actually be MS-DOS 4.0.  It might be MS-DOS 4.01... or PC-DOS 4.01... or some strange combination.

Take a look at SETENV.BAT in the source code release and you will find the following line:

echo setting up system to build the MS-DOS 4.01 SOURCE BAK...

What files I have been able to build appear to exactly match the MS-DOS 4.0 (not 4.01) release images.  But, being as some of this source code release is mangled beyond use, unfortunately we can't really be sure that everything matches the actual 4.0 release.  It might be an interim build between 4.0 and 4.01.

Oh!  That's right.

Did I mention that this source code release of MS-DOS 4.0 doesn't successfully build?

The Code Does Not Compile

Allow me to repeat myself:

The code that has been supplied contains significant problems which will prohibit it from compiling a complete, working version of MS-DOS 4.0.

I attempted build under multiple environments (including on a released version of MS-DOS 4.01, MS-DOS 5, PC-DOS, and under DOSBox) -- and dug through the errors until I was confident of the issues (and, importantly, was confident that we weren't simply looking at an obvious case of user error).

ERROR!  ERROR!

Note: If you want to skip the "How To Build It" portion, simply scroll down to the "BOOM!  ERROR!" section below.

Want to unsuccessfully build MS-DOS 4.0 yourself?  Here are some super easy to follow steps.

  1. Download the contents of the MS-DOS 4.0 GitHub repository.
  2. Install DOSBox.  (Seriously, this works just as well in DOSBox as it does anywhere else.)
  3. Within DOSBox run the following command: "MOUNT D PATH" (replace "PATH" with the path to that folder you downloaded in step 1).

If you did everything correctly, you will now -- within DOSBox -- have a D:\ drive with a directory named "SRC" in it.

Note the D:\SRC directory.  That's important.

The BAT and make files which build MS-DOS 4.0 expect all of the files to be in D:\SRC.  So replicating that environment will make it so you don't need to tweak any files at all.

Now we actually do the build.

  1. Change to the D:\SRC directory.  "D:" then "CD SRC".
  2. Now run "SETENV.BAT".  This will setup the paths and whatnot for the build environment.
  3. Then simply run "NMAKE".  That will kick off the build for everything.

Easy, right?

BOOM!  ERROR!

At this point you will quickly see that several files compile cleanly.  Until you get to GETMSG.ASM and, later, USA.INF.  Both of these files are mangled.  I was able to force GETMSG.ASM to compile by commenting out some lines... but USA.INF is completely hosed.

I don't see how whoever uploaded this source could have possibly done a successful compile prior to releasing it.

Seriously.  Hosed, I say!  Hosed!

It's not all bad news, luckily.  The majority of the code does appear to be here -- and most of it builds without any catastrophic errors.  With some work (a replaced file here, some re-written code there) I am confident a variant on this MS-DOS 4.0 release will be able to be built... unfortunately, because of changes needed to make it compile, it won't be a historically perfect replica of the system.

Not without Microsoft figuring out what they did wrong and re-releasing the source code.  Which, considering how rarely Microsoft releases source code for these historical pieces of software... I won't be holding my breath.

Fun Side Note #3: After Microsoft announced the source code release of MS-DOS 4.0, a huge number of articles have popped up on a number of Tech News sites.  Tech Journalist after Tech Journalist praising the release.  Yet not one of them has reported that the code does not actually compile.  Which means that none of them even tried to verify the claims from Microsoft.  Not.  One.  Except for The Lunduke Journal, of course.  I'll let you draw your own conclusion about what that means.

The historical record has been compromised... a little.

It doesn't build.  That's a problem.

Also, it's kind of hard to be 100% sure what this specific release even is (is it 4.0... is it 4.01... is it from IBM or MS?  An interim build?  It looks mostly like 4.0... but there's some weird bits that could use clarification.).

But what makes this even worse... is that not only has some of the code been mangled and corrupted... but some of the code comments were actively modified in the few hours after the source code was publicly posted!

Thus further destroying the historical value of this source code.  Which, to put it mildly, kinda sucks.

Brain-damaged Tim Patterson

But, as luck would have it, that source code change... is really, really amusing.  And pretty minor.

"Brain-damaged Tim Patterson"

A modified comment.  "Brain-damaged Tim Patterson" becomes "Brain-damaged TP".

It's a simple change -- obscuring an insult of Tim Patterson (the original creator of Quick & Dirty DOS)... replacing his full name with his initials.  But, if this is a historical record, this change should not occur.

Here's a fun question: Who, exactly, made this change?  Microsoft is not accepting any changes to this source code repository from the outside world.  So, whoever made the change has the blessing of Microsoft.

Well, hold on to your butts!

This change was made by GitHub user "mzbik", with the simple comment "MZ is back!".

Ok.  Great.  But who the heck is "MZ"?

None other than the legendary Mark Zbikowski.  One of the early Microsoft employees (joining in 1981) -- and the programmer who took over the MS-DOS project (from Tim Patterson) starting with version 2.0... and leading DOS through version 4.0.

Mark Zbikowski and his epic moustache.

Clearly Mark -- who usurped Tim as the Dev Lead / Manager of MS-DOS -- did not want that little "Brain-damaged" insult of Tim to be part of the historical record.

Or, perhaps, he really wanted to call attention to it by making the change.

Either way, we now can be somewhat sure that Mark Zbikowski, himself, wrote that comment way back in the 1980s.  And, even more fascinating, Mark remembered that comment -- from the '80s -- so clearly that he knew to quickly go and change it -- almost immediately -- once the source was made public.  (I barely remember source code comments I made last week, let alone almost 40 years ago... this really stuck with him!)

And that level of irreverent whimsy -- one historically significant programmer insulting another historically significant programmer... in source code comments -- makes me smile.

Ok, sure.  That change isn't a huge deal.  In fact, I'm now glad it happened as it drew my attention to it.

But what else has been changed?  What else will be changed?  It's worth asking.  This is for the preservation of history, after all.

Lots and Lots of Questions

In fact, this release raises a lot of questions.

  • Why was building of this source code not tested prior to release?
  • What process caused these source code files to be mangled?
  • What all was changed from the original source archive?
  • Why has Microsoft only released source code for the 3 least popular versions of MS-DOS (1.25, 2.0, & 4.0)?  Microsoft does not profit from versions 3.3, 5.0, or 6.x (far more popular and/or useful releases).  Why are those being held back?
  • Microsoft loves to tell people how much they love Open Source... yet they have released source code for only a very small number of products (far less than 1% of their total software releases).  Even ancient software, unsuported for decades, remains closed source.  Why?
  • The last release of MS-DOS source code (versions 1.25 and 2.0) occurred 10 years ago (2014).  Why has it taken 10 years to release source code that Microsoft hasn't used since the 1980s?  Will the next release of source code be 10 years from now... in 2034?
  • And, shoot, why has not one other Tech News publication actually tried to compile this code... or notice the changes being made... or look into the details at all?

I don't mean to sound like a Negative Nancy, here.  This release is, without a doubt, incredibly interesting and important.

And Microsoft is under no obligation to release the source code for these pieces of software.  No obligation whatsoever.  If they wanted to keep it all locked away in their vault, that's entirely their call.

That said, Microsoft's near constant declarations of their "love for Open Source" -- including their ownership of GitHub -- would suggest to me that they would be eager to release the source for 30 and 40 year old software that they haven't earned a penny on in decades.

If they truly loved the idea of "Open Source"... they would do it.  In a heartbeat.  But they don't.  Which tells me a lot about their actual views on "Open Source".

Some things never change...

When I worked at Microsoft -- in the late 1990s and early 2000s -- I pushed, regularly, to release code, binaries, and documentation of historicaly significant Microsoft software.  The old stuff that nobody used anymore, but which should be preserved and studied for posterity.

Back in those days, I got a lot of push-back.  To put it mildly.

Microsoft management was extremely hesitant to release code -- and even free binaries -- of these historic products.  And, honestly, it looks like that situation has barely improved since then.  Shoot.  What they do release doesn't even compile.

Just the same: I applaud Microsoft for releasing this MS-DOS 4.0 code!  Truly, I do!

Now... release some more!  Preferably without mangling the code this time.

And don't give me any of that "we can't find the code for our most famous products" malarkey.  The Lunduke Journal knows better.

community logo
Join the Lunduke Community
To read more articles like this, sign up and join my community today
14
What else you may like…
Videos
Podcasts
Posts
Articles
The War for Linux

Widespread discrimination based on Ethnicity, Religion, & Politics across the Linux World. Red Hat, IBM, The Linux Foundation, GNOME, elementary, Linux Mint, and more are involved -- bullies working to exclude those they don't like.

They are at war against the very soul of the Linux and Open Source world.

This is the first part in a series of shows and articles. I'm going after these bullies.

01:05:29
On the Z-80 Holborn Computers

Remembering the (very) funky Holborn computers of the early 1980s.

The full article: https://lunduke.locals.com/post/5588902/1950s-sci-fi-style-computers-powered-by-a-z80-built-in-holland

00:14:04
On The History of Screensavers: 1961 - 1990

From Sci-Fi novels and Atari... to old Macs and Flying Toasters.

The full article: https://lunduke.locals.com/post/5588984/the-definitive-history-of-screensavers-1961-1990

00:18:01
November 22, 2023
The futility of Ad-Blockers

Ads are filling the entirety of the Web -- websites, podcasts, YouTube videos, etc. -- at an increasing rate. Prices for those ad placements are plummeting. Consumers are desperate to use ad-blockers to make the web palatable. Google (and others) are desperate to break and block ad-blockers. All of which results in... more ads and lower pay for creators.

It's a fascinatingly annoying cycle. And there's only one viable way out of it.

Looking for the Podcast RSS feed or other links? Check here:
https://lunduke.locals.com/post/4619051/lunduke-journal-link-central-tm

Give the gift of The Lunduke Journal:
https://lunduke.locals.com/post/4898317/give-the-gift-of-the-lunduke-journal

The futility of Ad-Blockers
November 21, 2023
openSUSE says "No Lunduke allowed!"

Those in power with openSUSE make it clear they will not allow me anywhere near anything related to the openSUSE project. Ever. For any reason.

Well, that settles that, then! Guess I won't be contributing to openSUSE! 🤣

Looking for the Podcast RSS feed or other links?
https://lunduke.locals.com/post/4619051/lunduke-journal-link-central-tm

Give the gift of The Lunduke Journal:
https://lunduke.locals.com/post/4898317/give-the-gift-of-the-lunduke-journal

openSUSE says "No Lunduke allowed!"
September 13, 2023
"Andreas Kling creator of Serenity OS & Ladybird Web Browser" - Lunduke’s Big Tech Show - September 13th, 2023 - Ep 044

This episode is free for all to enjoy and share.

Be sure to subscribe here at Lunduke.Locals.com to get all shows & articles (including interviews with other amazing nerds).

"Andreas Kling creator of Serenity OS & Ladybird Web Browser" - Lunduke’s Big Tech Show - September 13th, 2023 - Ep 044

Buckle up...

Not really nerdy, but something I found historically interesting. Stopped at a Walmart that’s still in the 2000s-2010s design. It’s getting a remodel to be in line with the new layouts but getting out of my car I noticed the paint was peeling off a part of the wall outside revealing the old stripes from the 90s. Not sure why, but I find that kind of thing cool.

post photo preview
post photo preview
post photo preview
Atari Coin Executive -- The Open Source Video Game Arcade management system... from 1982
Powered by an Atari 800. Plus a handheld 6507 computer. And, not kidding, it really was open source.

1982 was a big year for Atari video arcades — with the release of such classics as Gravatar, Millipede, and Space Duel (complimenting the already massive number of popular Atari games filling video game arcades).

In order to make the management (and, primarily, the accounting) of video game arcades easier — and more future-y — Atari developed and released the “Atari Coin Executive”.

And it is incredibly cool.

I wouldn't mind having that desk.

The central brain of the Atari Coin Executive was an Atari 800 computer (with 48k of RAM) with a number of accessories, including:

  • 2 x Atari 810 Disk Drives

  • An Atari 850 Interface Module (which added RS232)

  • An Atari 825 printer

  • An Amdek 13 inch color monitor

The Atari 800. Ain’t she pretty?

How the Atari Coin Executive worked was both simple… and, at the same time, incredibly cool.

I kinda want to setup an arcade... just so I can use the Atari Coin Executive.

The basic process:

  1. A “Coin Monitor” was installed in the coin slot of every arcade game.

  2. Each Coin Monitor is connected back to the Atari Coin Executive workstation (that Atari 800) via “telephone type wiring”.

  3. The arcade manager can then use that Atari 800 to see how much each game is earning.

Screenshot of the Coin Executive main menu

Fun fact: The Atari Coin Executive software was open source and written in a combination of BASIC and Assembly. Or, as Atari put it in 1982: “In Basic and 6502 Assembler - Source listings and manual supplied”.  You can find images of the Atari Coin Executive software over on the AtariAge Forum.

In addition to the above mentioned setup, the Atari Coin Executive also included a handheld computer called the “Data Recorder”.

It's a 1982 Atari handheld!  Sort of!

The “Atari Coin Executive Data Recorder” was powered by a MOS 6507 CPU with 16K of RAM (8 2k chips), and communicated with the Atari Coin Executive computer via 300 baud serial. It even had a small built-in printer.

This allowed people to manage several arcades, in separate locations, by:

  1. Plugging the Data Recorder into each arcade machine equipped with a Coin Monitor.

  2. Then taking the Data Recorder back to the Coin Executive computer and downloading the data into the Coin Executive software.

Finally, here’s a color picture of the whole setup — including the custom desk which was used for the Coin Executive.

Fern not included.
Read full Article
post photo preview
The 1948 precursor to the hard disk
A brass rotating, magnetic drum... inspired by a voice dictation machine.

Floppy disks. Zip disks. Hard disks.

These sorts of spinning, magnetic storage mediums have been critical to several decades of computers. It’s almost hard to imagine the computers of the 1970s, 80s, and 90s without floppy and hard disks (and other magnetic drives).

But how, exactly, did they come into existence?

Let’s take a quick look at the very first of such devices… and their inspiration.

1946 - The Mail-a-Voice

We’ll begin our journey with the 1946 release of the Brush Mail-a-Voice.

The Mail-a-Voice.  Be Honest.  You want one.

A truly fascinating device, the Mail-a-Voice looked like a phonograph… except it used a paper disc that was coated in magnetic material. You could then record up to 3 minutes of audio on a single paper disk (which would spin at 20 rotations per minute)… and then fold the paper disc up and mail it to someone inside a standard envelope.

Thus the “Mail-a-Voice”.

This device didn’t store computer data itself -- it was only for audio -- but it did inspire engineers who were working on cheap data storage for computers…

February, 1948 - Andrew Booth’s Spinning Drum

In a 1947 trip to the USA, Andrew Booth (who was working on his own computer design), had the chance to see the “Mail-a-Voice” in action.

Since Booth needed a good, inexpensive storage medium for his computer… he attempted to build a similar device using a flat, paper, magnetic disk. What was, essentially, a first attempt at what we would now call a “Floppy Disk”.

Unfortunately, it didn’t quite work. Booth found that he needed to spin the paper disk quite a bit faster in order to make it viable as a data storage mechanism… and he had a hard time keeping the paper disk flat.

In fact, as Booth upped the RPM to 3,000 — which is what he determined he needed — the paper disk itself started to disintegrate. Booth would later comment:

“I suppose I really invented the floppy disc, it was a real flop!”

So he abandoned that approach and, instead, decided to use a brass drum. Why brass? Because brass is a bit less likely to disintegrate than… paper.

It's brass, baby!  BRASS!

This system worked. His brass, rotating drum (with a magnetic coating), had a 2 inch diameter and could store 10 bits per inch.

Yeah. 10 bits. Per inch.

Not exactly massive amounts of storage. But, hey, it was a start! And it didn’t disintegrate! Huzzah!

Improving the magnetic drum

With the first prototype working, Booth set about improving his magnetic, rotating drum storage device. The final version ended up being able to store 256 words of either 20 or 21 bits each (different sources cite different values here and there does not appear to be consensus on if it was 20 or 21 bit words).

In modern terms: This would be equivalent to roughly 5 kilobits of data storage.  Give or take.

This storage drum was put to use on the ARC (the Automatic Relay Computer).

Booth working on the Automatic Relay Computer.

When all was said and done, the ARC could utilize that storage drum to handle 50 numbers and could load a program consisting of 300 individual instructions.

It wasn't exactly a “Hard Disk Drive”… more of a “Hard Drum Drive”.

Either way… Pretty darn cool for the 1940s.

1956 - The First “Hard Disk Drive”

Over the years that followed, this idea was refined and improved. The rotating drum was abandoned for hard, magnetic platters — ones sturdy enough to handle much higher RPMs (certainly much sturdier than paper!)... and thus leading to faster data access.

These improvements eventually leading to the 1956 release of the IBM Model 350 Disk Storage Unit for the IBM 305 RAMAC computer.

IBM Model 350 Disk Storage Unit

The Model 350 Hard Disk Drive, in a base configuration, could store roughly 3.75 MB — all contained in a cabinet 5 feet long, 2 1/2 feet deep, and 5 1/2 feet tall — with platters spinning at 1,200 RPM.

And all thanks to a voice dictation device built for mailing 3 minutes of audio on a folded-up piece of paper.

Read full Article
post photo preview
Who (really) created the "Byte"?
And what is the REAL definition of term?

Kilobytes (KB). Megabytes (MB). Gigabytes (GB).

We use these storage measurements every single, gosh-darned day. And most of us feel like we know exactly what they mean. But do we really?

Do we really — truly — know what a “Byte” is… and its origin? I mean… who came up with the term “Byte”, anyway?

Let’s take a moment to look over the history of the term. If, for no other reason, than to feel smarter than most other nerds.

What is a “Byte”?

If you ask Mr. Google, a Byte is exactly 8 Bits.

Mr. Google wouldn't lie... right?

Ok. Great. 8 Bits = 1 Byte.

So what is a Bit?

That part is simple.

A Bit is the smallest unit of information for a digital computer. A Bit can have two possible values… 0 or 1. It is a boolean. A binary.

Many people believe “Bit” is short for “Bite”. You find this in many computer history books. This little tidbit has been repeated so often, many believe it. However, like many such oft-repeated anecdotes in computing… it’s hogwash.

In fact, “Bit” is an acronym for “Binary Information Digit”. Squish that phrase together and you get “Bit”.

Fun factoids about the origin of the “Bit”

The first usage of the word “bit”, when talking about a type of data in reference to computing, was by Vannevar Bush. He published an articled entitled “Instrumental Analysis” in the October, 1936 issue of “American Mathematical Society”. In it he used the phrase “bits of information” when talking about punch cards.

However 

“Bit” was commonly used in Middle English to refer to “a mouthful” or a “morsel” of food. (This is the origin of why many believe “Bit” is short for “Bite”… even though it isn’t.) As such, Vannevar Bush may not have actually been thinking about a “Bit” as a “Binary digit”… instead he may simply have thought “this is a morsel of data”. Also worth noting… Bush never actually defines what a “bit” is. Making it likely that he was simply using the word “bit” in the Middle English way.

The first — distinctly verifiable — usage of “Bit” in this way is by John Tukey. From “A Mathematical Theory of Communication” written by C. E. Shannon in 1949:

“The choice of a logarithmic base corresponds to the choice of a unit for measuring information. If the base 2 is used the resulting units may be called binary digits, or more briefly bits, a word suggested by J. W. Tukey. A device with two stable positions, such as a relay or a flip-flop circuit, can store one bit of information.”

There you have it. More information about the origin of the term “bit” than you ever wanted to know.

You’re welcome.

Ok. Great.

So, in short, a Bit is a 0 or 1. And a Byte is a group of 8 Bits. Easy.

Not so fast there, sport!

While the Byte being 8 Bits is commonly accepted today… that was not always the case. Not by a long shot!

In fact, there are two competing stories for who created the term “Byte”… and neither of them were referring to a set of 8 Bits!

Seriously!

Werner Buchholz’s 6 Bits

The most often cited creator of the term “Byte” is Werner Buchholz — who used the term, in 1956, to refer to a grouping of 6 Bits when working on the IBM Stretch Super computer.

Man sitting at IBM Stretch console. Image source: computer-history.info.

A “6 Bit” Byte was common in those days. In fact, Braille was a 6 Bit encoding of characters for the blind. And many of the early computers (from IBM and others) used 6 Bit groupings to encode character data.

6 Bits -- not 8 Bits -- per Byte.

However (you knew there had to be a “however”)…

Louis G. Dooley’s N Bits

Around that same time (1956 or so), Louis Dooley first used the word “Byte” to refer to an undefined grouping of “Bits”. But, typically, used as “4 Bits”.

That's right.  Not 8 Bits.  Not 6 Bits.  But 4 Bits.

Dooley published the following letter in BYTE magazine:

“I would like to get the following on record: The word byte was coined around 1956 to 1957 at MIT Lincoln Laboratories within a project called SAGE (the North American Air Defense System), which was jointly developed by Rand, Lincoln Labs, and IBM. In that era, computer memory structure was already defined in terms of word size. A word consisted of x number of bits; a bit represented a binary notational position in a word. Operations typically operated on all the bits in the full word.

 

We coined the word byte to refer to a logical set of bits less than a full word size. At that time, it was not defined specifically as x bits but typically referred to as a set of 4 bits, as that was the size of most of our coded data items. Shortly afterward, I went on to other responsibilities that removed me from SAGE. After having spent many years in Asia, I returned to the U.S. and was bemused to find out that the word byte was being used in the new microcomputer technology to refer to the basic addressable memory unit.

 

Louis G. Dooley
Ocala, FL”

So… what the heck is a “Byte”?!

That’s right. We now have two very, very different definitions for the word “Byte”. Both creations of the word happened independently… and at almost the exact same moment in time.

  • The Buchholz Byte” - A grouping of 6 Bits.
  • The Dooley Byte” - A grouping of an undefined number of bits, less than a full word size. Often used to describe 4 Bits.

You’ll note that neither of these definitions — from the men who created the term — have the number “8” in them.

The shift towards 8 Bits per Byte started to happen in the 1970s… with the development and gaining popularity of 8-Bit processors, such as the legendary Intel 8008.

A revision of the Intel 8008 CPU

Interestingly, some of those early 8-Bit CPU’s had specific functions for handling 4-Bit chunks of data. Because, up until that point, 4 and 6-Bit “Bytes” were incredibly common (including in the predecessor to the Intel 8008… the 4-Bit Intel 4004).

Fun Factoid: Nowadays a 4-Bit group is called a “Nibble”. Which is adorable.

For quite some time the term “octet” or “octad” was used to denote 8 Bit groups. At some point along the way, most people phased that out as well… simply referring to all “groups of bits” as a “Byte”. Though you will still find “octet” used here and there, especially when talking about various network protocols.

All of which means…

Dooley invented the modern “Byte”… not Buchholz

While many writers, enthusiasts, and computer historians are quick to say that Werner Buchholz coined the term “Byte”… they are obviously mistaken.

Besides the fact that it’s hard to discern who (Dooley or Buchholz) actually used the term first… the Buchholz definition is no longer used at all in modern computing.

The Buchholz definition is specific. 6 Bits. Which modern computing has determined is not the amount of Bits in a modern Byte.

The Dooley definition, on the other hand, allows for wiggle room. Which means that an 8 Bit “Byte” would fit the Dooley definition. But not the Buchholz.

The facts are clear: Louis G. Dooley created the word “Byte”. At least as it has been used for the last 40+ years.

But Buchholz — an absolute legend in the computing world — gets one heck of an Honorable Mention trophy.

Read full Article
See More
Available on mobile and TV devices
google store google store app store app store
google store google store app tv store app tv store amazon store amazon store roku store roku store
Powered by Locals