Lucia Cerchie

Posted on Nov 8, 2020 • Updated on Dec 22, 2021

Why Do Most Programming Languages Index From Zero?

#computerscience #codenewbie #todayilearned #programming

This morning I was counting the number of days from an appointment on my calendar. "0, 1, 2, 3..." I muttered under my breath. Good thing I said it out loud, or I wouldn't have caught my mistake. That's what spending hours on array-based JavaScript challenges does for you!

I got a chuckle out of it, but it made me wonder why most programming languages count from zero in the first place. I'd known before and completely forgotten. So join me while I learn some computer science.

First of all, let's clarify-- we're not actually counting from zero, we're indexing from zero. This comment by mowwwalker on stackexchange was enlightening to me:

"Woah, woah, no one counts from zero, we index from zero. No >one says the "zeroth" element. We say the "first" element at >index 0. Think of the index as how far an element is offset >from the first position. Well, the first element is at the >first position, so it's not offset at all, so its index is 0. >The second element as one element before it, so it's offset 1 >element and is at index 1 – mowwwalker Apr 5 '13 at 14:32"

I found this comment helpful in adjusting my perspective, but it's not why we count from zero. The answer lies in the approach that influential computer scientists took, like Dijkstra.

Edsger W. Dijkstra made monumental contributions to computer science, including algorithms, new concepts, methods, theories, and general areas of research. He was also a proponent of starting from zero.

He wrote a short paper on the topic in August of 1982. He started by looking at four different ways to denote natural numbers 2, 3, ..., 12.

In his paper, Dijkstra eliminates c) and d) since the they lack the advantage of a) and b). That is, in a) and b) the length of the subsequence is equal to the difference between the bounds (in these cases, 11), and if you had two adjacent subsequences the upper bound of one would equal the lower bound of the other.

For example, in adjacent subsequences 1 < i ≤ 13 and 13 < i ≤ 23, the upper bound of the first is equal to the lower bound of the second (13).

Ok, but which is better, a) or b)? Djikstra points out that b) excludes the lower bound in its notation. That's inconvenient, since if you started a subsequence at 0, like 0,1,2 then you'd force the notation into using unnatural numbers, like so: -1 < i ...

As Dijkstra says, this is 'ugly', so we go with method one.

If we're going with method one, then how do we denote the elements "by subscript," or, I believe, like mowwwalker says, how far each element is offset from the first position. Dijkstra's answer is simple:

The influence of Dijkstra, and, I assume, other computer science giants, now explains why most programming languages start at zero.

My curiosity satisfied, I return to my coffee and ginger cookies on a fine Sunday morning.

Oldest comments (11)

Kirk Shillingford • Nov 8 '20

Concise, fun, and informative! Always nice to learn new things on a Sunday :)

Marian • Nov 8 '20

TIL. Thanks for sharing :)

Thomas Broyer • Nov 9 '20

Fwiw, in France, as seen in Emily in Paris, (probably many other countries), we name floors "from 0" too (floor 0 being called rez-de-chaussée), so the 5eme étage is the 6th floor.

Lucia Cerchie • Nov 9 '20

This is so cool to hear about! Thanks for letting me know about this part of French culture that's analogous to so many programming languages.

Gust van de Wal • Nov 25 '20

You've managed to not once spell Dijkstra's first or last name correctly

Lucia Cerchie • Dec 29 '20

I've edited accordingly. :)

Barry • Dec 10 '20

It is funny it bugs me a lot when languages start at 1 but I think I have only ran into one a while ago in my personal experience.

Rex Bloom • Mar 11 '21

I think pointers, and pointer arithmetic, in low level languages, is another source of influence for this decision.

Filip Filmar • Mar 14 '21

There's also this, which gives a bit of a different story: exple.tive.org/blarg/2013/10/22/ci...

Samuel Chan • Apr 8 '22

I teach math and one thing I learned is that some mathematical languages (like Matlab and R) starts at index 1.

items[1] actually returns the first item in that array / list.

So perhaps that has got to do with the founding motivation of the language — all the way back to its founding philosophy.

Lucia Cerchie • Apr 8 '22

It's really cool to hear about this from the perspective of someone who uses Matlab and R! Yeah I'd think so -- these founding fathers of computer science were philosophers as well.

DEV Community

Why Do Most Programming Languages Index From Zero?

Oldest comments (11)

Read next

Synch vs. Async Programming

Build a spreadsheet app with an AI-copilot (Next.js, gpt4, LangChain, & CopilotKit)

Episode 24/14: Angular Query, New Template Syntax

PlumeJS - No fuss web-components framework