DEV Community

Cover image for Categorical thinking
stereobooster
stereobooster

Posted on • Edited on • Originally published at stereobooster.com

Categorical thinking

This post inspired by the lecture of Stanford professor Robert Sapolsky which skims over some ideas of categorical thinking. I find it helpful to understand some aspects of software development.

What is a category?

Categorical thinking is one of the tricks our brain employs to deal with big amounts of information. Instead of dealing with each nuance of given objects brain separates in categories (buckets) and deal with less number of objects (categories). As a software developer, I can assume this "compression" improves the speed of processing and storage (it is better to refer to the scientific paper on biology to get a more authoritative opinion).

This trick is especially "useful" when we have to deal with continuous information. For example, there is the whole spectrum of colors but we tend to distinguish only 7 colors in the rainbow (at least English speakers; thanks to Newton, who believed in alchemy and mystical power of number 7).

Categories are some arbitrarily chosen buckets. It is not necessarily dictated by some "logic". Choice of categories can be dictated by culture and language, also can change over time. Returning to rainbow example - not all languages have the same basic set of words for colors. For example, Russian and Greek both consider light blue and dark blue as separate colors. People who work with colors professionally, like painters, can distinguish more colors (remember this meme "Artist vs Normal People"?). Important to understand that we have the same vision (not considering any kind of disorders), but we have an issue to distinguish (it takes us more time, or we're more likely to do an error or harder to recall) depending on how rich our color vocabulary.

Related ideas: abstract thinking, symbolic thinking, stereotypes, pattern recognition.

How it can go wrong?

When the choice of categories is arbitrary and not dictated by some universal logic (universal logic like they have in math), then categorical thinking can lead to issues.

Things can be seen as similar even if they different when they fall into one category

As we can see objects a and b are pretty distant yet they are in the same category so they can be perceived as of the same kind.

In programming, this principle manifest as "duplication is far cheaper than the wrong abstraction".

For example, Evan Czaplicki explains how lists of checkboxes, which can be seen as a similar turn out to be completely different.

Things can be seen as different even if they similar when they fall into different categories

As we can see objects a and b are pretty close yet they are in different categories so they can be perceived as of a different kind.

For example, the infinite debate about CSS vs CSS-in-JS. From those categories point of view, CSS Modules falls in CSS-in-JS, but it much closer to CSS's BEM methodologies.

Thinking in categories traps you and it is hard to see the big picture

Focusing on categories we sometimes forgot to pause and reevaluate what we are dealing with.

In programming, this principle manifests as "law of the instrument " e.g. if the only tool you have is a hammer, it is tempting to treat everything as if it were a nail.

For example, the debate about different paradigms like OOP and functional styles. It is possible to write the same application in both styles, so there is something common behind it, yet programmers argue about which category is better.

Different understanding of categories

Categories are chosen arbitrarily. And when there is no formal basis for these choices different people can choose differently and lately this will source of confusion.

In programming, this manifests as sketchy terms which have no precise meaning or "taste" for the code.

For, example strong vs weak types, some people will consider C++ as weak because of memory unsafety, some people will consider JS as weak because of implicit coercions, some people will consider Go as weak because of the absence of the polymorphism. In practice, this sketchy terminology is useless and must be replaced with a more precise one.

Conclusion

Be aware of categories you use and make sure you are not trapped in one set of buckets.

Top comments (3)

Collapse
 
swfisher profile image
Sam Fisher • Edited

Thanks for this thoughtful post.

The application of mathematical thinking to real things, like physics, always begins with some intuition from the domain. Universal logic will not help with the creation of useful qualitative constructs, and as such would never “dictate” the choice of categories.

Collapse
 
stereobooster profile image
stereobooster

Some choice of categories doesn't matter. For example, we chose base 10 numbers, because we have 10 fingers, we chose the value of 1 kg based on the fact that this is how much 1000 cubic centimeters of water weights in normal circumstances. And if we would choose a bit different measurements here nothing would basically change. Those are relative thins and it's ok.

Some choices are dictated because the information itself is discrete and applying discrete categories to discrete ideas is ok (I guess). For example, we have odd and even number and those two buckets perfectly describe all natural numbers without problems.

Problems with categories begin when you want to describe continuous information. You chose some categories, but later you forgot that essentially you are dealing with continuous information, not a discrete one.

Collapse
 
swfisher profile image
Sam Fisher • Edited

Yep, apart from the idea that those choices of unit have “nice properties” and so they do matter. Units also seem to not strictly be categories because it’s easy and natural to have a decimal and fractional # of units for something. This counteracts the danger in the “opaqueness” of thinking in discrete categories over continuous information.

There’s this other weirder problem, which is “how do you decide which dimensions or axes the information is varying over?” For example, if OOP and FP are two points that live in some continuous space, how do we choose axes to place those things on?

So not only do we not know how to make meaningful categories, we don’t know how to describe the continuous space in which we’re making a choice.

In fact, the idea of a one or many dimensional continuum to describe this is just as much an imperfect construct as the categorical distinctions we made within that continuum.