A common take on Kubernetes is that it's very complicated.
... and because it's complicated, the configuration is very verbose.
... and because there's so much config YAML, we need big toolchains just to handle that config.
I want to convince you that the arrow of blame points in the opposite direction!
Kubernetes has a simple, genius idea about how to manage configuration.
Because it's straightforward and consistent, we can manage more config than we ever could before! And now that we can manage oodles more config, we can build overcomplicated systems. Hooray!
The configs themselves may be complicated. So in this post, I'm going to skip the configs. I'll focus purely on the API machinery and how to explore that API.
Building APIs this way could benefit a lot of tools.
What is the Idea?
To explain the simple, genius idea, let's start with the simple, genius idea of Unix:
Everything is a file.
Or to be more precise, everything is a text stream. Unix programs read and write text streams. The filesystem is an API for finding text streams to read. Not all of these text streams are files!
-
~/hello-world.txt
is a text file -
/dev/null
is an empty text stream -
/proc
is a set of text streams for reading about processes
Let's take a closer look at /proc
. Here's a Julia Evans comic about it.
You can learn about what's running on your system by looking at /proc
, like:
- How many processes are running (
ls /proc
- List the processes) - What command line started process PID (
cat /proc/PID/cmdline
- Get the process specification) - How much memory process PID is using (
cat /proc/PID/status
- Get the process status)
What is the Kubernetes API?
The Kubernetes API is /proc
for distributed systems.
Everything is a resource over HTTP. We can explore every Kubernetes resource with a few HTTP GET commands.
To follow along, you'll need:
-
kind
- or any small, throwaway Kubernetes cluster -
curl
- or any CLI tool for sending HTTP requests -
jq
- or any CLI tool for exploring JSON -
kubectl
- to helpcurl
authenticate
Let's start by creating a cluster:
$ kind create cluster
Creating cluster "kind" ...
β Ensuring node image (kindest/node:v1.19.1) πΌ
β Preparing nodes π¦
β Writing configuration π
β Starting control-plane πΉοΈ
β Installing CNI π
β Installing StorageClass πΎ
Set kubectl context to "kind-kind"
You can now use your cluster with:
kubectl cluster-info --context kind-kind
Have a nice day! π
$ kubectl proxy &
Starting to serve on 127.0.0.1:8001
kubectl proxy
is a server that handles certificates for us, so that we don't need to worry about auth tokens with curl
.
The Kubernetes API has more hierarchy than /proc
. It's split into folders by version and namespace and resource type. The API path format looks like:
/api/[version]/namespaces/[namespace]/[resource]/[name]
On a fresh kind
cluster, there should be some pods already running in the kube-system
namespace we can look at. Let's list all the system processes in our cluster:
$ curl -s http://localhost:8001/api/v1/namespaces/kube-system/pods | head -n 20
{
"kind": "PodList",
"apiVersion": "v1",
"metadata": {
"selfLink": "/api/v1/namespaces/kube-system/pods",
"resourceVersion": "1233"
},
"items": [
{
"metadata": {
"name": "coredns-f9fd979d6-5zxtx",
"generateName": "coredns-f9fd979d6-",
"namespace": "kube-system",
"selfLink": "/api/v1/namespaces/kube-system/pods/coredns-f9fd979d6-5zxtx",
"uid": "a30e70cc-2b53-4511-a5de-57c80e5b68ad",
"resourceVersion": "549",
"creationTimestamp": "2021-03-04T15:51:21Z",
"labels": {
"k8s-app": "kube-dns",
"pod-template-hash": "f9fd979d6"
That's a lot of text! We can use jq
to pull out the names of objects.
$ curl -s http://localhost:8001/api/v1/namespaces/kube-system/pods | jq '.items[].metadata.name'
"coredns-f9fd979d6-5zxtx"
"coredns-f9fd979d6-bn6jz"
"etcd-kind-control-plane"
"kindnet-fcjkd"
"kube-apiserver-kind-control-plane"
"kube-controller-manager-kind-control-plane"
"kube-proxy-sn64n"
"kube-scheduler-kind-control-plane"
The /pods
endpoint lists out all the processes, like ls /proc
. If we want to look at a particular process, we can query /pods/POD_NAME
.
$ curl -s http://localhost:8001/api/v1/namespaces/kube-system/pods/kube-apiserver-kind-control-plane | head -n 10
{
"kind": "Pod",
"apiVersion": "v1",
"metadata": {
"name": "kube-apiserver-kind-control-plane",
"namespace": "kube-system",
"selfLink": "/api/v1/namespaces/kube-system/pods/kube-apiserver-kind-control-plane",
"uid": "a8f893b7-1cdb-48fd-9505-87d71c81adcb",
"resourceVersion": "458",
"creationTimestamp": "2021-03-04T15:51:17Z",
Or, again, we can use jq
to fetch a particular field.
$ curl -s http://localhost:8001/api/v1/namespaces/kube-system/pods/kube-apiserver-kind-control-plane | jq '.status.phase'
"Running"
How to unpack what kubectl
is doing
All of the things above can be done with kubectl
. kubectl
provides a more friendly interface. But if you're ever wondering what APIs kubectl
is calling, you can run it with -v 6
:
$ kubectl get -v 6 -n kube-system pods kube-apiserver-kind-control-plane
I0304 12:47:59.687088 3573879 loader.go:375] Config loaded from file: /home/nick/.kube/config
I0304 12:47:59.697325 3573879 round_trippers.go:443] GET https://127.0.0.1:44291/api/v1/namespaces/kube-system/pods/kube-apiserver-kind-control-plane 200 OK in 5 milliseconds
NAME READY STATUS RESTARTS AGE
kube-apiserver-kind-control-plane 1/1 Running 0 116m
For more advanced debugging, use -v 8
to see the complete response body.
The point isn't that you should throw away kubectl
in favor of curl
to interact with Kubernetes. Just like you shouldn't throw away ps
in favor of ls /proc
.
But I've found disecting Kubernetes like this is helpful to think of it as a process-management system built on a couple straightforward principles:
- Everything is a resource over HTTP.
- Every object is read and written the same way.
- All object state is readable.
These are powerful ideas1. They help us build tools that fit together well.
In the same way that we can pipe Unix tools together (like jq
), we can define new Kubernetes objects and combine them with existing ones.
Sometimes they're silly! Like in this Ellen KΓΆrbes talk on how to build a Useless Machine.
In future posts, I want to talk about how to write code that uses these APIs effectively. And how we're leaning into these ideas in Tilt. Stay tuned!
Originally published on the Tilt Dev Blog.
Image Credit: "Curling;--a Scottish Game, at Central Park" by John George Brown. Via Wikipedia.
Top comments (0)