Chris Noring for Microsoft Azure

Posted on Jul 1, 2019

How YOU actually use Kubernetes, starring YAML files

#kubernetes #devops #docker #tutorial

Follow me on Twitter, happy to take your suggestions on topics or improvements /Chris

I've described concepts such as Minkube, Pods, Nodes, Services and more over the course of four articles. Although educational you are likely to use Kubernetes in a different way than how I been showing you so far, you are likely to use YAML files

YAML files are files that let us define a configuration of whatever it may be.

YAML as in Y A M L, aka Y M C A? :)

No, not like that, more YAML as in Camel

Oh ok, tell me more

YAML actually stands for "YAML Ain't Markup Language" even though it from the beginning stood for Yet Another Markup Language. The reason for changing its meaning was to make clear its purpose as data-oriented, rather than document markup.

Yep, you got it, it's a recursive definition. It's also called a human-readable data serialization language.

Thanks for the history lesson, what about its role in Kubernetes?

As we said before we've been showing you Kubernetes usage by using the command line and that ain't quite how people use it.

Let me guess I use YAML files

Right, every single thing you can describe and deploy to Kubernetes can be described as a YAML file. It's actually simpler that way as well. But before we talk about specific Kubernetes usage we need to know some basics about the YAML files, so to understand the file when we are looking at it

I'm gonna need coffee for this right?

Resources

This article isn't really part of a series but I have written other parts on it. If you are completely new I recommend reading part I first below

Part I - from the beginning, Part I, Basics, Deployment and Minikube In this part, we cover why Kubernetes, some history and some basic concepts like deploying, Nodes, Pods.
Part II - introducing Services and Labeling In this part, we deepen our knowledge on Pods and Nodes. We also introduce Services and labeling using labels to query our artifacts.
Part III - Scaling
In this part, we look at scaling using desired state. We specify how many Pods we want and let Kubernetes do the heavy lifting in terms of ensuring that your Pods are scaled up to the desired number but also maintaining it by using something called self-healing.
Part IV - Auto scaling
This is how you likely will carry out scaling in Kubernetes by looking at CPU usage

Below is some more resource that I recommend you have a look at when Kubernetes is starting to click:

Horizontal Pod Auto scale This describes in detail how everything works. On a flow as well as API level
Free Azure Account If you want to try out AKS, Azure Kubernetes Service, you will need a free Azure account
Kubernetes.io One of the best resources to learn about Kubernetes is at this official Kubernetes site by Google.
Kubernetes overview An overview of Kubernetes, all its parts and how it works
Kubernetes in the Cloud Do you feel you know everything about Kubernetes already and just want to learn how to use a managed service? Then this link is for you
Documentation on AKS, Azure Kubernetes Service Azure Kubernetes Service, a managed Kubernetes
Best practices on AKS You already know AKS and want to learn how to use it better?

The YAML format

What do we need to know about the format to be able to work with it? Well, YAML is a superset of JSON which makes a JSON file a valid YAML file. That's an interesting piece of information but really want to know is the structures used within namely:

Maps
Lists

Ok, show me

Maps

Let's start with creating a file, let's call it config.yaml:

--- 
version: 1
name: some value

This above right here contains some useful insights on the YAML format. The --- is called a separator and is optional unless we have more than one structure in the file. What we see after that is version and name. The key is separated by its value with :.

This is supposed to be a superset of JSON right, aren't we missing double quotations?

Ah, sharp eyes, double quotations "" are actually optional. Why write more than you have to right? So the above corresponds to the following in JSON:

{
  "version": "1",
  "name": "some value"
}

The above is a map, but one of the keys in your Map can point to a map in turn, like the below example:

--- 
version: 1
name: some value,
address: 
  street: One Microsoft Way
  city: Redmond
  country: USA

Lists

Ok then, let's look at our next format lists. Consider the following example:

version: 1
todos:
 - shop
 - clean
 - do math
 - write essay

todos in this case is the list type and this would correspond to the following in JSON:

"version": "1",
"todos": ["shop", "clean", "do match", "write essay"]

What about list items that are maps?

That's simple:

created: 2019-01-01
items:
 - quantity: 2
   price: 10
   discounts:
     - id: 1
       displayName: 10% off  
     - id: 111
       displayName: Holiday Sale

The above would correspond to the following JSON:

{
  "created": "2019-01-01",
  "items": [{
    "quantity": 2,
    "price" : 111,
    "discounts": [{
      "id": 1,
      "displayName": "10% off"
    }, {
      "id": 111,
      "displayName": "Holiday Sale"
    }]
  }]
}

I get it, I think. What about Kubernetes?

Oh, one more thing before we turn all Kubernetes. Tabs = NOT cool. Spaces are what you use but be consistent, if you start using 2 spaces, usee that throughout.

YAML In Kubernetes

Most of the things you can deploy to a Cluster in Kubernetes can be described as a YAML file. There are some obvious advantages to that:

Saving your fingers: No more typing long constructs on the command line
Source control: YAML files can be stored in source control and thereby we can track changes
Flexibility: You’ll be able to create much more complex structures using YAML than you can on the command line

So what are we going to describe in YAML then? We will not only describe but deploy the following:

Pod, Pod is the smallest possible deployable unit. I know know, you usually don't deploy just a Pod, for the sake of practice lets do it this once
Deployment, this is more on the level of what we would want to deploy. It gives us the ability to defines replicas among other things

Describe and deploy a Pod

Lets build up YAML file for a Pod gradually and finish off by deploying to the Cluster. Ok, here we go:

—--
apiVersion: v1
kind: Pod

We start by specifying an apiVersion. Then we continue with specifying kind: Pod. Now this one is important. This can be many different types like a Deployment or a Service for example, Pod it is.

Metadata
The next part of our file is the metadata part. Here we give it a name and also a label:

metadata:
 name: My Pod
 labels:
   app: web

Specification

Now we come to the part where we describe the parts that make up the Pod.

spec:
  containers:
    - name: first-app
      image: gcr.io/google-samples/kubernetes-bootcamp:v1
      ports:
        - containerPort: 8080

All put together our pod.yaml file should now look like this:

apiVersion: v1
kind: Pod
metadata:
 name: my-pod
 labels:
   app: web
spec:
  containers:
    - name: first-app
      image: gcr.io/google-samples/kubernetes-bootcamp:v1
      ports:
        - containerPort: 8080

Deploying our POD YAML

At this point we want to deploy it to a cluster. So we will talk to our old friend again, minikube and kubectl.

Sometimes my minikube is a bit slow so I restart it with the command minikube start. Once that command has finished running let's check the pod situation with:

kubectl get pods

It should list all the Pods that are currently up and running:

As you can see above we have done a fair bit of experimenting in the past and created all sorts of Pods, this will obviously look different for you.

Thereafter let's take our pod.yaml file and create a Pod from it. To do that we run the command:

kubectl create -f pod.yaml

Which simply means create a Pod from a file -f, with file name pod.yaml. The result of this command is:

The result of the command implies we got a Pod created, let's rerun our pods listing command:

kubectl get pods

This time around our new Pod is up and running. But of course, deploying just a Pod like this is no good. The reason is that Pods are mortal, if it goes down, its gone. No service will bring it up again.

So why even show it?

It might be educational to show how you can create an artifact from a file but from a Kubernetes standpoint, it's not using all the wonderful services that could keep it alive if something happens. After all, that's why we have Kubernetes, for the resilience.

Therefore, let's look at a deployment next.

Before we continue let's clean up after ourselves with:

kubectl delete pods my-pod

This trigger something called a graceful termination, which is the recommended way of doing things. If you are impatient though you can force delete it with the following command:

kubectl delete pods my-pod --grace-period=0 --force

We of course delete the graceful way so after a while we get this:

Were you to run kubectl get pods our pod is now gone. All our app data, gone, vamose.

I get it, take it easy, I won't create Pods directly, can we continue?

Deployment

Ok then, deployments, the right way of doing things.

We will use something called a ReplicationController. Now, this is technically an older concept that is gradually being replaced by Deployments which are a high-level concept managing the deployment of ReplicaSets. There are issues on dashboard support and other things still, which makes all these similar concepts hang around to confuse us a bit more ;) Bottom line is its ok to use these older concepts for still some time.

This time around we will define a new type of object, a so-called ReplicationController. Now, this is a deployment that not only points out what image to deploy to a Pod but also how many Pods we want any given point, a so-called desired state. Let's have a look at the file, starting from the top:

apiVersion: v1
kind: ReplicationController
metadata: 
  name: my-rc

We define the kind as of type ReplicationController instead of Pod and we go on to name it in the metadata as my-rc. Now let's head down to the specification part:

spec:
  replicas: 5
  selector: 
    app: hello-world
  template:
    metadata:
      labels:
        app: hello-world
    spec:
      containers:
      - name: my-first-app
        image: gcr.io/google-samples/kubernetes-bootcamp:v1
        ports:
        - containerPort: 8080

The really interesting part here is replicas, the number of Pods and the rest of it is mainly us pointing out the same image again.

The specification looks slightly different at the top level, as it is a ReplicationController but the inner spec is what we are used to, pointing out the image, port and other things. Let's see the full file that we name rc.yaml:

apiVersion: v1
kind: ReplicationController
metadata: 
  name: my-rc
spec:
  replicas: 5
  selector: 
    app: hello-world
  template:
    metadata:
      labels:
        app: hello-world
    spec:
      containers:
      - name: my-first-app
        image: gcr.io/google-samples/kubernetes-bootcamp:v1
        ports:
        - containerPort: 8080

We create the above deployment based on the file by typing:

kubectl create -f rc.yaml

Above we see that our replication controller is created.

Lets list all our pods right now and make sure we get as many pods as expected:

kubectl get pods

Looking at the above listing we see that we have 5 replicas, 5 instances of our Pod. We could try and delete one and see what happens:

kubectl delete pods my-rc-wx6h7

It tells us the above Pod is deleted so we wait for the graceful deletion to finish and then we list our Pods again and this is what we get:

As you can see above, our specific Pod is deleted but it's being replaced by another one, the number of replicas is intact, Kubernetes is doing its job

So how do we bring it down, garlic, a stake?

Let's start by listing all replication controllers:

kubernetes get replicationcontroller

Now there are two way we can bring it down, garlic or deletion ;)

Yea yea very funny, deletion, tell me more bored

It's as simple as typing

kubectl delete replicationcontroller [name of replication controller]

What happens if you do the above is that it will not only delete the replication controller but the all the pods with it, so-called cascading delete. If you just want the replication controller gone you can specify --cascade=false. We want it all gone so we just type:

kubectl delete my-rc

That should remove all of it. If we run kubectl get replicationcontroller it should be gone. Let's run kubectl get pods and make sure the pods are gone too.

Let's take away the other replication controller as well, it's been hanging around my system for far too long:

kubectl delete replicationcontroller my-rc

Quickly hit:

kubectl get pods

and you will be able to see the Pods being terminated:

Summary

In this article, we talked about the file format YAML, what it is, how it works. We also discussed how we should be using YAML files to configure our Kubernetes experience. Furthermore, we showed how to deploy Pods but also ReplicationControllers. Hopefully, this was educational and gave you a taste on what real Kubernetes management looks like.

Top comments (1)

HyScale • Dec 9 '19

Hi Chris! Very educational on the much-dreaded YAML files while deploying to k8s. we at team Hyscale have been working on an open-source tool, that autogenerates k8s manifests for developers looking to deploy, eliminating the need to write all that YAML.

We would love to know the community's take on it and are open to feedback. :) We are at github.com/hyscale/hyscale

DEV Community

How YOU actually use Kubernetes, starring YAML files

Resources

The YAML format

Maps

Lists

YAML In Kubernetes

Describe and deploy a Pod

Deploying our POD YAML

Deployment

Summary

Top comments (1)

Read next

Retour d'expérience : Quand ECS s'impose comme une alternative pertinente à Kubernetes

My Top Cursor Tips (v0.43)

From Sunshine to Snowfall: Crafting Weather-Based UIs with DevCycle Feature Flag Challenge

Building and Deploying TypeScript Microservices to Kubernetes