Mathieu Rey

Posted on Oct 15, 2020 • Originally published at matrey.github.io

Build your own Ubuntu AMI

#aws #ubuntu

The steps in this article are run from an EC2 instance in the same region you want your AMI to be registered into. This builder VM should have the same architecture & OS as the image you intend to build.
You can use the official Ubuntu AMI the first time, then use your own Ubuntu AMI for subsequent builds.

Note that the following focuses on building an Ubuntu bionic (18.04 LTS) AMI, as it has been my workhorse for the past 2 years, and is still supported for another couple of years.
But if you are starting fresh, you should probably aim at focal (20.04 LTS).

Leverage Ubuntu Cloud Images

Rather than attempting to build an image from scratch, we will start with a ready made image from https://cloud-images.ubuntu.com

Given a codename (e.g. bionic or focal) we need to identify the most recent release available.

Ubuntu publishes release data in simple streams format. You can read more about it on https://github.com/smoser/talk-simplestreams/blob/master/Notes.txt

The following can be used to find the most recent bionic build:

$ sudo apt install simplestreams
$ sstream-query --no-verify --json --max=1 https://cloud-images.ubuntu.com/releases/streams/v1/com.ubuntu.cloud:released:download.sjson arch=amd64 release='bionic' ftype='disk1.img'
[
  {
    "aliases": "18.04,b,bionic",
    "arch": "amd64",
    "content_id": "com.ubuntu.cloud:released:download",
    "datatype": "image-downloads",
    "format": "products:1.0",
    "ftype": "disk1.img",
    "item_name": "disk1.img",
    "item_url": "https://cloud-images.ubuntu.com/releases/server/releases/bionic/release-20201014/ubuntu-18.04-server-cloudimg-amd64.img",
    "label": "release",
    "license": "http://www.canonical.com/intellectual-property-policy",
    "md5": "9aa011b2b79b1fe42a7c306555923b1b",
    "os": "ubuntu",
    "path": "server/releases/bionic/release-20201014/ubuntu-18.04-server-cloudimg-amd64.img",
    "product_name": "com.ubuntu.cloud:server:18.04:amd64",
    "pubname": "ubuntu-bionic-18.04-amd64-server-20201014",
    "release": "bionic",
    "release_codename": "Bionic Beaver",
    "release_title": "18.04 LTS",
    "sha256": "9fdd8fa3091b8a40ea3f571d3461b246fe4e75fbd329b217076f804c9dda06a3",
    "size": "359923712",
    "support_eol": "2023-04-26",
    "supported": "True",
    "updated": "Wed, 14 Oct 2020 18:20:34 +0000",
    "version": "18.04",
    "version_name": "20201014"
  }
]

However, it feels a bit overkill for what we need, and fortunately, there is also a "low tech" option available, relying on some text files at specific URLs.

There is a section at the bottom of https://help.ubuntu.com/community/UEC/Images explaining how these text files work:

Machine Consumable Ubuntu Cloud Guest images Availability Data

In order to provide information about what builds are available for download or running on ec2, a 'query' interface is exposed at http://cloud-images.ubuntu.com/query . This will allow users of the service to download images or find out the latest ec2 AMIs programmatically.

The data is laid out as follows:

There are 2 files in top level director 'daily.latest.txt' and 'released.latest.txt'. Each of these files contains tab delimited data, with 4 fields per record. daily.latest.txt has information about the daily builds, released.latest.tt about released builds:
<suite> <build_name> <label>     <serial>
hardy   server       release     20100128
For each record in the top level files another set of files will exist:

<suite>/<build_name>/released-dl.current.txt downloadable images data for the most recent released build

[...]

The downloadable image data files contain 7 tab delimited fields:
<suite>  <build_name> <label> <serial> <arch> <download_path> <suggested_name>
maverick server       daily   20100826 i386   server/maverick/20100826/maverick-server-uec-i386.tar.gz  ubuntu-maverick-daily-i386-server-20100826

We are interested in the "released" version, not the "daily" builds.
So we just need to download https://cloud-images.ubuntu.com/query/bionic/server/released-dl.current.txt and grep for our architecture:

$ curl -Ss 'https://cloud-images.ubuntu.com/query/bionic/server/released-dl.current.txt' | grep amd64 | tr '\t' '\n'
bionic
server
release
20201014
amd64
server/releases/bionic/release-20201014/ubuntu-18.04-server-cloudimg-amd64.tar.gz
ubuntu-bionic-18.04-amd64-server-20201014

Note that we could also directly take a blind shot and download from https://cloud-images.ubuntu.com/releases/bionic/release/, which always points to the latest release. But:

knowing the build date helps validate the most recent image is newer than what we already have (2 to 3 weeks can elapse between 2 "released" images)
we would need to rely on a hardcoded name fragment, e.g. "ubuntu-18.04-server-cloudimg-amd64" for bionic

Convert the image

Column 6 of "released-dl.current.txt" provides us the "download_path", with a URI ending in ".tar.gz":

server/releases/bionic/release-20201014/ubuntu-18.04-server-cloudimg-amd64.tar.gz

According to the directory listing, this is in a "Cloud Image/EC2 tarball" format.

However... I never got a booting instance this way, so instead we will go for the ".img" version, which un-helpfully reads "USB image", but is actually a qcow2 disk image.

First, we need to convert the qcow2 image into raw. Note that where qcow2 images are "sparse" (unused disk space doesn't count), raw images require the same amount of space as their size (i.e. several GB).

$ sudo apt-get install qemu-utils
$ qemu-img convert -O raw ubuntu-18.04-server-cloudimg-amd64.img ubuntu-18.04-server-cloudimg-amd64.raw

The raw image is ready to "burn" into an EBS volume. We will leverage the AWS CLI for the following steps.

Make the AMI

Here is a high-level overview of what we need to do:

Create a new EBS volume
Attach the new EBS to the current instance managing the build
Use dd to write the raw image to the volume
Detach the volume
Request a snapshot of the volume and wait until it is completed
Delete the volume
Register the snapshot as an AMI

We will need to create an IAM policy that allows these actions through AWS APIs, without exposing the account too much.
Granularity is not too great, and the way I found to lock down actions as much as possible relies on tags (on the builder instance and on the volume & snapshot).

The builder VM should have a tag "WithRole=amibuilder"
We will give the same tag to the temporary EBS volume we will write the image onto

While not as tightly locked down as I would have liked, restricting permissions this way still has the nice side-effect of preventing some mistakes (e.g. can't detach the wrong volume from a different instance)

Here is the IAM policy I ended up with:

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "ec2:RegisterImage",
                "ec2:DescribeVolumes",
                "ec2:CreateSnapshot",
                "ec2:DescribeSnapshots",
                "ec2:CreateVolume"
            ],
            "Resource": "*"
        },
        {
            "Effect": "Allow",
            "Action": [
                "ec2:DetachVolume",
                "ec2:AttachVolume",
                "ec2:DeleteVolume"
            ],
            "Resource": "*",
            "Condition": {
                "StringEquals": {
                    "ec2:ResourceTag/WithRole": "amibuilder"
                }
            }
        },
        {
            "Effect": "Allow",
            "Action": "ec2:CreateTags",
            "Resource": "arn:aws:ec2:*:*:volume/*",
            "Condition": {
                "StringEquals": {
                    "ec2:CreateAction": "CreateVolume"
                }
            }
        },
        {
            "Effect": "Allow",
            "Action": "ec2:CreateTags",
            "Resource": "arn:aws:ec2:*:*:snapshot/*",
            "Condition": {
                "StringEquals": {
                    "ec2:CreateAction": "CreateSnapshot"
                }
            }
        }
    ]
}

Subsequent steps rely on the AWS cli. Setup instructions are provided at: https://docs.aws.amazon.com/cli/latest/userguide/install-cliv2-linux.html

curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip"
unzip awscliv2.zip
sudo ./aws/install
rm -rf aws awscliv2.zip

We want to create a volume just big enough for our raw image. So for that, we need to check the image size, and the trick is to use du --apparent-size (on some bionic raw image, du returned 1.1 GB whereas du --apparent-size returned a more proper 2.2 GB). And using --block-size=1G directly gives us a value rounded up to the next GB.

$ SIZE=$( du --block-size=1G --apparent-size "ubuntu-18.04-server-cloudimg-amd64.raw" | cut -f 1 )
$ aws ec2 create-volume --tag-specifications 'ResourceType=volume,Tags=[{Key=WithRole,Value=amibuilder},{Key=Name,Value="AMI building"}]' --availability-zone "ap-southeast-1" --size "$SIZE" --volume-type gp2 --output text --query 'VolumeId'

Once the volume is created, we need to attach it to our builder instance. But it's not really obvious where it will end up: /dev/sd{x}? /dev/xvd{x}? /dev/nvme{x}n{y}?
We will use lsblk before attaching the volume, then after attaching it, and compare the two, to figure out where the new volume landed.

lsblk --json | jq --raw-output .blockdevices[].name | sort > "${TMPDIR}/volumes-before.txt"
cp "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt"

instance_id=$(curl -L -Ss http://169.254.169.254/latest/meta-data/instance-id)
aws ec2 attach-volume --device /dev/sdi --instance-id "$instance_id" --volume-id "$volumeid" --output text --query 'State'

while cmp "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt" >/dev/null 2>/dev/null; do
  lsblk --json | jq --raw-output .blockdevices[].name | sort > "${TMPDIR}/volumes-after.txt"
  sleep 3;
done

# If we are here, it means a new device appeared
NEWDEV=$( comm -13 "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt" | head -n1 )
dev=/dev/${NEWDEV}

As an extra precaution, in the case of NVMe volumes, we can actually query the "sn" attribute and confirm it matches the ID returned by ec2 create-volume.

$ sudo apt-get install nvme-cli
$ nvme id-ctrl "/dev/${DEVICE_NAME}" --output-format=json | jq --raw-output .sn | sed -e 's/vol/vol-/'

We then just dd the raw image onto the device

dd if="ubuntu-18.04-server-cloudimg-amd64.raw" of="$dev" bs=8M

Then run the rest of the API calls to detach the volume and make a snapshot of it. Note that snapshots are fairly slow, it's not uncommon to have to wait several minutes, even for a tiny 4 GB volume.
Once we have the snapshot, we can delete the volume.

# Detach the volume
aws ec2 detach-volume --volume-id "$volumeid" --output text --query 'State'
while aws ec2 describe-volumes --volume-id "$volumeid" --output text --query 'Volumes[*].State' | grep -v -q available; do
  sleep 3;
done

# Create a snapshot of the volume
snapshotid=$(aws ec2 create-snapshot --tag-specifications 'ResourceType=snapshot,Tags=[{Key=Name,Value="For AMI"}]' --description "${LABEL}" --volume-id "$volumeid" --output text --query 'SnapshotId')
while aws ec2 describe-snapshots --snapshot-id "$snapshotid" --output text --query 'Snapshots[*].State' | grep -q pending; do
  sleep 10;
done

# We can now delete the volume
aws ec2 delete-volume --volume-id "$volumeid" --output text

Finally, we register the snapshot as an AMI.

If you want to have a nice logo in the AMI list, make sure to include "ubuntu" somewhere into the AMI name.
Thanks https://www.turnkeylinux.org/comment/12501:

AWS automatically 'determine' the platform by parsing the image name. So, if Ubuntu is included in the image name, the platform will be Ubuntu. If Redhat is included in the name, the platform will be Redhat. It's just for show...

# Register the snapshot as a new AMI
block_device_mapping=$(cat <<EOF
[
  {
    "DeviceName": "/dev/sda1",
    "Ebs": {
      "DeleteOnTermination": false, 
      "SnapshotId": "$snapshotid",
      "VolumeSize": $SIZE,
      "VolumeType": "gp2"
    }
  }, {
    "DeviceName": "/dev/sdb",
    "VirtualName": "ephemeral0"
  }
]
EOF
)

amiid=$(aws ec2 register-image --name "${LABEL}" --ena-support --description "${LABEL}" --architecture x86_64 --virtualization-type hvm --block-device-mapping "$block_device_mapping" --root-device-name "/dev/sda1" --output text --query 'ImageId')
echo "Published AMI ${amiid} in region ${AWS_DEFAULT_REGION}"

That's it! You are now able to use your own AMI! Which is... pretty much identical to the official Ubuntu AMI. So what's the point?

Building our own AMIs actually allows us to customize them.

Customize the AMI

Once you have the raw image, you can actually attach it to a loop device, mount it, and edit it.
For that we will use losetup:

devloop=$( losetup -f ) # e.g. /dev/loop0
losetup -f -P "ubuntu-18.04-server-cloudimg-amd64.raw"

MOUNTPOINT=/mount/image
mkdir -p "$MOUNTPOINT"
mount "${devloop}p1" "$MOUNTPOINT"

Next, in order to run our custom script in a chroot, we need to tweak the environment a bit:

# Allow network access from chroot environment
if [[ -e "$MOUNTPOINT/etc/resolv.conf" ]] || [[ -L "$MOUNTPOINT/etc/resolv.conf" ]]; then
  mv $MOUNTPOINT/etc/resolv.conf $MOUNTPOINT/etc/resolv.conf.bak
fi
cat /etc/resolv.conf > $MOUNTPOINT/etc/resolv.conf

# Extra mounts
mount -t proc none $MOUNTPOINT/proc/
mount -t sysfs none $MOUNTPOINT/sys/
mount -o bind /dev $MOUNTPOINT/dev/

# prevent daemons from starting during apt-get
echo -e '#!/bin/sh\nexit 101' > $MOUNTPOINT/usr/sbin/policy-rc.d
chmod 755 $MOUNTPOINT/usr/sbin/policy-rc.d

We can now run our script under chroot:

cp "${CHROOT_SCRIPT}" $MOUNTPOINT/tmp/custom_user_script
chroot $MOUNTPOINT /tmp/custom_user_script
rm -f $MOUNTPOINT/tmp/custom_user_script

This is not very different from RUN lines in a Dockerfile. Here are for instance some commands I would use to setup Docker, Netdata (monitoring) and Fluentbit (logging) on the image:

# Add utilities
apt-get install --no-install-recommends -y apt-transport-https ca-certificates curl software-properties-common make zip unzip jq

# Install docker
curl -L -o /etc/apt/trusted.gpg.d/docker.asc 'https://download.docker.com/linux/ubuntu/gpg'
add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable"
apt-get update
apt-get install -y docker-ce docker-ce-cli containerd.io
systemctl enable docker

# Install netdata
curl -L -o /etc/apt/trusted.gpg.d/netdata.asc https://packagecloud.io/netdata/netdata/gpgkey
add-apt-repository -s "deb https://packagecloud.io/netdata/netdata/ubuntu/ bionic main"
apt-get update
apt-get install -y netdata

# Install fluentbit
curl -L -o /etc/apt/trusted.gpg.d/fluentbit.asc http://packages.fluentbit.io/fluentbit.key
add-apt-repository "deb http://packages.fluentbit.io/ubuntu/bionic bionic main"
apt-get update
apt-get install -y td-agent-bit

Note that some packages don't like being installed that way ; e.g. I wasn't able to preinstall Percona server, because it has a post-install script waiting forever on the service to start. YMMV.

Once we are done, we cleanup out tweaks:

# Unmount extra mountpoints
for PT in dev proc sys; do
  umount "$MOUNTPOINT/$PT"
done

# Put resolv.conf symlink back in place
rm -f  $MOUNTPOINT/etc/resolv.conf
if [[ -e "$MOUNTPOINT/etc/resolv.conf.bak" ]] || [[ -L "$MOUNTPOINT/etc/resolv.conf.bak" ]]; then
  mv $MOUNTPOINT/etc/resolv.conf.bak $MOUNTPOINT/etc/resolv.conf
fi

# Clean up policy-rc.d
rm -f $MOUNTPOINT/usr/sbin/policy-rc.d

Complete script

	#!/bin/bash
	set -euo pipefail
	DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )"

	# Thanks:
	# * https://github.com/alestic/alestic-git/blob/master/bin/alestic-git-build-ami for the overall approach and ec2 commands
	# * https://github.com/kickstarter/build-ubuntu-ami/blob/master/data/user_data.sh.erb for the user script run in chroot
	# * https://blog.tinned-software.net/mount-raw-image-of-entire-disc/ for how to mount the raw image with losetup


	# Required IAM policy (replace "aws-cn" by "aws" if outside of China):
	#
	# {
	# "Version": "2012-10-17",
	# "Statement": [
	# {
	# "Effect": "Allow",
	# "Action": [
	# "ec2:RegisterImage",
	# "ec2:DescribeVolumes",
	# "ec2:CreateSnapshot",
	# "ec2:DescribeSnapshots",
	# "ec2:CreateVolume"
	# ],
	# "Resource": "*"
	# },
	# {
	# "Effect": "Allow",
	# "Action": [
	# "ec2:DetachVolume",
	# "ec2:AttachVolume",
	# "ec2:DeleteVolume"
	# ],
	# "Resource": "*",
	# "Condition": {
	# "StringEquals": {
	# "ec2:ResourceTag/WithRole": "amibuilder"
	# }
	# }
	# },
	# {
	# "Effect": "Allow",
	# "Action": "ec2:CreateTags",
	# "Resource": "arn:aws-cn:ec2:::volume/*",
	# "Condition": {
	# "StringEquals": {
	# "ec2:CreateAction": "CreateVolume"
	# }
	# }
	# },
	# {
	# "Effect": "Allow",
	# "Action": "ec2:CreateTags",
	# "Resource": "arn:aws-cn:ec2:::snapshot/*",
	# "Condition": {
	# "StringEquals": {
	# "ec2:CreateAction": "CreateSnapshot"
	# }
	# }
	# }
	# ]
	# }
	#
	# Also, the VM running this script should have a tag "WithRole=amibuilder"

	# Required AWS CLI config file:
	#
	# [default]
	# aws_access_key_id=
	# aws_secret_access_key=



	# To install AWS cli
	# From https://docs.aws.amazon.com/cli/latest/userguide/install-cliv2-linux.html
	#
	# curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip"
	# unzip awscliv2.zip
	# sudo ./aws/install
	# rm -rf aws awscliv2.zip

	# To install nvme
	# apt-get install nvme-cli

	# To install qemu-img
	# apt-get install qemu-utils

	for needcommand in curl aws jq lsblk qemu-img losetup; do
	command -v "$needcommand" >/dev/null 2>&1 \|\| { echo >&2 "This script requires ${needcommand}"; exit 1; }
	done

	# Check if disks are "sdX"/"xvdX" or "nvmeXnY", require nvme-cli if needed
	if [[ "$( lsblk --json \| jq --raw-output .blockdevices[].name \| head -n1 \| cut -c 1-4 )" == "nvme" ]]; then
	command -v nvme >/dev/null 2>&1 \|\| { echo >&2 "This script requires nvme (apt-get install nvme-cli)"; exit 1; }
	fi

	CODENAME=
	AZ=
	ADDSIZE=
	MORERECENT=
	KEEPBIN=
	CHROOT_SCRIPT=
	IMGNAME=
	while [ $# -gt 0 ]; do
	case $1 in
	--codename) CODENAME=$2; shift 2 ;;
	--az) AZ=$2; shift 2 ;; # (optional) We need the AZ to use for creating the volume. It should match where this script is running. Auto detected through instance metadata if missing.
	--if-more-recent) MORERECENT=$2; shift 2 ;; # (optional) YYYYMMDD format
	--keep-binaries) KEEPBIN=1; shift ;;
	--run-script) CHROOT_SCRIPT=$2; shift 2 ;;
	--add-size) ADDSIZE=$2; shift 2 ;; # (optional) integer + unit (e.g. M or G)
	--label) IMGNAME=$2; shift 2 ;;
	*) echo "$0: Unrecognized option: $1" >&2; exit 1;
	esac
	done

	if [[ -z "$CODENAME" ]]; then
	echo "$0: Missing --codename (e.g. bionic)" >&2; exit 1;
	fi
	if [[ -z "$IMGNAME" ]]; then
	IMGNAME=Ubuntu
	fi

	if [[ -z "$AWS_SHARED_CREDENTIALS_FILE" ]]; then
	echo "$0: Missing env var AWS_SHARED_CREDENTIALS_FILE" >&2; exit 1;
	fi

	if [[ ! -z "${CHROOT_SCRIPT}" ]]; then
	if [[ ! -x "${CHROOT_SCRIPT}" ]]; then
	echo "$0: --run-script target does not exist or is not executable" >&2; exit 1;
	fi
	fi

	if [[ -z "$AZ" ]]; then
	# Try to auto-detect the AZ
	AZ=$( curl -L -Ss "http://169.254.169.254/latest/meta-data/placement/availability-zone" )
	if [[ -z "$AZ" ]]; then
	echo "$0: Missing --az (e.g. cn-north-1a) and failed to auto-detect it" >&2; exit 1;
	fi
	fi

	# Prepare temp directory
	BUILDTIME=$( date +%s )
	TMPDIR=$DIR/images/tmp-${BUILDTIME}
	mkdir -p "${TMPDIR}"
	# shellcheck disable=SC2064
	trap "rm -rf '${TMPDIR}'" EXIT


	if [[ "${AZ:0:2}" == "cn" ]]; then
	# Use China mirror
	MIRRORBASEAPI=https://mirrors.nju.edu.cn/ubuntu-cloud-images
	MIRRORBASEDL=http://mirrors.nju.edu.cn/ubuntu-cloud-images # We use http for faster download ; it's OK because we verify the file sha256sum for integrity
	else
	MIRRORBASEAPI=https://cloud-images.ubuntu.com
	MIRRORBASEDL=https://cloud-images.ubuntu.com
	fi

	# Check what is the most recent Ubuntu release available for our codename
	curl -L -Ss "${MIRRORBASEAPI}/query/${CODENAME}/server/released-dl.current.txt" \| grep amd64 > "${TMPDIR}/release.txt"
	RELEASEDATE=$( cat "${TMPDIR}/release.txt" \| cut -f 4 )
	if [[ ! -z "$MORERECENT" ]]; then
	# We verify the release is more recent than what we already have
	DAYS=$((RELEASEDATE - MORERECENT))
	if [[ "$DAYS" -le 0 ]]; then
	exit 0
	fi
	fi
	if [[ -z "$RELEASEDATE" ]]; then
	echo "Unknown codename $CODENAME" >&2
	exit 1
	fi

	# If we are here, we need to get the base image
	mkdir -p "$DIR/images"
	OUTPUT=$DIR/images/ubuntu-${CODENAME}-${RELEASEDATE}.qcow2
	DLURL=$( cat "${TMPDIR}/release.txt" \| cut -f 6 \| sed -e 's/tar\.gz$/img/' )
	if [[ ! -f "$OUTPUT" ]]; then
	# Need to download
	curl -L -Ss "${MIRRORBASEDL}/${DLURL}" -o "${OUTPUT}"
	fi
	CSURL=$( dirname "${MIRRORBASEAPI}/${DLURL}" )/SHA256SUMS
	CSNAME=$( basename "${MIRRORBASEAPI}/${DLURL}" )
	CHECKSUM_EXPECTED=$( curl -L -Ss "${CSURL}" \| grep "${CSNAME}" \| cut -f 1 -d ' ' )
	CHECKSUM_GOTTEN=$( sha256sum "${OUTPUT}" \| cut -f 1 -d ' ' )
	if [[ "$CHECKSUM_EXPECTED" != "$CHECKSUM_GOTTEN" ]]; then
	echo "$0: Bad checksum on download!" >&2; exit 1;
	fi

	LABEL="${IMGNAME} ${CODENAME} ${RELEASEDATE} (build ${BUILDTIME})"

	# Convert qcow2 image to raw
	RAWOUTPUT=$TMPDIR/image.raw
	qemu-img convert -O raw "$OUTPUT" "$RAWOUTPUT"

	if [[ -z "$KEEPBIN" ]]; then
	# No need to keep the source image
	rm -f "$OUTPUT"
	fi

	# Add space to the image if requested
	if [[ ! -z "${ADDSIZE}" ]]; then
	qemu-img resize -f raw "$RAWOUTPUT" "+${ADDSIZE}"
	fi

	# Mount the volume
	devloop=$( losetup -f ) # e.g. /dev/loop0
	losetup -f -P "$RAWOUTPUT"

	MOUNTPOINT=/mount/image
	mkdir -p "$MOUNTPOINT"
	mount "${devloop}p1" "$MOUNTPOINT"
	echo "[SIDE EFFECT] Mounted ${devloop}p1 under $MOUNTPOINT"

	# Run additional commands in a chroot
	if [[ ! -z "${CHROOT_SCRIPT}" ]]; then

	# Use all the space
	if [[ ! -z "${ADDSIZE}" ]]; then
	growpart "$devloop" 1
	resize2fs "${devloop}p1"
	fi

	# Allow network access from chroot environment
	if [[ -e "$MOUNTPOINT/etc/resolv.conf" ]] \|\| [[ -L "$MOUNTPOINT/etc/resolv.conf" ]]; then
	mv $MOUNTPOINT/etc/resolv.conf $MOUNTPOINT/etc/resolv.conf.bak
	fi
	cat /etc/resolv.conf > $MOUNTPOINT/etc/resolv.conf

	# Extra mounts
	mount -t proc none $MOUNTPOINT/proc/
	mount -t sysfs none $MOUNTPOINT/sys/
	mount -o bind /dev $MOUNTPOINT/dev/

	# prevent daemons from starting during apt-get
	echo -e '#!/bin/sh\nexit 101' > $MOUNTPOINT/usr/sbin/policy-rc.d
	chmod 755 $MOUNTPOINT/usr/sbin/policy-rc.d

	# RUN CUSTOM USER SCRIPT
	cp "${CHROOT_SCRIPT}" $MOUNTPOINT/tmp/custom_user_script
	chroot $MOUNTPOINT /tmp/custom_user_script
	rm -f $MOUNTPOINT/tmp/custom_user_script

	# Unmount extra mountpoints
	for PT in dev proc sys; do
	umount "$MOUNTPOINT/$PT"
	done

	# Put resolv.conf symlink back in place
	rm -f $MOUNTPOINT/etc/resolv.conf
	if [[ -e "$MOUNTPOINT/etc/resolv.conf.bak" ]] \|\| [[ -L "$MOUNTPOINT/etc/resolv.conf.bak" ]]; then
	mv $MOUNTPOINT/etc/resolv.conf.bak $MOUNTPOINT/etc/resolv.conf
	fi

	# Clean up policy-rc.d
	rm -f $MOUNTPOINT/usr/sbin/policy-rc.d

	fi

	umount -l "$MOUNTPOINT"
	rmdir "$MOUNTPOINT"

	losetup -d "$devloop"

	export AWS_DEFAULT_REGION=${AZ: : -1} # remove the last character to convert the AZ code into a region code

	# List the volumes currently attached
	lsblk --json \| jq --raw-output .blockdevices[].name \| sort > "${TMPDIR}/volumes-before.txt"
	cp "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt"

	# Create and attach a temporary EBS volume
	SIZE=$( du --block-size=1G --apparent-size "$RAWOUTPUT" \| cut -f 1 ) # --apparent-size is mandatory (with it 2.2 GB ; without 1.1 GB)
	volumeid=$(aws ec2 create-volume --tag-specifications 'ResourceType=volume,Tags=[{Key=WithRole,Value=amibuilder},{Key=Name,Value="AMI building"}]' --availability-zone "$AZ" --size "$SIZE" --volume-type gp2 --output text --query 'VolumeId' )
	echo "[SIDE EFFECT] Created volume $volumeid"
	while aws ec2 describe-volumes --volume-id "$volumeid" --output text --query 'Volumes[*].State' \| grep -v -q available; do
	sleep 3;
	done

	instance_id=$(curl -L -Ss http://169.254.169.254/latest/meta-data/instance-id)
	aws ec2 attach-volume --device /dev/sdi --instance-id "$instance_id" --volume-id "$volumeid" --output text --query 'State'
	echo "[SIDE EFFECT] Attached volume $volumeid to instance $instance_id under device path /dev/sdi"
	while cmp "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt" >/dev/null 2>/dev/null; do
	lsblk --json \| jq --raw-output .blockdevices[].name \| sort > "${TMPDIR}/volumes-after.txt"
	sleep 3;
	done

	# If we are here, it means a new device appeared
	NEWDEV=$( comm -13 "${TMPDIR}/volumes-before.txt" "${TMPDIR}/volumes-after.txt" \| head -n1 )
	case "$NEWDEV" in
	sdi\|xvdi)
	dev=/dev/${NEWDEV}
	;;
	nvm*)
	# we can use nvme cli to verify it's the right volume
	dev=/dev/${NEWDEV}
	if [[ "$volumeid" != "$( nvme id-ctrl "$dev" --output-format=json \| jq --raw-output .sn \| sed -e 's/vol/vol-/' )" ]]; then
	echo >&2 "NVMe volume mismatch"
	exit 1
	fi
	;;
	*)
	echo >&2 "Did not find the volume"
	exit 1
	;;
	esac

	# Write the image to disk
	dd if="$RAWOUTPUT" of="$dev" bs=8M
	rm -f "$RAWOUTPUT"

	# Detach the volume
	aws ec2 detach-volume --volume-id "$volumeid" --output text --query 'State'
	while aws ec2 describe-volumes --volume-id "$volumeid" --output text --query 'Volumes[*].State' \| grep -v -q available; do
	sleep 3;
	done

	# Create a snapshot of the volume
	snapshotid=$(aws ec2 create-snapshot --tag-specifications 'ResourceType=snapshot,Tags=[{Key=Name,Value="For AMI"}]' --description "${LABEL}" --volume-id "$volumeid" --output text --query 'SnapshotId')
	echo "[SIDE EFFECT] Created a snapshot $snapshotid"
	while aws ec2 describe-snapshots --snapshot-id "$snapshotid" --output text --query 'Snapshots[*].State' \| grep -q pending; do
	sleep 10;
	done

	# We can now delete the volume
	aws ec2 delete-volume --volume-id "$volumeid" --output text

	# Register the snapshot as a new AMI
	block_device_mapping=$(cat <<EOF
	[
	{
	"DeviceName": "/dev/sda1",
	"Ebs": {
	"DeleteOnTermination": false,
	"SnapshotId": "$snapshotid",
	"VolumeSize": $SIZE,
	"VolumeType": "gp2"
	}
	}, {
	"DeviceName": "/dev/sdb",
	"VirtualName": "ephemeral0"
	}
	]
	EOF
	)

	amiid=$(aws ec2 register-image --name "${LABEL}" --ena-support --description "${LABEL}" --architecture x86_64 --virtualization-type hvm --block-device-mapping "$block_device_mapping" --root-device-name "/dev/sda1" --output text --query 'ImageId')
	echo "[SIDE EFFECT] Created AMI ${amiid}"
	echo "Published AMI ${amiid} in region ${AWS_DEFAULT_REGION}"

	exit 0

view raw ec2-create-ubuntu-ami.sh hosted with ❤ by GitHub

	sudo env AWS_SHARED_CREDENTIALS_FILE=/home/ubuntu/.aws-creds \
	bash /home/ubuntu/ec2-create-ubuntu-ami.sh \
	--codename bionic --keep-binaries --add-size 1G \
	--run-script /home/ubuntu/ubuntu-bionic-extra.sh \
	--label ubuntu-docker-host 2>&1 \| tee log-docker-ami-2020-10-12.log

view raw sample-usage.sh hosted with ❤ by GitHub

	#!/bin/bash

	set -euo pipefail
	set -vx

	AZ=$( curl -L -Ss "http://169.254.169.254/latest/meta-data/placement/availability-zone" )

	# Ubuntu mirror
	if [[ "${AZ:0:2}" == "cn" ]]; then
	sed -i 's#://.\.ubuntu\.com[^ ]#://mirrors.tuna.tsinghua.edu.cn/ubuntu/#gi' /etc/apt/sources.list
	fi

	# Update but skipping kernel
	# From https://www.bonusbits.com/wiki/HowTo:Upgrade_Ubuntu_without_Updating_the_Kernel
	apt-mark hold linux-image-generic linux-headers-generic
	apt-get update
	apt-get -y upgrade
	apt-mark unhold linux-image-generic linux-headers-generic

	# Add utilities
	apt-get install --no-install-recommends -y apt-transport-https ca-certificates curl software-properties-common make zip unzip jq

	# Install docker
	if [[ "${AZ:0:2}" == "cn" ]]; then
	dockerkey=https://download.docker.com/linux/ubuntu/gpg
	dockerrepo=https://mirrors.tuna.tsinghua.edu.cn/docker-ce/linux/ubuntu/
	else
	dockerkey=https://download.docker.com/linux/ubuntu/gpg
	dockerrepo=https://download.docker.com/linux/ubuntu
	fi
	curl -L -o /etc/apt/trusted.gpg.d/docker.asc ${dockerkey}
	add-apt-repository "deb [arch=amd64] ${dockerrepo} $(lsb_release -cs) stable"
	apt-get update
	apt-get install -y docker-ce docker-ce-cli containerd.io
	systemctl enable docker

	# Allow ubuntu to use docker without sudo
	# NO! user ubuntu doesn't exist at this stage!
	#usermod -aG docker ubuntu

	# Remove stupid indentation rules for vim
	rm -f /usr/share/vim/vim80/indent.vim

	# Use vim as default editor
	update-alternatives --set editor /usr/bin/vim.basic

	# Install netdata
	curl -L -o /etc/apt/trusted.gpg.d/netdata.asc https://packagecloud.io/netdata/netdata/gpgkey
	add-apt-repository -s "deb https://packagecloud.io/netdata/netdata/ubuntu/ bionic main"
	apt-get update
	apt-get install -y netdata

	# Install fluentbit
	curl -L -o /etc/apt/trusted.gpg.d/fluentbit.asc http://packages.fluentbit.io/fluentbit.key
	add-apt-repository "deb http://packages.fluentbit.io/ubuntu/bionic bionic main"
	apt-get update
	apt-get install -y td-agent-bit

	exit 0

view raw ubuntu-bionic-extra.sh hosted with ❤ by GitHub

Why not... ?

Why not use the official AMI directly?

Back in 2016 AWS China had no marketplace, you had to build your own AMI
It allows customizing the instance, and contrary to cloud-config, the instance is ready to use right after boot

Why not create an instance, configure it and make its snapshot into an AMI?

From prior Windows sysadmin experience, I had to sysprep the master instance to reset the instance-specific SID
On Linux, it's not very clear what should be removed / cleaned up: bash history, SSH host keys, authorized_keys, but probably also some more arcane things like /etc/machine-id, /var/lib/systemd/random-seed, etc. It's much better if they have never been there in the first place.

Why not use packer?

Scratch your own itch? (:
What we did is probably similar to Packer's Amazon chroot builder: https://www.packer.io/docs/builders/amazon-chroot

References

https://github.com/alestic/alestic-git/blob/master/bin/alestic-git-build-ami for the overall approach and ec2 commands
https://github.com/kickstarter/build-ubuntu-ami/blob/master/data/user_data.sh.erb for the user script run in chroot
https://blog.tinned-software.net/mount-raw-image-of-entire-disc/ for how to mount the raw image with losetup

Speedy emails, satisfied customers

Are delayed transactional emails costing you user satisfaction? Postmark delivers your emails almost instantly, keeping your customers happy and connected.

DEV Community

Build your own Ubuntu AMI

Leverage Ubuntu Cloud Images

Machine Consumable Ubuntu Cloud Guest images Availability Data

Convert the image

Make the AMI

Customize the AMI

Complete script

Why not... ?

Why not use the official AMI directly?

Why not create an instance, configure it and make its snapshot into an AMI?

Why not use packer?

References

Speedy emails, satisfied customers

Top comments (0)

Read next

Tutorial Install Ubuntu 22.04.4 LTS Menggunakan balenaEtcher

Introduction to Amazon VPC and Its Fundamentals

Glue cross-account setup

Transform Your Cloud Migration Strategy: Transition Microsoft workloads to Linux on AWS with AI Solutions

Okay